Hi,
As per the subject, what is the charset encoding for the text string that is returned from the Range.getText() while parsing a Word document.
Is it to the system default java charset or is it in UTF-8.
Is there any other methods that can be used to grab the text from a word document in a specific encoding.
(Something similar to the extractText(java.lang.String encoding)
method from the pdfextractor class)