Hi There
We were intending to replace the use of POI with Aspose Words in our document indexing routines. The problem is while extracting text and indexing a large number of Word and Excel documents, ocassionally, Aspose Words and Aspose Excel literally freezes for 1 minute. It seems Aspose Word is somehow contacting the network. Is it performing a license check or obtaining a resource, I wonder? Furthermore, Aspose Excel is pracically unusable for text extraction. We are seeing deadlocks appearing all over the place.
This is what we are using to extract text:
protected void write(File file, int saveFormat) throws IOException {
InputStream is = null;
try {
is = data.getInputStream();
Document doc = new Document(is);
if (saveFormat==SaveFormat.HTML) {
HtmlSaveOptions options = new HtmlSaveOptions(saveFormat);
options.setExportImagesAsBase64(true);
doc.save(file.getPath(),options);
} else {
doc.save(file.getPath(),saveFormat);
}
} catch (Exception e) {
throw new IOException(“failed to extract text from office file: {”+data.getName()+"}"+ e.getMessage());
} finally {
StreamUtil.closeWithLogging(is);
}
}
Are there any other options we must supply to stop Aspose from contacting the network?
Also, may I ask what versions of Excel and Word do Aspose libraries support.
Please advise.
Jamie
Hi Jamie,
Hi