DOC file taking too long to process

Hi,
Attaching the document taking longer time for conversion, using java and latest version of Aspose 23.4
1681285766770-Jessica Morgan_11449637_cv-library.docx.zip (502.8 KB)

@RChilli_Nidhi Your document is actually corrupted. However, MS Word can recover it. Also, .NET version of Aspose.Words throws an exception upon loading the document, while Java version hangs.

We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): WORDSNET-25354,WORDSJAVA-2848

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.

Yes, the document is actually corrupted, but it should return an error.

We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Okay Thanks

1 Like

Hi,

Also, the same concern highlighted previously is with a few more files.

One more attached file is taking time while getting the page count, please check this too

int docPageCount = doc.getPageCount();

1681289933704-638152551735949891.zip (82.5 KB)

@RChilli_Nidhi
We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): WORDSJAVA-2849

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.

Hi,

Any update on the raised concern?

We are getting multiple files related to the same concern!
Additionally want to know is there a way to extract the images only from limited pages?

@RChilli_Nidhi WORDSNET-25354 issue (related to the Exception on .NET platform) is under development and scheduled to be fixed in 23.9 (September 2023) version of Aspose.Words. We will be sure to notify you once the version with a fix is available. But please note this is a rough estimate and it can be shifted. The issues WORDSJAVA-2848 (related to hangs on load) and WORDSJAVA-2849 (related to hangs on updatePageLayout call) are still pending for the analysis.

In order to extract images from specific document pages you could use Document.extractPages() method to get a temporary document with only specified pages and then extract images from it as regular.