Extremely Slow Conversion from PDF to DocX (Java)

Hello,

Converting a ~200 page document using 1 core and 3gb memory takes more than 5 minutes. We are using the Java version 22.2.

Any recommendations on how to make it run faster? is the .NET or C++ version expected to run faster?

@draftwise

Please try to use the 22.10 version of the API as well as try to increase the Java Heap Size in -xms/-xmx parameters. In case issue still persists, please share your sample source file with us. We will test the scenario in our environment and address it accordingly.

Hi!

Sorry for the delay with providing examples. Below are 2 of many files that are either taking a very long time to convert (above 10 minutes) or are causing OOMEs.

Gaziantep CTA.pdf (2.0 MB)
FinTech Acquisition Corp. IV_20201231_EX-2.1_18930314_4090978.pdf (2.4 MB)

Please advise!

@draftwse

Below tickets have been logged in our issue tracking system for your files:

  • PDFJAVA-42379
  • PDFJAVA-42380

We will further look into details of the ticket and let you know once the tickets are resolved. Please be patient and spare us some time.

We are sorry for the inconvenience.