We have implemented PDF-to-Word document conversion using the Aspose library. Initially, during testing, the conversion worked without any delays or timeout issues. However, we are now observing failures during the conversion process.
To resolve this, we attempted the following approaches, but the issue still persists:
-
Upgraded the package to the latest version.
-
Adjusted conversion parameters (e.g., image resolution, format) based on file size.
-
Implemented page-wise PDF-to-Word conversion followed by merging into a single document.
please help us to resolve this issue
@MadanTest
Can you please share your sample document and code to reproduce the issue? We will further proceed to assist you accordingly.
6mb (1).pdf (6.0 MB)
please check the sample document which is failing
6mb (1).pdf (6.04 MB)
@MadanTest
I’m forwarding your request to the forum section on the Aspose PDF library.
You can also use a cloud-based solution to convert documents from PDF to Word, an on-premise solution is also available. Client SDKs are available for various languages, including CSharp.
var pdfFilename = "6mb.pdf";
var docFilename = "output.docx";
var configuration = new Configuration(client_secret, client_id);
var api = new PdfApi(configuration);
using (var file = File.OpenRead(pdfFilename))
{
api.UploadFile(pdfFilename, file);
}
api.PutPdfInStorageToDoc(pdfFilename, docFilename, format: DocFormat.DocX.ToString());
@MadanTest
We have tested the scenario in our environment using 25.11 version of the API and noticed that the program kept running while consuming up to 2.3G of memory. Is it the same issue that you are facing right now?
Document pdfDocument = new Document(dataDir + "6mb (1).pdf");
DocSaveOptions saveOptions = new DocSaveOptions();
saveOptions.Format = DocSaveOptions.DocFormat.DocX;
saveOptions.Mode = DocSaveOptions.RecognitionMode.Flow;
saveOptions.RelativeHorizontalProximity = 2.5f;
saveOptions.RecognizeBullets = true;
pdfDocument.Save(dataDir + $"6mb (1).docx", saveOptions);
yes same issue , how we can reduce the memory issues or slowness while converting the document from pdf to word