On convert PDF to WORD the text get dispersed and the letters get disappeared from the words. Please refer the attached video for the reference.
Also attached the PDF for reference. ConvertIssue.zip (4.7 MB)
@tweakingtech sorry I cannot reproduce your issue, can you please attach your MS Word output file.
This is the code that I used:
LoadOptions opt = new LoadOptions()
{
LoadFormat = LoadFormat.Pdf,
};
Document doc = new Document(@"C:\Temp\input.pdf", opt);
doc.Save(@"C:\Temp\output.docx");
output.docx (404.4 KB)
@tweakingtech Do you use Aspose.Words or Aspose.PDF to convert your PDF document to Word? Could you please share the code you use for conversion?
Using Aspose.PDF
Code:
Document pdfDocument = new Document(input);
Aspose.Pdf.DocSaveOptions docsave = new DocSaveOptions
{
Format = format,
CustomProgressHandler = ConversionProgressCallback,
Mode = DocSaveOptions.RecognitionMode.Flow,
ExtractOcrSublayerOnly = false
};
// Save the output in Docx format
pdfDocument.Save(output, docsave);
@tweakingtech I will move the topic into Aspose.PDF forum. My colleagues from Aspose.PDF team will help you shortly.
The code provided by @eduardo.canal uses Aspose.Words for conversion and the output looks correct.
Is there any dll of Aspose.Words for .NetFramework 4.5.2
@tweakingtech Sure, there is Aspose.Words for .NET 4
https://www.nuget.org/packages/Aspose.Words
But loading of PDF document is supported in .NET Framework 4.6.1, .NET Standard 2.0 and .NET6 version of Aspose.Words.
We were able to reproduce the issue using Aspose.PDF for .NET 23.3 in our environment. Therefore, it has been logged as PDFNET-54078 in our issue tracking system. We will further look into its details and keep you posted with the status of its correction. Please be patient and spare us some time.
We are sorry for the inconvenience.
Is there any option to load Aspose.Words api in 4.5.2 .Net framework.
@tweakingtech Sure, simply add NuGet reference to Aspose.Words package:
https://www.nuget.org/packages/Aspose.Words