Below issues I’m experiencing after converting pdf to word:
- Spacing between words are being changed.
- Style for headings and other properties are not being preserved. For example, style of heading from Heading1, Heading2,…,Heading6 becomes normal.
- Some special characters are not being converted properly or missing after conversion.
- Font style is also not preserving.
PFA for more information.
Converted File:
Output (1).docx (10.0 KB)
Original File:
sample (1)-6.pdf (6.7 KB)
@bhavikahirr First of all, please note, Aspose.Words is designed to work with MS Word documents. MS Word documents are flow documents and they have structure very similar to Aspose.Words Document Object Model. On the other hand PDF documents are fixed page format documents . While loading PDF document, Aspose.Words converts Fixed Page Document structure into the Flow Document Object Model. Unfortunately, such conversion does not guaranty 100% fidelity.
- This occurs because the document is converted to flow format and words spacing is handled by MS Word rules.
- There is no information about heading in PDF document. So Aspose.Words simply preserves formatting of the text.
- The problem is logged as WORDSNET-25696.
- The problem is logged as WORDSNET-25697.
The issues you have found earlier (filed as WORDSNET-25697,WORDSNET-25696) have been fixed in this Aspose.Words for .NET 23.11 update also available on NuGet.