PDF to DOCX conversions contain unusual font names, with 6 character prefixes, e.g. NJNRFM+TimesNewRomanPS-BoldMT
JUORIA+Helvetica
The prefixes in the font names can be seen when viewing/editing the converted document in Word. The Word document content looks fine, but the prefixes are unwanted as they cause issues when converting the same document to HTML (using other 3rd party libraries)
Steps:
-
This issue can be reproduced using the latest version of the aspose.pdf.dll (.NET) and also using the online test converter website: Convert PDF | Online and Free
-
This example CV (found online) exhibits the issue but various sample PDF documents I have also tested with have similar issues.
-
After conversion, click on various parts of the document and notice the font has unusual prefixes, e.g. in the sample doc linked above, the line of text
An example of a good CV
has the fontNJNRFM+TimesNewRomanPS-BoldMT
An old/similar issue was reportedly fixed
Any guidance or help is greatly appreciated.