Symbol not getting rendered when we convert DOCX to PDF

Hello Team,
We are trying to convert Docx to PDF,
We have HOP symbol as text in Docx which is not getting converted to PDF
When we convert Word to PDF by using Microsoft Word export functionality, it is converting that symbol but through Aspose Word to PDF online converter, conversion is not happening
below is the text
“Zeitschrift FR Anorganische Und Allgemeine Chemie”
I have attached the input Word document, Converted PDF document through Microsoft Word Export Functionality, and Converted PDF document through Aspose online Word to PDF converter.
InputOutputDocuments.zip (662.2 KB)

I don’t know much about it. But I would like to learn more. I hope someone will help here. Thank you so much!

@pthube
We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): WORDSNET-27721

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.

@pthube We have completed analyzing the issue. The problematic character, as you mentioned, is a U+0081 HOP control character. As it is a control character, it is not actually expected to be present in text and rendered. MS Word has inconsistent behavior regarding it. In UI and XPS export MS Word displays it as .notdef glyph, in PDF export MS Word replaces it with bullet, in printing MS Word replaces it with space. So Aspose.Words rendering as a .notdef glyph seems to be acceptable for us. We are going to close this issue as “Won’t Fix”.

As a workaround you can replace HOP control character with some proper character (like bullet) in Aspose.Words DOM.

we are getting “\u0081” and “\u0082” this characters in the Text. and we are sending it to aspose.words and getting this result from aspose.words ( “Zeitschrift FR Anorganische Und Allgemeine Chemie”)
these are Unicode characters Link : “Unicode Character '<control>' (U+0081)”.
we may get some more characters like this in the user request.
how do we handle this?
or do we need to add any font?

@pthube

Could you please elaborate the problem? If possible, please attach your input document and code that will allow us to reproduce the problem. Do you mean these characters are not rendered properly in PDF?

we are using aspose .word to generate a Word document then word document we convert in to PDF, while generating Word we have Unicode characters like “\u0081” and “\u0082”, and we pass it to aspose. but in the generated document we are getting below output.
“Zeitschrift FR Anorganische Und Allgemeine Chemie” (Square) here we passed - “\u0082” .
expected Output - “Zeitschrift F,R Anorganische Und Allgemeine Chemie”
is there any font required for those Unicode characters? (“\u0081” and “\u0082”)
we already attached the Input document in attachment that you can consider as output here in this scenario

for your second question - Docx document also not getting correct so when we convert it to PDF we are getting the same result as Docx

@pthube I managed to reproduce the problem using the following code:

Document doc = new Document();
DocumentBuilder builder = new DocumentBuilder(doc);
builder.Write("inserted as boxes: \x0081 and \x0082");
doc.Save(@"C:\Temp\out.docx");
doc.Save(@"C:\Temp\out.pdf");

We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): WORDSNET-27768

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.

after checking we found that the behavior is for the below Unicode as well
from \u0080 to \u0099
these symbols we are not getting on both docx and pdf

@pthube Thank you for additional information. I have added it to the issue description. We will keep you informed and let you know once it is resolved or we have more information for you.

The issues you have found earlier (filed as WORDSNET-27721) have been fixed in this Aspose.Words for .NET 25.2 update also available on NuGet.

Hello Team,

Still facing the same issue.
I tried with below document
InputDocSymbolIssue.docx (136.1 KB)
on this link
https://products.aspose.app/words/conversion/word-to-pdf
you can search for this text “Zeitschrift FR Anorganische Und Allgemeine Chemie”

@pthube As it was mentioned above the WORDSNET-27721 issue has been closed as Won't Fix. So Aspose.Words behavior was not changed in this case. The WORDSNET-27768 is not resolved yet.