Loosing text when converting to Word

Hi together,

when converting the attached pdf to a doc or docx document, some content will be lost.

8580-8689-[06.11.18 15-33-37-413].PDF (146.1 KB)

Code:
Document pdfDocument = new Document(pathToPDF); pdfDocument.save(target, SaveFormat.Doc);

Now look into the PDF an search for the text maris. You will find two matches (marisstart an marisende). As you see, the text has a white font color. When converting the pdf to a doc/docx you won’t find the text maris anymore.

I have tested this with more than 300 documents that have the text maris inside and every document lost exactly this two words.

Is there a problem with the color of the text? Can you reproduce it? Is it a bug?

Kind regards,
Jochen

@hsp2000

Thanks for contacting support.

We were able to replicate the issue in our environment and have logged it as PDFJAVA-38128 in our issue tracking system for the sake of correction. We will further look into details of the issue and keep you posted with the status of its correction. Please be patient and spare us little time.

We are sorry for the inconvenience.