Convert PDF to DOC/DOCX - Words in transparent shapes are lost when converted

Hi Aspose team,

We found some words in PDFs are lost when converted to Doc in the case that the words are in ransparent shapes
image.png (231.1 KB)

We tested Foxit to convert pdf to word, it performs well and can recognize the words in the transparent shapes.
We are using Aspoe.pdf v19.12. Here’s our code
using (Document pdf = new Document(“sample.pdf”))
{
// Save using save options
// Create DocSaveOptions object
DocSaveOptions docSaveOptions = new DocSaveOptions();
// Set the recognition mode as Flow
docSaveOptions.Mode = DocSaveOptions.RecognitionMode.Flow;
pdf.Save(“sample.doc”, docSaveOptions);
}

Here’s our test file
addTranslucentShape .pdf (138.2 KB)

@Glority_Developer,

Thanks for contacting support.

We have tested this scenario in our environment and have been able to observe the issue that you have mentioned. We have logged issue with ID PDFNET-47502 in our issue tracking system. We will share with you as soon as logged issue is fixed.

Hi Aspose team,

Did you have any progress on it? Will this issue be fixed within 1-2 months? We are going to have the code freeze date of our product.

@Glority_Developer

Sadly, the earlier logged ticket is still pending for a resolution as there are other pending issues in the queue which were logged previously than this issue. We have recorded your concerns and will surely inform you once we have some updates regarding issue fix or ETA. Please give us some time.

We are sorry for the inconvenience.