Free Support Forum - aspose.com

PdfExtractor returns words with spaces and breaks inbetween

I have a PDF document that looks fine and also if I copy and paste the text into word it all looks good.

But if I use the PdfExtractor function, most words are split up into individual letters.

I've attached the PDF and the results from the extracts. I only care about the number of words in the file, so any solution is fine that counts somewhat correct.

Thanks

Remy

Hi,

Thank you for considering Aspose.

I have logged this issue as PDFKITNET-3547. I will discuss this with the developers and we will let you know as soon as solution is found.

Thanks.

Can I track the progress of this somewhere? Or do you post the news in this thread again?

Thanks

Remy

Dear Remy,

We will notify you in this thread when we resolved this issue.

Dear Remy,

We hope this bug could be fixed in end of this month.

Best Regards.

I just downloaded the newest version of Pdf.Kit (2.6.0.0) from 8/29 but it looks like the issue has not been fixed yet?

Thanks

Remy

Hi,

We have meet some technology problems and delay the publishing of fixing this bug. We hope it can be available in the end of this month now.

Regards.

We have published hotfix 2.6.1. Please try it.

Hey Guys,

Thanks a lot, looks much better now! Sorry for the slow response from my side.

Keep up the good work.

Remy