PdfExtractor returns words with spaces and breaks inbetween

rblaettler · August 8, 2007, 4:14am

I have a PDF document that looks fine and also if I copy and paste the text into word it all looks good.

But if I use the PdfExtractor function, most words are split up into individual letters.

I've attached the PDF and the results from the extracts. I only care about the number of words in the file, so any solution is fine that counts somewhat correct.

Thanks

Remy

AdeelTaseer · August 8, 2007, 7:44am

Hi,

Thank you for considering Aspose.

I have logged this issue as PDFKITNET-3547. I will discuss this with the developers and we will let you know as soon as solution is found.

Thanks.

rblaettler · August 9, 2007, 3:54am

Can I track the progress of this somewhere? Or do you post the news in this thread again?

Thanks

Remy

forever · August 9, 2007, 7:40am

Dear Remy,

We will notify you in this thread when we resolved this issue.

GeorgieYuan · August 9, 2007, 9:33pm

Dear Remy,

We hope this bug could be fixed in end of this month.

Best Regards.

rblaettler · September 17, 2007, 2:46pm

I just downloaded the newest version of Pdf.Kit (2.6.0.0) from 8/29 but it looks like the issue has not been fixed yet?

Thanks

Remy

GeorgieYuan · September 18, 2007, 10:58am

Hi,

We have meet some technology problems and delay the publishing of fixing this bug. We hope it can be available in the end of this month now.

Regards.

forever · September 25, 2007, 2:27am

We have published hotfix 2.6.1. Please try it.

rblaettler · October 23, 2007, 2:41pm

Hey Guys,

Thanks a lot, looks much better now! Sorry for the slow response from my side.

Keep up the good work.

Remy