Exctract text from PDF using Aspose.PDF for .NET - number are disappear

hi. when i upload this file i have number are disappear.

i add the code example and the file

Gabriel_Shehtman_-_product_Manager (1).pdf (37.5 KB)

 Dim docHtml As New Aspose.pdf.Document(f.InputStream)
                  Dim textAbsorber As New TextAbsorber()
                  docHtml.Pages.Accept(textAbsorber)
                  Dim extractedText As String = textAbsorber.Text
                  Response.Write(extractedText)

thank you :slight_smile:

@eranlipi

Would you please elaborate a bit more about the issue that which numbers are disappearing in the extracted text. We were unable to notice any issue in our environment.

@asad.ali

under the line Product Manager, EL - AL
when you open the pdf file you can find this line “Dec 2018 · Present

when i use the code i get “Dec 201— · Present

the number 2018 became 201–

thank you :slight_smile:

@eranlipi

Thanks for sharing details.

We have managed to observe the similar issue in our environment with Aspose.PDF for .NET 20.4 and logged it as PDFNET-48101 in our issue tracking system. We will further look into its details and keep you posted with the status of its correction. Please be patient and spare us some time.

We are sorry for the inconvenience.

1 Like