PDF Embedded text data overwrites on the image


#1

hi,

we have converted a pdf(OCR data embedded) to tiff image. after converting it to tiff the OCR character are printed slightly above the text with different font size. really those are junk characters which produced by OCR

This issue is occurring on random cases.

Here is the code snippet.

using (Aspose.Pdf.Document pdfDocument = pdfDoc)
{
Resolution resolution = new Resolution(300);

TiffSettings tiffSettings = new TiffSettings();
tiffSettings.Compression = Aspose.Pdf.Devices.CompressionType.CCITT4;
tiffSettings.Depth = ColorDepth.Default;
tiffSettings.SkipBlankPages = false;
TiffDevice tiffDevice = new TiffDevice(resolution, tiffSettings);
tiffDevice.Process(“d:\test\input.pdf”, “d:\test\output.tif”);
}


#2

@netadmin.edmamericas

Thank you for contacting support.

Would you please share source and generated file with us by zipping them so that we may try to reproduce and investigate it in our environment. Before sharing requested data, please ensure using Aspose.PDF for .NET 18.11.