Converting PDF to Office with OCR, and PDF to Markdown Format

Hi,

I want to know whether your product support the converting from PDF to Office(word, excel, ppt) with OCR which help to convert uneditable pdf to editable docx, etc, and especially in Chinese charaters.

Another question is whether your product could convert pdf to markdown.

Thank you very much.

@magicsword,
Thank you for contacting support.

Aspose.Slides APIs support the Chinese language and allow you to import PDF documents to PowerPoint presentations.
More details: Import PowerPoint from PDF or HTML|Aspose.Slides Documentation

As for Word, Excel, and PDF documents, my colleagues from Aspose.Words, Aspose.Cells, and Aspose.PDF teams will answer you shortly.

@alexey.noskov, @amjad.sahi, @asad.ali FYI

@magicsword You can use Aspose.Words for .NET and for Python to convert PDF to Word. Please see our documentation for more information:
https://docs.aspose.com/words/net/convert-pdf-to-other-document-formats/

@magicsword,

1). You may use Aspose.PDF to convert PDF to DOCX, XLSX and PPT, see the documents for your reference.
https://docs.aspose.com/pdf/net/convert-pdf-to-word/
https://products.aspose.com/pdf/net/conversion/pdf-to-docx/

https://docs.aspose.com/pdf/net/convert-pdf-to-excel/
https://products.aspose.com/pdf/net/conversion/pdf-to-xlsx/

https://docs.aspose.com/pdf/net/convert-pdf-to-powerpoint/
https://products.aspose.com/pdf/net/conversion/pdf-to-pptx/

2). For converting PDF to MD (markdown), you may try to use Aspose.Words for the task.

Hello,

I have attached a non-searchable PDF document. I need to convert this PDF into a Word document, using OCR during the conversion process to generate a searchable Word document, not just displaying images in the Word file. I hope the converted text can be as accurate as possible. Could your product help me accomplish this task? If so, could I perform a simple test? This feature is crucial for our product.

Thank you for your time!

Alexey Noskov via Free Support Forum - aspose.comforum@aspose.com 在 2024年5月28日 周二 17:37 写道:

| alexey.noskov
May 28 |

  • | - |

@magicsword You can use Aspose.Words for .NET and for Python to convert PDF to Word. Please see our documentation for more information:
https://docs.aspose.com/words/net/convert-pdf-to-other-document-formats/

焊缝超声探伤检测记录.pdf (87.5 KB)

@magicsword Recognition of images to text is out of Aspose.Words scope. I think you should consider using Aspose.OCR.