Extract text from Excel spreadsheets via Aspose.Cells for Java

Hi,

Does Aspose.Cells API provides any text extraction facility like Apachi POI? Apache POI - Text Extraction

Also, if a worksheet has a word, excel, powerpoint or visio document embedded in it, can we extract text from those embedded objects as well?

@kashif.iqbal,
We are working on your requirement and will provide our feedback soon.

@kashif.iqbal,

Thanks for your query.

Well, you can save the workbook to .txt format, then try using the APIs like e.g File.ReadAllText() from the file for your needs.

Anyways, we have logged the following two tickets for investigations for your requirements:
CELLSJAVA-42963 - Extract text from Excel spreadsheets via Aspose.Cells
CELLSJAVA-42964- Extract text from embedded objects in Excel spreadsheets via Aspose.Cells APIs

Once we have an update on any of the features, we will let you know.

1 Like

@kashif.iqbal,
We have investigated CELLSJAVA-42964 a bit more and would like to update you that Aspose.Cells does not support parsing embedded objects, please use Aspose.Words and other products of Aspose to parse embedded objects.

1 Like

@kashif.iqbal,

Regarding issue “CELLSJAVA-42963”, we have no plan to support it. If you want to get plain text, please save the file as .csv or txt format, then get the text from .csv or txt file.