Hi, I am looking at various java libraries which can extract text from a .doc and .docx file. Is aspose capable of that? If yes can you point me to a right example
Thanks for your inquiry. Yes, you can extract contents from Doc/Docx files. Please read following documentation links for your kind reference.
Please note that Aspose.Words is quite different from the Microsoft Word’s Object Model in that it represents the document as a tree of objects more like an XML DOM tree. When you load a Word document into Aspose.Words, it builds its DOM and all document elements and formatting are simply loaded into memory. Please read the following articles for more information on DOM:
Please let us know if you have any more queries.