Free Support Forum - aspose.com

Text form .doc .docx file

Hi, I am looking at various java libraries which can extract text from a .doc and .docx file. Is aspose capable of that? If yes can you point me to a right example

Thanks


Hi,


Thanks for your inquiry. Yes, you can extract contents from Doc/Docx files. Please read following documentation links for your kind reference.
http://www.aspose.com/docs/display/wordsjava/Extract+Content+Overview+and+Code
http://www.aspose.com/docs/display/wordsjava/How+to++Extract+Selected+Content+Between+Nodes+in+a+Document

Please note that Aspose.Words is quite different from the Microsoft Word’s Object Model in that it represents the document as a tree of objects more like an XML DOM tree. When you load a Word document into Aspose.Words, it builds its DOM and all document elements and formatting are simply loaded into memory. Please read the following articles for more information on DOM:

Please let us know if you have any more queries.