I have just read the very nice article covering word document content extraction using IDocumentVisitor. This brought back to my mind that there is a small (small for me - it might be harder to implement though, I admit) piece missing in Aspose.Word at the moment: Omitting extraction of left over “Track Changes” pieces of text when running the IDocumentVisitor approach. Currently I do not have any means of determining whether or not the extracted text is a left over of Track Changes or whether it’s just normal text.
Any plans/dates for this? I know it’s somewhere on your list, but any rough estimation, yet?
Thanks a lot in advance
Very good idea! I think we can easily add to IDocumentVisitor flags that will identify runs of text were deleted or inserted.
sounds good to me, looking forward to it.
Check out Document.AcceptAllRevisions in Aspose.Word 2.2.3 http://aspose.com/blogs/Roman.Korchagin/archive/2005/03/09/575.aspx
great stuff. I have just checked it using my test application. Looks to be working like a charm. I’ll create a couple of test cases now and see what gives.
Thanks a lot for your support