Can you please make any recommendations on parsing out the info from the attached file? I had originally though going node by node would work. However the requirements got tougher. Basically I need to save the main object names (style: heading 2), item names (style: heading 3), then each of the sub items (style: normal). The trickiest part is parsing out each of the rows under each category. I have to have a list of each row by category, per item. I decided on using a dictionary<string, row class>, where string is the category name, and row class is a class I made to hold each of the headings. I don’t need to store the headings.
I have looked at examples Aspose has on-line, but the don’t seem to fit my situation. Can this be parsed linearly, node by node, or would you recommend another approach. The hardest thing I am facing is keeping track of my cursor while I do this linearly over a NodeCollection.
Additional notes: If there are more than one Main Object next to each other, I skip to the last one. Headings in the category tables, can not be used for finding the text. The headings are dynamic. My requirements require that I look for the 4 columns merged together.
Any suggestions you can provide will be greatly appreciated.
NodeCollection rows =