Extracting text from the attached file shows text that isn’t in the document. It doesn’t happen on any other files, so I’d be interested to know if something is going wrong with Aspose.
I use code along these lines (not showing actual code but alternative code so you can replicate the issue):
NodeCollection nodeColl = doc.GetChildNodes(NodeType.Paragraph, true);
for (int i = 0; i <nodeColl.Count; i++)
{
Paragraph para = (Paragraph) nodeColl[i];
string strParagraphText = para.Range.Text
}
When that gets to paragraph number 101, the text should be “Conclusion”. Instead, it’s “Consultation Include views of relevant Overview and Scrutiny Committee, regulatory committee(s), Area Forum(s), Ward Member(s). Proposals relating to the budget and policy framework must include details of the nature and extent of consultation with stakeholders and relevant overview and scrutiny committees and outcome thereof.”
There are no comments, tracked changes, footnotes or other items that I can see which would cause this. Where can it be getting all that extra text from?
Thanks,
Daniel