I am evaluating the words .net component for purchase consideration. I’m simply trying to take a word doc and convert it to text. I have 2 questions.
-
The original word doc contains auto-numbers in the body. If I just read in the text into memory using aw.Document.GetText I lose those auto-numbers. If I save it to a text file it preserves them. Is there a flag or something I need to set to keep them in memory?
-
The word doc has some data that looks like this:
<span style=“font-family:“Arial”,“sans-serif””>
RE: Rosa Gutierrez Amezquita
DOB: March 30, 1940
MR#: 55555
REASON FOR CONSULTATION: I was asked to see this patient by Dr. Maribel Flores for evaluation and management of endstage renal disease.
When i view it in memory or in the saved text file version I don’t get newlines in the first 3 lines above. Instead I see weird characters as in the attached file.
How can I remove or convert these to carriage returns so they look like above?
Thanks,
Mike