We are in the IT placement business.
We are trying to automate the processing of incomming resumes sent in by people applying for jobs.
We need to do two things for now...
1: Try to programatically get a vaild first and last name of the applicant by scraping it out of their resume. (I have a fairly successful method worked out.)
2: Pull out the resume text and convert it to an RTF document to store in a database. (At some point in the future we will not do this, we will just store the Word document.)
When I use the GetText() method on the Document object or on individual Paragraph objects I get a lot of non-text items. I end up with header, footer, embedded graphic items.
Is there a way to just get the TEXT?