Aspose Slides is giving unexpected results for a particular PowerPoint slide that we have.
We are using the following code to get text from power point slides (so that we can search them for specific words).
Dim pptxPresentation As Presentation = New Presentation(_strFilePath) Dim textFramesPPTX As ITextFrame() = Util.SlideUtil.GetAllTextFrames(pptxPresentation, True) For i As Integer = 0 To textFramesPPTX.Length - 1 For Each port As IPortion In From para In textFramesPPTX(i).Paragraphs From port1 In para.Portions Select port1 sb.Append(port.Text) sb.AppendLine() Next Next
We found a problem with the linked Powerpoint which has only one word in it.
The word Associate for some reason gets split into the following pieces which then creates problems with our Search.
“Ass” & vbCrLf & “o” & vbCrLf & “c” & vbCrLf & “ia” & vbCrLf & “t” & vbCrLf & “e”
The code words with most slides but we seem to have some hidden characters here that Aspose is picking up.
NOTE: If I copy and paste the word into a brand new Powerpoint document, the same problem exists with the new document but… If I delete the word in the slide and retype the same word over it, the problem is fixed. It appears there are some hidden characters that Aspose is picking up.
The problem was discovered by a user and I am hoping that there is some way of getting to to work on his slide instead of asking him to retype (in case it also occurs on other documents).
Any help would be greatly appreciated.
Thanks in advance.