Splitting large PDF into multiple PDF

Laksh · June 23, 2016, 2:07pm

I am having issue while splitting a large pdf into multiple pdfs files.

Details

There are 10 pdf files concatenated together into a large single PDF file. There is a separator page between each PDF file.

In reality the large pdf will be given to us by client using the separator that we provide. For this sample application though I have created “Sample.pdf” and it is provided in attached solution.

I want to split this large pdf into individual PDF file. The separator is used to detect end of single document. I use TextAbsorber to read the text from the page.

Issue

In the code I loop through all the pages of “Sample.pdf” While debugging, I see "Pages" collection has all the pages, however when I execute application without debugging all pages from 3rd page onwards are null.

See the attached screenshots

Executing the Application

1>Extract attached zip

2>Open PDfSplitter solution. Include your own license file into "Lib" folder.

3>Set the license file's build Action to "Embedded Resource"

4>Execute the application.

I also wanted to know if this is recommended solution to divide and then detect multiple pdf documents?

codewarior · June 24, 2016, 4:56pm

Hi Laxmikant,

Thanks for using our API’s.

I have tested the scenario and have managed to reproduce same problem. For the sake of correction, I have logged it as PDFNET-41024 in our issue tracking system. We will further look into the details of this problem and will keep you posted on the status of correction. Please be patient and spare us little time. We are sorry for this inconvenience.

timg · December 20, 2017, 10:32pm

I had a similar issue when using TextAbsorber version 11.5 to split a large PDF document. It would fail from the 3rd page onward. Interestingly, when I reverted to version 10.2 it worked with no problem (I did not try any other versions in between). I don’t know if this is related, but if you have time to experiment with an earlier version of TextAbsorber, you may find that it works for you as well.

I still have problems when I use version 11.5, but no problem with the earlier 10.2 version I had been using before. Hope that helps somehow.

imran.rafique · December 21, 2017, 12:17am

@timg,

Kindly share all details of the scenario, including PDF document and code. We will investigate and share our findings with you.