Splitting Double-Page Scanned PDFs into Single Pages using Aspose.PDF in C#

Hi,

Pamphlets are often scanned two pages at a time, resulting in each PDF page representing two pages from the original document. Can Aspose.PDF in C# split the pages of such a PDF so that, for instance, a PDF that initially had 100 pages will be transformed into 200 pages, with each PDF page representing only one page from the original hard copy document?

@asemalsaif

Yes, you can use Aspose.PDF in C# to split a double-page scanned PDF into single pages. The process involves iterating through each page of the original PDF and saving each one as a separate document. Below is a sample code snippet that demonstrates how to achieve this:

using System.IO;
using Aspose.Pdf;

namespace Aspose.Pdf.Examples.CSharp.AsposePDF.Pages
{
    public class SplitToPages
    {
        public static void Run()
        {
            // Specify the directory where the PDF is located
            string dataDir = "path_to_your_pdf_directory/";

            // Open the original PDF document
            Document pdfDocument = new Document(dataDir + "SplitToPages.pdf");

            int pageCount = 1;

            // Loop through all the pages in the original document
            foreach (Page pdfPage in pdfDocument.Pages)
            {
                // Create a new document for each page
                Document newDocument = new Document();
                newDocument.Pages.Add(pdfPage);
                
                // Save the new document with a unique name
                newDocument.Save(dataDir + "page_" + pageCount + "_out.pdf");
                pageCount++;
            }
        }
    }
}

In this code:

  • You first load the original PDF document.
  • Then, you loop through each page of the document.
  • For each page, you create a new document and add the current page to it.
  • Finally, you save the new document with a unique filename.

Make sure to replace "path_to_your_pdf_directory/" with the actual path where your PDF file is located. This will result in each page of the original document being saved as a separate PDF file, effectively splitting the double-page scanned PDF into single pages.

For more details, you can refer to the example provided in the Aspose documentation here.