Convert MSG to PDF Raise System.NullReferenceException

Hello

When I’m trying to convert the MSG File I’ve got a System.NullReferenceException when I try to manipulate Aspose.Words.Document (property PageCount). But only when mhtOptions.SkipInlineImages is true.

I’m using Aspose .NET (Aspose.Email 24.11.0.0 / Aspose.Words 24.12.0.0)

Here is the code I use to convert a MSG File to PDF :

  • firstPageIndex & pageCount = 0
  • withImagesFromMsg = 1

COUTURIER XL - Adhésion PRO BTP.7z (2.3 MB)

public static int Convert(string inputFile, string outputFile, ref string errmsg, int firstPageIndex, int pageCount, int withImagesFromMsg)
{
    errmsg = "";
    try
    {
        FileStream docStream = new FileStream(inputFile, FileMode.Open, FileAccess.Read, FileShare.ReadWrite);

        try
        {

            MemoryStream ms = new MemoryStream();
            MhtSaveOptions mhtOptions = Aspose.Email.SaveOptions.DefaultMhtml; //new Aspose.Email.MhtSaveOptions();
                                                                               //                    mhtOptions.MhtFormatOptions = MhtFormatOptions.WriteHeader | MhtFormatOptions.WriteCompleteEmailAddress | MhtFormatOptions.RenderCalendarEvent | MhtFormatOptions.DisplayAsOutlook | MhtFormatOptions.RenderTaskFields | MhtFormatOptions.RenderVCardInfo;
            mhtOptions.MhtFormatOptions = MhtFormatOptions.WriteHeader | MhtFormatOptions.WriteCompleteEmailAddress | MhtFormatOptions.RenderCalendarEvent | MhtFormatOptions.RenderTaskFields | MhtFormatOptions.RenderVCardInfo;
            mhtOptions.SkipInlineImages = (withImagesFromMsg != 1);

            // If mail date is to be printed according to the local system timezone, 
            // subtract the mail timezone offset from mail date and add your local system timezone 
            TimeZone localZone = TimeZone.CurrentTimeZone;
            TimeSpan ts = localZone.GetUtcOffset(DateTime.Now);
            CultureInfo culture = new CultureInfo("fr-FR");

            string ext = Path.GetExtension(inputFile).ToLower();

            if (ext == ".eml")
            {
                var message = MailMessage.Load(docStream); // eml
                message.TimeZoneOffset = localZone.GetUtcOffset(message.Date.ToLocalTime());
                message.Save(ms, mhtOptions);
            }
            else
            {
                var mapiMessage = MapiMessage.Load(docStream, new MsgLoadOptions());
                string date = (mapiMessage.ClientSubmitTime.ToUniversalTime() + ts).ToString("ddd, dd MMM yyyy HH:mm:ss", CultureInfo.InvariantCulture);
                string newDate = ResetUTCOffset(date, ts);
                mapiMessage.Headers["Date"] = newDate;
                mapiMessage.Save(ms, mhtOptions);
            }

            FontSettings fontSettings = new FontSettings();

            Aspose.Words.Loading.HtmlLoadOptions lo = new Aspose.Words.Loading.HtmlLoadOptions { FontSettings = fontSettings };

            lo.LoadFormat = LoadFormat.Mhtml;
            lo.WebRequestTimeout = 1000;
            Document doc = new Document(ms, lo);

            DocumentBuilder dBuilder = new DocumentBuilder(doc);
            dBuilder.PageSetup.LeftMargin = ConvertUtil.MillimeterToPoint(10);
            dBuilder.PageSetup.RightMargin = ConvertUtil.MillimeterToPoint(10);
            dBuilder.PageSetup.TopMargin = ConvertUtil.MillimeterToPoint(10);
            dBuilder.PageSetup.BottomMargin = ConvertUtil.MillimeterToPoint(10);
            foreach (Shape shape in doc.GetChildNodes(NodeType.Shape, true))
            {
                ResizebigImage(shape);
            }
            int docPageCount = doc.PageCount;

            PdfSaveOptions pdfOptions = new PdfSaveOptions();
            PageRange pageRange;
            if ((pageCount == -1) || ((firstPageIndex + pageCount) > docPageCount))
                pageRange = new PageRange(firstPageIndex + 1 > docPageCount ? 0 : firstPageIndex, Int32.MaxValue);
            else
                pageRange = new PageRange(firstPageIndex + 1 > docPageCount ? 0 : firstPageIndex, pageCount);

            pdfOptions.PageSet = new PageSet(pageRange);

            // optimisation de la taille du pdf généré
            pdfOptions.ImageCompression = PdfImageCompression.Jpeg;
            pdfOptions.JpegQuality = 75;
            pdfOptions.TextCompression = PdfTextCompression.Flate;

            //save the document to PDF file
            doc.Save(outputFile, pdfOptions);
            doc.Cleanup();

            ms.Dispose();
            //log.Close();
            return (0);
        }
        finally
        {
            // Close stream
            docStream.Close();
            docStream.Dispose();
            GC.Collect();
        }
    }
    catch (Exception ex)
    {
        errmsg = ex.Message + " | " + ex.StackTrace;
        return (1);
    }
}

@tparassin

It seems that you are encountering a System.NullReferenceException when trying to access the PageCount property of an Aspose.Words.Document object, specifically when the SkipInlineImages option is set to true. This issue may arise due to the way the MHTML content is being processed when inline images are skipped.

To troubleshoot and potentially resolve this issue, consider the following steps:

  1. Check MHTML Options: Ensure that the MHTML options you are using are correctly configured. The SkipInlineImages property being set to true means that any inline images in the email will not be included in the MHTML output. This could lead to issues if the document relies on those images for layout or rendering.

  2. Load Options: When creating the LoadOptions for the Aspose.Words.Document, ensure that you are setting the LoadFormat to LoadFormat.Mhtml. This is crucial for correctly interpreting the MHTML content.

  3. Debugging: Add additional logging or debugging statements to check the state of the MemoryStream before it is passed to the Document constructor. This can help identify if the stream is empty or if there are any issues with the content being loaded.

  4. Update Libraries: Ensure that you are using the latest versions of Aspose.Email and Aspose.Words. Sometimes, bugs are fixed in newer releases, so updating to the latest version may resolve the issue.

  5. Sample Code: Here is a simplified version of how you might structure your code to convert an MSG file to PDF, ensuring that the MHTML options are set correctly:

    public static int Convert(string inputFile, string outputFile, ref string errmsg)
    {
        errmsg = "";
        try
        {
            using (FileStream docStream = new FileStream(inputFile, FileMode.Open, FileAccess.Read))
            {
                MemoryStream ms = new MemoryStream();
                MhtSaveOptions mhtOptions = new MhtSaveOptions
                {
                    MhtFormatOptions = MhtFormatOptions.WriteHeader | MhtFormatOptions.WriteCompleteEmailAddress,
                    SkipInlineImages = true // Adjust this as needed
                };
    
                var message = MailMessage.Load(docStream);
                message.Save(ms, mhtOptions);
    
                var loadOptions = new Aspose.Words.LoadOptions { LoadFormat = LoadFormat.Mhtml };
                Document doc = new Document(ms, loadOptions);
    
                // Proceed with PDF conversion
                doc.Save(outputFile, new PdfSaveOptions());
                return 0;
            }
        }
        catch (Exception ex)
        {
            errmsg = ex.Message + " | " + ex.StackTrace;
            return 1;
        }
    }
    

By following these steps and ensuring that your options are set correctly, you should be able to avoid the NullReferenceException and successfully convert your MSG file to PDF. If the problem persists, consider reaching out to Aspose support with detailed information about your issue.

Sources:
[1]: SaveEmailAsPDF.cs

Effectively using Aspose.Words.Loading.LoadOptions (Mhtml) instead of Aspose.Words.Loading.HtmlLoadOptions corrected the violation.

Is there a way to know by code, witch option that I need to use ?

Hello @tparassin,

We didn’t find any issues on the Aspose.Email side. We are forwarding this request to our colleagues on the Aspose.Words forum for further investigation.

@tparassin Unfortunately, I am unable to reproduce the problem on my side using the latest 25.1 version of Aspose.Words and 24.12 version of Aspose.Email.
Could you please also share ResizebigImage method sources? Probably, the problem is caused by preprocessing the document using this method.