Words running together when converting from PDF to Word

I’m using a licensed version of Aspose.PDF and ran into an interesting issue. We create a PDF using a MemoryStream based on a byte[] stored in our database and random blocks of words are having the spaces between them removed. For instance, if the PDF has the sentence “This is a sample sentence with many words in it.” the Word output may show “Thisisa sample sentence withmanywordsinit.” Any help would be greatly appreciated!

@rickyh
Please attach the data and code used so that we can reproduce and investigate the issue.

HLV (2).pdf (262.8 KB)

HLV (2).docx (680.1 KB)

var stream = new MemoryStream(vaultFile)
{
				Position = 0L
};

var document = new Document(stream);
var file = new MemoryStream();

document.Save(file, SaveFormat.DocX);

file.Position = 0L;

return File(file, "application/vnd.openxmlformats-officedocument.wordprocessingml.document", $"{fso.Name[..^3]}docx");

@rickyh
We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): PDFNET-59048

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.

@rickyh
When I try to reproduce this situation, an exception occurs (library version 25.01, Windows, .Net 6). Accordingly, I created a task for the development team.

We’re currently using library version 24.11.0, Windows .NET 8

@rickyh
In my environment, the exception also occurs with library version 24.11.0, Windows .NET 8. Therefore, it is necessary to wait for the solution of the task that was created.

@sergei.shibanov do you have any updates you can provide at this time?

@rickyh
Nothing new for this task yet. Created tasks are solved in the order they are received, taking into account priorities.
The highest priority is for tasks with paid support, followed by tasks from users who have purchased a license.
The time it takes to solve problems can also vary. Therefore, unfortunately, it is not even possible to give ETA.

@sergei.shibanov I assume there’s still no updates you can provide? Is there even an idea of if the task has been started or where it sits in the priority list?

@rickyh
Unfortunately, the regiment has nothing new on this issue.

@sergei.shibanov we’re discussing internally getting paid support. Would you be able to provide a timeline if we purchased the Developer Small Business license? Or, could you provide an idea of how much of a priority this would be with the paid support?

@rickyh
With paid support, the task priority will be the highest and this will significantly speed up the task solution. With all this, I will not undertake to give a time frame - the task may turn out to be difficult (although it does not look like it).

I will ask the development team to give an estimate and write to you.

1 Like

@rickyh
The development team has taken the task for study, in the coming days I expect to write whether they have any certainty.

1 Like