Extract image from word document


#1

Hi Team,
Extract image from word document and convert to pdf, has some issues like:

  • Axis numbers present in the image charts are modified.
  • Text is over lapped.
  • Text alignment are modified.
  • Text alignment mismatch.

Issues: issues.zip (370 Bytes)

Sample 1: sample1.zip (770.0 KB)

Sample 2: sample2.zip (39.0 KB)

Sample 3: Sample3.zip (54.8 KB)

Sample 4: Sample4.zip (46.6 KB)

Sample 5: sample5.zip (41.7 KB)

Sample 6: sample6.zip (40.7 KB)

Sample 7:sample7.zip (16.7 KB)

Help us to resolve those issues.Many Thanks in advance.
Regards,
Suruthy


#2

@suruthyb

Thanks for your inquiry. Please ZIP and attach the following resources here for testing.

  • Input Word document(s).
  • working source code to reproduce this issue.

We will then investigate the issue on our end and provide you more information.

P.S. If your file size is big then you may upload the ZIP file to Dropbox or any other file hosting service and share the download link here for testing.


#3

@mannanfazil
Please find the source and input:

Input: document.zip (5.5 MB)

source:imageExtraction.zip (1.6 KB)


#4

@suruthyb

Thanks for your inquiry. We have tested the scenarios and have managed to reproduce the same issues at our side. For the sake of correction, we have logged these problems in our issue tracking system as

WORDSNET-17862: Text overlapping in rendered PDF
WORDSNET-17865: DOCX to PDF conversion issue with chart rendering
WORDSNET-17866: Incorrect rendering of Axis labels after converting to PDF
WORDSNET-17867: Axis labels changed during Docx to PDF conversion
WORDSNET-17868 : Range of Y-Axis of chart is changed in output PDF
WORDSJAVA-1955 : Image is missing in output PDF

You will be notified via this forum thread once these issues are resolved.

We apologize for your inconvenience.