The requirement is extracting the images and saved into new document.For the extraction process using paragraph node and fig caption as keyword. In my code i have separate the image handling in following ways
Section A-handling figures with caption as previous
Section B-handling images with caption as nextsibling
Section C-handling images inside the table
Section D-handling images landscape mode
Section E-handling label images
In input document having table images and fig caption in next sibling images . It extracted the images .
please kindly help me to resolve the issues
Issue 1-In section A -some empty documents is created along wtih output. How to delete empty documents created during the execution
Issue 2-In section section D-some fig captions are extracted along with output.How to delete fig captions.
The source code Source.zip (8.4 KB)
The input test.zip (1.9 MB)
The actual output Actual Output.zip (2.1 MB)
The expected output Expected Output.zip (1.9 MB)
Thank you very much,