I am using MarkLogic to generate XML files for PDF documents which has images, formatted text (italic and bold), tables etc. Can you please provide some guidelines for the best conversion. I am using normal conversion with following pipelines:
- Conversion Processing
- DocBook Conversion
- HTML Conversion
- PDF Conversion
- PDF Conversion (Page Layout, Image Batching)
- Status Change Handling
The images are not maintained with their title and format also not maintained. Tables are appearing as normal paragraph in the generated XML.