SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
- docling
- convert
- document
- pdf
- docx
- html
- markdown
- layout
- model
- segmentation
- table
- structure
- former
- ai
- document-parser
- document-parsing
- documents
- pdf-converter
- pdf-to-json
- pdf-to-text
- pptx
- tables
- xlsx