Docling is an open-source document parsing and understanding library designed for generative-AI workflows. It processes many formats (PDF, DOCX, PPTX, HTML, images, audio, WebVTT), offers advanced PDF layout/table/code/formula understanding, OCR and ASR support, a unified document representation, multiple export formats, local execution for sensitive data, CLI, and integrations with popular agent/LLM frameworks. It also provides an MCP server for agentic usage.