A powerful Model Context Protocol (MCP) server built with FastMCP that provides comprehensive PDF processing capabilities including text extraction, image extraction, and OCR for reading text within ...
Python orchestrator – Single entry point (python -m pipeline) manages OCR, cleaning, AI refinement, and EPUB generation with shared configuration. Page-aware workflow – Every stage preserves page ...