Intelligent Document Processing (IDP) - it is enterprise level solutionfor end-to-end intelligent document processing. IDP Solution is designed to intelligently process scanned or digitally generated documents (images) of different format.
High-level IDP diagram
Supported OCR Engines
The elDoc IDP system supports multiple OCR engines, allowing flexibility in accuracy, performance, and deployment scenarios.
Tesseract elDoc IDP includes a built-in OCR engine based on the latest version of Tesseract, enhanced to deliver optimal recognition accuracy. (See Supported Languages below)
Google Vision API elDoc IDP can be configured to use Google Vision API for OCR processing, providing high accuracy and robust language support. For more details and languages support, refer to Google Vision API Supported Languages.
PaddleOCR API elDoc IDP can integrate with PaddleOCR, a high-performance open-source OCR framework optimized for multilingual text detection and recognition. PaddleOCR offers strong accuracy for complex layouts, supports a wide range of languages, and is particularly effective for structured documents and dense text scenarios. PaddleOCR can be also deployed on-prem.
VL Model API Enables AI/LLM-based OCR using OpenAI API, leveraging vision-language models for advanced document understanding, including complex layouts, context-aware extraction, and semantic interpretation. VL model can be deployed on-prem using ollama, vLLM, llama.cpp, etc.