OCR software, free and offline
Accurate × Fast × Comprehensive
Contexts Optical Compression
PDF to Markdown with vision models
Visual Causal Flow
Formula recognition based on LaTeX-OCR and ONNXRuntime
OCRmyPDF adds an OCR text layer to scanned PDF files
Awesome multilingual OCR toolkits based on PaddlePaddle
Enhances Tesseract OCR output using LLMs (local or API)
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
A high-quality tool for convert PDF to Markdown and JSON
OCR expert VLM powered by Hunyuan's native multimodal architecture
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Multilingual Document Layout Parsing in a Single Vision-Language Model
Convert AI papers to GUI
PDF scientific paper translation with preserved formats
Math OCR model that outputs LaTeX and markdown
Open Source Document Management System for Digital Archives
A framework to enable multimodal models to operate a computer
A simple tool for reading in poorly redacted documents
Get your documents ready for gen AI
A Repo For Document AI
Document content and metadata extraction microservice
OCR model for complex documents with layout-aware structured outputs