OCR

6 articles in this category

AI NewsOCRLanguage Model

Baidu Qianfan-OCR: A 4B-Parameter Unified Document Intelligence Model for End-to-End Parsing

Baidu Qianfan Team releases Qianfan-OCR, a 4B-parameter model achieving 93.12 on OmniDocBench v1.5 through a unified vision-language architecture.

Mar 18, 2026

AI NewsArtificial IntelligenceOCR

Zhipu AI Unveils GLM-OCR: A High-Efficiency 0.9B Multimodal Model for Document Parsing and KIE

Zhipu AI and Tsinghua University launch GLM-OCR, a 0.9B multimodal model achieving 5.2 tokens per step via Multi-Token Prediction for high-speed document understanding and structured data extraction.

Mar 15, 2026

AI NewsOCRArtificial Intelligence

FireRed-OCR-2B: Solving Table and LaTeX Hallucinations with GRPO

FireRed-OCR-2B achieves a 92.94% SOTA score on OmniDocBench v1.5 by using Format-Constrained GRPO to eliminate structural hallucinations in tables and LaTeX.

Mar 1, 2026

AI NewsOCRMachine Learning

Mistral Releases OCR 3 with Improved Accuracy on Handwritten and Structured Documents

Mistral OCR 3 achieves a 74% win rate over its predecessor, significantly improving accuracy on forms, handwriting, and tables.

Jan 15, 2026

AI NewsDocument AIOCR

Mistral AI Releases OCR 3: A Smaller Optical Character Recognition (OCR) Model for Structured Document AI at Scale

Mistral AI released OCR 3, achieving a 74% win rate over its previous version on key document types and offering pricing as low as $1 per 1,000 pages.

Dec 19, 2025

AI NewsArtificial IntelligenceOCR

Comparing the Top 6 OCR Models in 2025: A Comprehensive Analysis

A detailed comparison of six leading OCR systems in 2025, including Google Cloud Document AI, AWS Textract, Azure AI Document Intelligence, ABBYY, PaddleOCR 3.0, and DeepSeek OCR, with focus on performance, deployment, and use cases.

Nov 2, 2025