PaddleOCR 3.5 Integrates Transformers Backend for OCR and Document Parsing

digital · 2026-05-18

PaddleOCR 3.5 now supports Transformers as an inference backend for its OCR and document parsing models, including PP-OCRv5 and PaddleOCR-VL 1.5. This release introduces a flexible inference-engine interface, allowing developers to select the backend via the engine parameter and configure options like dtype, device placement, and attention implementation through engine_config. The integration aims to reduce friction for developers building RAG, Document AI, search, analytics, or agent applications within Hugging Face-centered stacks. While the Transformers backend offers a familiar development experience and Hub-compatible model discovery, the default paddle_static backend remains recommended for maximum throughput. A live demo is available on Hugging Face Spaces. The release was supported by Hugging Face engineers, including Anton Vlasjuk, Raushan Turganbay, and Yoni Gozlan.

Key facts

PaddleOCR 3.5 supports Transformers as an inference backend.
Models include PP-OCRv5 and PaddleOCR-VL 1.5.
Developers can configure backend options via engine_config.
Transformers backend is suitable for Hugging Face-centered stacks.
Default paddle_static backend is recommended for maximum throughput.
Live demo available on Hugging Face Spaces.
Integration supported by Hugging Face engineers Anton Vlasjuk, Raushan Turganbay, and Yoni Gozlan.
PaddleOCR manages OCR and document parsing pipelines behind the scenes.

PaddleOCR 3.5 Integrates Transformers Backend for OCR and Document Parsing

Key facts

Entities

Institutions

Sources