PaddlePaddle

PaddleOCR

未分类

PaddlePaddle

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

59.5k
Stars
9.0k
Forks
151
Issues
304
Contributors
484
Watchers
ocrchineseocrpdf2markdownpp-ocrpp-structuredocument-parsingdocument-translationkieai4sciencepdf-extractor-ragpdf-parserrag
Python
{"name":"Apache License 2.0","spdxId":"Apache-2.0"}

Project Description

A rich, leading and practical OCR tool library

© 2025 GitHub Fun. All rights reserved.