Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
PaddlePaddle
company
Verified
AI & ML interests
Deep Learning Framework
Recent Activity
View all activity
Papers
GraphNet: A Large-Scale Computational Graph Dataset for Tensor Compiler Research
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Organization Card
spaces
5
pinned
Running
160
PaddleOCR-VL Online Demo
📈
Recognize text and elements in images
Running
68
PP-OCRv5 Online Demo
🌍
Universal-Scene Text Recognition Model with High-Accuracy
Running
24
PP-StructureV3 Online Demo
📊
Next-Gen High-Precision Doc Parsing Solution
Running
127
PaddleOCR
⚡
Extract text from images in multiple languages
models
76
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text
•
1.0B
•
Updated
•
25.8k
•
1.18k
PaddlePaddle/PP-DocLayoutV2
Updated
•
12.4k
•
5
PaddlePaddle/devanagari_PP-OCRv5_mobile_rec
Image-to-Text
•
Updated
•
205
PaddlePaddle/latin_PP-OCRv5_mobile_rec
Image-to-Text
•
Updated
•
40.7k
•
1
PaddlePaddle/ta_PP-OCRv5_mobile_rec
Image-to-Text
•
Updated
•
80
PaddlePaddle/te_PP-OCRv5_mobile_rec
Image-to-Text
•
Updated
•
70
PaddlePaddle/el_PP-OCRv5_mobile_rec
Image-to-Text
•
Updated
•
137
PaddlePaddle/cyrillic_PP-OCRv5_mobile_rec
Image-to-Text
•
Updated
•
134
PaddlePaddle/arabic_PP-OCRv5_mobile_rec
Image-to-Text
•
Updated
•
595
PaddlePaddle/th_PP-OCRv5_mobile_rec
Image-to-Text
•
Updated
•
2.05k