Image-Text-to-Text
PaddleOCR
Safetensors
English
Chinese
multilingual
paddleocr_vl
ERNIE4.5
PaddlePaddle
image-to-text
ocr
document-parse
layout
table
formula
chart
conversational
custom_code
Eval Results
Instructions to use PaddlePaddle/PaddleOCR-VL with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PaddleOCR
How to use PaddlePaddle/PaddleOCR-VL with PaddleOCR:
# See https://www.paddleocr.ai/latest/version3.x/pipeline_usage/PaddleOCR-VL.html to installation from paddleocr import PaddleOCRVL pipeline = PaddleOCRVL(pipeline_version="v1") output = pipeline.predict("path/to/document_image.png") for res in output: res.print() res.save_to_json(save_path="output") res.save_to_markdown(save_path="output") - Notebooks
- Google Colab
- Kaggle
Improve model card: Add paper, code, and project page links
#15
by nielsr HF Staff - opened
This PR enhances the model card by adding prominent links to key resources:
- Paper: Added an explicit link to the Hugging Face Papers page for "PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model".
- Code: Included a direct link to the official GitHub repository, "PaddlePaddle/PaddleOCR GitHub", for easier access to the code.
- Project Page: Added a link to the comprehensive PaddleOCR-VL documentation, "PaddleOCR-VL Documentation", which serves as the project's main documentation page.
These additions improve the discoverability and completeness of information for users interested in the model.
Thank you for your contribution.
ChengCui changed pull request status to merged