pinned
Running on L40S
569
MinerU OCR
📚
A data extraction tool to convert PDF to Markdown and JSON
OpenDataLab provides high-quality open datasets and tools for large models. China Large model corpus Data Alliance open source data service designated platform
MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale
The Trinity of Consistency as a Defining Principle for General World Models
A data extraction tool to convert PDF to Markdown and JSON
demo of MinerU-Diffusion
Convert table images into HTML tags with TRivia-3B
Evaluate formula recognition accuracy
Demo for DocLayout-YOLO
Recognize math equations from images