rootsautomation/MUSTARD
Viewer • Updated • 1.43k • 13
VLMs and long context, document processing and understanding, confidence, calibration, alignment, and decision making.
GutenOCR: A Grounded Vision-Language Front-End for Documents
PubMed-OCR: PMC Open Access OCR Annotations