olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org โข 12 items โข Updated Dec 23, 2025 โข 147