CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation Paper • 2505.24456 • Published May 30, 2025
AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text Paper • 2503.18247 • Published Mar 24, 2025
Afri-MCQA: Multimodal Cultural Question Answering for African Languages Paper • 2601.05699 • Published Jan 9 • 2
Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches Paper • 2508.21512 • Published Aug 29, 2025
Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages Paper • 2603.23654 • Published 20 days ago
AfrIFact: Cultural Information Retrieval, Evidence Extraction and Fact Checking for African Languages Paper • 2604.00706 • Published 13 days ago
LLM2Vec-Gen: Generative Embeddings from Large Language Models Paper • 2603.10913 • Published Mar 11 • 44
Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking Paper • 2602.21196 • Published Feb 24 • 7
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models Paper • 2602.16609 • Published Feb 18 • 6
Economies of Open Intelligence: Tracing Power & Participation in the Model Ecosystem Paper • 2512.03073 • Published Nov 27, 2025 • 7
The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models Paper • 2510.13996 • Published Oct 15, 2025 • 9