Scaling Low-Resource MT via Synthetic Data Generation with LLMs Paper • 2505.14423 • Published May 20, 2025 • 2
CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation Paper • 2505.24456 • Published May 30, 2025
AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text Paper • 2503.18247 • Published Mar 24, 2025
Afri-MCQA: Multimodal Cultural Question Answering for African Languages Paper • 2601.05699 • Published Jan 9 • 2
Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches Paper • 2508.21512 • Published Aug 29, 2025
Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages Paper • 2603.23654 • Published 19 days ago
AfrIFact: Cultural Information Retrieval, Evidence Extraction and Fact Checking for African Languages Paper • 2604.00706 • Published 11 days ago
RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation Paper • 2603.09723 • Published Mar 10 • 7
view post Post 205 Good news! Ulysses Sequence Parallelism from the Snowflake AI Research and the Deepspeed teams has been integrated into HuggingFace Trainer, Accelerate and TRLFor extensive details please see this writeup:https://huggingface.co/blog/ulysses-spThanks a lot to Kashif Rasul for helping make it happen. Also the others in the HF team who helped with integration. See translation 🤗 1 1 + Reply
Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024 Paper • 2406.16777 • Published Jun 24, 2024 • 1