X-MuTeST: A Multilingual Benchmark for Explainable Hate Speech Detection and A Novel LLM-consulted Explanation Framework Paper • 2601.03194 • Published about 17 hours ago • 1
X-MuTeST: A Multilingual Benchmark for Explainable Hate Speech Detection and A Novel LLM-consulted Explanation Framework Paper • 2601.03194 • Published about 17 hours ago • 1
EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI Paper • 2509.11648 • Published Sep 15, 2025 • 1
D-HUMOR: Dark Humor Understanding via Multimodal Open-ended Reasoning Paper • 2509.06771 • Published Sep 8, 2025 • 5
Query Attribute Modeling: Improving search relevance with Semantic Search and Meta Data Filtering Paper • 2508.04683 • Published Aug 6, 2025
DSBC : Data Science task Benchmarking with Context engineering Paper • 2507.23336 • Published Jul 31, 2025 • 2
NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models Paper • 2506.07731 • Published Jun 9, 2025 • 2
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance Paper • 2507.22448 • Published Jul 30, 2025 • 68
A Technical Study into Small Reasoning Language Models Paper • 2506.13404 • Published Jun 16, 2025 • 8
Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home Paper • 2501.12835 • Published Jan 22, 2025 • 4
LLM-Independent Adaptive RAG: Let the Question Speak for Itself Paper • 2505.04253 • Published May 7, 2025 • 14
Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA Paper • 2505.21115 • Published May 27, 2025 • 140
SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs Paper • 2504.08192 • Published Apr 11, 2025 • 3
CoRAG: Collaborative Retrieval-Augmented Generation Paper • 2504.01883 • Published Apr 2, 2025 • 9
Position: Mechanistic Interpretability Should Prioritize Feature Consistency in SAEs Paper • 2505.20254 • Published May 26, 2025 • 5
Uncovering Cultural Representation Disparities in Vision-Language Models Paper • 2505.14729 • Published May 20, 2025 • 1
A Survey of NL2SQL with Large Language Models: Where are we, and where are we going? Paper • 2408.05109 • Published Aug 9, 2024
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning Paper • 2504.08600 • Published Apr 11, 2025 • 32
Robust and Fine-Grained Detection of AI Generated Texts Paper • 2504.11952 • Published Apr 16, 2025 • 12
Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance Paper • 2504.09753 • Published Apr 13, 2025 • 6