Talker-T2AV: Joint Talking Audio-Video Generation with Autoregressive Diffusion Modeling Paper • 2604.23586 • Published 16 days ago • 3
From Storage to Experience: A Survey on the Evolution of LLM Agent Memory Mechanisms Paper • 2605.06716 • Published 5 days ago • 5
From Storage to Experience: A Survey on the Evolution of LLM Agent Memory Mechanisms Paper • 2605.06716 • Published 5 days ago • 5
Talker-T2AV: Joint Talking Audio-Video Generation with Autoregressive Diffusion Modeling Paper • 2604.23586 • Published 16 days ago • 3
Towards Comprehensive Stage-wise Benchmarking of Large Language Models in Fact-Checking Paper • 2601.02669 • Published Jan 6 • 4
Towards Comprehensive Stage-wise Benchmarking of Large Language Models in Fact-Checking Paper • 2601.02669 • Published Jan 6 • 4
DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs Paper • 2601.03559 • Published Jan 7 • 14
DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs Paper • 2601.03559 • Published Jan 7 • 14
DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs Paper • 2601.03559 • Published Jan 7 • 14
REFLEX: Self-Refining Explainable Fact-Checking via Disentangling Truth into Style and Substance Paper • 2511.20233 • Published Nov 25, 2025 • 3 • 2
MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique Paper • 2511.09067 • Published Nov 12, 2025 • 2
MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique Paper • 2511.09067 • Published Nov 12, 2025 • 2 • 2
ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use Paper • 2504.07981 • Published Apr 4, 2025 • 5
AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness Paper • 2507.01702 • Published Jul 2, 2025 • 4
FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models Paper • 2502.17924 • Published Feb 25, 2025
AmbiGraph-Eval: Can LLMs Effectively Handle Ambiguous Graph Queries? Paper • 2508.09631 • Published Aug 13, 2025
EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty Paper • 2510.00732 • Published Oct 1, 2025 • 6 • 2
EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty Paper • 2510.00732 • Published Oct 1, 2025 • 6
EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty Paper • 2510.00732 • Published Oct 1, 2025 • 6
AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness Paper • 2507.01702 • Published Jul 2, 2025 • 4