AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios Paper • 2602.23166 • Published 12 days ago • 37
DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval Paper • 2603.04743 • Published 5 days ago • 45
Heterogeneous Agent Collaborative Reinforcement Learning Paper • 2603.02604 • Published 7 days ago • 157
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling Paper • 2603.04791 • Published 5 days ago • 14
Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory Paper • 2603.04257 • Published 6 days ago • 18
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 27B • Updated 2 days ago • 58.2k • 149
Jackrong/Qwen3.5-2B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 2B • Updated 3 days ago • 15k • 80
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Text Generation • 28B • Updated 2 days ago • 15.7k • 330
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning Paper • 2602.21534 • Published 13 days ago • 23
PyVision-RL: Forging Open Agentic Vision Models via RL Paper • 2602.20739 • Published 14 days ago • 29