Hindsight Credit Assignment for Long-Horizon LLM Agents Paper ⢠2603.08754 ⢠Published Mar 7 ⢠5
Hindsight Credit Assignment for Long-Horizon LLM Agents Paper ⢠2603.08754 ⢠Published Mar 7 ⢠5
Build error Agents 2 ChinaTravel š¢ 2 Evaluate and compare AI model performance on ChinaTravel benchmark tasks
A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning Paper ⢠2510.15444 ⢠Published Oct 17, 2025 ⢠151
ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning Paper ⢠2412.13682 ⢠Published Dec 18, 2024 ⢠7
ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning Paper ⢠2412.13682 ⢠Published Dec 18, 2024 ⢠7
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models Paper ⢠2502.04404 ⢠Published Feb 6, 2025 ⢠25