-
TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering
Paper • 2504.20114 • Published • 4 -
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens
Paper • 2603.23516 • Published • 41 -
MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution
Paper • 2603.18718 • Published • 8 -
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience
Paper • 2603.24533 • Published • 41
Thomas Ferraz
thomas-ferraz
AI & ML interests
NLP in portuguese
Recent Activity
updated a collection 3 days ago
Retrieve-Reasoning updated a collection 3 days ago
Retrieve-ReasoningOrganizations
Reasoning LLMs
-
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models
Paper • 2502.04404 • Published • 25 -
Learning Adaptive Parallel Reasoning with Language Models
Paper • 2504.15466 • Published • 44 -
TTRL: Test-Time Reinforcement Learning
Paper • 2504.16084 • Published • 122 -
THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models
Paper • 2504.13367 • Published • 26
Retrieve-Reasoning
-
TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering
Paper • 2504.20114 • Published • 4 -
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens
Paper • 2603.23516 • Published • 41 -
MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution
Paper • 2603.18718 • Published • 8 -
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience
Paper • 2603.24533 • Published • 41
Reinforcement Learning
Reasoning LLMs
-
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models
Paper • 2502.04404 • Published • 25 -
Learning Adaptive Parallel Reasoning with Language Models
Paper • 2504.15466 • Published • 44 -
TTRL: Test-Time Reinforcement Learning
Paper • 2504.16084 • Published • 122 -
THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models
Paper • 2504.13367 • Published • 26