oguzhanercan
's Collections
Reasoning
updated
Paper
•
2506.10910
•
Published
•
66
Fractional Reasoning via Latent Steering Vectors Improves Inference Time
Compute
Paper
•
2506.15882
•
Published
•
2
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via
Context-Aware Multi-Stage Policy Optimization
Paper
•
2507.14683
•
Published
•
134
The Invisible Leash: Why RLVR May Not Escape Its Origin
Paper
•
2507.14843
•
Published
•
85
Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for
RLVR
Paper
•
2507.15778
•
Published
•
20
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
Paper
•
2507.19457
•
Published
•
28
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains
RLVR
Paper
•
2508.14029
•
Published
•
118
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task
Arithmetic
Paper
•
2509.01363
•
Published
•
58
Which Heads Matter for Reasoning? RL-Guided KV Cache Compression
Paper
•
2510.08525
•
Published
•
22
Paper
•
2510.06557
•
Published
•
30
A Theoretical Study on Bridging Internal Probability and
Self-Consistency for LLM Reasoning
Paper
•
2510.15444
•
Published
•
147
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper
•
2510.18866
•
Published
•
111
Scaling Latent Reasoning via Looped Language Models
Paper
•
2510.25741
•
Published
•
221
Reasoning with Sampling: Your Base Model is Smarter Than You Think
Paper
•
2510.14901
•
Published
•
47
OpenSIR: Open-Ended Self-Improving Reasoner
Paper
•
2511.00602
•
Published
•
20