🔄 In a Training Loop

Pratyay Banerjee

Neilblaze

·

https://neilblaze.live

AI & ML interests

IR, NLP, Pattern Recognition, xAI, Interpretability, Evals

Recent Activity

upvoted a paper about 16 hours ago

Blind-Spots-Bench: Evaluating Blind Spots in Multimodal Models

upvoted a paper about 16 hours ago

KronQ: LLM Quantization via Kronecker-Factored Hessian

upvoted a paper about 16 hours ago

DSpark: Confidence-Scheduled Speculative Decoding with Semi-Autoregressive Generation

View all activity

Organizations

upvoted 14 papers about 16 hours ago

Blind-Spots-Bench: Evaluating Blind Spots in Multimodal Models

Paper • 2607.08317 • Published 8 days ago • 26

KronQ: LLM Quantization via Kronecker-Factored Hessian

Paper • 2607.07964 • Published 9 days ago • 28

DSpark: Confidence-Scheduled Speculative Decoding with Semi-Autoregressive Generation

Paper • 2607.05147 • Published 11 days ago • 35

TurboServe: Serving Streaming Video Generation Efficiently and Economically

Paper • 2606.19271 • Published about 1 month ago • 37

Wan-Streamer v0.2: Higher Resolution, Same Latency

Paper • 2607.04443 • Published 12 days ago • 39

Multi-Block Diffusion Language Models

Paper • 2606.29215 • Published 17 days ago • 42

ResearchStudio-Idea: An Evidence-Grounded Research-Ideation Skill Suite from ML Conference Outcomes

Paper • 2607.04439 • Published 12 days ago • 58

Search Beyond What Can Be Taught: Evolving the Knowledge Boundary in Agentic Visual Generation

Paper • 2607.05382 • Published 8 days ago • 74

ABot-AgentOS: A General Robotic Agent OS with Lifelong Multi-modal Memory

Paper • 2607.10350 • Published 6 days ago • 77

Hierarchical Sparse Attention Done Right: Toward Infinite Context Modeling

Paper • 2607.02980 • Published 14 days ago • 78

DOPD: Dual On-policy Distillation

Paper • 2606.30626 • Published 18 days ago • 111

Weak-to-Strong Generalization via Direct On-Policy Distillation

Paper • 2607.05394 • Published 9 days ago • 122

Harness Handbook: Making Evolving Agent Harnesses Readable,Navigable, and Editable

Paper • 2607.13285 • Published 3 days ago • 179

The Mirage of Optimizing Training Policies: Monotonic Inference Policies as the Real Objective for LLM Reinforcement Learning

Paper • 2606.29526 • Published 19 days ago • 165

upvoted a collection about 16 hours ago

Nemotron-Labs-TwoTower

Diffusion Language Modeling with Pretrained Autoregressive Nemotron 3 Models • 1 item • Updated about 14 hours ago • 8

upvoted 2 articles about 23 hours ago

Article

A brief history of distillation in AI

sergiopaniego

•

14 days ago

• 3

Article

Distillation in 2026 (so far): which frontier models use it and how

sergiopaniego

•

9 days ago

• 17

upvoted an article about 24 hours ago

Article

Model Routing Is Simple. Until It Isn’t.

ibm-research

•

2 days ago

• 30

upvoted 2 collections 1 day ago

Qwen-AgentWorld

3 items • Updated 23 days ago • 67

Bonsai 27B

10 items • Updated 2 days ago • 153