Turbo Pascal's picture

Turbo Pascal

TurboPascal

·

AI & ML interests

None yet

Recent Activity

updated a collection 10 days ago

upvoted a paper 16 days ago

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

updated a collection 28 days ago

View all activity

Organizations

updated a collection 10 days ago

LLM

3 items • Updated 10 days ago

upvoted a paper 16 days ago

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

Paper • 2603.25562 • Published Mar 26 • 19

updated a collection 28 days ago

LLM

3 items • Updated 10 days ago

upvoted a collection about 1 month ago

Marco-MoE

A suit of multilingual MoE models with highly-sparse architectures • 5 items • Updated Apr 8 • 17

liked a model about 2 months ago

AIDC-AI/Marco-Mini-Global-Base

Text Generation • 17B • Updated Apr 3 • 52 • 7

liked a model 2 months ago

AIDC-AI/Marco-Nano-Base

Text Generation • 8B • Updated Apr 3 • 53 • 15

upvoted 2 papers 2 months ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 147

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published Dec 15, 2025 • 113

liked 4 datasets 2 months ago

MaziyarPanahi/Nemotron-Cascade-2-SFT-Data-Small

Viewer • Updated Mar 22 • 4.9M • 941 • 4

nvidia/Nemotron-Cascade-2-SFT-Data

Viewer • Updated Mar 19 • 15.9M • 10.4k • 68

stepfun-ai/Step-3.5-Flash-SFT

Viewer • Updated Mar 14 • 1.62M • 11.1k • 336

nohurry/Opus-4.6-Reasoning-3000x-filtered

Viewer • Updated Mar 31 • 2.33k • 3.82k • 609

upvoted 2 papers 2 months ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 151

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 113

liked a model 3 months ago

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21, 2025 • 1.44M • • 1.98k

New activity in Alibaba-NLP/new-impl 3 months ago

torch.AcceleratorError: CUDA error: device-side assert triggered

#14 opened 3 months ago by

liked a model 8 months ago

HuggingFaceTB/SmolVLM-256M-Instruct

Image-Text-to-Text • 0.3B • Updated Apr 8, 2025 • 916k • 363

upvoted an article 9 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers

tomaarsen

•

Mar 26, 2025

• 195

liked a model 9 months ago

ByteDance-Seed/Seed-OSS-36B-Instruct

Text Generation • 36B • Updated Aug 26, 2025 • 39.7k • 499

upvoted a collection 9 months ago

BGE

31 items • Updated Feb 4 • 161