Roman Ivanov

perelmanych

14 2 6

AI & ML interests

None yet

Recent Activity

new activity 12 days ago

WeiboAI/VibeThinker-3B:Really looking forward for 9B or 12B variants

liked a model 12 days ago

WeiboAI/VibeThinker-3B

upvoted an article 17 days ago

GLM-5.2: Built for Long-Horizon Tasks

View all activity

Organizations

None yet

New activity in WeiboAI/VibeThinker-3B 12 days ago

Really looking forward for 9B or 12B variants

#20 opened 12 days ago by

perelmanych

liked a model 12 days ago

WeiboAI/VibeThinker-3B

Text Generation • 3B • Updated 5 days ago • 77.3k • • 776

upvoted an article 17 days ago

Article

GLM-5.2: Built for Long-Horizon Tasks

zai-org

•

18 days ago

• 120

New activity in ubergarm/Kimi-K2-Thinking-GGUF 8 months ago

Definitely interested in this one!

🚀 2

#1 opened 8 months ago by

mtcl

upvoted a collection 10 months ago

Qwen3-Coder

Collection

5 items • Updated Dec 31, 2025 • 180

New activity in xai-org/grok-2 11 months ago

Incorrect Model Uploaded

🤗👍 18

#8 opened 11 months ago by

noteventhrice

New activity in zai-org/CC-Bench-trajectories 11 months ago

Qwen3 coder version

#1 opened 11 months ago by

perelmanych

New activity in sleepdeprived3/Llama-3.3-T3 12 months ago

Difference from other presets

#1 opened 12 months ago by

perelmanych

liked a model 12 months ago

sleepdeprived3/Llama-3.3-T3

Updated May 18, 2025 • 1

liked 3 models about 1 year ago

liked a model over 1 year ago

bartowski/Qwen_QwQ-32B-GGUF

Text Generation • 33B • Updated Mar 5, 2025 • 3.54k • 167

New activity in bartowski/DeepSeek-R1-Distill-Qwen-32B-GGUF over 1 year ago

R1 32b is much worse than QwQ ...

#6 opened over 1 year ago by

mirek190

New activity in deepseek-ai/DeepSeek-R1-Distill-Llama-70B over 1 year ago

SFT (Non-RL) distillation is this good on a sub-100B model?

#2 opened over 1 year ago by

KrishnaKaasyap

New activity in mradermacher/Llama3-ChatQA-1.5-70B-GGUF about 2 years ago

IQ2_XS variant

#2 opened about 2 years ago by

perelmanych

New activity in lmsys/vicuna-33b-v1.3 over 2 years ago

When we can expect vicuna variant of CodeLlama-2 34b model?

👍 1

#10 opened over 2 years ago by

perelmanych

New activity in TheBloke/WizardCoder-Guanaco-15B-V1.1-GGML almost 3 years ago

Can't load q5_1 model

#1 opened almost 3 years ago by

perelmanych

New activity in anon8231489123/vicuna-13b-GPTQ-4bit-128g about 3 years ago

Error when using with web-ui "KeyError: 'model.layers.39.self_attn.q_proj.wf1'"

❤️ 1

#7 opened about 3 years ago by

TheFairyMan

New activity in anon8231489123/gpt4-x-alpaca-13b-native-4bit-128g about 3 years ago

Error using ooba-gooba

👍 1

#6 opened about 3 years ago by

blueisbest

Roman Ivanov

AI & ML interests

Recent Activity

Organizations

perelmanych's activity

Really looking forward for 9B or 12B variants

GLM-5.2: Built for Long-Horizon Tasks

Definitely interested in this one!

Incorrect Model Uploaded

Qwen3 coder version

Difference from other presets

R1 32b is much worse than QwQ ...

SFT (Non-RL) distillation is this good on a sub-100B model?

IQ2_XS variant

When we can expect vicuna variant of CodeLlama-2 34b model?

Can't load q5_1 model

Error when using with web-ui "KeyError: 'model.layers.39.self_attn.q_proj.wf1'"

Error using ooba-gooba