Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
71.2
TFLOPS
Roman Ivanov
perelmanych
14
2
6
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
new
activity
12 days ago
WeiboAI/VibeThinker-3B:
Really looking forward for 9B or 12B variants
liked
a model
12 days ago
WeiboAI/VibeThinker-3B
upvoted
an
article
17 days ago
GLM-5.2: Built for Long-Horizon Tasks
View all activity
Organizations
None yet
perelmanych
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
WeiboAI/VibeThinker-3B
12 days ago
Really looking forward for 9B or 12B variants
4
#20 opened 12 days ago by
perelmanych
liked
a model
12 days ago
WeiboAI/VibeThinker-3B
Text Generation
•
3B
•
Updated
5 days ago
•
77.3k
•
•
776
upvoted
an
article
17 days ago
view article
Article
GLM-5.2: Built for Long-Horizon Tasks
zai-org
•
18 days ago
•
120
New activity in
ubergarm/Kimi-K2-Thinking-GGUF
8 months ago
Definitely interested in this one!
🚀
2
25
#1 opened 8 months ago by
mtcl
upvoted
a
collection
10 months ago
Qwen3-Coder
Collection
5 items
•
Updated
Dec 31, 2025
•
180
New activity in
xai-org/grok-2
11 months ago
Incorrect Model Uploaded
🤗
👍
18
6
#8 opened 11 months ago by
noteventhrice
New activity in
zai-org/CC-Bench-trajectories
11 months ago
Qwen3 coder version
#1 opened 11 months ago by
perelmanych
New activity in
sleepdeprived3/Llama-3.3-T3
12 months ago
Difference from other presets
#1 opened 12 months ago by
perelmanych
liked
a model
12 months ago
sleepdeprived3/Llama-3.3-T3
Updated
May 18, 2025
•
1
liked
3 models
about 1 year ago
agentica-org/DeepScaleR-1.5B-Preview
Text Generation
•
2B
•
Updated
Apr 9, 2025
•
4.04k
•
•
584
bartowski/open-thoughts_OpenThinker2-32B-GGUF
Text Generation
•
33B
•
Updated
Apr 5, 2025
•
617
•
11
open-thoughts/OpenThinker2-32B
Text Generation
•
33B
•
Updated
Jun 5, 2025
•
10.8k
•
•
57
liked
a model
over 1 year ago
bartowski/Qwen_QwQ-32B-GGUF
Text Generation
•
33B
•
Updated
Mar 5, 2025
•
3.54k
•
167
New activity in
bartowski/DeepSeek-R1-Distill-Qwen-32B-GGUF
over 1 year ago
R1 32b is much worse than QwQ ...
22
#6 opened over 1 year ago by
mirek190
New activity in
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
over 1 year ago
SFT (Non-RL) distillation is this good on a sub-100B model?
3
#2 opened over 1 year ago by
KrishnaKaasyap
New activity in
mradermacher/Llama3-ChatQA-1.5-70B-GGUF
about 2 years ago
IQ2_XS variant
1
#2 opened about 2 years ago by
perelmanych
New activity in
lmsys/vicuna-33b-v1.3
over 2 years ago
When we can expect vicuna variant of CodeLlama-2 34b model?
👍
1
#10 opened over 2 years ago by
perelmanych
New activity in
TheBloke/WizardCoder-Guanaco-15B-V1.1-GGML
almost 3 years ago
Can't load q5_1 model
3
#1 opened almost 3 years ago by
perelmanych
New activity in
anon8231489123/vicuna-13b-GPTQ-4bit-128g
about 3 years ago
Error when using with web-ui "KeyError: 'model.layers.39.self_attn.q_proj.wf1'"
❤️
1
16
#7 opened about 3 years ago by
TheFairyMan
New activity in
anon8231489123/gpt4-x-alpaca-13b-native-4bit-128g
about 3 years ago
Error using ooba-gooba
👍
1
39
#6 opened about 3 years ago by
blueisbest
Load more