Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
62.8
TFLOPS
102
3
38
hai
cloudyu
Follow
Seawolf2003's profile picture
distantquant's profile picture
Sirclavin's profile picture
170 followers
·
45 following
yu-hai-52a1702a
AI & ML interests
Long-horizon Agentic Harness
Recent Activity
new
activity
about 13 hours ago
yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF:
Title: v1 results don't match the claimed 30-40% (base) vs 80-100% (this model) accuracy — sharing my eval
updated
a model
about 16 hours ago
cloudyu/gpt-oss-120b-Fable-5-Distilled-GGUF
liked
a model
about 18 hours ago
autotrust/gpt-oss-120b-Fable-5-Distilled-GGUF
View all activity
Organizations
cloudyu
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF
about 13 hours ago
Title: v1 results don't match the claimed 30-40% (base) vs 80-100% (this model) accuracy — sharing my eval
1
#47 opened about 13 hours ago by
cloudyu
New activity in
WeiboAI/VibeThinker-3B
3 days ago
这不是套壳的qwen 2 3B吗?
👍
1
1
#8 opened 3 days ago by
cloudyu
New activity in
stepfun-ai/Step-3.7-Flash
19 days ago
Supers Found in model
❤️
2
9
#11 opened 19 days ago by
tcclaviger
New activity in
unsloth/DeepSeek-V4-Flash
24 days ago
Worse than (smaller) MiniMax M2.7??
17
#2 opened about 2 months ago by
deleted
New activity in
cloudyu/GPT-OSS-120B-2experts-MLX-q4-Claude-4.6-Opus-Reasoning-Distilled
about 1 month ago
Hard time using it
2
#3 opened about 2 months ago by
Vinpolar
New activity in
tencent/Hy3-preview
about 2 months ago
very nice model
2
#8 opened about 2 months ago by
cloudyu
New activity in
deepseek-ai/DeepSeek-V4-Pro
about 2 months ago
Base都开源了 太大方了 致敬
#73 opened about 2 months ago by
cloudyu
New activity in
cloudyu/GPT-OSS-120B-MLX-q4-Claude-4.6-Opus-Reasoning-Distilled
about 2 months ago
Request Gguff or safetensor version??
1
#1 opened about 2 months ago by
Rubertigno
New activity in
MiniMaxAI/MiniMax-M2.7
2 months ago
1/14 非常糟糕的测试结果。
3
#16 opened 2 months ago by
cloudyu
New activity in
cloudyu/GPT-OSS-120B-2experts-MLX-q4-Claude-4.6-Opus-Reasoning-Distilled
2 months ago
Vllm support
1
#2 opened 2 months ago by
cse2011
New activity in
Qwen/Qwen3.5-397B-A17B
4 months ago
感谢老铁除夕坚持工作
❤️
6
#12 opened 4 months ago by
cloudyu
New activity in
inclusionAI/LLaDA2.1-mini
4 months ago
error report to run example
3
#3 opened 4 months ago by
cloudyu
New activity in
Qwen/Qwen3-Coder-Next
5 months ago
"num_experts_per_tok": 10 这个设置是领导拍脑袋拍出来的吗?
1
#12 opened 5 months ago by
cloudyu
New activity in
cloudyu/Mixtral_34Bx2_MoE_60B
6 months ago
Update README.md
#17 opened 6 months ago by
cherry0328
New activity in
deepseek-ai/DeepSeek-V3.2-Exp
9 months ago
咱这个模型是非得国庆前更新吗??
😔
👍
113
31
#1 opened 9 months ago by
luckjone
New activity in
deepseek-ai/DeepSeek-V3.1-Terminus
9 months ago
国庆是休息日,请给我们关注的同学一点休息时间
👀
👍
64
1
#10 opened 9 months ago by
luckjone
New activity in
deepseek-ai/DeepSeek-V3.2-Exp
9 months ago
Transformers does not recognize this architecture
6
#6 opened 9 months ago by
eva20150932-atlascloud
New activity in
unsloth/grok-2-GGUF
9 months ago
mac studio : loading model vocabulary: unknown pre-tokenizer type: 'grok-2'
#5 opened 9 months ago by
cloudyu
New activity in
Wan-AI/Wan2.2-T2V-A14B-Diffusers
10 months ago
demo能不能亲自跑一下,成功了再发出来?
#8 opened 10 months ago by
cloudyu
New activity in
ByteDance-Seed/Seed-OSS-36B-Instruct
10 months ago
Why is the chat_template mixed with Chinese and English?
👍
2
5
#8 opened 10 months ago by
Daucloud
Load more