25 67

Suraj

ghishadow

AI & ML interests

None yet

Recent Activity

liked a dataset 3 days ago

HuggingFaceFW/fineweb

liked a model 10 days ago

bharatgenai/Param2-17B-A2.4B-Thinking

upvoted a paper 21 days ago

Unified Latents (UL): How to train your latents

View all activity

Organizations

liked a dataset 3 days ago

HuggingFaceFW/fineweb

Viewer • Updated Jul 11, 2025 • 52.5B • 189k • 2.71k

liked a model 10 days ago

bharatgenai/Param2-17B-A2.4B-Thinking

Text Generation • 17B • Updated 8 days ago • 2.77k • 58

upvoted a paper 21 days ago

Unified Latents (UL): How to train your latents

Paper • 2602.17270 • Published about 1 month ago • 58

liked a model 22 days ago

facebook/sam3

Mask Generation • 0.9B • Updated Nov 20, 2025 • 2.4M • 1.73k

upvoted an article 23 days ago

Article

Small Language Models (SLM): A Comprehensive Overview

Feb 22, 2025

•

140

liked a Space 23 days ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

upvoted an article 24 days ago

Article

Bamba: Inference-Efficient Hybrid Mamba2 Model

Dec 18, 2024

•

upvoted an article 27 days ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

about 1 month ago

•

488

liked 3 models 3 months ago

upvoted a collection 4 months ago

Ministral 3

Collection

Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 10 days ago • 31

liked a model 4 months ago

litert-community/Gemma3-1B-IT

Text Generation • Updated Jan 9 • 24.4k • 555

liked a model 5 months ago

maya-research/maya1

Text-to-Speech • Updated Nov 12, 2025 • 47.1k • 872

upvoted a paper 5 months ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17, 2025 • 50

liked 2 models 5 months ago

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 260k • 1.28k

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 7.26M • • 4.47k

upvoted an article 7 months ago

Article

The Hacker's Guide to Building an AI Supercluster

Aug 31, 2025

•

liked a Space 7 months ago

The Ultra-Scale Playbook

🌌

3.75k

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection 7 months ago

Gemma 3-270m

Collection

Collection of models for Gemma 3-270m • 4 items • Updated Dec 16, 2025 • 21

Suraj

AI & ML interests

Recent Activity

Organizations

ghishadow's activity

Small Language Models (SLM): A Comprehensive Overview

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

Bamba: Inference-Efficient Hybrid Mamba2 Model

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

The Hacker's Guide to Building an AI Supercluster

The Ultra-Scale Playbook