Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
1
154
Nishanth K R
itsme-nishanth
Follow
Mi6paulino's profile picture
gmayank100's profile picture
JeevaBalan-95's profile picture
5 followers
ยท
46 following
AI & ML interests
AI, ML, Data intelligence
Recent Activity
new
activity
1 day ago
TeichAI/Qwen3-4B-Thinking-2507-Gemini-3-Pro-Preview-High-Reasoning-Distill:
Requesting benchmarks
liked
a model
7 days ago
Nanbeige/Nanbeige4.1-3B
reacted
to
Sunny111
's
post
with ๐
about 1 month ago
Are you familiar with reverse residual connections or looping in language models? Excited to share my Looped-GPT blog post and codebase ๐ https://github.com/sanyalsunny111/Looped-GPT TL;DR: looping during pre-training improves generalization. Plot shows GPT2 LMs pre-trained with 15.73B OWT tokens P.S. This is my first post here โ I have ~4 followers and zero expectations for reach ๐
View all activity
Organizations
itsme-nishanth
's datasets
5
Sort:ย Recently updated
itsme-nishanth/mini-gemma-finewik-tokenized
Viewer
โข
Updated
Dec 31, 2025
โข
49.6k
โข
11
itsme-nishanth/mini-gemma-finewiki-tokenized
Viewer
โข
Updated
Dec 31, 2025
โข
49.6k
โข
5
itsme-nishanth/JAT-GPT-pretrain_v2_tokenized
Viewer
โข
Updated
Jul 19, 2025
โข
40k
โข
103
itsme-nishanth/JAT-GPT-pretrain_v2
Viewer
โข
Updated
Jul 19, 2025
โข
40k
โข
105
itsme-nishanth/JAT-GPT-pretrain
Viewer
โข
Updated
Jul 18, 2025
โข
10k
โข
109