Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Fugaku-LLM
community
Activity Feed
Follow
83
AI & ML interests
None defined yet.
Recent Activity
Taishi-N324
authored
a paper
3 days ago
On the Optimal Reasoning Length for RL-Trained Language Models
Taishi-N324
authored
a paper
5 months ago
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources
Taishi-N324
authored
a paper
6 months ago
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
View all activity
Team members
10
Fugaku-LLM
's datasets
None public yet