Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
909.9
TFLOPS
Hanz
hanzceo
3
4
23
Follow
Phyte987's profile picture
1 follower
ยท
9 following
hanzceo
hanzceo
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 16 hours ago
Morphing into Hybrid Attention Models
replied
to
Banaxi-Tech
's
post
1 day ago
A new model is coming! Its going to take a long time on my 5070 Ti so expect a release in ~1 month. We think this model is going to be SOTA For its size. Our Mini Version will be 25M Parameters and Pro with 140M. The Pro version has a 3072 Context Window (Extensible to up to 6K with RoPE) And the Mini version has a context window of 4096 (Up to 8K with RoPE) Meanwhile we are currently working on a Instruct Version of our BananaMind 1.5 Base. The training will start this weekend We are very exited to release it when its done!
reacted
to
Banaxi-Tech
's
post
with ๐ฅ
1 day ago
A new model is coming! Its going to take a long time on my 5070 Ti so expect a release in ~1 month. We think this model is going to be SOTA For its size. Our Mini Version will be 25M Parameters and Pro with 140M. The Pro version has a 3072 Context Window (Extensible to up to 6K with RoPE) And the Mini version has a context window of 4096 (Up to 8K with RoPE) Meanwhile we are currently working on a Instruct Version of our BananaMind 1.5 Base. The training will start this weekend We are very exited to release it when its done!
View all activity
Organizations
hanzceo
's datasets
1
Sort:ย Recently updated
hanzceo/sts-en-en
Viewer
โข
Updated
17 days ago
โข
1.18k
โข
46