cg
percisestretch
ยท
AI & ML interests
None yet
Recent Activity
liked
a model about 2 months ago
tencent/HY-MT1.5-1.8B liked
a model 3 months ago
ibm-granite/granite-4.0-350m new activity
6 months ago
tencent/Hunyuan-MT-7B:why are you using query/key layer norm AFTER rotary Organizations
None yet