Gonçalo Faria
graf
AI & ML interests
NLP
Recent Activity
updated
a model
1 day ago
graf/sbon8-c87baa82-qwen2.5
updated
a model
1 day ago
graf/bt_oracle-c0a966a3-qwen2.5
published
a model
2 days ago
graf/sbon8-c87baa82-qwen2.5
Organizations
models
91
graf/sbon8-c87baa82-qwen2.5
2B
•
Updated
•
167
graf/bt_oracle-c0a966a3-qwen2.5
2B
•
Updated
•
147
graf/qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-5-overfit
2B
•
Updated
•
676
graf/qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-5
2B
•
Updated
•
1.67k
graf/qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-6
2B
•
Updated
•
845
graf/qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-7
2B
•
Updated
•
1.25k
graf/qwen2.5-1.5b-instruct-sft-test-gt2-lr1e-5
2B
•
Updated
•
596
graf/qwen2.5-1.5b-instruct-sft-test-gt2-lr1e-6
2B
•
Updated
•
11
graf/qwen2.5-1.5b-instruct-sft-test-gt2-lr1e-7
2B
•
Updated
•
168
graf/qwen2.5-1.5b-instruct-sft-test-gt-lr1e-7
2B
•
Updated
•
270
datasets
47
graf/qwen_deepsr_train_no_tags
Viewer
•
Updated
•
24.3k
•
105
graf/qwen_deepsr_math_test_no_tags
Viewer
•
Updated
•
418
•
33
graf/qwen_deepsr_gsm8k_test_no_tags
Viewer
•
Updated
•
1.28k
•
12
graf/qwen_deepsr_fix_train_no_tags
Updated
•
13
graf/qwen_deepsr_fix_train
Viewer
•
Updated
•
24.3k
•
64
graf/qwen_deepsr_train
Viewer
•
Updated
•
24k
•
125
graf/qwen_deepsr_gsm8k_test
Viewer
•
Updated
•
1.27k
•
24
graf/qwen_deepsr_math_test
Viewer
•
Updated
•
415
•
55
graf/DeepScaleR-Preview-Dataset.gt.1.20000.ancestral.128.Qwen2.5-1.5B-Instruct.bon
Viewer
•
Updated
•
12.6k
•
24
graf/DeepScaleR-Preview-Dataset.gt.4.20000.ancestral.128.Qwen2.5-1.5B-Instruct.rwmv.0.5
Viewer
•
Updated
•
80k
•
8