mlx-community/Dolci-Think-DPO-32B-Flat
Viewer
•
Updated
•
200k
•
20
•
1
mlx-community/Josiefied-Qwen3-dpo-v1-flat
Viewer
•
Updated
•
500
•
59
•
1
mlx-community/dolma3_mix-common_crawl-art_and_design-160k
Viewer
•
Updated
•
160k
•
21
•
1
mlx-community/Dolci-Instruct-SFT-No-Tools-400K
Viewer
•
Updated
•
402k
•
17
mlx-community/Dolci-Instruct-SFT-No-Tools-200K
Viewer
•
Updated
•
202k
•
11
mlx-community/Dolci-Instruct-SFT-No-Tools-100K
Viewer
•
Updated
•
102k
•
25
mlx-community/Dolci-Think-RL-7B-2k
Viewer
•
Updated
•
2.2k
•
196
•
2
mlx-community/ultrafeedback-prompts-flat-rlhf
Viewer
•
Updated
•
37.9k
•
12
•
1
mlx-community/recycling_the_web-400K
Viewer
•
Updated
•
400k
•
30
mlx-community/recycling_the_web-1k
Viewer
•
Updated
•
1.1k
•
147
•
1
mlx-community/medfit-dataset
Viewer
•
Updated
•
6.44k
•
25
•
3
mlx-community/recycling_the_web-100K
Viewer
•
Updated
•
100k
•
90
mlx-community/recycling_the_web-200K
Viewer
•
Updated
•
200k
•
33
mlx-community/recycling_the_web-1m
Viewer
•
Updated
•
1M
•
46
mlx-community/mlx_lm_calibration_v5
Viewer
•
Updated
•
1
•
12
mlx-community/Intermediate-Thinking-130k
Viewer
•
Updated
•
135k
•
74
•
3
mlx-community/hermes-reasoning-tool-use
Viewer
•
Updated
•
51k
•
75
•
4
Viewer
•
Updated
•
959k
•
44
•
5
mlx-community/dhanishtha-2.0-superthinker
Viewer
•
Updated
•
11.7k
•
37
•
2
Viewer
•
Updated
•
8.57k
•
43
mlx-community/dclm-baseline-1.0-138k
Viewer
•
Updated
•
138k
•
25
•
1
mlx-community/orpo-dpo-mix-40k-flat-mlx
Viewer
•
Updated
•
44.2k
•
12
mlx-community/Human-Like-DPO
Viewer
•
Updated
•
972
•
101
•
4
mlx-community/orpo-dpo-mix-40k-mlx
Viewer
•
Updated
•
44.2k
•
38
mlx-community/fineweb-200k
Viewer
•
Updated
•
200k
•
32
•
1
mlx-community/qwen3_dwq_calibration_1332_235b
Viewer
•
Updated
•
1.33k
•
19
•
2
mlx-community/qwen3_dwq_calibration_5328
Viewer
•
Updated
•
5.33k
•
25
mlx-community/qwen3_dwq_calibration_2664
Viewer
•
Updated
•
2.66k
•
8
mlx-community/qwen3_dwq_calibration_1332
Viewer
•
Updated
•
1.33k
•
15
•
2
Viewer
•
Updated
•
1k
•
46