Edit Models filters
Apps
Inference Providers
Active filters:
QuestionAnswering
JamieAi33/Phi-2-QLora
JamieAi33/Phi-2_PEFT
KakashiH/BashExplainer_Gemma
2KKLabs/Kaleidoscope_small_v1
2KKLabs/Kaleidoscope_large_v1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Ins
Reinforcement Learning
•
8B
•
Updated
•
4
•
2
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Base
Reinforcement Learning
•
8B
•
Updated
•
11
•
2
SEGAgentRL/LLDS-A-GSPO-Qwen2.5-3B-Ins
Reinforcement Learning
•
3B
•
Updated
•
4
•
1
SEGAgentRL/LLDS-R-GSPO-Qwen2.5-3B-Ins
Reinforcement Learning
•
3B
•
Updated
•
3
•
1
SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Base
Reinforcement Learning
•
3B
•
Updated
•
5
•
1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Base-MA
Reinforcement Learning
•
3B
•
Updated
•
3
•
1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Base
Reinforcement Learning
•
3B
•
Updated
•
10
SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Ins
Reinforcement Learning
•
3B
•
Updated
•
3
•
1
mradermacher/LLDS-A-GSPO-Qwen2.5-3B-Ins-GGUF
3B
•
Updated
•
81
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-GGUF
8B
•
Updated
•
1.62k
•
1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Ins
Reinforcement Learning
•
3B
•
Updated
•
4
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-i1-GGUF
8B
•
Updated
•
762
•
2
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Ins-GGUF
8B
•
Updated
•
79
•
1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Base-GGUF
3B
•
Updated
•
41
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Ins-i1-GGUF
8B
•
Updated
•
89
•
1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Ins-GGUF
3B
•
Updated
•
536
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Ins-GGUF
3B
•
Updated
•
267
•
1
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Base-GGUF
3B
•
Updated
•
62
•
1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Base-MA-GGUF
3B
•
Updated
•
56
•
1
mradermacher/LLDS-R-GSPO-Qwen2.5-3B-Ins-GGUF
3B
•
Updated
•
716
•
1
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Base-i1-GGUF
3B
•
Updated
•
270
•
1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Base-i1-GGUF
3B
•
Updated
•
364
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Ins-i1-GGUF
Updated
•
268
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Ins-i1-GGUF
Updated
•
198
•
2