ProgramTrace
non-profit
AI & ML interests
None defined yet.
models 8
PTPReasoning/Llama-3.1-8B-RL-Clean-V2
8B • Updated
PTPReasoning/Llama-3.1-8B-RL-Baseline-V2
8B • Updated
PTPReasoning/Llama-3.1-8B-SFT-Baseline
Text Generation • 8B • Updated
PTPReasoning/Llama-3.1-8B-SFT-Clean-V2
Text Generation • 8B • Updated
PTPReasoning/Qwen2.5-7B-Base-RL-Clean-V2
Text Generation • 8B • Updated
PTPReasoning/Qwen2.5-7B-Base-RL-Baseline
Text Generation • 8B • Updated
PTPReasoning/Qwen2.5-7B-Base-SFT-Clean-V2
Text Generation • 8B • Updated
PTPReasoning/Qwen2.5-7B-Base-SFT-Baseline-V2
Text Generation • 8B • Updated
datasets 12
PTPReasoning/finqa
Viewer • Updated • 1.15k • 52
PTPReasoning/hotpot_qa
Viewer • Updated • 500 • 48
PTPReasoning/PubMedQA
Viewer • Updated • 1.5k • 8
PTPReasoning/MedCalc-Bench-v1.0
Viewer • Updated • 22.5k • 14 • 2
PTPReasoning/PTP-RL-ITL-Final-Clean-V2
Viewer • Updated • 19k • 6
PTPReasoning/PTP-SFT-ITL-Final-Baseline-V2
Viewer • Updated • 4.12k • 8
PTPReasoning/PTP-SFT-ITL-Final-Clean-V2
Viewer • Updated • 4.21k • 5
PTPReasoning/PTP-RL-MedCalc-Bench
Viewer • Updated • 9.34k • 7
PTPReasoning/PTP-RL-DAPO-EN
Viewer • Updated • 14.1k • 6
PTPReasoning/mmlu_pro_biology
Viewer • Updated • 717 • 6