Aloïs Thomas
alothomas
·
AI & ML interests
None yet
Organizations
models 12
alothomas/radbert-rad-verifier-context
Text Classification • 0.1B • Updated • 2
alothomas/radbert-rad-verifier-single
Text Classification • 0.1B • Updated • 3
alothomas/deberta-rad-verifier-context
Text Classification • 0.2B • Updated • 4
alothomas/deberta-rad-verifier-single
Text Classification • 0.2B • Updated • 2
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k-LastStepOnly
Token Classification • 0.5B • Updated • 1
alothomas/Qwen2.5-0.5B-PRM-RAD-seq
Updated
alothomas/ppo-LunarLander-v2
Reinforcement Learning • Updated • 2
alothomas/Qwen2.5-3B-PRM-RAD-balanced-150k
Token Classification • 3B • Updated • 1
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k
Token Classification • 0.5B • Updated • 1
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-V4
Token Classification • 0.5B • Updated • 1
datasets 0
None public yet