cuong1692001/Math12K_high_3B_lr1.25e-6_bs1_gas_1_2GPU Text Generation • 242k • Updated 3 days ago • 16
cuong1692001/Math12K_low_3B_lr1.25e-6_bs1_gas_1_2GPU Text Generation • 242k • Updated 3 days ago • 22