·
AI & ML interests
None yet
Organizations
None yet
splusminusx/SmolLM2-FT-MyDataset
Text Generation
• 0.1B • Updated
• 2
splusminusx/SmolLM2-FT-ORPO
Text Generation
• 0.1B • Updated
• 2
splusminusx/SmolLM2-FT-DPO
Text Generation
• 0.1B • Updated
• 2
splusminusx/Starling-LM-7B-beta-GGUF
7B • Updated
• 11
splusminusx/a2c-PandaReachDense-v2
Reinforcement Learning
• Updated
• 3
splusminusx/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
• Updated
splusminusx/ppo-CartPole-v1-unit-8
Updated
splusminusx/LunarLander-v2-unit-8
Reinforcement Learning
• Updated
splusminusx/poca-SoccerTwos
Reinforcement Learning
• Updated
• 2
splusminusx/a2c-AntBulletEnv-v0
Reinforcement Learning
• Updated
• 1
Reinforcement Learning
• Updated
• 47
splusminusx/ppo-SnowballTarget
Reinforcement Learning
• Updated
• 7
splusminusx/Reinforce-PixelCopter
Reinforcement Learning
• Updated
splusminusx/Reinforce-CartPole-v1
Reinforcement Learning
• Updated
splusminusx/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
• Updated
• 2
Reinforcement Learning
• Updated
splusminusx/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
• Updated
Reinforcement Learning
• Updated
• 14
splusminusx/ppo-LunarLander-v2
Reinforcement Learning
• Updated
• 1