Running 165 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 165 Building and scaling RL environments for LLM training
unsloth/NVIDIA-Nemotron-3-Nano-Omni-30B-A3B-Reasoning Text Generation • 33B • Updated 23 days ago • 1.77k • 14
DavidAU/Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking Image-Text-to-Text • 40B • Updated 7 days ago • 6.45k • 49