solved classic rl environments
Nitish Pandey
nitishpandey04
AI & ML interests
LLMs, Translation
Recent Activity
upvoted
an
article
10 days ago
Deriving the PPO Loss from First Principles
updated
a collection
25 days ago
Classic Reinforcement Learning
updated
a model
25 days ago
nitishpandey04/CarRacing-v3