solved classic rl environments
Nitish Pandey
nitishpandey04
AI & ML interests
LLMs, Translation
Recent Activity
upvoted
an
article
about 13 hours ago
Deriving the PPO Loss from First Principles
updated
a collection
16 days ago
Classic Reinforcement Learning
updated
a model
16 days ago
nitishpandey04/CarRacing-v3