Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
933.5
TFLOPS
1
33
8
Nitish Pandey
nitishpandey04
Follow
Gargaz's profile picture
Savioureke25's profile picture
21world's profile picture
3 followers
·
18 following
_nitish_pandey_
nitishpandey04
AI & ML interests
LLMs, Translation
Recent Activity
updated
a collection
7 days ago
Reading List
upvoted
a
paper
7 days ago
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
updated
a collection
26 days ago
Reading List
View all activity
Organizations
nitishpandey04
's models
5
Sort: Recently updated
nitishpandey04/Reinforce-PixelCopter
Reinforcement Learning
•
Updated
Feb 25
nitishpandey04/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Feb 24
nitishpandey04/q-Taxi-v3
Reinforcement Learning
•
Updated
Feb 4
nitishpandey04/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Feb 4
nitishpandey04/PPO-LunarLander-v2
Reinforcement Learning
•
Updated
Jan 22