Perusha Moodley
moodlep
·
AI & ML interests
RL, DRL, Decision Transformers, Auxiliary signals, self-supervised methods
Recent Activity
liked
a dataset
10 days ago
Anthropic/hh-rlhf
upvoted
a
paper
22 days ago
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
updated
a model
29 days ago
moodlep/smollm2-17b-dpo-cai-v1
Organizations
Collections
1
models
9
moodlep/smollm2-17b-dpo-cai-v1
Updated
•
11
moodlep/smollm2-1.7b-instr-sft-cai-v1
Updated
moodlep/smollm2-1.7b-instr-sft-cai
Updated
•
12
moodlep/mistral-7b-sft-constitutional-ai
Updated
•
3
moodlep/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
moodlep/output
Updated
moodlep/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
•
1
moodlep/ppo-Huggy
Reinforcement Learning
•
Updated
•
24
moodlep/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
6