suchen
suc16
ยท
AI & ML interests
LLM
Recent Activity
liked
a model
19 days ago
moonshotai/Moonlight-16B-A3B
upvoted
an
article
about 1 month ago
Proximal Policy Optimization (PPO)
upvoted
a
paper
2 months ago
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language
Models
Organizations
None yet
models
None public yet
datasets
None public yet