Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
fristrup
's Collections
MCTS-free RL reasoning
MCTS-free RL reasoning
updated
Jan 25
Upvote
-
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper
•
2501.12599
•
Published
Jan 22
•
111
Upvote
-
Share collection
View history
Collection guide
Browse collections