Ariel Kwiatkowski
RedTachyon
ยท
AI & ML interests
RL, MARL, Crowd Simulation
Recent Activity
upvoted
a
paper
7 days ago
Soft Tokens, Hard Truths
upvoted
a
paper
8 months ago
PILAF: Optimal Human Preference Sampling for Reward Modeling
authored
a paper
8 months ago
PILAF: Optimal Human Preference Sampling for Reward Modeling