Yu Yang's picture

3 8 2

Yu Yang

yuyangy

·

https://sites.google.com/g.ucla.edu/yuyang/home

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

AI Risk Categorization Decoded (AIR 2024): From Government Regulations to Corporate Policies

authored a paper about 1 month ago

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

upvoted a paper about 1 month ago

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

View all activity

Organizations

yuyangy's activity

upvoted a paper about 1 month ago

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Paper • 2502.05163 • Published Feb 7 • 22

upvoted 3 papers 5 months ago

Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning

Paper • 2410.22304 • Published Oct 29, 2024 • 18

SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models

Paper • 2403.07384 • Published Mar 12, 2024 • 1

SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI

Paper • 2410.11096 • Published Oct 14, 2024 • 13

upvoted a paper 9 months ago

MIRAI: Evaluating LLM Agents for Event Forecasting

Paper • 2407.01231 • Published Jul 1, 2024 • 18

upvoted a paper 10 months ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11

upvoted a collection 10 months ago

Weak-to-Strong Extrapolation Expedites Alignment

Better aligned models obtained by weak-to-strong model extrapolation (ExPO) • 25 items • Updated 18 days ago • 17

upvoted a paper about 1 year ago

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 65