SDPO: Segment-Level Direct Preference Optimization for Social Agents Paper • 2501.01821 • Published Jan 3 • 18
Enhancing Human-Like Responses in Large Language Models Paper • 2501.05032 • Published 29 days ago • 49