Revealing Fine-Grained Values and Opinions in Large Language Models Paper โข 2406.19238 โข Published Jun 27, 2024 โข 14
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment Paper โข 2405.19332 โข Published May 29, 2024 โข 15
LLMs achieve adult human performance on higher-order theory of mind tasks Paper โข 2405.18870 โข Published May 29, 2024 โข 17
Guiding a Diffusion Model with a Bad Version of Itself Paper โข 2406.02507 โข Published Jun 4, 2024 โข 16