Probing LLMs for Joint Encoding of Linguistic Categories Paper • 2310.18696 • Published Oct 28, 2023 • 1
How far can bias go? -- Tracing bias from pretraining data to alignment Paper • 2411.19240 • Published Nov 28, 2024
CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models Paper • 2405.13974 • Published May 22, 2024 • 9