LLM Alignment faking in large language models Paper • 2412.14093 • Published Dec 18, 2024 • 7 fka/awesome-chatgpt-prompts Viewer • Updated Jan 6 • 203 • 12.2k • 7.62k