Evaluating Copyright Takedown Methods for Language Models Paper โข 2406.18664 โข Published Jun 26, 2024 โข 1
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications Paper โข 2402.05162 โข Published Feb 7, 2024 โข 1