Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Cem Anil
anilcem
Follow
Drzero0's profile picture
1 follower
ยท
1 following
AI & ML interests
None yet
Recent Activity
authored
a paper
4 days ago
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming
authored
a paper
about 1 year ago
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
authored
a paper
over 1 year ago
Studying Large Language Model Generalization with Influence Functions
View all activity
Organizations
Papers
3
arxiv:
2501.18837
arxiv:
2401.05566
arxiv:
2308.03296
models
None public yet
datasets
None public yet