Generate a leaderboard for evaluating language models
Evaluate LLM cybersecurity risks
Measuring Security Aptitude Across LLMs