Output Guardrail Policy Datasets Collection Groups both benchmark and training datasets for guardrail models • 28 items • Updated 6 days ago
Output Guardrail Processed Datasets Collection Datasets created from processing `Output Guardrail Policy Datasets` through `instruction_tuning_prepare.py` • 13 items • Updated Dec 19, 2024
GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering Paper • 2409.06595 • Published Sep 10, 2024 • 38
PrimeGuard: Safe and Helpful LLMs through Tuning-Free Routing Paper • 2407.16318 • Published Jul 23, 2024 • 7
PrimeGuard: Safe and Helpful LLMs through Tuning-Free Routing Paper • 2407.16318 • Published Jul 23, 2024 • 7
Does fine-tuning GPT-3 with the OpenAI API leak personally-identifiable information? Paper • 2307.16382 • Published Jul 31, 2023