Haebin Seong's picture

4 3 1

Haebin Seong

hbseong

·

https://imnotkind.github.io/

haebin-seong-95097615a

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models

updated a model 6 days ago

hbseong/peft-starcoder-lora-a100-adalflow

upvoted a paper 6 days ago

SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models

View all activity

Organizations

None yet

hbseong's activity

upvoted a paper 6 days ago

SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models

Paper • 2502.12464 • Published 7 days ago • 27

upvoted 2 papers 4 months ago

Do LLMs Have Political Correctness? Analyzing Ethical Biases and Jailbreak Vulnerabilities in AI Systems

Paper • 2410.13334 • Published Oct 17, 2024 • 13

HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models

Paper • 2410.01524 • Published Oct 2, 2024 • 3