garak: A Framework for Security Probing Large Language Models Paper • 2406.11036 • Published Jun 16, 2024 • 1
garak: A Framework for Security Probing Large Language Models Paper • 2406.11036 • Published Jun 16, 2024 • 1
Semantic Consistency for Assuring Reliability of Large Language Models Paper • 2308.09138 • Published Aug 17, 2023 • 2
Representation noising effectively prevents harmful fine-tuning on LLMs Paper • 2405.14577 • Published May 23, 2024 • 1
Representation noising effectively prevents harmful fine-tuning on LLMs Paper • 2405.14577 • Published May 23, 2024 • 1
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity Paper • 2401.17072 • Published Jan 30, 2024 • 25
Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs Paper • 2010.15285 • Published Oct 28, 2020 • 1
Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs Paper • 2010.15285 • Published Oct 28, 2020 • 1