Submitted by akhaliq 20 On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes · 7 authors 6
Submitted by akhaliq 15 Bring Your Own Data! Self-Supervised Evaluation for Large Language Models · 9 authors