The Differences Between Direct Alignment Algorithms are a Blur Paper • 2502.01237 • Published 11 days ago • 112
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-10 Text Generation • Updated Aug 14, 2024 • 104
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-5 Text Generation • Updated Aug 14, 2024 • 102
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-0 Text Generation • Updated Aug 14, 2024 • 99
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-10 Text Generation • Updated Aug 14, 2024 • 107
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-5 Text Generation • Updated Aug 14, 2024 • 99
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-0 Text Generation • Updated Aug 14, 2024 • 106
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-10 Text Generation • Updated Aug 14, 2024 • 106
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-5 Text Generation • Updated Aug 13, 2024 • 102
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-0 Text Generation • Updated Aug 13, 2024 • 103
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-10 Text Generation • Updated Aug 13, 2024 • 106
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-5 Text Generation • Updated Aug 13, 2024 • 99
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-0 Text Generation • Updated Aug 13, 2024 • 99
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-10 Text Generation • Updated Aug 13, 2024 • 106
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-5 Text Generation • Updated Aug 13, 2024 • 99
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-0 Text Generation • Updated Aug 13, 2024 • 103
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-10 Text Generation • Updated Aug 13, 2024 • 102
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-5 Text Generation • Updated Aug 13, 2024 • 104
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-0 Text Generation • Updated Aug 13, 2024 • 103
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-0.0001_neftune_alpha-5 Text Generation • Updated Aug 13, 2024 • 99