Mikhail Budnikov's picture

1 4 1

Mikhail Budnikov

Mlxa

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

The Differences Between Direct Alignment Algorithms are a Blur

updated a model 6 months ago

Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-10

updated a model 6 months ago

Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-5

View all activity

Organizations

Mlxa's activity

upvoted a paper 10 days ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 11 days ago • 112

upvoted 2 papers 8 months ago

Long Code Arena: a Set of Benchmarks for Long-Context Code Models

Paper • 2406.11612 • Published Jun 17, 2024 • 25

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Paper • 2406.08973 • Published Jun 13, 2024 • 87

upvoted a paper 10 months ago

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4, 2024 • 61