3 7

Alexey Gorbatovski

Myashka

Myashka

AI & ML interests

NLP Alignment

Recent Activity

commented on a paper 9 days ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

new activity 21 days ago

agentica-org/DeepScaleR-Preview-Dataset:There are no answers for 6 samples

updated a model 2 months ago

Myashka/Qwen2.5-7B-UltraChat200K_EMA_SFT-Lr_3e_6-Alpha_0.01

View all activity

Organizations

None yet

Papers 4

models 37

datasets 11

Myashka/CryptoNews_50_50

Viewer • Updated Mar 23, 2024 • 1.15k • 6

Myashka/CryptoNews

Viewer • Updated Mar 17, 2024 • 1.15k • 6

Myashka/gpt2-imdb-constractive

Viewer • Updated Dec 4, 2023 • 59.1k • 8

Myashka/SO_Python_basics_QA_human_pref

Viewer • Updated Nov 5, 2023 • 185k • 28

Myashka/SO-Python_basics_QA-filtered-2023-T5_paraphrased-tanh_score

Viewer • Updated Aug 23, 2023 • 117k • 11

Myashka/SO_Python_basics_QA_human_preferences_no_gen

Viewer • Updated Jul 29, 2023 • 6.17k • 32

Myashka/SO-Python_basics_QA-filtered-2023-tanh_score

Viewer • Updated Jul 25, 2023 • 30k • 11

Myashka/SO-Python_QA-filtered-2023-no_code-tanh_score

Viewer • Updated Jul 18, 2023 • 66.1k • 11 • 2

Myashka/SO-Python_QA-filtered-2023-tanh_score

Viewer • Updated Jul 14, 2023 • 69.5k • 36

Myashka/SO-Python_QA-filtered-2023-tanh_score-after_2023_02

Viewer • Updated Jul 13, 2023 • 1.06k • 14 • 1

View 11 datasets

Alexey Gorbatovski

AI & ML interests

Recent Activity

Organizations

Papers 4

models 37 Sort: Recently updated

datasets 11 Sort: Recently updated

models 37

datasets 11