Nguyễn Minh Phúc's picture

4

Nguyễn Minh Phúc

DatPySci

·

AI & ML interests

Reinforcement learning, NLP

Recent Activity

updated a dataset 1 day ago

DatPySci/tldr_pythia-6.9b_pref

published a dataset 2 days ago

DatPySci/tldr_pythia-6.9b_pref

published a dataset 10 days ago

DatPySci/gpt2_dpo_tldr

View all activity

Organizations

Collections 1

models 95

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.1_steps_72000__tldr

Updated Nov 18, 2024

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.05_steps_72000__tldr

Updated Nov 18, 2024

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.01_steps_72000__tldr

Updated Nov 18, 2024

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.1_steps_32400__tldr

Updated Nov 18, 2024

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.05_steps_32400__tldr

Updated Nov 18, 2024

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.01_steps_32400__tldr

Updated Nov 18, 2024

DatPySci/llama3-1b_reward_tldr

Text Classification • Updated Nov 11, 2024 • 107

DatPySci/EleutherAI_pythia-2.8b-deduped__ipo_pythia-2.8b_beta-0.1__tldr

Updated Nov 9, 2024

DatPySci/EleutherAI_pythia-2.8b-deduped__dpo_pythia-2.8b_beta-0.05__tldr

Updated Nov 8, 2024

DatPySci/EleutherAI_pythia-2.8b-deduped__length_IS_pythia-2.8b_beta-0.05__tldr

Updated Oct 26, 2024

datasets 55

DatPySci/tldr_pythia-6.9b_pref

Viewer • Updated 1 day ago • 94.9k • 47

DatPySci/tldr_synthetic_llama3_3b_32

Viewer • Updated 14 days ago • 5.47k • 54

DatPySci/llama3_3b_sft_tldr_synthetic

Viewer • Updated 19 days ago • 5.47k • 103

DatPySci/weak_gpt2_large_dpo_hh

Viewer • Updated 29 days ago • 8k • 52

DatPySci/weak_gpt2_medium_dpo_hh

Viewer • Updated 29 days ago • 8k • 56

DatPySci/weak_gpt2_dpo_hh

Viewer • Updated 29 days ago • 8k • 58

DatPySci/Llama-3.2-3B_refine_gpt2-large_tldr

Viewer • Updated 30 days ago • 8k • 86

DatPySci/Llama-3.2-3B_refine_gpt2-medium_tldr

Viewer • Updated 30 days ago • 8k • 84

DatPySci/Llama-3.2-3B_refine_gpt2_tldr

Viewer • Updated 30 days ago • 8k • 79

DatPySci/Llama-3.2-1B_refine_gpt2-large_tldr

Viewer • Updated 30 days ago • 8k • 48