Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Daniil Tsesarev's picture

3

Daniil Tsesarev

tsessk

21world's profile picture

·

AI & ML interests

transformers)

Organizations

None yet

tsessk 's collections 1

llm course @ HSE and vk llm A collection of SmolLM-135M models fine-tuned with DPO, PPO, and Reward Modeling to enhance human-like expressiveness

tsessk/llm-course-hw2-dpo

Text Generation • 0.1B • Updated Mar 8 • 5
tsessk/llm-course-hw2-reward-model

Text Classification • 0.1B • Updated Mar 8 • 3
tsessk/llm-course-hw2-ppo

Text Generation • 0.1B • Updated Mar 8 • 8

llm course @ HSE and vk llm A collection of SmolLM-135M models fine-tuned with DPO, PPO, and Reward Modeling to enhance human-like expressiveness

tsessk/llm-course-hw2-dpo

Text Generation • 0.1B • Updated Mar 8 • 5
tsessk/llm-course-hw2-reward-model

Text Classification • 0.1B • Updated Mar 8 • 3
tsessk/llm-course-hw2-ppo

Text Generation • 0.1B • Updated Mar 8 • 8

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs