Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Saish Mendke
saishmendke10
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
17 days ago
saishmendke10/news_llm_3.2_1b
updated
a model
18 days ago
saishmendke10/news_llm_3.2_1b_grpo
updated
a model
18 days ago
saishmendke10/news_llm_3.2_1b_grpo
View all activity
Organizations
None yet
Papers
1
arxiv:
2412.13578
models
9
Sort: Recently updated
saishmendke10/news_llm_3.2_1b
Updated
17 days ago
•
37
saishmendke10/news_llm_3.2_1b_grpo
Updated
18 days ago
saishmendke10/news_llm_3.2_3b_grpo
Updated
Jul 25
saishmendke10/Llama-3.2-3B-Instruct-GRPO-test
Updated
Jul 20
saishmendke10/Qwen2-0.5B-GRPO-test
Updated
Jul 7
saishmendke10/news_llm_3-8b-Instruct-bnb-4bit-grpo
Updated
Jun 16
saishmendke10/news_llm_3.2_3b
Updated
Jun 16
•
18
saishmendke10/news_llm_3-8b-Instruct-bnb-4bit
Updated
Jun 15
•
3
saishmendke10/Qwen2-0.5B-GRPO
Updated
Apr 3
datasets
0
None public yet