Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
vijayendraprasad gadwla
vijay1369
Follow
0 followers
ยท
10 following
AI & ML interests
None yet
Recent Activity
reacted
to
codelion
's
post
with ๐ฅ
7 days ago
I recently added a recipe in ellora to improve reasoning capabilities to Gemma-3-1B using self-supervised learning. Model now shows step-by-step thinking in <think> tags before answering. Logic puzzle accuracy: 61% โ 84%. 3 hours training on single GPU. ๐ง Used GRPO where model generates multiple responses and learns to prefer better reasoning. Works surprisingly well for making smaller models more transparent. ๐ Colab: https://colab.research.google.com/github/codelion/ellora/blob/main/Ellora_Recipe_2_Reasoning_LoRA_with_Self-Rewarding_GRPO.ipynb ๐ค Model: https://huggingface.co/codelion/gemma-3-1b-it-reasoning-grpo-lora ๐ป Code: https://github.com/codelion/ellora
View all activity
Organizations
None yet
models
0
None public yet
datasets
0
None public yet