Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
Balamurugan Balakreshnan
Balab2021
Follow
0 followers
·
6 following
https://balakreshnan.github.io/
balakreshnan
balamurugan-balakreshnan
AI & ML interests
AI & ML, Deep Learning, Large Language models, Large Vision Models, Large Action Models, Small Language models,
Recent Activity
new
activity
22 days ago
mcp-course/unit_3_quiz:
Unit 3 completed
updated
a model
about 1 month ago
Balab2021/gpt-oss-20b-multilingual-reasoner
published
a model
about 1 month ago
Balab2021/gpt-oss-20b-multilingual-reasoner
View all activity
Organizations
Balab2021
's models
31
Sort: Recently updated
Balab2021/gpt-oss-20b-multilingual-reasoner
Updated
about 1 month ago
Balab2021/sftqwen_finetuned_model_1-5BHS
Text Generation
•
0.9B
•
Updated
Jul 28
•
8
Balab2021/1B_finetuned_llama3.2_HS
Text Generation
•
0.8B
•
Updated
Jul 25
•
9
Balab2021/Qwen2-0.5B-GRPO-test
Updated
Jul 11
Balab2021/ppo-Huggy
Reinforcement Learning
•
Updated
Feb 17
•
14
Balab2021/Taxi-V3
Reinforcement Learning
•
Updated
Feb 17
Balab2021/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Feb 17
Balab2021/poca-SoccerTwos
Reinforcement Learning
•
Updated
Feb 12
Balab2021/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Feb 6
•
4
Balab2021/Reinforce-model-4-2
Reinforcement Learning
•
Updated
Feb 6
Balab2021/Reinforce-model-4
Reinforcement Learning
•
Updated
Feb 6
Balab2021/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Feb 5
Balab2021/ppo-CartPole-v1
Reinforcement Learning
•
Updated
Feb 5
Balab2021/LunarLander-v2
Reinforcement Learning
•
Updated
Feb 5
Balab2021/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Feb 5
Balab2021/ppo-PyramidsTraining
Reinforcement Learning
•
Updated
Feb 4
•
8
Balab2021/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Feb 4
•
6
Balab2021/Reinforce-unit4ex2
Reinforcement Learning
•
Updated
Feb 4
Balab2021/Reinforce-unit4
Reinforcement Learning
•
Updated
Feb 4
Balab2021/q-taxi-v3
Reinforcement Learning
•
Updated
Feb 3
Balab2021/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Feb 3
•
1
Balab2021/DeepSeek-R1-Distill-Llama-8B-Fine-tunedBespoke
Updated
Jan 29
Balab2021/Florence-2-FT-DocVQA
Image-Text-to-Text
•
0.3B
•
Updated
Sep 21, 2024
•
4
Balab2021/bbphi35ftv1
Text Generation
•
4B
•
Updated
Aug 22, 2024
•
4
Balab2021/phi-3-5-mini-LoRA
Updated
Aug 22, 2024
Balab2021/bbphi3ftv1
Text Generation
•
4B
•
Updated
Aug 12, 2024
•
3
Balab2021/phi-3-mini-LoRA
Updated
Aug 12, 2024
Balab2021/llama3
Text Generation
•
8B
•
Updated
Apr 20, 2024
•
4
Balab2021/phi2cricketipl
Text Generation
•
3B
•
Updated
Mar 30, 2024
•
4
Balab2021/llama2cricketipl
Text Generation
•
7B
•
Updated
Mar 18, 2024
•
4
Previous
1
2
Next