Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1853.0
TFLOPS
2
6
3
Andrew Siah
andrewsiah
Follow
tanyarai's profile picture
LeroyDyer's profile picture
21world's profile picture
3 followers
·
6 following
theandrewsiah
andrewsiah
AI & ML interests
None yet
Organizations
andrewsiah
's models
10
Sort:Â Recently updated
andrewsiah/Qwen-2.5-1.5B-Instruct-Datamix
Text Generation
•
2B
•
Updated
Feb 16
•
5
andrewsiah/Qwen-2.5-7B-Simple-RL
Text Generation
•
8B
•
Updated
Feb 15
•
1
andrewsiah/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
Feb 14
andrewsiah/Qwen2.5-1.5B-Open-R1-Distill
Updated
Feb 13
andrewsiah/Reinforce-1
Reinforcement Learning
•
Updated
Aug 1, 2023
andrewsiah/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Jun 19, 2023
•
6
andrewsiah/taxi-v3
Reinforcement Learning
•
Updated
Jun 19, 2023
andrewsiah/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jun 19, 2023
andrewsiah/ppo-Huggy
Reinforcement Learning
•
Updated
Jun 18, 2023
•
8
andrewsiah/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jun 18, 2023
•
7