Derry Pratama
ibndias
AI & ML interests
None yet
Recent Activity
updated
a model
about 11 hours ago
ibndias/Qwen2.5-1.5B-Open-R1-GRPO
published
a model
1 day ago
ibndias/Qwen2.5-1.5B-Open-R1-GRPO
published
a model
1 day ago
ibndias/Qwen-2.5-7B_Base_Math_smalllr
Organizations
Collections
2
Papers
2
models
13
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
•
35
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/Qwen-2.5-7B_Base_Math_smalllr
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/Qwen2.5-1.5B-Open-R1-GRPO1st
Text Generation
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/taxi-v3
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/Nous-Hermes-2-MoE-2x34B
Text Generation
•
Updated
•
1.5k
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/NeuralHermes-MoE-2x7B
Text Generation
•
Updated
•
1.63k
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678957786276-noauth.png)
ibndias/mistral-7b-gtfobins-lora
Text Generation
•
Updated
•
15