·
AI & ML interests
None yet
Recent Activity
Organizations
hendrydong/qwen7b-grpo-v2-step300
hendrydong/qwen7b-grpo-v2-step280
hendrydong/qwen7b-grpo-v2-step260
hendrydong/qwen7b-grpo-v2-step240
hendrydong/qwen7b-grpo-v2-step220
hendrydong/qwen7b-grpo-v2-step200
hendrydong/qwen7b-grpo-v2-step180
hendrydong/qwen7b-grpo-v2-step160
hendrydong/qwen7b-grpo-v2-step140
hendrydong/qwen7b-grpo-v2-step120
hendrydong/qwen7b-grpo-v2-step100
hendrydong/qwen7b-grpo-v2-step80
hendrydong/qwen7b-grpo-v2-step60
hendrydong/qwen7b-grpo-v2-step40
hendrydong/qwen7b-grpo-v2-step20
hendrydong/qwen-7b-reinforce-rej-step740
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step720
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step700
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step680
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step660
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step640
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step620
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step600
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step580
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step560
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step540
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step520
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step500
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step480
Text Generation
•
8B
•
Updated
•
1
hendrydong/qwen-7b-reinforce-rej-step460
Text Generation
•
8B
•
Updated
•
1