qgallouedec
·
AI & ML interests
None yet
Recent Activity
Organizations
qgallouedec/Qwen-2.5-7B-Simple-RL
Text Generation
•
8B
•
Updated
•
1
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
•
3
qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
qgallouedec/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
qgallouedec/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-packing
Image-Text-to-Text
•
12B
•
Updated
•
2
qgallouedec/gemma-3-12b-it-codeforces-SFT
Image-Text-to-Text
•
12B
•
Updated
•
3
•
5
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-no-packing
Image-Text-to-Text
•
12B
•
Updated
qgallouedec/gemma-3-4b-it-codeforces-SFT
Image-Text-to-Text
•
4B
•
Updated
•
3
•
3
qgallouedec/gemma-3-27b-it-codeforces-SFT
Image-Text-to-Text
•
27B
•
Updated
•
4
•
5
qgallouedec/Qwen2.5-0.5B-codeforces-SFT
Text Generation
•
0.5B
•
Updated
•
1
qgallouedec/Qwen2.5-0.5B-GRPO-main
Text Generation
•
0.5B
•
Updated
•
3
•
qgallouedec/gemma-2-2B-it-thinking-function_calling
Updated
qgallouedec/Qwen2.5-0.5B-GRPO-2873
Updated
qgallouedec/Qwen2.5-0.5B-GRPO-2776-next
Updated
qgallouedec/Qwen2.5-32B-Open-R1-GRPO
qgallouedec/Qwen2.5-14B-Open-R1-GRPO
Updated
qgallouedec/Qwen2.5-7B-Open-R1-GRPO
Updated
qgallouedec/Qwen2-0.5B-GRPO
Updated
qgallouedec/tiny-Qwen2ForSequenceClassification-2.5
Text Classification
•
1.22M
•
Updated
qgallouedec/tiny-Qwen2ForCausalLM-2.5
Text Generation
•
39M
•
Updated
•
107
qgallouedec/Qwen2-0.5B-Reward-Math-Sheperd-KN-fix-cast
Token Classification
•
0.5B
•
Updated
qgallouedec/Qwen2-0.5B-Reward-Math-Sheperd-KN
0.5B
•
Updated
qgallouedec/Qwen2-0.5B-Reward-Math-Sheperd-wo-compute-acc
Updated
qgallouedec/Qwen2-0.5B-Reward-Math-Sheperd
Token Classification
•
0.5B
•
Updated
•
77
•
1
qgallouedec/Llama-3.1-8B-SQL
Updated
qgallouedec/tiny-Qwen2ForCausalLM-2.5-Coder
Text Generation
•
2.43M
•
Updated
•
1
qgallouedec/tiny-Qwen2ForCausalLM-Coder
Text Generation
•
2.43M
•
Updated
•
2
qgallouedec/my_dir2_checkpoint-3_merged
Text Generation
•
1.03M
•
Updated
•
1
1.03M
•
Updated