In a Training Loop 🔄
lewtun
·
AI & ML interests
LLMs, LLMs, LLMs
Organizations
lewtun/dolci-think-sft-6400
Viewer
• Updated • 6.4k • 28
lewtun/dolci-think-sft-3200
Viewer
• Updated • 3.2k • 24
lewtun/dolci-think-sft-1600
Viewer
• Updated • 1.6k • 27
lewtun/dolci-think-sft-800
Viewer
• Updated • 800 • 24
lewtun/dolci-think-sft-400
Viewer
• Updated • 400 • 28
lewtun/dolci-think-sft-200
Viewer
• Updated • 200 • 28
lewtun/s1K-1.1-dataforge-testing-20251219-213939
Viewer
• Updated • 1k • 40
lewtun/s1K-1.1-dataforge-testing-20251219-081400
Viewer
• Updated • 819 • 49
lewtun/s1K-1.1-dataforge-testing-20251218-204703
Viewer
• Updated • 920 • 141
lewtun/dataforge-testing-20251218-152114
Viewer
• Updated • 1k • 99
lewtun/s1K-1.1-dataforge-testing-20251216-142704
Viewer
• Updated • 10 • 18
lewtun/s1K-1.1-dataforge-testing-20251216-123019
Viewer
• Updated • 1k • 83
lewtun/Polaris-Dataset-53K
Viewer
• Updated • 53.3k • 48
lewtun/details_meta-llama__Llama-2-7b-chat-hf_private
Viewer
• Updated • 7.21k • 50
lewtun/OpenThoughts3-missing-think-sample
Viewer
• Updated • 100 • 9
lewtun/details_Qwen__Qwen2.5-Coder-3B-Instruct
Viewer
• Updated • 33 • 26
lewtun/details_deepseek-ai__DeepSeek-R1-Distill-Qwen-1.5B
Viewer
• Updated • 1k • 17
lewtun/details_open-thoughts__OpenThinker-7B
Viewer
• Updated • 597 • 32
lewtun/details_deepseek-ai__DeepSeek-R1-Distill-Qwen-7B
Viewer
• Updated • 597 • 33
lewtun/details_meta-llama__Llama-3.2-3B-Instruct
Viewer
• Updated • 1.74k • 55
lewtun/details_deepseek-ai__DeepSeek-R1-Distill-Llama-8B
Viewer
• Updated • 598 • 17
lewtun/details_meta-llama__Llama-3.1-8B-Instruct
Viewer
• Updated • 597 • 9
lewtun/details_Qwen__Qwen2.5-1.5B-Instruct
Viewer
• Updated • 2.25k • 18
lewtun/details_Qwen__Qwen2.5-0.5B-Instruct
Viewer
• Updated • 898 • 10
lewtun/details_meta-llama__Llama-3.2-1B-Instruct
Viewer
• Updated • 898 • 5
lewtun/details_Qwen__Qwen2.5-Math-1.5B-Instruct
Viewer
• Updated • 11k • 8
Viewer
• Updated • 1 • 5
lewtun/Llama-3.2-1B-Instruct-best_of_n-prm-completions
Viewer
• Updated • 10 • 5
Preview
• Updated • 113