rl-rag/qwen3-8b-base-combined-sft-training-data-v20250824_MiroSystemPrompt Text Generation • 8B • Updated about 15 hours ago • 4
rl-rag/qwen3-8b-combined-sft-training-data-v20250824_MiroSystemPrompt Text Generation • 8B • Updated about 15 hours ago • 8
rl-rag/qwen3-4b-it-combined-sft-training-data-v20250824_MiroSystemPrompt Text Generation • 4B • Updated about 15 hours ago • 5
rl-rag/qwen2.5-7b-combined-sft-training-data-v20250824_MiroSystemPrompt Text Generation • 8B • Updated about 15 hours ago • 4
rl-rag/rl_rag_sqa_searcharena_rubrics_web_augmented_rubrics_only_with_new_mcp_system_prompt Viewer • Updated about 24 hours ago • 2.94k • 15
rl-rag/rl_rag_sqa_searcharena_rubrics_web_augmented_longform_averaged_outcome_with_system_prompt Viewer • Updated about 24 hours ago • 2.94k • 41
rl-rag/rl_rag_sqa_searcharena_rubrics_web_augmented_outcome_with_new_mcp_system_prompt Viewer • Updated about 24 hours ago • 2.94k • 9