mm-o1/mmo1-math-qwen2.5_vl_3b-sft_mmr1_sft_0503_v10_mathinstruct_onlygemini_ep5 Image-to-Text • 4B • Updated Jul 10 • 8.86k
mm-o1/qwen2_5_vl_7b_mmr1_mminstruct_coldstart_rlv6_wllm_response4k_rolloutn32_shuffle_0618 8B • Updated Jun 28 • 7
mm-o1/qwen2_5_vl_7b_mmr1_mminstruct_coldstart_rlv6_wllm_response4k_rolloutn32_learn08_bleu02_0618 8B • Updated Jun 28 • 7
mm-o1/qwen2_5_vl_7b_mmr1_mminstruct_coldstart_rlv5_wllm_response4k_rolloutn32_shuffle_0618 8B • Updated Jun 28 • 6
mm-o1/qwen2_5_vl_7b_mmr1_mminstruct_coldstart_rlv5_wllm_response4k_rolloutn32_learn08_bleu02_0618 8B • Updated Jun 28 • 7
mm-o1/qwen2_5_vl_7b_mmr1_coldstartv10_rlv9_wllm_response4k_rolloutn32_shuffle_0510 8B • Updated May 14 • 4
mm-o1/qwen2_5_vl_7b_mmr1_coldstartv10_rlv9_wllm_response4k_rolloutn16_learn10_0510 8B • Updated May 13 • 4
mm-o1/qwen2_5_vl_7b_mmr1_coldstartv10_rlv9_wllm_response4k_rolloutn16_learn08_bleu02_0510 8B • Updated May 12 • 7
mm-o1/qwen2_5_vl_7b_mmr1_coldstartv10_rlv9_wllm_response4k_rolloutn16_shuffle_0510 8B • Updated May 11 • 4