ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1 Viewer • Updated 12 days ago • 29.3k • 65
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filtered_v1_v1 Viewer • Updated 12 days ago • 29.3k • 40
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filtered_v1 Viewer • Updated 18 days ago • 29.3k • 210
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2filtered Viewer • Updated 20 days ago • 28.9k • 55
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2 Viewer • Updated 20 days ago • 29.3k • 42
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filteredd Viewer • Updated 20 days ago • 29.3k • 48
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1filtered Viewer • Updated 20 days ago • 29.1k • 48
ZHLiu627/qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2 Viewer • Updated 21 days ago • 29.3k • 61