khuang2
/

qwen-2.5-3b-r1-countdown-train_query_and_policy_vdebug

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

qwen-2.5-3b-r1-countdown-train_query_and_policy_vdebug / runs

1 contributor

History: 1 commit

khuang2's picture

Training in progress, step 20

b7be965 verified about 1 month ago