TAROT: Test-driven and Capability-adaptive Curriculum Reinforcement Fine-tuning for Code Generation with Large Language Models
Paper
โข 2602.15449 โข Published
โข 6
Diffusion models, alignment
torch.compile2 ** search_round) and repeat 1 - 3.