Reparameterized Absorbing Discrete Diffusion (RADD) small model with t-dce loss trained for 400k iterations. | |
Code: https://github.com/ML-GSAI/RADD. | |
Paper: https://arxiv.org/abs/2406.03736. |
Reparameterized Absorbing Discrete Diffusion (RADD) small model with t-dce loss trained for 400k iterations. | |
Code: https://github.com/ML-GSAI/RADD. | |
Paper: https://arxiv.org/abs/2406.03736. |