radd-t-dce / README.md
Jingyang Ou
Update README.md
73ef94d verified
|
raw
history blame
193 Bytes

Reparameterized Absorbing Discrete Diffusion (RADD) small model with t-dce loss trained for 400k iterations.

Code: https://github.com/ML-GSAI/RADD.

Paper: https://arxiv.org/abs/2406.03736.