pgptlformer-tinystories / re-pqt-rmsXrmsx3-ATTNII_fast-af1c5037-28b9-4ce3-b351-c78046f90ee7.txt
SQCU's picture
compiled models train faster so you can train more of them in a short experiment, to better convergence.
921107d verified
raw
history contribute delete
3.72 MB
File too large to display, you can check the raw version instead.