transformer no longer returns unnecessary attention weights. fix: allow backward when training ingredient decoder
3ab629c
amaiasalvador
commited on