distily_test_attn_miles / benchmarks.shelve.dir
lapp0's picture
End of training
f56b68f verified
raw
history blame contribute delete
204 Bytes
'teacher', (0, 14412556)
'attn_layer_mapper=all, attn_loss_fn=raw_mse, attn_projector=miles', (14412800, 14412543)
'attn_layer_mapper=all, attn_loss_fn=logsum, attn_projector=miles', (28825600, 14412543)