Commit History
run eval on the first step to get a baseline (#617)
2844eb2
unverified
mypy wandb ignore (#572)
c6d870b
unverified
fix wandb so mypy doesn't complain (#562)
bf08044
unverified
Add training callback to send predictions to WandB table (#521)
5b67ea9
unverified
Early stopping metric (#537)
e30f1e3
unverified
No gather single gpu (#523)
09f1543
unverified
Changed Bench Eval to report metrics correctly by split. Added total accuracy and renamed previously used bench_accuracy to bench_average_accuracy. (#512)
42f9642
unverified
Alpay Ariyak
commited on