winglian's picture
support for explicit test_dataset definition for evals (#786)
cda52dc unverified