Commit History

fix: description text
b9777d9

gardarjuto commited on

Add WikiQA
fa8bb65

gardarjuto commited on

fix: show partial results even if some evaluations haven't finished
7fdb5f5

gardarjuto commited on

fix: read request information even if eval is running
b61f534

gardarjuto commited on

switch to flat inflection benchmark
8874217

gardarjuto commited on

add submission instructions to about page
80793c6

gardarjuto commited on

remove submit tab
117d89c

gardarjuto commited on

fix: filtering support for models missing details
5e8e87c

gardarjuto commited on

remove intro text and citation block
dcb54b6

gardarjuto commited on

add benchmark descriptions and links to About page
67a665c

gardarjuto commited on

add winogrande and arc-challenge
56926f2

gardarjuto commited on

skip model detail validation for OAI/Anthropic models
4ec9008

gardarjuto commited on

fix typo in metric name
b1416b0

gardarjuto commited on

remove debug prints
9e6a3bf

gardarjuto commited on

fix metric name
a0ee03a

gardarjuto commited on

add debug prints
105e1f2

gardarjuto commited on

revert to correct usage of ModelDetails (without api)
24c8d00

gardarjuto commited on

debug print
ee4b341

gardarjuto commited on

debug print
a5c094b
verified

gardari commited on

debug print
decb818
verified

gardari commited on

debug print
6a989eb
verified

gardari commited on

debug print
427f12d
verified

gardari commited on

debug print
ea10299
verified

gardari commited on

Added empty default for api in ModelDetails
e8f05cc
verified

gardari commited on

Added model API to submission screen
20fd601
verified

gardari commited on

add Icelandic evals
9ef7f1a
verified

gardari commited on

Change metric string
96f9cbe
verified

gardari commited on

Comment out winogrande for debugging
ab6318a
verified

gardari commited on

Change title
4d276e3
verified

gardari commited on

Change title
2a3757e
verified

gardari commited on

Make name for HF token explicit
bd503b0
verified

gardari commited on

Fix repo names
c9a0e12
verified

gardari commited on

Update src/envs.py
d7e7ffd
verified

gardari commited on

doc
c1b8a96

Clémentine commited on

simplified the template
24622c4

Clémentine commited on

CPU, TOKEN, env variables (#4)
55cc480
verified

clefourrier HF staff meg HF staff commited on

Update src/submission/check_validity.py
6eb8bfd

clefourrier HF staff commited on

made token a requirement
f982b8e

Clémentine commited on

test
f0298e1

Clémentine commited on

fix
c15e77e

Clémentine commited on

removed quantization to simplify
b899767

Clémentine commited on

now with a functionning backend
1ffc326

Clémentine commited on

update read
943f952

Clémentine commited on

fixs
314f91a

Clémentine commited on

updated leaderboard
efeee6d

Clémentine commited on

Simplified leaderboard v0
9833cdb

Clémentine commited on

simplified some parts of the code + updated requirements
9d22eee

Clémentine commited on

Added check on tokenizer to prevent submissions which won't run
7302987

Clémentine commited on

Update benchmark count and fix typo (`inetuning->finetuning`) (#395)
7abc6a7

clefourrier HF staff alvarobartt HF staff commited on

fix order of request file vs request file list, to avoid resubmitting issues
976f398

Clémentine commited on