edgerunner-research's picture
Update README.md
59e75cb verified
|
raw
history blame
302 Bytes

MT-Bench

  • Score: 8.55

Arena hard

image/png

Alpaca Bench 2

image/png