Update README.md
Browse files
README.md
CHANGED
@@ -11,9 +11,11 @@ Hi! Welcome on the org page of the Evaluation team at HuggingFace.
|
|
11 |
We want to support the community in building and sharing quality evaluations, for reproducible and fair model comparisions, to cut through the hype of releases and better understand actual model capabilities.
|
12 |
|
13 |
We're behind the:
|
14 |
-
- [Open LLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/) (over 11K models evaluated since 2023)
|
15 |
- [lighteval](https://github.com/huggingface/lighteval) LLM evaluation suite, fast and filled with the SOTA benchmarks you might want
|
16 |
- [evaluation guidebook](https://github.com/huggingface/evaluation-guidebook), your reference for LLM evals
|
17 |
- [leaderboards on the hub](https://huggingface.co/blog?tag=leaderboard) initiative, to encourage people to build more leaderboards in the open for more reproducible evaluation. You'll find some doc [here](https://huggingface.co/docs/leaderboards/index) to build your own, and you can look for the best leaderboard for your use case [here](https://huggingface.co/spaces/OpenEvals/find-a-leaderboard)!
|
18 |
|
|
|
|
|
|
|
19 |
We're not behind the [evaluate metrics guide](https://huggingface.co/evaluate-metric) but if you want to understand metrics better we really recommend checking it out!
|
|
|
11 |
We want to support the community in building and sharing quality evaluations, for reproducible and fair model comparisions, to cut through the hype of releases and better understand actual model capabilities.
|
12 |
|
13 |
We're behind the:
|
|
|
14 |
- [lighteval](https://github.com/huggingface/lighteval) LLM evaluation suite, fast and filled with the SOTA benchmarks you might want
|
15 |
- [evaluation guidebook](https://github.com/huggingface/evaluation-guidebook), your reference for LLM evals
|
16 |
- [leaderboards on the hub](https://huggingface.co/blog?tag=leaderboard) initiative, to encourage people to build more leaderboards in the open for more reproducible evaluation. You'll find some doc [here](https://huggingface.co/docs/leaderboards/index) to build your own, and you can look for the best leaderboard for your use case [here](https://huggingface.co/spaces/OpenEvals/find-a-leaderboard)!
|
17 |
|
18 |
+
Our archived projects:
|
19 |
+
- [Open LLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/) (over 11K models evaluated since 2023)
|
20 |
+
|
21 |
We're not behind the [evaluate metrics guide](https://huggingface.co/evaluate-metric) but if you want to understand metrics better we really recommend checking it out!
|