Spaces:
Running
Running
HEADER_TITLE = """ | |
<div style="display: flex; flex-wrap: wrap; justify-content: space-around; align-items: center;"> | |
<div> | |
<h1 align="center" id="space-title">Adam Mickiewicz University's<br>Center for Artificial Intelligence</h1> | |
<h2 align="center" id="space-subtitle">LLMzSzŁ: a comprehensive LLM benchmark for Polish</h2> | |
</div> | |
</div> | |
""" | |
MAIN_DESC = """## Overall scores | |
**LLMzSzŁ** (_LLM-y za Szkolną Ławą_, LLMs Behind the School Desk) is a comprehensive LLM benchmark for Polish, | |
based on a collection of Polish national exams, including both academic | |
and professional tests extracted from the archives of the Polish Central Examination Board. | |
""" | |
GIM_DESC = """### Middle School Exam | |
The **Middle School Exam** (*egzamin gimnazjalny*) was formerly a key component of Poland’s secondary education system. This exam was taken by students completing their lower secondary education. | |
#### Subjects Covered | |
This exam assessed students in core subjects like: | |
- **Polish Language** | |
- **Mathematics** | |
- **Science** | |
- **Foreign Language** | |
preparing them for higher levels of education. | |
#### Recent Changes | |
Due to recent educational reforms in Poland: | |
- **Middle Schools Phased Out**: Middle schools have been removed from the education system. | |
- **Exam Discontinued**: With this change, the Middle School Exam has also been discontinued. | |
""" | |
MAT_DESC = """### High School Exam | |
The **High School Exam** (*matura*) is one of the most critical exams in Poland. This exam serves as a prerequisite for higher education and is a significant milestone in students' academic journeys. | |
#### Mandatory Subjects | |
The *matura* exam includes essential subjects: | |
- **Polish Language** | |
- **Mathematics** | |
- **Foreign Language** | |
#### Additional Subjects | |
Students may also choose additional subjects based on: | |
- **Areas of Interest**: Students select subjects aligned with their strengths or future studies. | |
- **University Requirements**: Specific subjects may be necessary for admission to certain university programs. | |
#### Minimum points required | |
To pass the High School Exam, students needed to achieve a minimum score of **30%** in mandatory subjects. Additional subjects do not require a minimum score to pass, but the results significantly impact the points needed for university admission. | |
""" | |
OSM_DESC = """### 8th-Grade Exam | |
The **8th-grade exam** (*egzamin ósmoklasisty*) is a standardized assessment taken by Polish students at the end of their primary education (8th grade). | |
#### Subjects Covered | |
This exam mainly includes tests in: | |
- **Polish Language** | |
- **Mathematics** | |
- **Foreign Language** | |
#### Purpose and Role | |
- **Secondary Education Foundation**: Provides a foundation for secondary education placements, guiding students to the next stage of their academic journey. | |
- **Student Assessment**: Evaluates core competencies, ensuring readiness for high school-level studies. | |
""" | |
ZAW_DESC = """### Professional Exam | |
The **Professional Exam** (*egzamin zawodowy*) is tailored for students pursuing vocational education in Poland. This exam is typically taken after completing vocational training programs, which may range from: | |
- **Technical High Schools** | |
- **Post-secondary Vocational Courses** | |
#### Purpose and Importance | |
- **Certification**: A successful score on this exam certifies a student’s qualifications in a specific trade or profession. | |
- **Career Readiness**: Enables students to enter the workforce directly. | |
- **Further Education**: Offers a foundation for pursuing specialized training. | |
""" | |
ABOUT = """## ABOUT | |
LLMzSzŁ (_LLM-y za Szkolną Ławą_, LLMs Behind the School Desk) is a comprehensive LLM benchmark for Polish, | |
based on a collection of Polish national exams, including both academic | |
and professional tests extracted from the archives of the [Polish Central Examination Board](https://cke.gov.pl). | |
The benchmark was prepared by Adam Mickiewicz University's Center for Artificial Intelligence. | |
""" | |
DATASET_TABLE = """ | |
$$ | |
\\begin{array}{|c|c|c|c|} | |
\\hline | |
\\textbf{Exam} & \\textbf{Discipline} & \\textbf{Number of Questions} & \\textbf{Original Number of Questions} \\\\ | |
\\hline | |
8th-Grade Exam & Polish Language & 15 & 101\\\\ | |
8th-Grade Exam & Mathematics & 35 & 99\\\\ | |
Middle School Exam & Mathematics & 110 & 304\\\\ | |
Middle School Exam & Nature & 65 & 226\\\\ | |
High School Exam & Biology & 21 & 169\\\\ | |
High School Exam & Physics & 136 & 399\\\\ | |
High School Exam & Mathematics & 220 & 492\\\\ | |
Professional Exam & Arts & 1778 & 3480\\\\ | |
Professional Exam & Mechanical, Mining and Metallurgical & 9097 & 30240\\\\ | |
Professional Exam & Agriculture and Forestry & 7344 & 23200\\\\ | |
\\hline | |
\\end{array} | |
$$ | |
""" | |
CITATION_LABEL = "Copy the following snippet to cite these results" | |
CITATION_CONTENT = """ | |
@misc{eval-harness, | |
title = {A framework for few-shot language model evaluation}, | |
author = {Gao, Leo and Tow, Jonathan and Abbasi, Baber and Biderman, Stella and Black, Sid and DiPofi, Anthony and Foster, Charles and Golding, Laurence and Hsu, Jeffrey and Le Noac'h, Alain and Li, Haonan and McDonell, Kyle and Muennighoff, Niklas and Ociepa, Chris and Phang, Jason and Reynolds, Laria and Schoelkopf, Hailey and Skowron, Aviya and Sutawika, Lintang and Tang, Eric and Thite, Anish and Wang, Ben and Wang, Kevin and Zou, Andy}, | |
year = 2023, | |
publisher = {Zenodo}, | |
doi = {10.5281/zenodo.10256836}, | |
url = {https://zenodo.org/records/10256836} | |
}, | |
@inproceedings{gralinski-2019-geval, | |
title = "{GE}val: Tool for Debugging {NLP} Datasets and Models", | |
author = "Grali{\'n}ski, Filip and | |
Wr{\'o}blewska, Anna and | |
Stanis{\l}awek, Tomasz and | |
Grabowski, Kamil and | |
G{\'o}recki, Tomasz", | |
year = "2019", | |
doi = "10.18653/v1/W19-4826", | |
url = "https://aclanthology.org/W19-4826/", | |
} | |
}""" | |