VARCO_Arena / guide_mds /input_jsonls_kr.md
sonsus's picture
others
c2ba4d5
|
raw
history blame
2.19 kB
#### \[KR\] ์ง‘์–ด๋„ฃ์„ jsonl ํŒŒ์ผ ๊ฐ€์ด๋“œ
๋น„๊ตํ•  ๋ชจ๋ธ์ด ๋‹ค์„ฏ ๊ฐœ๋ผ๋ฉด ๋‹ค์„ฏ ๊ฐœ์˜ .jsonl ํŒŒ์ผ์„ ์—…๋กœ๋“œํ•˜์„ธ์š”.
* ๐Ÿ’ฅ๋ชจ๋“  jsonl ์€ ๊ฐ™์€ ์ˆ˜์˜ ํ–‰์„ ๊ฐ€์ ธ์•ผํ•ฉ๋‹ˆ๋‹ค.
* ๐Ÿ’ฅ`model_id` ํ•„๋“œ๋Š” ํŒŒ์ผ๋งˆ๋‹ค ๋‹ฌ๋ผ์•ผํ•˜๋ฉฐ ํŒŒ์ผ ๋‚ด์—์„œ๋Š” ์œ ์ผํ•ด์•ผํ•ฉ๋‹ˆ๋‹ค.
**jsonl ํ•„์ˆ˜ ํ•„๋“œ**
* ๊ฐœ๋ณ„
* `model_id`: ํ‰๊ฐ€๋ฐ›๋Š” ๋ชจ๋ธ์˜ ์ด๋ฆ„์ž…๋‹ˆ๋‹ค. (์งง๊ฒŒ ์“ฐ๋Š” ๊ฒƒ ์ถ”์ฒœ)
* `generated`: ๋ชจ๋ธ์ด testset instruction ์— ์ƒ์„ฑํ•œ ์‘๋‹ต์„ ๋„ฃ์œผ์„ธ์š”.
* ๋ฒˆ์—ญํ‰๊ฐ€ ํ”„๋กฌํ”„ํŠธ ์‚ฌ์šฉ์‹œ (`translation_pair`. `streamlit_app_local/user_submit/mt/llama5.jsonl` ์—์„œ ์˜ˆ์‹œ ๋ณผ ์ˆ˜ ์žˆ์Œ)
* `source_lang`: input language (e.g. Korean, KR, kor, ...)
* `target_lang`: output language (e.g. English, EN, ...)
* ๊ณตํ†ต ๋ถ€๋ถ„ (**๋ชจ๋“  ํŒŒ์ผ์— ๋Œ€ํ•ด ๊ฐ™์•„์•ผ ํ•จ**)
* `instruction`: ๋ชจ๋ธ์— ์ง‘์–ด๋„ฃ๋Š” `testset instruction` ํ˜น์€ `input`์— ํ•ด๋‹นํ•˜๋Š” ๋ฌด์–ธ๊ฐ€์ž…๋‹ˆ๋‹ค.
* `task`: ์ „์ฒด ๊ฒฐ๊ณผ๋ฅผ subset์œผ๋กœ ๊ทธ๋ฃน์ง€์–ด์„œ ๋ณด์—ฌ์ค„ ๋•Œ ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค. `evaluation prompt`๋ฅผ ํ–‰๋ณ„๋กœ ๋‹ค๋ฅด๊ฒŒ ์‚ฌ์šฉํ•˜๊ณ  ์‹ถ์„ ๋•Œ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๊ฐ jsonl ํŒŒ์ผ์€ ์•„๋ž˜์ฒ˜๋Ÿผ ์ƒ๊ฒผ์Šต๋‹ˆ๋‹ค.
```python
# model1.jsonl
{"model_id": "๋ชจ๋ธ1", "task": "๊ธธ ๋ฌป๊ธฐ", "instruction": "์–ด๋””๋กœ ๊ฐ€์•ผํ•˜์˜ค", "generated": "์ €๊ธฐ๋กœ์š”"}
{"model_id": "๋ชจ๋ธ1", "task": "์‚ฐ์ˆ˜", "instruction": "1+1", "generated": "2"} # ๊ธธ ๋ฌป๊ธฐ์™€ ์‚ฐ์ˆ˜์˜ ๊ฒฝ์šฐ ๋‹ค๋ฅธ ํ‰๊ฐ€ ํ”„๋กฌํ”„ํŠธ๋ฅผ ์‚ฌ์šฉํ•˜๊ณ  ์‹ถ์„ ์ˆ˜ ์žˆ๊ฒ ์ฃ ?
# model2.jsonl -* model1.jsonl๊ณผ `instruction`์€ ๊ฐ™๊ณ  `generated`, `model_id` ๋Š” ๋‹ค๋ฆ…๋‹ˆ๋‹ค!
{"model_id": "๋ชจ๋ธ2", "task": "๊ธธ ๋ฌป๊ธฐ", "instruction": "์–ด๋””๋กœ ๊ฐ€์•ผํ•˜์˜ค", "generated": "ํ•˜์ด"}
{"model_id": "๋ชจ๋ธ2", "task": "์‚ฐ์ˆ˜", "instruction": "1+1", "generated": "3"}
...
..
```
์˜ˆ๋ฅผ ๋“ค์–ด, ํ•œ๊ฐ€์ง€ ๋ชจ๋ธ์— ๋Œ€ํ•ด ๋‹ค๋ฅธ ํ”„๋กฌํ”„ํŒ…์„ ์‹œ๋„ํ•˜์—ฌ ๋‹ค๋ฅธ ์ƒ์„ฑ๋ฌธ์„ ์–ป์—ˆ๊ณ  ์ด๋ฅผ ๋น„๊ตํ•˜๊ณ  ์‹ถ์€ ๊ฒฝ์šฐ๋ฅผ ์ƒ๊ฐํ•ด๋ด…์‹œ๋‹ค. ์ด ๋•Œ ํ‰๊ฐ€๋ฐ›์„ testset์€ ๊ฐ™์œผ๋ฏ€๋กœ `instruction`์€ ๋ชจ๋‘ ๊ฐ™๊ณ  ํ”„๋กฌํ”„ํŒ…์— ๋”ฐ๋ผ `generated`๋Š” ๋‹ฌ๋ผ์ง€๊ฒ ์ฃ ? `model_id` ๋Š” `"prompt1"`, `"prompt2"` ๋“ฑ ์ทจํ–ฅ์— ๋งž๊ฒŒ ์ ์–ด์ฃผ์‹œ๋ฉด ๋ฉ๋‹ˆ๋‹ค.