For fineweb-edu in korean
devngho PRO
devngho
AI & ML interests
Efficient Korean NLP, Fine Korean datasets
Recent Activity
updated
a dataset
6 minutes ago
geulgyeol/geulgyeol-by-nc-nd-links
updated
a dataset
7 minutes ago
geulgyeol/geulgyeol-by-nd-links
updated
a dataset
7 minutes ago
geulgyeol/geulgyeol-by-sa
Organizations
Collections
3
models
27
devngho/llama-ablation-large-korean-corpus
Text Generation
•
Updated
devngho/llama-ablation-large-korean-corpus_edu
Text Generation
•
Updated
•
31
devngho/llama-ablation-large-random
Text Generation
•
Updated
•
507
devngho/llama-ablation-korean-corpus_edu
Text Generation
•
Updated
•
3
devngho/llama-ablation-korean-textbooks
Text Generation
•
Updated
•
5
•
1
devngho/llama-ablation-korean-textbooks-jamo
Text Generation
•
Updated
•
12
devngho/llama-ablation-random
Text Generation
•
Updated
•
239
devngho/non-jamo-tokenizer-exp1
Updated
devngho/jamo-tokenizer-exp1
Updated
devngho/code_edu_classifier_v2_microsoft_codebert-base
Text Classification
•
Updated
•
2
datasets
15
devngho/the-stack-llm-annotations-v2
Viewer
•
Updated
•
1.89M
•
94
devngho/korean-webtext-edu
Viewer
•
Updated
•
1.98M
•
37
•
1
devngho/korean-textbooks-edu
Viewer
•
Updated
•
10.1M
•
57
•
1
devngho/korean-wikipedia-edu
Viewer
•
Updated
•
605k
•
79
•
1
devngho/the_stack_llm_annotations
Viewer
•
Updated
•
1.89M
•
33
•
3
devngho/ko_llm_annotations
Viewer
•
Updated
•
1.55M
•
30
•
1
devngho/the-stack-mini-nonshuffled
Viewer
•
Updated
•
6.22M
•
53
•
1
devngho/the-stack-mini
Viewer
•
Updated
•
6.22M
•
45
devngho/culturax-mini-nonshuffled
Viewer
•
Updated
•
71.8M
•
1.71k
devngho/korean_wikipedia
Viewer
•
Updated
•
1.02M
•
32