Update README.md
Browse files
README.md
CHANGED
|
@@ -14,34 +14,50 @@ datasets:
|
|
| 14 |
- greengerong/leetcode
|
| 15 |
---
|
| 16 |
|
| 17 |
-
|
| 18 |
|
| 19 |
-
|
| 20 |
|
| 21 |
-
|
| 22 |
|
| 23 |
-
|
| 24 |
|
| 25 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 26 |
|
| 27 |
-
|
|
|
|
| 28 |
|
| 29 |
-
|
| 30 |
-
|
| 31 |
-
|
| 32 |
-
|
| 33 |
-
torch_dtype='auto'
|
| 34 |
-
).eval()
|
| 35 |
|
| 36 |
-
|
| 37 |
-
|
| 38 |
-
|
| 39 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 40 |
|
| 41 |
-
|
| 42 |
-
output_ids = model.generate(input_ids.to('cuda'))
|
| 43 |
-
response = tokenizer.decode(output_ids[0][input_ids.shape[1]:], skip_special_tokens=True)
|
| 44 |
|
| 45 |
-
|
| 46 |
-
print(response)
|
| 47 |
-
```
|
|
|
|
| 14 |
- greengerong/leetcode
|
| 15 |
---
|
| 16 |
|
| 17 |
+
.png)
|
| 18 |
|
| 19 |
+
# BokantLM 0.1-0.5B
|
| 20 |
|
| 21 |
+
**BokantLM**λ *"Bokant"* (μν리칸μ€μ΄λ‘ "μμ" β "μ΅κ³ ")λΌλ μ΄λ¦μ²λΌ, **μμ§λ§ νΉμ μμ
μμ μ΅κ³ μ μ±λ₯μ λͺ©νλ‘ νλ LLM**μ
λλ€.
|
| 22 |
|
| 23 |
+
μ΄ νλ‘μ νΈμ ν΅μ¬ μ² νμ **"λͺ¨λ κ²μ μνλ λͺ¨λΈ"μ΄ μλλΌ, "νλμ μ§μ€ν΄ μνλ λͺ¨λΈ"**μ λ§λλ κ²μ
λλ€.
|
| 24 |
|
| 25 |
+
---
|
| 26 |
+
|
| 27 |
+
## β λͺ¨λΈ κ°μ
|
| 28 |
+
|
| 29 |
+
- **λͺ¨λΈλͺ
:** BokantLM 0.1-0.5B
|
| 30 |
+
- **κΈ°λ° λͺ¨λΈ:** [Qwen2.5-0.5B](https://huggingface.co/Qwen/Qwen2.5-0.5B)
|
| 31 |
+
- **νμΈνλ λ°μ΄ν°:** [`greengerong/leetcode`](https://huggingface.co/datasets/greengerong/leetcode)
|
| 32 |
+
|
| 33 |
+
---
|
| 34 |
+
|
| 35 |
+
## β μλμ μ² ν
|
| 36 |
|
| 37 |
+
λλΆλΆμ LLMμ λ²μ©μ±μ μν΄ λ€μν λΆμΌλ₯Ό νμ΅νμ§λ§,
|
| 38 |
+
BokantLMμ **νΉμ λΆμΌ νμ μΌλ‘ μ΅μμ ν¨μ¨κ³Ό μ±λ₯**μ λ
Έλ¦½λλ€.
|
| 39 |
|
| 40 |
+
μ΄λ² **0.1-0.5B λ²μ **μ μ½λ©/μκ³ λ¦¬μ¦ λΆμΌλ₯Ό μ§μ€ νμ΅νμμΌλ©°,
|
| 41 |
+
νΉν **LeetCode μ€νμΌμ λ¬Έμ ν΄κ²°**μ μ΅μ νλμ΄ μμ΅λλ€.
|
| 42 |
+
|
| 43 |
+
---
|
|
|
|
|
|
|
| 44 |
|
| 45 |
+
## β ν₯ν κ³ν
|
| 46 |
+
|
| 47 |
+
- β
**μ½λ© νΉν** λͺ¨λΈ μΆμ (νμ¬ λ²μ )
|
| 48 |
+
- π μν λ¬Έμ νμ΄ νΉν λ²μ
|
| 49 |
+
- π λ²λ₯ , μν, κ³Όν λ± **λΆμΌλ³ μ΄κ²½λ νΉν λͺ¨λΈ**
|
| 50 |
+
- π **λν LLM μ§μ μ¦λ₯(Distillation) μ μ© μλ**
|
| 51 |
+
|
| 52 |
+
---
|
| 53 |
+
|
| 54 |
+
## β λΌμ΄μ μ€
|
| 55 |
+
|
| 56 |
+
μ΄ μ μ₯μ λ° λͺ¨λΈμ μ°κ΅¬ λ° λΉμμ
μ μ©λλ‘λ§ μ¬μ© κ°λ₯ν©λλ€.
|
| 57 |
+
μμ
μ μ¬μ©μ λ³λ λ¬Έμ λ°λλλ€.
|
| 58 |
+
|
| 59 |
+
---
|
| 60 |
|
| 61 |
+
## BokantLM - "Small but Supreme in Its Domain"
|
|
|
|
|
|
|
| 62 |
|
| 63 |
+
BokantLMμ λͺ¨λ κ²μ μνλ λ²μ© λͺ¨λΈμ΄ μλλΌ, **ν λΆμΌμ μ§μ€ν΄ μ΅κ³ μ ν¨μ¨κ³Ό μ±λ₯μ λ΄λ κ²μ λͺ©νλ‘ νλ μ΄κ²½λ LLM λͺ¨λΈ**μ
λλ€.
|
|
|
|
|
|