CarrotAI
/

Llama-3.2-Rabbit-Ko-3B-Instruct-2412

Text Generation

Model card Files Files and versions Community

CarrotAI commited on Dec 6, 2024

Commit

8020a08

·

verified ·

1 Parent(s): 6c55003

Update README.md

Files changed (1) hide show

README.md +36 -0

README.md CHANGED Viewed

@@ -13,7 +13,43 @@ pipeline_tag: text-generation
 ![image/webp](https://cdn-uploads.huggingface.co/production/uploads/64633ebb39359568c63b52ad/Cji1GoDqERF0zCR9gKJKV.webp)
 ## Score
 LogicKor
 | Category | Single turn | Multi turn |
 |---|---|---|

 ![image/webp](https://cdn-uploads.huggingface.co/production/uploads/64633ebb39359568c63b52ad/Cji1GoDqERF0zCR9gKJKV.webp)
+## Model Description
+### Model Details
+- **Name**: Carrot Llama-3.2 Rabbit Ko 2412
+- **Version**: 3B Instruct
+- **Base Model**: CarrotAI/Llama-3.2-Rabbit-Ko-3B-Instruct-2412
+- **Languages**: Korean, English
+- **Model Type**: Large Language Model (Instruction-tuned)
+### Training Process
+본 모델은 다음과 같은 주요 훈련 단계를 거쳤습니다:
+1. **SFT (Supervised Fine-Tuning)**
+   - 고품질 한국어 및 영어 데이터셋을 사용하여 기본 모델을 세부 조정
+2. **DPO (Direct Preference Optimization)**
+   - 인간의 선호도를 직접적으로 반영하여 모델의 응답 품질 개선
+   - 모델의 일관성, 정확성, 그리고 사용자 선호도를 최적화
+### Limitations
+- 3B 파라미터 규모로 인한 복잡한 작업에서의 제한적 성능
+- 특정 도메인에 대한 깊이 있는 전문성 부족
+- 편향성 및 환각 가능성
+### Ethics Statement
+모델 개발 과정에서 윤리적 고려사항을 최대한 반영하였으나, 사용자는 항상 결과를 비판적으로 검토해야 합니다.
+### How to Use
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+model = AutoModelForCausalLM.from_pretrained("CarrotAI/Llama-3.2-Rabbit-Ko-3B-Instruct-2412")
+tokenizer = AutoTokenizer.from_pretrained("CarrotAI/Llama-3.2-Rabbit-Ko-3B-Instruct-2412")
+```
 ## Score
+### Performance Metrics
 LogicKor
 | Category | Single turn | Multi turn |
 |---|---|---|