Update README.md
Browse files
README.md
CHANGED
@@ -17,7 +17,7 @@ This repository provides access to **DMaS-LLaMa-Lite-step-3.3k**, a 1.7-billion-
|
|
17 |
- **Parameters**: 1.7B (36 layers, 32 attention heads, RMSNorm)
|
18 |
- **Tokenizer**: GPT-2 tokenizer
|
19 |
- **Training Data**: FineWeb-Edu subset (educational text)
|
20 |
-
- **Training Steps**:
|
21 |
- **Optimizer**: AdamW with linear warmup and decay
|
22 |
- **Hardware**: Trained on 1-2 RTX A6000 GPUs with PyTorch DDP
|
23 |
- **Dataset Source**: [FineWeb-Edu Dataset](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu)
|
|
|
17 |
- **Parameters**: 1.7B (36 layers, 32 attention heads, RMSNorm)
|
18 |
- **Tokenizer**: GPT-2 tokenizer
|
19 |
- **Training Data**: FineWeb-Edu subset (educational text)
|
20 |
+
- **Training Steps**: 3,300
|
21 |
- **Optimizer**: AdamW with linear warmup and decay
|
22 |
- **Hardware**: Trained on 1-2 RTX A6000 GPUs with PyTorch DDP
|
23 |
- **Dataset Source**: [FineWeb-Edu Dataset](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu)
|