Safetensors
llama
MilesQLi commited on
Commit
bb3947b
·
verified ·
1 Parent(s): 23e75e2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -17,7 +17,7 @@ This repository provides access to **DMaS-LLaMa-Lite-step-3.3k**, a 1.7-billion-
17
  - **Parameters**: 1.7B (36 layers, 32 attention heads, RMSNorm)
18
  - **Tokenizer**: GPT-2 tokenizer
19
  - **Training Data**: FineWeb-Edu subset (educational text)
20
- - **Training Steps**: 2,700
21
  - **Optimizer**: AdamW with linear warmup and decay
22
  - **Hardware**: Trained on 1-2 RTX A6000 GPUs with PyTorch DDP
23
  - **Dataset Source**: [FineWeb-Edu Dataset](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu)
 
17
  - **Parameters**: 1.7B (36 layers, 32 attention heads, RMSNorm)
18
  - **Tokenizer**: GPT-2 tokenizer
19
  - **Training Data**: FineWeb-Edu subset (educational text)
20
+ - **Training Steps**: 3,300
21
  - **Optimizer**: AdamW with linear warmup and decay
22
  - **Hardware**: Trained on 1-2 RTX A6000 GPUs with PyTorch DDP
23
  - **Dataset Source**: [FineWeb-Edu Dataset](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu)