BUT-FIT
/

CSTinyLlama-1.2B

Text Generation

text-generation-inference

Model card Files Files and versions

mfajcik commited on Dec 16, 2024

Commit

8c8d368

·

verified ·

1 Parent(s): b3cba5a

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -28,6 +28,7 @@ Below we
 ## Distance in y Between Fine-Tuning and Training from Scratch
 <img src="figures/tllama_test_distance.png" width="900"/>
 ## Training parameters
 Not mentioned parameters are the same as for [TinyLLama-2.5T](https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T).

 ## Distance in y Between Fine-Tuning and Training from Scratch
 <img src="figures/tllama_test_distance.png" width="900"/>
+The distance |x1-x2| with same function value f1(x1)=f2(x2) grows with more steps. On convergence, it starts to rapidly increase (perhaps exponentially).
 ## Training parameters
 Not mentioned parameters are the same as for [TinyLLama-2.5T](https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T).