EQUES
/

TinySwallow-Stratos-1.5B

Model card Files Files and versions Community

stardust-eques commited on Feb 5

Commit

8ca6283

·

verified ·

1 Parent(s): d1d0465

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -11,6 +11,14 @@ datasets:
 [SakanaAI/TinySwallow-1.5B-Instruct](https://huggingface.co/SakanaAI/TinySwallow-1.5B-Instruct)
 に対し[Stratos-35k](https://huggingface.co/datasets/bespokelabs/Bespoke-Stratos-35k)データセットを用いて学習を施した軽量なLLMです。
 | Model | Base model | Dataset | Size | Organization |
 | ---- | ---- | ---- | ---- | ---- |
@@ -136,4 +144,5 @@ Apache-2.0
 - SakanaAI & Swallow team : development and release of TinySwallow-1.5B
 - BespokeLabs : development and share of training codes
 - NovaSkyAI : development and share of SkyThought

 [SakanaAI/TinySwallow-1.5B-Instruct](https://huggingface.co/SakanaAI/TinySwallow-1.5B-Instruct)
 に対し[Stratos-35k](https://huggingface.co/datasets/bespokelabs/Bespoke-Stratos-35k)データセットを用いて学習を施した軽量なLLMです。
+## TinySwallowとは
+SakanaAI社とSwallowチームによって開発された軽量なLLMです。新しい蒸留手法[TAID](https://arxiv.org/abs/2501.16937)が利用されています。
+## Stratos-35kとは
+Bespoke Labsにより公開されている質問、推論の過程、そして答えからなる推論データセットです。
+話題の[Deepseek-R1](https://github.com/deepseek-ai/DeepSeek-R1)を用いて合成されました。
 | Model | Base model | Dataset | Size | Organization |
 | ---- | ---- | ---- | ---- | ---- |
 - SakanaAI & Swallow team : development and release of TinySwallow-1.5B
 - BespokeLabs : development and share of training codes
 - NovaSkyAI : development and share of SkyThought
+- Authors of LlamaFactory