Update README.md
Browse files
README.md
CHANGED
@@ -13,6 +13,8 @@ base_model:
|
|
13 |
| [**Light-R1-7B-DS (ours)** 🤗](https://huggingface.co/qihoo360/Light-R1-7B-DS) |DeepSeek-R1-Distill-Qwen-7B|25.3.12|**59.1**|**44.3**|**49.4**|
|
14 |
| [Light-R1-32B (ours) 🤗](https://huggingface.co/qihoo360/Light-R1-32B) |Qwen2.5-32B-Instruct|25.3.4|76.6|64.6|61.8|
|
15 |
|
|
|
|
|
16 |
[GitHub page](https://github.com/Qihoo360/Light-R1)
|
17 |
|
18 |
|
@@ -21,7 +23,7 @@ Light-R1-7B-DS also performed well on GPQA *without* any specific training.
|
|
21 |
|
22 |
Originated from DeepSeek-R1-Distill-Qwen-7B, Light-R1-7B-DS is further trained with only [3K SFT data](https://huggingface.co/datasets/qihoo360/Light-R1-SFTData) as we've open-sourced, demonstrating the strong applicability of the released data.
|
23 |
|
24 |
-
We are excited to release this model along with the [technical report](https://
|
25 |
|
26 |
## Usage
|
27 |
Same as DeepSeek-R1-Distill-Qwen-7B.
|
|
|
13 |
| [**Light-R1-7B-DS (ours)** 🤗](https://huggingface.co/qihoo360/Light-R1-7B-DS) |DeepSeek-R1-Distill-Qwen-7B|25.3.12|**59.1**|**44.3**|**49.4**|
|
14 |
| [Light-R1-32B (ours) 🤗](https://huggingface.co/qihoo360/Light-R1-32B) |Qwen2.5-32B-Instruct|25.3.4|76.6|64.6|61.8|
|
15 |
|
16 |
+
[technical report](https://arxiv.org/abs/2503.10460)
|
17 |
+
|
18 |
[GitHub page](https://github.com/Qihoo360/Light-R1)
|
19 |
|
20 |
|
|
|
23 |
|
24 |
Originated from DeepSeek-R1-Distill-Qwen-7B, Light-R1-7B-DS is further trained with only [3K SFT data](https://huggingface.co/datasets/qihoo360/Light-R1-SFTData) as we've open-sourced, demonstrating the strong applicability of the released data.
|
25 |
|
26 |
+
We are excited to release this model along with the [technical report](https://arxiv.org/abs/2503.10460).
|
27 |
|
28 |
## Usage
|
29 |
Same as DeepSeek-R1-Distill-Qwen-7B.
|