zhs12 commited on
Commit
44e3134
·
verified ·
1 Parent(s): b5726e1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -13,6 +13,8 @@ base_model:
13
  | [**Light-R1-7B-DS (ours)** 🤗](https://huggingface.co/qihoo360/Light-R1-7B-DS) |DeepSeek-R1-Distill-Qwen-7B|25.3.12|**59.1**|**44.3**|**49.4**|
14
  | [Light-R1-32B (ours) 🤗](https://huggingface.co/qihoo360/Light-R1-32B) |Qwen2.5-32B-Instruct|25.3.4|76.6|64.6|61.8|
15
 
 
 
16
  [GitHub page](https://github.com/Qihoo360/Light-R1)
17
 
18
 
@@ -21,7 +23,7 @@ Light-R1-7B-DS also performed well on GPQA *without* any specific training.
21
 
22
  Originated from DeepSeek-R1-Distill-Qwen-7B, Light-R1-7B-DS is further trained with only [3K SFT data](https://huggingface.co/datasets/qihoo360/Light-R1-SFTData) as we've open-sourced, demonstrating the strong applicability of the released data.
23
 
24
- We are excited to release this model along with the [technical report](https://github.com/Qihoo360/Light-R1/blob/main/Light-R1.pdf).
25
 
26
  ## Usage
27
  Same as DeepSeek-R1-Distill-Qwen-7B.
 
13
  | [**Light-R1-7B-DS (ours)** 🤗](https://huggingface.co/qihoo360/Light-R1-7B-DS) |DeepSeek-R1-Distill-Qwen-7B|25.3.12|**59.1**|**44.3**|**49.4**|
14
  | [Light-R1-32B (ours) 🤗](https://huggingface.co/qihoo360/Light-R1-32B) |Qwen2.5-32B-Instruct|25.3.4|76.6|64.6|61.8|
15
 
16
+ [technical report](https://arxiv.org/abs/2503.10460)
17
+
18
  [GitHub page](https://github.com/Qihoo360/Light-R1)
19
 
20
 
 
23
 
24
  Originated from DeepSeek-R1-Distill-Qwen-7B, Light-R1-7B-DS is further trained with only [3K SFT data](https://huggingface.co/datasets/qihoo360/Light-R1-SFTData) as we've open-sourced, demonstrating the strong applicability of the released data.
25
 
26
+ We are excited to release this model along with the [technical report](https://arxiv.org/abs/2503.10460).
27
 
28
  ## Usage
29
  Same as DeepSeek-R1-Distill-Qwen-7B.