Update README.md
Browse files
README.md
CHANGED
@@ -147,4 +147,18 @@ train/train_longpo.py \
|
|
147 |
| Mistral-7B-LongPO-256K-EXP | 59.47 | 60.28 | 83.14 | 78.14 | 70.26 |
|
148 |
| Mistral-7B-LongPO-512K-EXP | 59.51 | 60.58 | 82.87 | 77.66 | 70.16 |
|
149 |
| Qwen2.5-7B-Instruct | 74.28 | 67.15 | 81.41 | 74.66 | 74.38 |
|
150 |
-
| Qwen2.5-7B-LongPO-128K | 73.64 | 65.70 | 80.82 | 74.98 | 73.79 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
147 |
| Mistral-7B-LongPO-256K-EXP | 59.47 | 60.28 | 83.14 | 78.14 | 70.26 |
|
148 |
| Mistral-7B-LongPO-512K-EXP | 59.51 | 60.58 | 82.87 | 77.66 | 70.16 |
|
149 |
| Qwen2.5-7B-Instruct | 74.28 | 67.15 | 81.41 | 74.66 | 74.38 |
|
150 |
+
| Qwen2.5-7B-LongPO-128K | 73.64 | 65.70 | 80.82 | 74.98 | 73.79 |
|
151 |
+
|
152 |
+
|
153 |
+
## Citation
|
154 |
+
If you find our project useful, hope you can star our repo and cite our paper as follows:
|
155 |
+
```
|
156 |
+
@inproceedings{
|
157 |
+
chen2025longpo,
|
158 |
+
title={Long{PO}: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization},
|
159 |
+
author={Guanzheng Chen and Xin Li and Michael Shieh and Lidong Bing},
|
160 |
+
booktitle={The Thirteenth International Conference on Learning Representations},
|
161 |
+
year={2025},
|
162 |
+
url={https://openreview.net/forum?id=qTrEq31Shm}
|
163 |
+
}
|
164 |
+
```
|