DAMO-NLP-SG
/

Mistral-7B-LongPO-512K-EXP

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Guanzheng commited on 5 days ago

Commit

63f1058

·

verified ·

1 Parent(s): e1b9d75

Update README.md

Files changed (1) hide show

README.md +15 -1

README.md CHANGED Viewed

@@ -147,4 +147,18 @@ train/train_longpo.py \
 | Mistral-7B-LongPO-256K-EXP | 59.47 | 60.28 | 83.14 | 78.14 | 70.26 |
 | Mistral-7B-LongPO-512K-EXP | 59.51 | 60.58 | 82.87 | 77.66 | 70.16 |
 | Qwen2.5-7B-Instruct | 74.28 | 67.15 | 81.41 | 74.66 | 74.38 |
-| Qwen2.5-7B-LongPO-128K | 73.64 | 65.70 | 80.82 | 74.98 | 73.79 |

 | Mistral-7B-LongPO-256K-EXP | 59.47 | 60.28 | 83.14 | 78.14 | 70.26 |
 | Mistral-7B-LongPO-512K-EXP | 59.51 | 60.58 | 82.87 | 77.66 | 70.16 |
 | Qwen2.5-7B-Instruct | 74.28 | 67.15 | 81.41 | 74.66 | 74.38 |
+| Qwen2.5-7B-LongPO-128K | 73.64 | 65.70 | 80.82 | 74.98 | 73.79 |
+## Citation
+If you find our project useful, hope you can star our repo and cite our paper as follows:
+```
+@inproceedings{
+    chen2025longpo,
+    title={Long{PO}: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization},
+    author={Guanzheng Chen and Xin Li and Michael Shieh and Lidong Bing},
+    booktitle={The Thirteenth International Conference on Learning Representations},
+    year={2025},
+    url={https://openreview.net/forum?id=qTrEq31Shm}
+}
+```