Update README.md
Browse files
README.md
CHANGED
@@ -28,7 +28,7 @@ This is the model card of a 🤗 transformers model that has been pushed on the
|
|
28 |
<!-- Provide the basic links for the model. -->
|
29 |
|
30 |
- **Repository:** https://github.com/RLHFlow/directional-preference-alignment
|
31 |
-
- **Paper [
|
32 |
|
33 |
## How to Get Started with the Model
|
34 |
|
@@ -83,13 +83,11 @@ print(generated_response)
|
|
83 |
**BibTeX:**
|
84 |
If you find this work useful to your research, please consider citing our paper
|
85 |
```
|
86 |
-
@
|
87 |
title={Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards},
|
88 |
author={Haoxiang Wang and Yong Lin and Wei Xiong and Rui Yang and Shizhe Diao and Shuang Qiu and Han Zhao and Tong Zhang},
|
89 |
year={2024},
|
90 |
-
|
91 |
-
archivePrefix={arXiv},
|
92 |
-
primaryClass={cs.LG}
|
93 |
}
|
94 |
```
|
95 |
|
|
|
28 |
<!-- Provide the basic links for the model. -->
|
29 |
|
30 |
- **Repository:** https://github.com/RLHFlow/directional-preference-alignment
|
31 |
+
- **Paper [ACL 2024]:** [Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards](https://arxiv.org/abs/2402.18571)
|
32 |
|
33 |
## How to Get Started with the Model
|
34 |
|
|
|
83 |
**BibTeX:**
|
84 |
If you find this work useful to your research, please consider citing our paper
|
85 |
```
|
86 |
+
@inproceedings{wang2024arithmetic,
|
87 |
title={Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards},
|
88 |
author={Haoxiang Wang and Yong Lin and Wei Xiong and Rui Yang and Shizhe Diao and Shuang Qiu and Han Zhao and Tong Zhang},
|
89 |
year={2024},
|
90 |
+
booktitle={ACL},
|
|
|
|
|
91 |
}
|
92 |
```
|
93 |
|