RLHFlow
/

DPA-v1-Mistral-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Haoxiang-Wang commited on May 23, 2024

Commit

88606fe

·

verified ·

1 Parent(s): 7c69a79

Update README.md

Files changed (1) hide show

README.md +3 -5

README.md CHANGED Viewed

@@ -28,7 +28,7 @@ This is the model card of a 🤗 transformers model that has been pushed on the
 <!-- Provide the basic links for the model. -->
 - **Repository:** https://github.com/RLHFlow/directional-preference-alignment
-- **Paper [optional]:** https://arxiv.org/abs/2402.18571
 ## How to Get Started with the Model
@@ -83,13 +83,11 @@ print(generated_response)
 **BibTeX:**
 If you find this work useful to your research, please consider citing our paper
 ```
-@article{wang2024arithmetic,
       title={Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards},
       author={Haoxiang Wang and Yong Lin and Wei Xiong and Rui Yang and Shizhe Diao and Shuang Qiu and Han Zhao and Tong Zhang},
       year={2024},
-      eprint={2402.18571},
-      archivePrefix={arXiv},
-      primaryClass={cs.LG}
 }
 ```

 <!-- Provide the basic links for the model. -->
 - **Repository:** https://github.com/RLHFlow/directional-preference-alignment
+- **Paper [ACL 2024]:** [Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards](https://arxiv.org/abs/2402.18571)
 ## How to Get Started with the Model
 **BibTeX:**
 If you find this work useful to your research, please consider citing our paper
 ```
+@inproceedings{wang2024arithmetic,
       title={Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards},
       author={Haoxiang Wang and Yong Lin and Wei Xiong and Rui Yang and Shizhe Diao and Shuang Qiu and Han Zhao and Tong Zhang},
       year={2024},
+      booktitle={ACL},
 }
 ```