Text Generation
Transformers
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Zhangchen Xu commited on
Commit
b516ab5
·
verified ·
1 Parent(s): 5ee112a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -278,6 +278,13 @@ If you find the model, data, or code useful, please cite:
278
  archivePrefix={arXiv},
279
  primaryClass={cs.CL}
280
  }
 
 
 
 
 
 
 
281
  ```
282
 
283
  **Contact**
 
278
  archivePrefix={arXiv},
279
  primaryClass={cs.CL}
280
  }
281
+
282
+ @article{xu2024stronger,
283
+ title={Stronger Models are NOT Stronger Teachers for Instruction Tuning},
284
+ author={Xu, Zhangchen and Jiang, Fengqing and Niu, Luyao and Lin, Bill Yuchen and Poovendran, Radha},
285
+ journal={arXiv preprint arXiv:2411.07133},
286
+ year={2024}
287
+ }
288
  ```
289
 
290
  **Contact**