openchat
/

openchat-3.5-1210

Text Generation

text-generation-inference

Model card Files Files and versions Community

alpayariyak commited on Dec 12, 2023

Commit

e23a160

·

1 Parent(s): daf77b7

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -38,13 +38,13 @@ pipeline_tag: text-generation
 | Model                       | HumanEval+ |
 |-----------------------------|------------|
-| WizardCoder-Python-34B-V1.0 | 64.6       |
 | GPT-3.5 (December 2023)     | 64.6       |
 | **OpenChat 3.5 1210**       | **63.4**   |
 | OpenHermes 2.5              | 41.5       |
   <div align="center" style="justify-content: center; align-items: center; "'>
-  <img src="https://github.com/alpayariyak/openchat/blob/master/assets/Untitled%20design-17.png?raw=true" style="width: 100%;  border-radius: 0.5em">
   </div>
 OpenChat is an innovative library of open-source language models, fine-tuned with [C-RLFT](https://arxiv.org/pdf/2309.11235.pdf) - a strategy inspired by offline reinforcement learning. Our models learn from mixed-quality data without preference labels, delivering exceptional performance on par with ChatGPT, even with a 7B model. Despite our simple approach, we are committed to developing a high-performance, commercially viable, open-source large language model, and we continue to make significant strides toward this vision.

 | Model                       | HumanEval+ |
 |-----------------------------|------------|
 | GPT-3.5 (December 2023)     | 64.6       |
 | **OpenChat 3.5 1210**       | **63.4**   |
+| GPT-3.5 (March 2023)     | 64.6       |
 | OpenHermes 2.5              | 41.5       |
   <div align="center" style="justify-content: center; align-items: center; "'>
+  <img src="https://github.com/alpayariyak/openchat/blob/master/assets/3.5-benchmarks.png?raw=true" style="width: 100%;  border-radius: 0.5em">
   </div>
 OpenChat is an innovative library of open-source language models, fine-tuned with [C-RLFT](https://arxiv.org/pdf/2309.11235.pdf) - a strategy inspired by offline reinforcement learning. Our models learn from mixed-quality data without preference labels, delivering exceptional performance on par with ChatGPT, even with a 7B model. Despite our simple approach, we are committed to developing a high-performance, commercially viable, open-source large language model, and we continue to make significant strides toward this vision.