Commit
·
6e46ae4
1
Parent(s):
c351234
Update README.md
Browse files
README.md
CHANGED
@@ -115,6 +115,8 @@ This prompt was designed to elicit assessments of answers in terms of helpfulnes
|
|
115 |
|
116 |
Follows the results against GPT-3.5 and Falcon, one of the highest performing open-source models at the moment:
|
117 |
|
|
|
|
|
118 |
| | **Lose** | **Tie** | **Win** |
|
119 |
|------------------------|----------|---------|---------|
|
120 |
| QUOKKA vs. **GPT-3.5** | 63.8% | 10.1% | 26.1% |
|
|
|
115 |
|
116 |
Follows the results against GPT-3.5 and Falcon, one of the highest performing open-source models at the moment:
|
117 |
|
118 |
+
* Automatic Evaluation **in Portuguese**:
|
119 |
+
|
120 |
| | **Lose** | **Tie** | **Win** |
|
121 |
|------------------------|----------|---------|---------|
|
122 |
| QUOKKA vs. **GPT-3.5** | 63.8% | 10.1% | 26.1% |
|