Update README.md
Browse files
README.md
CHANGED
@@ -8,7 +8,7 @@ base_model:
|
|
8 |
- meta-llama/Llama-3.1-8B-Instruct
|
9 |
---
|
10 |
|
11 |
-
This model is presented in the paper [Preference Learning Unlocks LLMs' Psycho-Counseling Skills](https://hf.co/papers/2502.19731). It's a fine-tuned Llama
|
12 |
|
13 |
The model aims to improve the quality of responses in psycho-counseling sessions and achieves a win rate of 87% against GPT-4o.
|
14 |
|
|
|
8 |
- meta-llama/Llama-3.1-8B-Instruct
|
9 |
---
|
10 |
|
11 |
+
This model is presented in the paper [Preference Learning Unlocks LLMs' Psycho-Counseling Skills](https://hf.co/papers/2502.19731). It's a fine-tuned [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) model trained using preference learning on the [PsychoCounsel-Preference](https://huggingface.co/datasets/Psychotherapy-LLM/PsychoCounsel-Preference) dataset. This dataset contains 36k high-quality preference comparison pairs aligned with the preferences of professional psychotherapists.
|
12 |
|
13 |
The model aims to improve the quality of responses in psycho-counseling sessions and achieves a win rate of 87% against GPT-4o.
|
14 |
|