Update README.md
Browse files
README.md
CHANGED
@@ -26,7 +26,7 @@ pipeline_tag: text-generation
|
|
26 |
|
27 |
#### EZO × PHI-4 × RL - Advancing LLM Training with Deepseek Knowledge
|
28 |
##### Overview
|
29 |
-
This model is the result of combining
|
30 |
|
31 |
##### Key Features & Improvements
|
32 |
Enhanced Multilingual Performance: Unlike previous iterations, this model strengthens English capabilities without compromising Japanese proficiency.
|
|
|
26 |
|
27 |
#### EZO × PHI-4 × RL - Advancing LLM Training with Deepseek Knowledge
|
28 |
##### Overview
|
29 |
+
This model is the result of combining Phi-4 with a reinforcement learning (RL) approach, incorporating insights from the latest research on Deepseek R1. By leveraging a novel training methodology, we successfully improved both Japanese and English capabilities while maintaining a high level of performance across key benchmarks.
|
30 |
|
31 |
##### Key Features & Improvements
|
32 |
Enhanced Multilingual Performance: Unlike previous iterations, this model strengthens English capabilities without compromising Japanese proficiency.
|