AXCXEPT
/

phi-4-deepseek-R1K-RL-EZO

Text Generation

text-generation-inference

Model card Files Files and versions Community

AXCXEPT commited on Jan 30

Commit

94003ae

·

verified ·

1 Parent(s): 944bf35

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -26,7 +26,7 @@ pipeline_tag: text-generation
 #### EZO × PHI-4 × RL - Advancing LLM Training with Deepseek Knowledge
 ##### Overview
-This model is the result of combining OpenAI’s Phi-4 with a reinforcement learning (RL) approach, incorporating insights from the latest research on Deepseek R1. By leveraging a novel training methodology, we successfully improved both Japanese and English capabilities while maintaining a high level of performance across key benchmarks.
 ##### Key Features & Improvements
 Enhanced Multilingual Performance: Unlike previous iterations, this model strengthens English capabilities without compromising Japanese proficiency.

 #### EZO × PHI-4 × RL - Advancing LLM Training with Deepseek Knowledge
 ##### Overview
+This model is the result of combining Phi-4 with a reinforcement learning (RL) approach, incorporating insights from the latest research on Deepseek R1. By leveraging a novel training methodology, we successfully improved both Japanese and English capabilities while maintaining a high level of performance across key benchmarks.
 ##### Key Features & Improvements
 Enhanced Multilingual Performance: Unlike previous iterations, this model strengthens English capabilities without compromising Japanese proficiency.