yolay
/

RAIF-DeepSeek-Qwen-7B

Text Generation

text-generation-inference

Model card Files Files and versions

yolay commited on Jul 31

Commit

6f748c7

·

verified ·

1 Parent(s): 17999bb

Update README.md

Files changed (1) hide show

README.md +0 -9

README.md CHANGED Viewed

@@ -16,15 +16,6 @@ The model DeepSeek-Qwen-7B is our optimized model for its advanced instruction-f
 **Code**: [https://github.com/yuleiqin/RAIF](https://github.com/yuleiqin/RAIF)
-## Overview and Framework
-Our preliminary experiments confirm that the reasoning (e.g., triggered by CoT prompting) of fast-thinking LLMs (instructed models) are often shallow and superficial. Such reasoning only briefly repeats parts of the input requests and fails to extract key components from the complex instructions that are often composed of various sub-instructions, constraints, and rules. On the contrary, existing slow-thinking LLMs (reasoning models) demonstrate superior performance where their deep, organized reasoning truly help the analyses of complex instructions and provide the decomposed action steps to the final answer. Consequently, it is important to incentivize the authentic reasoning of LLMs to solve complex instructions.
-![The shallow, superficial reasoning of LLMs cannot improve but instead degrade performance under complex instructions.](./images/intro.png)
-In this project, we present a reinforcement learning-based method for cultivation of the deep reasoning of LLMs.
-![The proposed method is aimed at incentivizing the deep reasoning capabilities of instructed LLMs for solving complex instructions.](./images/framework.png)
 ## Usage


16
17	Code: [https://github.com/yuleiqin/RAIF](https://github.com/yuleiqin/RAIF)
18









19
20	## Usage
21