DeepSeek R1 full-power version occasionally ends without returning </think>.
#196
by
yizhiezi
- opened
As the title suggests, R1 full-power SGLang deployment. In rare cases, the response appears like " blah blah blah blah blah"—Don't ""and""appear in pairs? How to distinguish between the thinking part and the actual answering part?
This issue occurs whether or not the final is included in the chat_template. The probability is roughly 1 in 1000.
Has anyone encountered this issue? Looking for possible causes and solutions.