When answering questions in Chinese, the model frequently terminates prematurely (outputs the end token). Is this a common problem?

#40

by zhangw355 - opened 2 days ago

Discussion

zhangw355

2 days ago

•

edited 2 days ago

As described in the title, I downloaded the model and carried out inference using the Transformer framework and the vLLM framework respectively. When asking questions in Chinese, the situation of premature termination of the answers would occur. May I ask whether this is a known issue? What are the probable causes and how can it be solved?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment