high-low mode

#3
by LD-inform - opened

Hi, is there a way i can determine how much will model think through the problem? (how many tokens should he spend on thinking)
When i read thinking block of distilled 1.5B i have impression that it is going deeper with thinking about it. 7B thinking is less in depth from my observations

Sign up or log in to comment