DeepSeek-R1-Math-114k / generation_config.json
BuiDGr8's picture
Upload model trained with Unsloth
fa95ac9 verified
raw
history blame contribute delete
231 Bytes
{
"_from_model_config": true,
"bos_token_id": 128000,
"do_sample": true,
"eos_token_id": 128001,
"max_length": 131072,
"pad_token_id": 128004,
"temperature": 0.6,
"top_p": 0.95,
"transformers_version": "4.48.2"
}