kenhktsui
/

Qwen2.5-3B-Instruct-GRPO-minp-sampling_temp_05

Text Generation

text-generation-inference

Model card Files Files and versions

Qwen2.5-3B-Instruct-GRPO-minp-sampling_temp_05

Ctrl+K

Ctrl+K

1 contributor

History: 5 commits

kenhktsui's picture

Upload model trained with Unsloth

f13df00 verified 7 months ago

.gitattributes

1.57 kB

Upload tokenizer 7 months ago
README.md

617 Bytes

Trained with Unsloth 7 months ago
added_tokens.json

605 Bytes

Upload tokenizer 7 months ago
config.json

808 Bytes

Trained with Unsloth 7 months ago
generation_config.json

139 Bytes

Trained with Unsloth 7 months ago
merges.txt

1.67 MB

Upload tokenizer 7 months ago
pytorch_model-00001-of-00002.bin
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch.HalfStorage",
- "torch._utils._rebuild_tensor_v2"
What is a pickle import?
4.96 GB
xet

Trained with Unsloth 7 months ago
pytorch_model-00002-of-00002.bin
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch.HalfStorage",
- "torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.21 GB
xet

Trained with Unsloth 7 months ago
pytorch_model.bin.index.json

35.6 kB

Trained with Unsloth 7 months ago
special_tokens_map.json

614 Bytes

Upload tokenizer 7 months ago
tokenizer.json

11.4 MB
xet

Upload tokenizer 7 months ago
tokenizer_config.json

7.36 kB

Upload model trained with Unsloth 7 months ago
vocab.json

2.78 MB

Upload tokenizer 7 months ago