Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Cran-May
/
CohenQu-DeepSeek-R1-Distill-Qwen-1.5B-GRPO-duplicate-fixed-6140715-Q5_K_M-GGUF
like
2
Transformers
GGUF
hf-cmu-collab/DeepScaleR-1.5B-Preview_on-policy_GRPO
Generated from Trainer
trl
grpo
llama-cpp
gguf-my-repo
Inference Endpoints
imatrix
conversational
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
CohenQu-DeepSeek-R1-Distill-Qwen-1.5B-GRPO-duplicate-fixed-6140715-Q5_K_M-GGUF
1 contributor
History:
4 commits
Cran-May
Upload README.md with huggingface_hub
1bda063
verified
3 days ago
.gitattributes
1.69 kB
Upload imatrix.dat with huggingface_hub
3 days ago
README.md
2.59 kB
Upload README.md with huggingface_hub
3 days ago
cohenqu-deepseek-r1-distill-qwen-1.5b-grpo-duplicate-fixed-6140715-q5_k_m-imat.gguf
1.29 GB
LFS
Upload cohenqu-deepseek-r1-distill-qwen-1.5b-grpo-duplicate-fixed-6140715-q5_k_m-imat.gguf with huggingface_hub
3 days ago
imatrix.dat
2.04 MB
LFS
Upload imatrix.dat with huggingface_hub
3 days ago