Curation of resources used in the paper "Demystifying Long Chain-of-Thought Reasoning in LLMs"
demystify-long-cot
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
1
models
29
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n2-raw-sft-ppo
Updated
•
2
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n1-raw-sft-ppo
Updated
•
2
demystify-long-cot/llama-3.1-8b-webit462k-qwq-n8-rft
Updated
demystify-long-cot/llama-3.1-8b-webit462k-qwq-n4-rft
Updated
demystify-long-cot/llama-3.1-8b-webit462k-qwq-n2-rft
Updated
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n8-rft
Updated
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n4-rft
Updated
•
2
demystify-long-cot/llama-3.1-8b-webit462k-qwq-n1-raw-sft
Updated
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n4-raw-sft
Updated
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n2-raw-sft
Updated
datasets
11
demystify-long-cot/math-train-action-n40
Viewer
•
Updated
•
217k
•
2
demystify-long-cot/math-train-qwen-rs-n256
Viewer
•
Updated
•
1.53M
•
1
demystify-long-cot/math-train-qwen-rs-n128
Viewer
•
Updated
•
766k
demystify-long-cot/math-train-qwen-rs-n64
Viewer
•
Updated
•
383k
demystify-long-cot/math-train-qwen-rs-n32
Viewer
•
Updated
•
192k
demystify-long-cot/math-train-qwq-rs-n256
Viewer
•
Updated
•
1.14M
•
1
demystify-long-cot/math-train-qwq-rs-n192
Viewer
•
Updated
•
854k
demystify-long-cot/math-train-qwq-rs-n128
Viewer
•
Updated
•
854k
demystify-long-cot/math-train-qwq-rs-n64
Viewer
•
Updated
•
428k
demystify-long-cot/math-train-qwq-rs-n32
Preview
•
Updated
•
1