Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
demystify-long-cot
community
Activity Feed
Follow
4
AI & ML interests
None defined yet.
Recent Activity
yuexiang96
authored
a paper
11 days ago
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents
yuexiang96
authored
a paper
11 days ago
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
yuexiang96
authored
a paper
11 days ago
Simulating Environments with Reasoning Models for Agent Training
View all activity
Team members
2
demystify-long-cot
's models
29
Sort: Recently updated
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n2-raw-sft-ppo
8B
•
Updated
Jan 20
•
6
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n1-raw-sft-ppo
8B
•
Updated
Jan 20
•
6
demystify-long-cot/llama-3.1-8b-webit462k-qwq-n8-rft
Updated
Jan 20
demystify-long-cot/llama-3.1-8b-webit462k-qwq-n4-rft
Updated
Jan 20
demystify-long-cot/llama-3.1-8b-webit462k-qwq-n2-rft
8B
•
Updated
Jan 20
•
6
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n8-rft
8B
•
Updated
Jan 20
•
8
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n4-rft
8B
•
Updated
Jan 20
•
11
demystify-long-cot/llama-3.1-8b-webit462k-qwq-n1-raw-sft
8B
•
Updated
Jan 20
•
6
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n4-raw-sft
8B
•
Updated
Jan 20
•
9
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n2-raw-sft
8B
•
Updated
Jan 20
•
7
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n1-raw-sft
8B
•
Updated
Jan 20
•
5
demystify-long-cot/llama-3.1-8b-math-qwen-n256-rft
8B
•
Updated
Jan 20
•
4
demystify-long-cot/llama-3.1-8b-math-qwen-n32-rft-ppo
8B
•
Updated
Jan 20
•
5
demystify-long-cot/llama-3.1-8b-math-qwen-n32-rft
8B
•
Updated
Jan 20
•
7
demystify-long-cot/llama-3.1-8b-math-qwen-n64-rft-ppo
8B
•
Updated
Jan 20
•
5
demystify-long-cot/llama-3.1-8b-math-qwen-n64-rft
8B
•
Updated
Jan 20
•
7
demystify-long-cot/llama-3.1-8b-math-qwen-n128-rft-ppo
8B
•
Updated
Jan 20
•
6
demystify-long-cot/llama-3.1-8b-math-qwen-n128-rft
8B
•
Updated
Jan 20
•
7
demystify-long-cot/llama-3.1-8b-math-qwen-n16-rft
8B
•
Updated
Jan 20
•
7
demystify-long-cot/llama-3.1-8b-math-qwq-n192-rft-ppo
8B
•
Updated
Jan 20
•
7
demystify-long-cot/llama-3.1-8b-math-qwq-n192-rft
8B
•
Updated
Jan 20
•
8
demystify-long-cot/llama-3.1-8b-math-qwq-n16-rft
8B
•
Updated
Jan 20
•
6
demystify-long-cot/llama-3.1-8b-math-qwq-n64-rft-ppo
8B
•
Updated
Jan 20
•
6
demystify-long-cot/llama-3.1-8b-math-qwq-n64-rft
8B
•
Updated
Jan 20
•
5
demystify-long-cot/llama-3.1-8b-math-qwq-n32-rft-ppo
8B
•
Updated
Jan 20
•
4
demystify-long-cot/llama-3.1-8b-math-qwq-n32-rft
8B
•
Updated
Jan 20
•
10
demystify-long-cot/llama-3.1-8b-math-qwq-n256-rft
8B
•
Updated
Jan 20
•
8
demystify-long-cot/llama-3.1-8b-math-qwq-n128-rft-ppo
8B
•
Updated
Jan 20
•
6
demystify-long-cot/llama-3.1-8b-math-qwq-n128-rft
8B
•
Updated
Jan 20
•
4