Ryan Marten
ryanmarten
AI & ML interests
None yet
Recent Activity
liked
a model
about 2 hours ago
deepseek-ai/DeepSeek-V3
new activity
about 2 hours ago
bespokelabs/Bespoke-Stratos-32B:What's your benchmark settings for DeepSeek-R1-Distill-Qwen-32B??
new activity
about 2 hours ago
open-thoughts/OpenThoughts-114k:The meaning of "distillation" - Does it require logit outputs from the teacher model?
Organizations
ryanmarten's activity
What's your benchmark settings for DeepSeek-R1-Distill-Qwen-32B??
2
#2 opened about 22 hours ago
by
AaronFeng753
The meaning of "distillation" - Does it require logit outputs from the teacher model?
1
#6 opened about 9 hours ago
by
saleem2
32,390 wrong math answers?
1
#3 opened about 7 hours ago
by
mlabonne
Only math
1
#2 opened about 15 hours ago
by
ryanmarten
Just want to confirm, this is full r1 data?
5
#3 opened 2 days ago
by
teknium
fix table
1
#1 opened 8 days ago
by
nlpguy
fix table
1
#1 opened 8 days ago
by
nlpguy