DC
Downtown-Case
AI & ML interests
None yet
Recent Activity
liked
a model
2 days ago
Genius-Society/piano_trans
new activity
4 days ago
DeusImperator/DeepSeek-R1-Distill-Qwen-32B_exl2_4.5bpw_L:Script used?
new activity
4 days ago
DeusImperator/Mistral-Small-24B-Instruct-2501_exl2_6.5bpw_L:Thanks.
Organizations
None yet
Downtown-Case's activity
Script used?
3
#1 opened 4 days ago
by
Downtown-Case
Thanks.
5
#1 opened 2 months ago
by
dinerburger

Are any of these the QAT releases of Gemma 3
6
#15 opened 17 days ago
by
Downtown-Case
Where are the QAT releases for Gemma 3?
#28 opened 17 days ago
by
Downtown-Case
What are these GGUFs
#2 opened 17 days ago
by
Downtown-Case
Could you please finetune this on the base model, instead of instruct?
1
#1 opened 5 months ago
by
Downtown-Case
Awesome! 88K context on 24GB works.
#3 opened 5 months ago
by
Downtown-Case
This looks great
6
#1 opened 5 months ago
by
DazzlingXeno
Is this trained off the base or instruct model?
2
#1 opened 5 months ago
by
Downtown-Case
Clarifications on how to use YaRN
5
#5 opened 6 months ago
by
Downtown-Case
128K usage?
2
#1 opened 6 months ago
by
Downtown-Case
Questiona, again.
5
#1 opened 7 months ago
by
DazzlingXeno
Base or chat?
3
#1 opened 8 months ago
by
DazzlingXeno
Why 12b? Who could run that locally?
47
#1 opened 8 months ago
by
kaidu88
Good at 128K!
1
#2 opened 8 months ago
by
Downtown-Case
Does this work for anyone?
4
#1 opened 9 months ago
by
Downtown-Case
Does this work for anyone?
4
#1 opened 9 months ago
by
Downtown-Case