ghostplant
ghostplant
AI & ML interests
None yet
Recent Activity
new activity about 14 hours ago
deepseek-ai/DeepSeek-R1:Deploying production ready service with Unsloth GGUF quants on your AWS account. (4 x L40S) new activity about 15 hours ago
deepseek-ai/DeepSeek-R1:Does R1 support long context (> 4K)? new activity 3 days ago
deepseek-ai/DeepSeek-R1:90+ tokens per second for MI300x8 using batch_size = 1Organizations
None yet
ghostplant's activity
Deploying production ready service with Unsloth GGUF quants on your AWS account. (4 x L40S)
8
#171 opened about 16 hours ago
by
samagra-tensorfuse
Does R1 support long context (> 4K)?
#172 opened about 15 hours ago
by
ghostplant
90+ tokens per second for MI300x8 using batch_size = 1
1
#166 opened 3 days ago
by
ghostplant
Q2_K_XL 好还是 Q4好呢
1
#34 opened 6 days ago
by
jializou

所以部署一个671B的模型 显存需要多少 有什么基准的硬件配置?
25
#118 opened 20 days ago
by
cena163

How much vram do you need?
8
#12 opened 22 days ago
by
hyun10
Is there a model removing non-shared MoE experts?
4
#17 opened 24 days ago
by
ghostplant
Please convert these models to GGUF format...
5
#12 opened about 1 month ago
by
Moodym