ghostplant's picture

24 4

ghostplant

ghostplant

·

AI & ML interests

None yet

Recent Activity

new activity about 1 month ago

deepseek-ai/DeepSeek-V3.2:DSA Question

new activity about 1 month ago

microsoft/VibeVoice-Realtime-0.5B:For those who need a simplified execution on NVIDIA GPU

updated a dataset 2 months ago

ghostplant/data-collections

View all activity

Organizations

None yet

New activity in deepseek-ai/DeepSeek-V3.2 about 1 month ago

DSA Question

#33 opened about 1 month ago by

New activity in microsoft/VibeVoice-Realtime-0.5B about 1 month ago

For those who need a simplified execution on NVIDIA GPU

#21 opened about 1 month ago by

New activity in deepseek-ai/DeepSeek-V3.2-Exp 4 months ago

Question about long-context evaluation in DeepSeek-V3.2-Exp

#15 opened 4 months ago by

New activity in openai/gpt-oss-120b 6 months ago

Can gpt-oss support local vllm deployment on a100 GPU?

#73 opened 6 months ago by

New activity in openai/gpt-oss-20b 6 months ago

Running gpt-oss Without FlashAttention 3 – Any Alternatives to Ollama?

#72 opened 6 months ago by

New activity in openai/gpt-oss-120b 6 months ago

Run GPT-OSS-120B with just Single A100 (80GB)

#80 opened 6 months ago by

New activity in Qwen/Qwen3-Coder-480B-A35B-Instruct 6 months ago

How is Qwen3's inv_freq computed from scratch?

#13 opened 6 months ago by

New activity in moonshotai/Kimi-K2-Instruct 7 months ago

Run 1T-param on A100/H100(80G)x8 using FP4

#9 opened 7 months ago by

New activity in nvidia/DeepSeek-R1-NVFP4 7 months ago

quantize deepseek-r1-0528 please

#14 opened 8 months ago by

New activity in deepseek-ai/DeepSeek-R1-0528 8 months ago

刚部署满血deepseek r1 0528版本，推理性能提升这么多嘛？不是架构没变嘛？

#75 opened 8 months ago by

How to run 0528version on GPU which don't support FP8

#64 opened 8 months ago by

这个问题大家的输出是什么？

#49 opened 8 months ago by

New activity in unsloth/DeepSeek-R1-GGUF 10 months ago

Share a mmlu test result,I use 2.51bit,and compare with ds api, baidu's ds,it seems 2.51bit is very smart at least in mmlu

#42 opened 11 months ago by

New activity in deepseek-ai/DeepSeek-R1 10 months ago

Does R1 support long context (> 4K)?

#172 opened 11 months ago by

New activity in nvidia/DeepSeek-R1-NVFP4 10 months ago

can this model run on Hopper GPU

#8 opened 11 months ago by

can this model run on A800 ?

#10 opened 11 months ago by

Why not use FP2 or IQ2 as kTransformers does?

#11 opened 11 months ago by

New activity in deepseek-ai/DeepSeek-R1 11 months ago

Deploying production ready service with Unsloth GGUF quants on your AWS account. (4 x L40S)

#171 opened 11 months ago by

samagra-tensorfuse

New activity in deepseek-ai/DeepSeek-R1 12 months ago

90+ tokens per second for MI300x8 using batch_size = 1

#166 opened 12 months ago by

New activity in unsloth/DeepSeek-R1-GGUF 12 months ago

Q2_K_XL 好还是 Q4好呢

#34 opened 12 months ago by