LagPixelLOL's picture

71

LagPixelLOL

v2ray

·

LagPixelLOL

AI & ML interests

Looking for compute sponsors, please contact me through my email [email protected]!

Recent Activity

updated a model about 11 hours ago

v2ray/nai-lora-iewa

published a model about 13 hours ago

v2ray/nai-lora-iewa

new activity 1 day ago

cognitivecomputations/DeepSeek-R1-AWQ:Can't get 48 TPS on 8x H800

View all activity

Organizations

v2ray's activity

New activity in cognitivecomputations/DeepSeek-R1-AWQ 1 day ago

Can't get 48 TPS on 8x H800

#21 opened 1 day ago by

New activity in v2ray/GPT4chan-8B 2 days ago

gpt-4chan Neo-J

#1 opened 2 days ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ 2 days ago

Pipeline Parallellism

#20 opened 2 days ago by

8*a100 OUT OF MEMORY

#19 opened 2 days ago by

requests get stuck when sending long prompts (already solved, but still don't know why?)

#18 opened 2 days ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ 3 days ago

Significant Speed Drop with Increasing Input Length on H800 GPUs

#17 opened 3 days ago by

New activity in cognitivecomputations/DeepSeek-V3-AWQ 3 days ago

Docker start with vllm failed. Official vllm docker image 0.7.3

#7 opened 3 days ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ 3 days ago

when i use vllm v0.7.2 to deploy r1 awq, i got empty content

#10 opened 11 days ago by

why "MLA is not supported with awq_marlin quantization. Disabling MLA." with 4090 * 32 (4 node / vllm 0.7.2)

#14 opened 4 days ago by

when i run command ,it didnot work. ( via vllm 0.7.3)

#16 opened 3 days ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ 4 days ago

skips the thinking process

#5 opened 16 days ago by

Any one can run this model with SGlang framework？

#13 opened 4 days ago by

New activity in v2ray/deepgelbooru 7 days ago

Any thresholds recommendation for this model?

#1 opened 9 days ago by

New activity in cognitivecomputations/DeepSeek-V3-AWQ 7 days ago

GPTQ Support

#1 opened about 2 months ago by

vllm support a100

#2 opened about 1 month ago by

HuggingLianWang

Code used to convert this / could you do v3 base?

#3 opened 30 days ago by

What calibration dataset do you use when applying AWQ?

#5 opened 11 days ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ 7 days ago

Deployment framework

#2 opened about 1 month ago by

MLA is not supported with moe_wna16 quantization. Disabling MLA.

#7 opened 12 days ago by

triton.runtime.errors.OutOfResources: out of resource: shared memory, Required: 163840, Hardware limit: 101376. Reducing block sizes or `num_stages` may help

#9 opened 11 days ago by