WHChoi's picture

6 30

WHChoi

AbdulaHassan

·

AI & ML interests

None yet

Recent Activity

liked a model 13 days ago

GSAI-ML/LLaDA-8B-Instruct

upvoted an article 15 days ago

What is test-time compute and how to scale it?

liked a model 22 days ago

perplexity-ai/r1-1776

View all activity

Organizations

AbdulaHassan's activity

upvoted an article 15 days ago

Article

What is test-time compute and how to scale it?

By

and 1 other •

Feb 6

• 54

upvoted a collection 6 months ago

Llama 3.1 GPTQ, AWQ, and BNB Quants

Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26, 2024 • 56

upvoted 3 collections 7 months ago

FP8

20 items • Updated Nov 3, 2024 • 1

FP8 LLMs for vLLM

Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 44 items • Updated Oct 17, 2024 • 67

INT8 LLMs for vLLM

Accurate INT8 quantized models by Neural Magic, ready for use with vLLM! • 50 items • Updated Sep 26, 2024 • 15