Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
14
3
1
Simon Mo
simon-mo
Follow
21world's profile picture
swordsmith's profile picture
chriszhouwei's profile picture
19 followers
Β·
10 following
https://github.com/simon-mo
simon_mo_
simon-mo
AI & ML interests
System for ML
Recent Activity
new
activity
27 days ago
openai/gpt-oss-120b:
[v1 engine][flash_attn backend] TypeError: flash_attn_varlen_func() got an unexpected keyword argument 's_aux' when running gpt-oss-120b on H200
new
activity
27 days ago
openai/gpt-oss-120b:
VLLM - Flash-attn 3
reacted
to
erikkaum
's
post
with π€
about 2 months ago
We just released native support for @SGLang and @vllm-project in Inference Endpoints π₯ Inference Endpoints is becoming the central place where you deploy high performance Inference Engines. And that provides the managed infra for it. Instead of spending weeks configuring infrastructure, managing servers, and debugging deployment issues, you can focus on what matters most: your AI model and your users π
View all activity
Organizations
simon-mo
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
openai/gpt-oss-120b
27 days ago
[v1 engine][flash_attn backend] TypeError: flash_attn_varlen_func() got an unexpected keyword argument 's_aux' when running gpt-oss-120b on H200
π
7
13
#41 opened 27 days ago by
RekklesAI
VLLM - Flash-attn 3
12
#23 opened 27 days ago by
chriswritescode
New activity in
Qwen/Qwen3-0.6B-FP8
4 months ago
Remove vLLM FP8 Limitation
#3 opened 4 months ago by
simon-mo
New activity in
Qwen/Qwen3-1.7B-FP8
4 months ago
Remove vLLM FP8 Limitation
#2 opened 4 months ago by
simon-mo
New activity in
Qwen/Qwen3-4B-FP8
4 months ago
Remove vLLM FP8 Limitation
#2 opened 4 months ago by
simon-mo
New activity in
Qwen/Qwen3-8B-FP8
4 months ago
Remove vLLM FP8 Limitation
#2 opened 4 months ago by
simon-mo
New activity in
Qwen/Qwen3-14B-FP8
4 months ago
Remove vLLM FP8 Limitation
#2 opened 4 months ago by
simon-mo
New activity in
Qwen/Qwen3-32B-FP8
4 months ago
Remove vLLM FP8 Limitation
π₯
1
#3 opened 4 months ago by
simon-mo
New activity in
Qwen/Qwen3-30B-A3B-FP8
4 months ago
Remove vLLM FP8 Limitation
10
#2 opened 4 months ago by
simon-mo
New activity in
Qwen/Qwen3-235B-A22B-FP8
4 months ago
Remove vLLM FP8 Limitation
#2 opened 4 months ago by
simon-mo
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
about 1 year ago
Update README.md
#1 opened about 1 year ago by
simon-mo
Load more