3 7 10

Kamal Ali

Kamali-Lab

AI & ML interests

None yet

Recent Activity

new activity about 17 hours ago

cerebras/DeepSeek-V3.2-REAP-508B-A37B:FP8 versions of DeepSeek-V3.2 would awesome!

upvoted a collection about 1 month ago

LLaDA 2.0

liked a model about 1 month ago

inclusionAI/LLaDA2.0-flash

View all activity

Organizations

New activity in cerebras/DeepSeek-V3.2-REAP-508B-A37B about 17 hours ago

FP8 versions of DeepSeek-V3.2 would awesome!

#1 opened 12 days ago by

Fernanda24

upvoted a collection about 1 month ago

LLaDA 2.0

Collection

7 items • Updated 4 days ago • 39

liked a model about 1 month ago

inclusionAI/LLaDA2.0-flash

Text Generation • 103B • Updated 10 days ago • 361 • 58

upvoted 2 papers about 1 month ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published Nov 17 • 136

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12 • 117

liked 2 datasets about 2 months ago

Open-Bee/Honey-Data-15M

Viewer • Updated Nov 5 • 14.8M • 37.3k • 103

sequelbox/Raiden-DeepSeek-R1

Viewer • Updated Mar 12 • 62.9k • 286 • 47

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.71k

The secrets to building world-class LLMs

liked a model about 2 months ago

KaraKaraWitch/GoldDiamondGold-L33-70b

Text Generation • 71B • Updated Oct 20 • 90 • 4

New activity in 12bitmisfit/OpenAI_GPT-OSS-120B_Pruned_REAP_58B-GGUF about 2 months ago

Q8 Quant

#1 opened about 2 months ago by

Kamali-Lab

upvoted a collection 2 months ago

Cerebras REAP

Collection

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 19 items • Updated 9 days ago • 70

liked a model 2 months ago

cerebras/GLM-4.5-Air-REAP-82B-A12B

Text Generation • 82B • Updated Oct 21 • 9.22k • 103

New activity in cerebras/GLM-4.5-Air-REAP-82B-A12B 2 months ago

Fixed Incorrect Parameter Count in README.md

#2 opened 2 months ago by

Kamali-Lab

liked a model 4 months ago

LatitudeGames/Wayfarer-2-12B

Text Generation • 12B • Updated Sep 3 • 125 • 60

liked 3 models 5 months ago

upvoted an article 6 months ago

Article

All LLMs Will Be Sparse BitNet Hybrids

May 14

•

upvoted a paper 6 months ago

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

Paper • 2507.01957 • Published Jul 2 • 21

upvoted an article 8 months ago

Article

Bamba-9B-v2 - Fast and powerful!

Apr 29

•

Kamal Ali

AI & ML interests

Recent Activity

Organizations

Kamali-Lab's activity

FP8 versions of DeepSeek-V3.2 would awesome!

The Smol Training Playbook

Q8 Quant

Fixed Incorrect Parameter Count in README.md

All LLMs Will Be Sparse BitNet Hybrids

Bamba-9B-v2 - Fast and powerful!