6 49 527

RachidAR

RachidARx

AI & ML interests

1.58 bit LLM

Recent Activity

liked a model 1 day ago

google/gemma-3-12b-it

upvoted a collection 1 day ago

Gemma 3 Release

liked a model 1 day ago

google/gemma-3-27b-it

View all activity

Organizations

RachidAR's activity

liked a model 1 day ago

google/gemma-3-12b-it

Image-Text-to-Text • Updated 2 days ago • 7.94k • 152

upvoted a collection 1 day ago

Gemma 3 Release

Collection

9 items • Updated about 7 hours ago • 221

liked 2 models 1 day ago

google/gemma-3-27b-it

Image-Text-to-Text • Updated 2 days ago • 38.5k • 504

ggml-org/gemma-3-12b-it-GGUF

Updated 2 days ago • 4.07k • 11

liked a model 4 days ago

Wan-AI/Wan2.1-T2V-14B

Text-to-Video • Updated 2 days ago • 207k • • 1.02k

liked a model 5 days ago

amd/Instella-3B-Instruct

Text Generation • Updated 7 days ago • 1.18k • 34

liked a model 15 days ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated 1 day ago • 441k • 1.13k

liked a Space 19 days ago

196

Whisper WebGPU

🎤

Convert spoken words to text

liked a model 23 days ago

perplexity-ai/r1-1776

Text Generation • Updated 15 days ago • 55k • • 2.12k

updated a collection 24 days ago

Ternary LLMs & Knowledge distillation & SOTA

Collection

11 items • Updated 24 days ago

upvoted a paper 24 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 26 days ago • 142

upvoted a paper 29 days ago

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published Dec 31, 2024 • 21

liked a model 29 days ago

Mozilla/TriLM-llamafile

Text Generation • Updated Aug 26, 2024 • 583 • 19

upvoted a paper 29 days ago

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 97

updated a collection 29 days ago

Ternary LLMs & Knowledge distillation & SOTA

Collection

11 items • Updated 24 days ago

upvoted 2 papers 29 days ago

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published Nov 7, 2024 • 66

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 142

liked 2 models 29 days ago

allenai/OLMoE-1B-7B-0125

Text Generation • Updated Jan 23 • 1.84k • 20

allenai/OLMoE-1B-7B-0125-Instruct

Text Generation • Updated Feb 4 • 9.71k • 41

liked a model about 1 month ago

bartowski/Virtuoso-Small-GGUF

Text Generation • Updated Dec 3, 2024 • 1.27k • 12