bartowski/TheDrummer_Fallen-Llama-3.3-R1-70B-v1-GGUF Text Generation β’ Updated 12 days ago β’ 8.62k β’ 4
mradermacher/DeepHermes-3-Llama-3-8B-Preview-Uncensored-DeLMAT-i1-GGUF Updated 22 days ago β’ 2.29k β’ 2
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ Updated 1 day ago β’ 441k β’ 1.12k
Running 2.24k 2.24k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper β’ 2502.11089 β’ Published 26 days ago β’ 142