-
4
Whisper Realtime Transcription (Gradio UI)
πTranscribe audio in realtime - Gradio UI version
-
7
DeepSeek R1 Distill Qwen 1.5B Demo Q8
π₯DeepSeek R1 Distill Qwen 1.5B Demo GGUF(Q8) Fully in CPU
-
Chain of Draft: Thinking Faster by Writing Less
Paper β’ 2502.18600 β’ Published β’ 50 -
88
Llama-4-Maverick-17B Research
πLlama-4-Maverick-17B + Real Time Deep Research
Matricardi Fabio
FM-1976
AI & ML interests
control system engineering, AI, LLM with python. ThePoorGPUguy on substack
Recent Activity
liked
a model
about 4 hours ago
dalle-mini/dalle-mini
liked
a Space
about 4 hours ago
dalle-mini/dalle-mini
liked
a model
about 5 hours ago
calcuis/lumina-gguf
Organizations
None yet
PAPERS
-
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper β’ 2412.13663 β’ Published β’ 154 -
A Survey of Small Language Models
Paper β’ 2410.20011 β’ Published β’ 45 -
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper β’ 2412.11768 β’ Published β’ 44 -
Chain of Draft: Thinking Faster by Writing Less
Paper β’ 2502.18600 β’ Published β’ 50
SMALL-TINY
A Collection of Small native Models
-
vicgalle/gpt2-alpaca-gpt4
Text Generation β’ 0.1B β’ Updated β’ 1.73k β’ 23 -
andreaskoepf/pythia-1.4b-gpt4all-pretrain
Text Generation β’ Updated β’ 20 β’ 7 -
EleutherAI/pythia-1b
Text Generation β’ 1B β’ Updated β’ 22.2k β’ 41 -
EleutherAI/pythia-410m-deduped
Text Generation β’ 0.5B β’ Updated β’ 16k β’ 20
Image Creation
Good and working HF spaces to create images with Diffusion models
-
Running on Zero1.9k1.9k
Stable Diffusion 3.5 Large
πGenerate images with SD3.5
-
Running on Zero9.04k9.04k
FLUX.1 [dev]
π₯Generate images from text prompts
-
Running on Zero4.96k4.96k
FLUX.1 [Schnell]
πGenerate images from text prompts
-
Running on Zero1.78k1.78k
DALLE 3 XL v2
π₯Generate images from text prompts
Playgrounds
GRADIO examples
-
Runtime error44
Whisper Realtime Transcription (Gradio UI)
πTranscribe audio in realtime - Gradio UI version
-
Running77
DeepSeek R1 Distill Qwen 1.5B Demo Q8
π₯DeepSeek R1 Distill Qwen 1.5B Demo GGUF(Q8) Fully in CPU
-
Chain of Draft: Thinking Faster by Writing Less
Paper β’ 2502.18600 β’ Published β’ 50 -
Running8888
Llama-4-Maverick-17B Research
πLlama-4-Maverick-17B + Real Time Deep Research
Image Creation
Good and working HF spaces to create images with Diffusion models
-
Running on Zero1.9k1.9k
Stable Diffusion 3.5 Large
πGenerate images with SD3.5
-
Running on Zero9.04k9.04k
FLUX.1 [dev]
π₯Generate images from text prompts
-
Running on Zero4.96k4.96k
FLUX.1 [Schnell]
πGenerate images from text prompts
-
Running on Zero1.78k1.78k
DALLE 3 XL v2
π₯Generate images from text prompts
PAPERS
-
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper β’ 2412.13663 β’ Published β’ 154 -
A Survey of Small Language Models
Paper β’ 2410.20011 β’ Published β’ 45 -
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper β’ 2412.11768 β’ Published β’ 44 -
Chain of Draft: Thinking Faster by Writing Less
Paper β’ 2502.18600 β’ Published β’ 50
Playgrounds
SMALL-TINY
A Collection of Small native Models
-
vicgalle/gpt2-alpaca-gpt4
Text Generation β’ 0.1B β’ Updated β’ 1.73k β’ 23 -
andreaskoepf/pythia-1.4b-gpt4all-pretrain
Text Generation β’ Updated β’ 20 β’ 7 -
EleutherAI/pythia-1b
Text Generation β’ 1B β’ Updated β’ 22.2k β’ 41 -
EleutherAI/pythia-410m-deduped
Text Generation β’ 0.5B β’ Updated β’ 16k β’ 20