Small LMs
- š
MonadGPT
š¬Mistral-7B
š»Voice Chat With Mistral 7B
šŖQwen VL
ā”ChatGLM 6B
šKoboldcpp Tiefighter
š¶Tinyllama Chat
šStable LM 2 Zephyr 1.6b
ā”MoE LLaVA
šChat with DeepSeek Coder 7B
š¬Llama 2 13b Chat
š¦LLaVA
š„Video LLaVA
šLlava
š¢LLaVA 1.6
šGradio Notebook Local Model
šBlind Chat
šWeb-LLM: Mistral 7B OpenOrca
š7B text-generation model running directly from the browser
[NSFW] C0ffee's Erotic Story Generator 2
šWhisper Chess
šPlay chess using voice commands
LLaMA Board
š¦Fine-tuning large language model with Gradio UI
Ratchet + Phi Locally
šRun Phi-3 in Browser
Ratchet + Whisper Locally
š£Run Whisper in Browser
- 4
Noosphere Webui on Cpu
š®Clone and set up stable-diffusion-webui and extensions
- 16
epicPhotoGASM Webui on Cpu
šSet up a stable-diffusion-webui with extensions and models
Experimental Phi3 Webgpu
šNeverSleep/Llama-3-Lumimaid-8B-v0.1
Text Generation ā¢ Updated ā¢ 1.52k ā¢ 81gradientai/Llama-3-8B-Instruct-Gradient-4194k
Text Generation ā¢ Updated ā¢ 265 ā¢ 70tiiuae/falcon-11B
Text Generation ā¢ Updated ā¢ 34.1k ā¢ 212- 14
Text-Streaming
štext streaming space using Gemma-7B
GemmaOnDevice
šGenerate responses using LLM on-device
- 4.34k
OpenGPT 4o
š„GPT 4o like bot.
PaliGemma Demo
š¤²Phi-3 WebGPU
šA private and powerful AI that runs locally in your browser
Mistral-7B-v0.3 Fast Chat
šFast chatting with Mistral v0.3
YOLOv10 Web
šDetect objects in uploaded images
WebGPU Nomic Embed
šClassify images in real-time using zero-shot classification
WebGPU Chat Qwen2
šGenerate text using Qwen2 model
- 1
GLiNER HandyLab
ā” Kosmos 2
š»- 7
Text Gen Playground
š«Chat with any model on the Hub
Gemini Nano (Chrome Built-in)
šRun Gemini Nano locally in your browser with Transformers.js
- 1
LLaVA WebGPU
šA private and powerful multimodal AI chatbot that runs local
Candle T5 Generation Wasm
šÆGenerate text using various T5 models
- 61
MInference
šGenerate text responses to user queries
SmolLM 360M Instruct WebGPU
šA blazingly fast and powerful AI chatbot that runs locally.
- 6
SmolLM 135M Instruct WebGPU
šA blazingly fast and powerful AI chatbot that runs locally.
- 78
Chameleon 30b
š„Generate descriptions for images using text prompts
- 5
Nymbot Lite
āØVision Chatbot with ImgGen & Web Search - Runs on CPU
- 3
Llama-3.1-8B-Instruct
š¦The best 8B model with 128K context
ollama-Chat
šChat with Ollama
- 4
Llama CSV Agent
š¤Need to analyze data? Let a Llama-3.1 agent do it for you!
- 1
MagicPrompt Stable Diffusion
š» WebLLM JSON Playground
šGenerate JSON output from prompts using LLMs
Webllm Simple Chat
š¬Chat with an AI assistant directly in your browser
- 79
Gemma 2 2B IT
š»Chatbot
- 1
Cohere Command R+ inference
āØc4ai-command-r-plus (hub inference, not API)
Phi-3-Mini-4k-Instruct
šPhi-3-Mini on hub inference
- 1
Yi-1.5-34B-Chat
š¼Yi-1.5-34B on hub inference
- 1
Mistral-7B-Instruct-v0.3
āØSOTA Small Model by Mistral AI
- 65
Falcon Mamba Playground
šGenerate chat responses using FalconMamba-7b model
MiniCPM-V-2 6
š¬Instant SmolLM
š¤Run SmolLM-360M-Instruct in realtime with MLC WebLLM
- 160
LongWriter
š¬LLM for long context
- 15
Phi-3.5-Mini-Instruct
šNew SOTA small model from Microsoft, and multilingual!
- 5
Inference Playground
š¤One-stop-shop for frequently used models
- 232
HF's Missing Inference Widget
š»Generate text responses using different models
1-Shot LLM Playground
š»Single-shot inference for rapid model testing
- 1
Phi-3.5-Mini WebLLM
ā”Engage in fast, local chat using WebLLM
- 214
Phi 3.5 Vision
š„Generate text from an image and question
Qwen2-VL-2B
š¤©Multilingual, Multimodal, Mighty 2B
Kotaemon
šDataset Rewriter
š- 6
Reflection 70B llama.cpp
š¢Reflection-70B by Matt Schumer
- 3
Joy Caption Alpha One
ā” Llama-3.2-3B-Instruct
š¦New SOTA small model from Meta
- 4
Llama-3.2-1B-Instruct
š¦the new tiny king
- 5
HTML To Markdown
šConvert HTML to Markdown with readerlm-1.5B
- 384
Llama-Vision-11B
šChat about images by uploading them and typing questions
Qwen-2.5 WebLLM
ā”Chat with a local language model in your browser
- 2
Llama-3.2 WebLLM
š¦Chat with a language model directly in your browser
- 106
Molmo 7B D 0924
š Emu3
šLlama 3.2 WebGPU
š¦A powerful AI chatbot that runs locally in your browser
- 3
WebLLM Playground
š - 9
Nemotron-Mini
šNemoAligner Synthetic SFT with function calling
Zamba2 7B
šMiniSearch
šMinimalist web-searching app with browser-based AI assistant
Janus Space Clone Me First
šGenerate images from text prompts
Qwen 2.5 Code Interpreter
šExecute code snippets and get results
- 236
Aya Expanse
šInteract with Aya Expanse to chat, speak, and generate images in 23 languages
Wllama
š¦Run GGUF directly on your browser!
- 14
SmolLM2-1.7B-Instruct Serverless
š¤New SOTA smol king by Hugging Face
BitNet.cpp
š»- 197
JanusFlow 1.3B
šHuggingface space for JanusFlow-1.3B
JanusFlow 1.3B
šText Gen | Vision | Image Gen | One 1.3b model
- 2
Ai Scraper
šScrape and summarize web content using AI
SmolVLM
šJanus 1.3B WebGPU
šIn-browser unified multimodal understanding and generation.
Omnivlm Dpo Demo
šGithub Issue Generator
š§Generate structured GitHub issues
- 207
ShowUI
š»Generate clickable coordinates on a screenshot
Text-to-Speech WebGPU
š£WebGPU text-to-Speech powered by OuteTTS and Transformers.js
- 8
Falcon3 Mamba 7b Instruct Playground
šChat with Falcon3-Mamba-7B-Instruct AI assistant
- 32
Falcon3 Demo
š¦F3-DEMO
SmallThinker Demo
š¬Llama 3.2 Reasoning WebGPU
š§Small and powerful reasoning LLM that runs in your browser
DeepSeek-R1 WebGPU
š§Next-generation reasoning model that runs locally in-browser
SmolVLM 500M Instruct WebGPU
š»Find text in images quickly
- 1
SmolVLM 256M Instruct WebGPU
šØUpload images to generate image captions
- 3
SmolVLM
šGenerate text descriptions from images and queries
Markdown Studio
ā”Convert HTML to Markdown/JSON, Markdown Live Preview
- 1.69k
Chat With Janus-Pro-7B
šA unified multimodal understanding and generation model.