view post Post 2527 OpenAI is now open again! Check out OpenAI’s brand new gpt‑oss‑20b model hosted on ZeroGPU 🤗 merterbak/gpt-oss-20b-demo See translation
view post Post 2837 Qwen 3 technical report released🚀Report: https://github.com/QwenLM/Qwen3/blob/main/Qwen3_Technical_Report.pdf See translation
Qwen 3 Alibaba's Qwen 3 models Qwen/Qwen3-0.6B Text Generation • 0.8B • Updated Jul 26 • 3.84M • • 582 Qwen/Qwen3-1.7B Text Generation • 2B • Updated Jul 26 • 1.1M • • 242 Qwen/Qwen3-4B Text Generation • 4B • Updated Jul 26 • 1.39M • • 373 Qwen/Qwen3-8B Text Generation • 8B • Updated Jul 26 • 4.28M • • 575
LLM's google/gemma-7b Text Generation • 9B • Updated Jun 27, 2024 • 38.5k • 3.19k meta-llama/Llama-2-7b-chat-hf Text Generation • 7B • Updated Apr 17, 2024 • 985k • 4.56k mistralai/Mixtral-8x7B-Instruct-v0.1 47B • Updated Jul 24 • 311k • 4.54k tiiuae/falcon-7b Text Generation • 7B • Updated Oct 12, 2024 • 71.9k • 1.09k
Qwen 3 Alibaba's Qwen 3 models Qwen/Qwen3-0.6B Text Generation • 0.8B • Updated Jul 26 • 3.84M • • 582 Qwen/Qwen3-1.7B Text Generation • 2B • Updated Jul 26 • 1.1M • • 242 Qwen/Qwen3-4B Text Generation • 4B • Updated Jul 26 • 1.39M • • 373 Qwen/Qwen3-8B Text Generation • 8B • Updated Jul 26 • 4.28M • • 575
LLM's google/gemma-7b Text Generation • 9B • Updated Jun 27, 2024 • 38.5k • 3.19k meta-llama/Llama-2-7b-chat-hf Text Generation • 7B • Updated Apr 17, 2024 • 985k • 4.56k mistralai/Mixtral-8x7B-Instruct-v0.1 47B • Updated Jul 24 • 311k • 4.54k tiiuae/falcon-7b Text Generation • 7B • Updated Oct 12, 2024 • 71.9k • 1.09k
Running on Zero 5 Seed Coder 8B Instruct 🚀 ByteDance Seed's coding focused Seed-Coder-8B-Instruct model