PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper • 2504.08791 • Published Apr 7, 2025 • 137
view post Post 2789 I like training LoRAshttps://huggingface.co/blog/nroggendorff/create-diffusers-dataset 🔥 6 6 👍 5 5 😔 3 3 + Reply
Running on Zero Featured 143 Gemma 2 llama.cpp 2B/9B/27B 😻 143 Chat with a language model using text input
legraphista/dolphin-2.9.1-llama-3-70b-IMat-GGUF Text Generation • 71B • Updated May 27, 2024 • 765 • 2