
AmpereComputing/llama-4-scout-16e-17b-instruct-gguf
108B
•
Updated
•
39
AI inference, AI in the cloud, AI on edge, software acceleration of AI workloads on hardware, efficient AI deployments, GPU-Free AI inference, AI model optimization.