TheStageAI/thewhisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated about 4 hours ago • 8.64k • 21
TheStageAI/Elastic-whisper-large-v3-turbo Automatic Speech Recognition • Updated about 4 hours ago • 314 • 2
TheStageAI/Elastic-whisper-large-v3 Automatic Speech Recognition • Updated about 4 hours ago • 235 • 2
view post Post 2610 We thought it would be easier, but finally we have integrated CuDNN Paged Attention to our models!Read article here: https://app.thestage.ai/blog/Integrating-cuDNN-Paged-Attention-to-TheStage-AI-Inference?id=8Llama-8B with CuDNN paged attention, including B200 support: TheStageAI/Elastic-Llama-3.1-8B-InstructMistral-Small-24B with CuDNN paged attention, including B200 support: TheStageAI/Elastic-Mistral-Small-3.1-24B-Instruct-2503 See translation 🚀 6 6 🔥 2 2 😎 2 2 + Reply
TheStageAI/Elastic-Mistral-Small-3.1-24B-Instruct-2503 Text Generation • Updated Jan 15 • 10 • 3