326 394 626

Yatharth Sharma

YaTharThShaRma999

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

YatharthS/LinaCodec

updated a model 8 days ago

YaTharThShaRma999/ncodec

updated a model 18 days ago

YaTharThShaRma999/miratts_finetune

View all activity

Organizations

liked a model 5 days ago

YatharthS/LinaCodec

Audio-to-Audio • Updated 5 days ago • 134 • 17

updated a model 8 days ago

YaTharThShaRma999/ncodec

Updated 8 days ago

updated a model 18 days ago

YaTharThShaRma999/miratts_finetune

Text Generation • 0.5B • Updated 18 days ago • 12

published a model 18 days ago

YaTharThShaRma999/miratts_finetune

Text Generation • 0.5B • Updated 18 days ago • 12

reacted to YatharthS's post with 🚀🔥 20 days ago

Post

3601

🤯 🤯 Released a high quality finetuned LLM based TTS model that can generate realistic and clear 48khz audio at over 100x realtime speed! 🤯 🤯

Github link: https://github.com/ysharma3501/MiraTTS

Model link: https://github.com/ysharma3501/MiraTTS

Blog explaining llm tts models: https://huggingface.co/blog/YatharthS/llm-tts-models

4 replies

upvoted an article 21 days ago

Article

LLM based Audio models

21 days ago

•

liked a model 21 days ago

YatharthS/MiraTTS

Text-to-Speech • 0.5B • Updated 15 days ago • 6.64k • 175

updated a model 22 days ago

YaTharThShaRma999/vocos_lib2

Updated 22 days ago

reacted to YatharthS's post with 🚀🔥 25 days ago

Post

2829

I just released LayaCodec, a highly efficient neural audio tokenizer/codec for TTS models, far better than most previous audio tokenizers.

🤯 Next-gen TTS models that use this could achieve several 100s of times real-time speed while producing clearer audio!! 🤯

GitHub repo: https://github.com/ysharma3501/LayaCodec
Model: YatharthS/LayaCodec

New activity in YatharthS/LayaCodec 26 days ago

Update README.md

#1 opened 26 days ago by

YaTharThShaRma999

liked a model 26 days ago

YatharthS/LayaCodec

Audio-to-Audio • Updated 25 days ago • 196 • 12

liked a Space 28 days ago

GLM ASR Nano

🌍

A space to transcribe audio files with the new sota GLM-ASR

reacted to Jofthomas's post with 🔥 about 1 month ago

Post

3563

The new Mistral 3 models are here !

Today, we announce Mistral 3, the next generation of Mistral models. Mistral 3 includes three state-of-the-art small, dense models (14B, 8B, and 3B) and Mistral Large 3 – our most capable model to date – a sparse mixture-of-experts trained with 41B active and 675B total parameters.

All models are released under the Apache 2.0 license.

Ministrals :
https://huggingface.co/collections/mistralai/ministral-3

Mistral Large 3:
https://huggingface.co/collections/mistralai/mistral-large-3

2 replies

liked a model about 1 month ago

Tongyi-MAI/Z-Image-Turbo

Text-to-Image • Updated 1 day ago • 363k • • 3.65k

New activity in Tongyi-MAI/Z-Image-Turbo about 1 month ago

Issue is that ZImagePipeline is not in the standard diffusers package

#2 opened about 1 month ago by

PierrunoYT

reacted to YatharthS's post with 🔥 about 2 months ago

Post

1683

Just uploaded a detailed blog about my findings in optimizing NeuTTS to generate 200 seconds of audio in a single second. Also went in depth in NeuTTS’s architecture. Will be happy to answer any questions.

https://huggingface.co/blog/YatharthS/making-neutts-200x-realtime

upvoted an article about 2 months ago

Article

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

Nov 21, 2025

•

reacted to YatharthS's post with 🔥 about 2 months ago

Post

1404

Just released a heavily optimized library for NeuTTS. It's over 200x realtime meaning it can generate over 200 seconds of audio in a single second using batching and supports voice cloning!!🤯🤯

Link: https://github.com/ysharma3501/FastNeuTTS

Yatharth Sharma

AI & ML interests

Recent Activity

Organizations

YaTharThShaRma999's activity

LLM based Audio models

Update README.md

GLM ASR Nano

Issue is that ZImagePipeline is not in the standard diffusers package

How to make NeuTTS-air generate over 200 seconds of audio in a single second.