Anurag's picture

Anurag

edwixx

·

https://anuragkanade.com/

AI & ML interests

TTS , ASR

Recent Activity

reacted to AdinaY's post with 🔥 about 18 hours ago

Dolphin 🐬 an open ASR model released by DataOceanAI, one of the biggest AI data provider in China 🔥 ✨ Supports 40 Eastern languages & 22 Chinese dialects ✨ Apache2.0 ✨ With 21.2M hours of data (7.4M open data) Model: https://huggingface.co/DataoceanAI/dolphin-base https://huggingface.co/DataoceanAI/dolphin-small Paper: https://huggingface.co/papers/2503.20212

liked a model 16 days ago

Skywork/Skywork-R1V-38B

liked a model 16 days ago

ASLP-lab/DiffRhythm-full

View all activity

Organizations

edwixx's activity

upvoted a paper 22 days ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published 23 days ago • 60

upvoted a paper about 1 month ago

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published Feb 25 • 54

upvoted an article about 2 months ago

Article

Build awesome datasets for video generation

Feb 12

• 30

upvoted a paper 3 months ago

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Paper • 2412.15322 • Published Dec 19, 2024 • 18

upvoted 2 collections 5 months ago

🎨 Image models

10 items • Updated 6 days ago • 2

BhasaAnuvaad

A Speech Translation Dataset for 13 Indian Languages • 11 items • Updated Jan 16 • 16

upvoted a paper 5 months ago

SongCreator: Lyrics-based Universal Song Generation

Paper • 2409.06029 • Published Sep 9, 2024 • 22

upvoted an article 8 months ago

Article

Introduction to ggml

Aug 13, 2024

• 176