How do I get the timestamps at word level for my generated audio ?
also curious about this issue π
Β· Sign up or log in to comment