AudioX: Diffusion Transformer for Anything-to-Audio Generation Paper • 2503.10522 • Published 10 days ago • 17
RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling Paper • 2503.09601 • Published 11 days ago • 14
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published Feb 19 • 66
Slam Collection All resources for SpeechLMs from "Slamming: Training a Speech Language Model on One GPU in a Day". We provide tokeniser, lm, and datasets • 6 items • Updated 27 days ago • 13
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published Feb 19 • 66
Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights Paper • 2502.09619 • Published Feb 13 • 31
Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published Jan 28 • 36
Unsupervised Speech Segmentation: A General Approach Using Speech Language Models Paper • 2501.03711 • Published Jan 7 • 1
Unsupervised Speech Segmentation: A General Approach Using Speech Language Models Paper • 2501.03711 • Published Jan 7 • 1