5 403

Literate Goggles

literate-goggles

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Scaling Rich Style-Prompted Text-to-Speech Datasets

upvoted a paper 6 days ago

Transformers without Normalization

upvoted a paper 6 days ago

WildIFEval: Instruction Following in the Wild

View all activity

Organizations

None yet

literate-goggles's activity

upvoted a paper 2 days ago

Scaling Rich Style-Prompted Text-to-Speech Datasets

Paper • 2503.04713 • Published 16 days ago • 1

upvoted 2 papers 6 days ago

Transformers without Normalization

Paper • 2503.10622 • Published 9 days ago • 129

WildIFEval: Instruction Following in the Wild

Paper • 2503.06573 • Published 13 days ago • 11

upvoted a paper 12 days ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 16 days ago • 107

upvoted a paper 16 days ago

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published 25 days ago • 69

upvoted an article 29 days ago

Article

SigLIP 2: A better multilingual vision language encoder

30 days ago

• 142

upvoted 2 papers 29 days ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published about 1 month ago • 183

Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound

Paper • 2502.05139 • Published Feb 7 • 1

upvoted 11 papers about 1 month ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 145

Region-Adaptive Sampling for Diffusion Transformers

Paper • 2502.10389 • Published Feb 14 • 52

Language Models Use Trigonometry to Do Addition

Paper • 2502.00873 • Published Feb 2 • 1

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 143

Logical Reasoning in Large Language Models: A Survey

Paper • 2502.09100 • Published Feb 13 • 22

XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model

Paper • 2406.04904 • Published Jun 7, 2024 • 9

IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Paper • 2502.05512 • Published Feb 8 • 2

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published Feb 6 • 25

upvoted an article about 1 month ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.18k