Submitted by akhaliq 20 SQL-PaLM: Improved Large Language ModelAdaptation for Text-to-SQL · 7 authors 3
Submitted by akhaliq 14 SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds · 9 authors 13
Submitted by akhaliq 6 Bytes Are All You Need: Transformers Operating Directly On File Bytes · 4 authors
Submitted by akhaliq 5 Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance · 12 authors 1
Submitted by akhaliq 4 StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners · 5 authors 1
Submitted by akhaliq 4 ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation · 4 authors
Submitted by akhaliq 4 MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training · 18 authors
Submitted by akhaliq 2 CodeTF: One-stop Transformer Library for State-of-the-art Code LLM · 6 authors
Submitted by akhaliq 1 The ObjectFolder Benchmark: Multisensory Learning with Neural and Real Objects · 8 authors
Submitted by akhaliq 1 Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation · 7 authors