-
VILA^2: VILA Augmented VILA
Paper • 2407.17453 • Published • 40 -
Octopus v4: Graph of language models
Paper • 2404.19296 • Published • 117 -
Octo-planner: On-device Language Model for Planner-Action Agents
Paper • 2406.18082 • Published • 48 -
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models
Paper • 2408.15518 • Published • 43
Collections
Discover the best community collections!
Collections including paper arxiv:2405.15682
-
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models
Paper • 2404.18796 • Published • 69 -
KAN: Kolmogorov-Arnold Networks
Paper • 2404.19756 • Published • 109 -
The Road Less Scheduled
Paper • 2405.15682 • Published • 23 -
Your Transformer is Secretly Linear
Paper • 2405.12250 • Published • 151
-
Rethinking Optimization and Architecture for Tiny Language Models
Paper • 2402.02791 • Published • 13 -
More Agents Is All You Need
Paper • 2402.05120 • Published • 53 -
Scaling Laws for Forgetting When Fine-Tuning Large Language Models
Paper • 2401.05605 • Published -
Aligning Large Language Models with Counterfactual DPO
Paper • 2401.09566 • Published • 2
-
Learning Vision from Models Rivals Learning Vision from Data
Paper • 2312.17742 • Published • 16 -
Unsupervised Universal Image Segmentation
Paper • 2312.17243 • Published • 20 -
Perspectives on the State and Future of Deep Learning -- 2023
Paper • 2312.09323 • Published • 6 -
Vision-Language Models as a Source of Rewards
Paper • 2312.09187 • Published • 12