-
Meta-Learning a Dynamical Language Model
Paper β’ 1803.10631 β’ Published -
TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation
Paper β’ 2003.11963 β’ Published -
BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model
Paper β’ 2212.04960 β’ Published β’ 1 -
Continuous Learning in a Hierarchical Multiscale Neural Network
Paper β’ 1805.05758 β’ Published β’ 1
Collections
Discover the best community collections!
Collections including paper arxiv:2409.02813
-
3
Multimodal Clembench
πExplore multimodal models: leaderboards, plots, trends, and versions
-
81
SEED-Bench Leaderboard
π -
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Paper β’ 2311.16502 β’ Published β’ 35 -
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
Paper β’ 2409.02813 β’ Published β’ 29
-
Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model
Paper β’ 2407.07053 β’ Published β’ 43 -
LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models
Paper β’ 2407.12772 β’ Published β’ 34 -
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models
Paper β’ 2407.11691 β’ Published β’ 14 -
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Paper β’ 2408.02718 β’ Published β’ 61
-
RLHF Workflow: From Reward Modeling to Online RLHF
Paper β’ 2405.07863 β’ Published β’ 67 -
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Paper β’ 2405.09818 β’ Published β’ 130 -
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models
Paper β’ 2405.15574 β’ Published β’ 53 -
An Introduction to Vision-Language Modeling
Paper β’ 2405.17247 β’ Published β’ 87
-
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Paper β’ 2405.15223 β’ Published β’ 13 -
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models
Paper β’ 2405.15574 β’ Published β’ 53 -
An Introduction to Vision-Language Modeling
Paper β’ 2405.17247 β’ Published β’ 87 -
Matryoshka Multimodal Models
Paper β’ 2405.17430 β’ Published β’ 31
-
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper β’ 2403.09611 β’ Published β’ 126 -
Evolutionary Optimization of Model Merging Recipes
Paper β’ 2403.13187 β’ Published β’ 51 -
MobileVLM V2: Faster and Stronger Baseline for Vision Language Model
Paper β’ 2402.03766 β’ Published β’ 14 -
LLM Agent Operating System
Paper β’ 2403.16971 β’ Published β’ 65