-
Video Creation by Demonstration
Paper β’ 2412.09551 β’ Published β’ 9 -
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Paper β’ 2412.07589 β’ Published β’ 47 -
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation
Paper β’ 2412.06531 β’ Published β’ 71 -
APOLLO: SGD-like Memory, AdamW-level Performance
Paper β’ 2412.05270 β’ Published β’ 38
Collections
Discover the best community collections!
Collections including paper arxiv:2403.05185
-
User-LLM: Efficient LLM Contextualization with User Embeddings
Paper β’ 2402.13598 β’ Published β’ 20 -
Personalized Audiobook Recommendations at Spotify Through Graph Neural Networks
Paper β’ 2403.05185 β’ Published β’ 25 -
SPAR: Personalized Content-Based Recommendation via Long Engagement Attention
Paper β’ 2402.10555 β’ Published β’ 35
-
VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Paper β’ 2312.02087 β’ Published β’ 23 -
FaceStudio: Put Your Face Everywhere in Seconds
Paper β’ 2312.02663 β’ Published β’ 33 -
Orthogonal Adaptation for Modular Customization of Diffusion Models
Paper β’ 2312.02432 β’ Published β’ 15 -
ReconFusion: 3D Reconstruction with Diffusion Priors
Paper β’ 2312.02981 β’ Published β’ 11
-
NExT-GPT: Any-to-Any Multimodal LLM
Paper β’ 2309.05519 β’ Published β’ 78 -
Large Language Model for Science: A Study on P vs. NP
Paper β’ 2309.05689 β’ Published β’ 21 -
AstroLLaMA: Towards Specialized Foundation Models in Astronomy
Paper β’ 2309.06126 β’ Published β’ 17 -
Large Language Models for Compiler Optimization
Paper β’ 2309.07062 β’ Published β’ 23