-
SaulLM-7B: A pioneering Large Language Model for Law
Paper • 2403.03883 • Published • 78 -
Character-LLM: A Trainable Agent for Role-Playing
Paper • 2310.10158 • Published • 1 -
LLM Agent Operating System
Paper • 2403.16971 • Published • 65 -
RakutenAI-7B: Extending Large Language Models for Japanese
Paper • 2403.15484 • Published • 13
Collections
Discover the best community collections!
Collections including paper arxiv:2501.03575
-
Video as the New Language for Real-World Decision Making
Paper • 2402.17139 • Published • 19 -
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Paper • 2310.19512 • Published • 16 -
VideoMamba: State Space Model for Efficient Video Understanding
Paper • 2403.06977 • Published • 27 -
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Paper • 2401.09047 • Published • 14
-
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Paper • 2402.14083 • Published • 47 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 608 -
Genie: Generative Interactive Environments
Paper • 2402.15391 • Published • 71 -
Humanoid Locomotion as Next Token Prediction
Paper • 2402.19469 • Published • 27
-
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Paper • 2402.04252 • Published • 26 -
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
Paper • 2402.03749 • Published • 13 -
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper • 2402.04615 • Published • 41 -
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss
Paper • 2402.05008 • Published • 22