-
A Comprehensive Study of GPT-4V's Multimodal Capabilities in Medical Imaging
Paper • 2310.20381 • Published • 2 -
Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V
Paper • 2310.19061 • Published • 8 -
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images
Paper • 2310.18652 • Published • 1 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 610
Collections
Discover the best community collections!
Collections including paper arxiv:2402.17764
-
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
Paper • 2312.13964 • Published • 20 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 259 -
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation
Paper • 2312.12491 • Published • 70 -
LLaVA-φ: Efficient Multi-Modal Assistant with Small Language Model
Paper • 2401.02330 • Published • 17
-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 147 -
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 97 -
ReFT: Representation Finetuning for Language Models
Paper • 2404.03592 • Published • 94 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 259
-
Pearl: A Production-ready Reinforcement Learning Agent
Paper • 2312.03814 • Published • 15 -
Beyond Surface: Probing LLaMA Across Scales and Layers
Paper • 2312.04333 • Published • 20 -
LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning
Paper • 2312.03849 • Published • 7 -
wikimedia/wikipedia
Viewer • Updated • 61.6M • 98.6k • 759
-
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper • 2312.00752 • Published • 143 -
SparQ Attention: Bandwidth-Efficient LLM Inference
Paper • 2312.04985 • Published • 39 -
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Paper • 2402.00159 • Published • 62 -
Neural Network Diffusion
Paper • 2402.13144 • Published • 95
-
Visual In-Context Prompting
Paper • 2311.13601 • Published • 19 -
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework
Paper • 2308.08155 • Published • 7 -
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Paper • 2303.02927 • Published • 3 -
The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4
Paper • 2311.07361 • Published • 14
-
MART: Improving LLM Safety with Multi-round Automatic Red-Teaming
Paper • 2311.07689 • Published • 8 -
DiLoCo: Distributed Low-Communication Training of Language Models
Paper • 2311.08105 • Published • 15 -
SparQ Attention: Bandwidth-Efficient LLM Inference
Paper • 2312.04985 • Published • 39 -
Aligning Large Language Models with Counterfactual DPO
Paper • 2401.09566 • Published • 2