Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2304.15004

Unified model that generate Text, Image, Video

TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation

Paper • 2412.03069 • Published Dec 4, 2024 • 30
Are Emergent Abilities of Large Language Models a Mirage?

Paper • 2304.15004 • Published Apr 28, 2023 • 6
Scaling Image Tokenizers with Grouped Spherical Quantization

Paper • 2412.02632 • Published Dec 3, 2024 • 10
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published Oct 17, 2024 • 33

Papers - Emergent Properties - Image

Are Emergent Abilities of Large Language Models a Mirage?

Paper • 2304.15004 • Published Apr 28, 2023 • 6

Papers - Emergent Properties - Exact String Match

Are Emergent Abilities of Large Language Models a Mirage?

Paper • 2304.15004 • Published Apr 28, 2023 • 6

Papers - Emergent Properties - Multiple Choice Grade

Are Emergent Abilities of Large Language Models a Mirage?

Paper • 2304.15004 • Published Apr 28, 2023 • 6

Papers - Emergent Properties

Are Emergent Abilities of Large Language Models a Mirage?

Paper • 2304.15004 • Published Apr 28, 2023 • 6
Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space

Paper • 2406.19370 • Published Jun 27, 2024 • 1

Papers - University - Stanford University

BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text

Paper • 2403.18421 • Published Mar 27, 2024 • 23
Long-form factuality in large language models

Paper • 2403.18802 • Published Mar 27, 2024 • 25
stanford-crfm/BioMedLM

Text Generation • Updated Mar 28, 2024 • 3.18k • 410
Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 53

Must Reads On Transformers and Diffusers

Explore the cutting-edge of AI with our curated list of must reads on Transformers & Diffusers, driving innovation in generative-AI and beyond.

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

Paper • 1701.06538 • Published Jan 23, 2017 • 5
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 50
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Paper • 2005.11401 • Published May 22, 2020 • 11
Language Model Evaluation Beyond Perplexity

Paper • 2106.00085 • Published May 31, 2021

🚀 Spinning Up in LLMs

Lost in the Middle: How Language Models Use Long Contexts

Paper • 2307.03172 • Published Jul 6, 2023 • 38
Efficient Estimation of Word Representations in Vector Space

Paper • 1301.3781 • Published Jan 16, 2013 • 6
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 16
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 50

Why think step by step? Reasoning emerges from the locality of experience

Paper • 2304.03843 • Published Apr 7, 2023
Are Emergent Abilities of Large Language Models a Mirage?

Paper • 2304.15004 • Published Apr 28, 2023 • 6
Knowledge Mechanisms in Large Language Models: A Survey and Perspective

Paper • 2407.15017 • Published Jul 22, 2024 • 34

Language vs. Thought vs. Hype

OpenAI, Anthropic, and the whole lot of "AGI", TESCREAL, etc., can go get stuffed, thank you very much.

Dissociating language and thought in large language models: a cognitive perspective

Paper • 2301.06627 • Published Jan 16, 2023 • 1
A Latent Space Theory for Emergent Abilities in Large Language Models

Paper • 2304.09960 • Published Apr 19, 2023 • 3
Are Emergent Abilities of Large Language Models a Mirage?

Paper • 2304.15004 • Published Apr 28, 2023 • 6
Do LLMs Really Adapt to Domains? An Ontology Learning Perspective

Paper • 2407.19998 • Published Jul 29, 2024 • 1

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs