Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2305.11206

It is a collection of papers that are useful in studying LLM.

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 55
LoRA: Low-Rank Adaptation of Large Language Models

Paper • 2106.09685 • Published Jun 17, 2021 • 35
Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 53
Lost in the Middle: How Language Models Use Long Contexts

Paper • 2307.03172 • Published Jul 6, 2023 • 40

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 29
CsFEVER and CTKFacts: Acquiring Czech data for fact verification

Paper • 2201.11115 • Published Jan 26, 2022
Training language models to follow instructions with human feedback

Paper • 2203.02155 • Published Mar 4, 2022 • 17
FinGPT: Large Generative Models for a Small Language

Paper • 2311.05640 • Published Nov 3, 2023 • 32

LIMA: Less Is More for Alignment

Paper • 2305.11206 • Published May 18, 2023 • 23

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 88
LIMA: Less Is More for Alignment

Paper • 2305.11206 • Published May 18, 2023 • 23
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset

Paper • 2309.11998 • Published Sep 21, 2023 • 25
Identifying Mislabeled Data using the Area Under the Margin Ranking

Paper • 2001.10528 • Published Jan 28, 2020

Large Language Models as Optimizers

Paper • 2309.03409 • Published Sep 7, 2023 • 76
When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale

Paper • 2309.04564 • Published Sep 8, 2023 • 16
LIMA: Less Is More for Alignment

Paper • 2305.11206 • Published May 18, 2023 • 23
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents

Paper • 2302.01560 • Published Feb 3, 2023 • 1

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs