Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2310.16944

Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements

Paper • 2210.01970 • Published Sep 30, 2022 • 11
Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122
Datasets: A Community Library for Natural Language Processing

Paper • 2109.02846 • Published Sep 7, 2021 • 12
HuggingFace's Transformers: State-of-the-art Natural Language Processing

Paper • 1910.03771 • Published Oct 9, 2019 • 16

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122

sources mentioned by hf.co/thomwolf tweet: x.com/Thom_Wolf/status/1720503998518640703

HuggingFaceH4/zephyr-7b-beta

Text Generation • Updated Oct 16, 2024 • 284k • • 1.65k
mistralai/Mistral-7B-v0.1

Text Generation • Updated Jul 24, 2024 • 947k • 3.56k
stingning/ultrachat

Viewer • Updated Feb 22, 2024 • 774k • 1.57k • 429
openbmb/UltraFeedback

Viewer • Updated Dec 29, 2023 • 64k • 2.23k • 347

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122
Running on Zero

14

14

Image Captioning with GIT

⚡

Caption images with descriptions
tiiuae/falcon-180B-chat

Text Generation • Updated Nov 7, 2023 • 116k • 543
Running on Zero

376

376

AICoverGen

🚀

Run image generation application

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122

Training & Architectures

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 49
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning

Paper • 2307.08691 • Published Jul 17, 2023 • 8
Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 157
Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 46

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122
Language Models can be Logical Solvers

Paper • 2311.06158 • Published Nov 10, 2023 • 19

Detecting Pretraining Data from Large Language Models

Paper • 2310.16789 • Published Oct 25, 2023 • 11
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models

Paper • 2310.13671 • Published Oct 20, 2023 • 19
AutoMix: Automatically Mixing Language Models

Paper • 2310.12963 • Published Oct 19, 2023 • 14
An Emulator for Fine-Tuning Large Language Models using Small Language Models

Paper • 2310.12962 • Published Oct 19, 2023 • 13

Previous
1
2
3
4
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs