Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2405.12250

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized

Partially Rewriting a Transformer in Natural Language

Paper • 2501.18838 • Published 7 days ago • 1
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders

Paper • 2501.17148 • Published 10 days ago • 1
Sparse Autoencoders Trained on the Same Data Learn Different Features

Paper • 2501.16615 • Published 10 days ago • 1
Open Problems in Mechanistic Interpretability

Paper • 2501.16496 • Published 11 days ago • 16

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 43
Qwen Technical Report

Paper • 2309.16609 • Published Sep 28, 2023 • 35
GPT-4 Technical Report

Paper • 2303.08774 • Published Mar 15, 2023 • 5
Gemini: A Family of Highly Capable Multimodal Models

Paper • 2312.11805 • Published Dec 19, 2023 • 44

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83
Small-scale proxies for large-scale Transformer training instabilities

Paper • 2309.14322 • Published Sep 25, 2023 • 20
Evaluating Cognitive Maps and Planning in Large Language Models with CogEval

Paper • 2309.15129 • Published Sep 25, 2023 • 6
Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 78

Previous
1
2
3
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs