14 49

Matt Barr

mattbarr

marr75

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

commented on a paper 4 months ago

Top-$nσ$: Not All Logits Are You Need

commented on a paper 4 months ago

Drowning in Documents: Consequences of Scaling Reranker Inference

View all activity

Organizations

mattbarr's activity

upvoted a paper 3 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 93

commented 2 papers 4 months ago

Top-$nσ$: Not All Logits Are You Need

Paper • 2411.07641 • Published Nov 12, 2024 • 20 •

Drowning in Documents: Consequences of Scaling Reranker Inference

Paper • 2411.11767 • Published Nov 18, 2024 • 17 •

upvoted 2 papers 4 months ago

Top-nσ: Not All Logits Are You Need

Paper • 2411.07641 • Published Nov 12, 2024 • 20

Drowning in Documents: Consequences of Scaling Reranker Inference

Paper • 2411.11767 • Published Nov 18, 2024 • 17

upvoted a paper 5 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 90

upvoted a paper 6 months ago

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Paper • 2409.02095 • Published Sep 3, 2024 • 36

commented a paper 8 months ago

Vision language models are blind

Paper • 2407.06581 • Published Jul 9, 2024 • 83 •

upvoted a paper 9 months ago

GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices

Paper • 2406.08451 • Published Jun 12, 2024 • 25

upvoted an article 10 months ago

Article

A Complete Guide to Audio Datasets

Dec 15, 2022

• 24

commented a paper 10 months ago

Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23, 2024 • 41 •

upvoted a paper 10 months ago

Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23, 2024 • 41

commented a paper 10 months ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19, 2024 • 153 •

upvoted a paper 10 months ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19, 2024 • 153

upvoted 2 papers 11 months ago

Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23, 2024 • 60

Training LLMs over Neurally Compressed Text

Paper • 2404.03626 • Published Apr 4, 2024 • 24

commented a paper 11 months ago

Long-context LLMs Struggle with Long In-context Learning

Paper • 2404.02060 • Published Apr 2, 2024 • 37 •

upvoted a paper 12 months ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 108

upvoted 2 papers about 1 year ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 610

Sora Generates Videos with Stunning Geometrical Consistency

Paper • 2402.17403 • Published Feb 27, 2024 • 18