Nikita

PQlet

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

A Primer on the Inner Workings of Transformer-based Language Models

upvoted a paper 18 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

liked a Space 22 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

None yet

PQlet's activity

upvoted a paper 5 days ago

A Primer on the Inner Workings of Transformer-based Language Models

Paper • 2405.00208 • Published Apr 30, 2024 • 10

upvoted a paper 18 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 21 days ago • 162

upvoted a paper 30 days ago

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10 • 86

upvoted an article about 2 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 572

upvoted a paper 4 months ago

CLEAR: Character Unlearning in Textual and Visual Modalities

Paper • 2410.18057 • Published Oct 23, 2024 • 203

upvoted an article 4 months ago

Article

Understanding InstaFlow/Rectified Flow

•

Oct 6, 2023

• 27

upvoted a paper 5 months ago

Mechanistic Permutability: Match Features Across Layers

Paper • 2410.07656 • Published Oct 10, 2024 • 18

upvoted a collection 5 months ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 105 items • Updated about 7 hours ago • 97

upvoted an article 6 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

• 243

upvoted a paper 7 months ago

Layerwise Recurrent Router for Mixture-of-Experts

Paper • 2408.06793 • Published Aug 13, 2024 • 32

upvoted a paper 8 months ago

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16, 2024 • 81

upvoted 3 papers 9 months ago

The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing

Paper • 2406.10601 • Published Jun 15, 2024 • 66

LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models

Paper • 2404.07004 • Published Apr 10, 2024 • 6

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Paper • 2406.08973 • Published Jun 13, 2024 • 87

upvoted a paper 10 months ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19, 2024 • 153

upvoted a paper 11 months ago

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 84