1 19 54

Nils Feldhus

nfel

https://nfelnlp.github.io

AI & ML interests

Interpretability, Explainability, Natural Language Generation

Recent Activity

authored a paper 5 days ago

Persona Prompting as a Lens on LLM Social Reasoning

submitted a paper 5 days ago

Persona Prompting as a Lens on LLM Social Reasoning

upvoted a paper 5 days ago

Persona Prompting as a Lens on LLM Social Reasoning

View all activity

Organizations

upvoted a paper 5 days ago

Persona Prompting as a Lens on LLM Social Reasoning

Paper • 2601.20757 • Published 6 days ago • 3

upvoted an article 12 days ago

Article

🪄 Interpreto: A Unified Toolkit for Interpretability of Transformer Models

15 days ago

•

upvoted a collection 2 months ago

👤 Implicit Personalization in Language Models

Collection

Works on detecting, attributing and controlling implicit personalization in language models • 22 items • Updated 6 days ago • 2

upvoted a paper 4 months ago

Interpreting Language Models Through Concept Descriptions: A Survey

Paper • 2510.01048 • Published Oct 1, 2025 • 2

upvoted a paper 5 months ago

RelP: Faithful and Efficient Circuit Discovery via Relevance Patching

Paper • 2508.21258 • Published Aug 28, 2025 • 4

upvoted 2 papers 7 months ago

Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes

Paper • 2507.12261 • Published Jul 16, 2025 • 1

Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data

Paper • 2507.00152 • Published Jun 30, 2025 • 1

upvoted 2 papers 8 months ago

Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework

Paper • 2506.15538 • Published Jun 18, 2025 • 1

GeistBERT: Breathing Life into German NLP

Paper • 2506.11903 • Published Jun 13, 2025 • 4

upvoted a collection 8 months ago

ELI-Why

Collection

🧠 ELI-Why: Evaluating the Pedagogical Utility of Language Model Explanations ACL Findings 2025 • 4 items • Updated Jun 11, 2025 • 3

upvoted 2 papers 8 months ago

Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability

Paper • 2505.13963 • Published May 20, 2025 • 1

Truth or Twist? Optimal Model Selection for Reliable Label Flipping Evaluation in LLM-based Counterfactuals

Paper • 2505.13972 • Published May 20, 2025 • 1

upvoted 2 papers 9 months ago

Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods

Paper • 2505.01198 • Published May 2, 2025 • 2

Do Large Language Models Latently Perform Multi-Hop Reasoning?

Paper • 2402.16837 • Published Feb 26, 2024 • 28

upvoted a paper 10 months ago

Enhancing Automated Interpretability with Output-Centric Feature Descriptions

Paper • 2501.08319 • Published Jan 14, 2025 • 11

upvoted a paper 11 months ago

QE4PE: Word-level Quality Estimation for Human Post-Editing

Paper • 2503.03044 • Published Mar 4, 2025 • 6

upvoted an article about 1 year ago

Article

What We Learned About LLM/VLMs in Healthcare AI Evaluation:

Nov 8, 2024

•

upvoted a paper over 1 year ago

A Primer on the Inner Workings of Transformer-based Language Models

Paper • 2405.00208 • Published Apr 30, 2024 • 12

upvoted a collection about 2 years ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 135 items • Updated Dec 18, 2025 • 118

Nils Feldhus

AI & ML interests

Recent Activity

Organizations

nfel's activity

🪄 Interpreto: A Unified Toolkit for Interpretability of Transformer Models

What We Learned About LLM/VLMs in Healthcare AI Evaluation: