Nils Feldhus (@nfelnlp) Twitter Tweets • TwiCopy

Nils Feldhus

@nfelnlp

+ Follow

Post-doctoral Researcher at BIFOLD / TU Berlin interested in interpretability and analysis of language models. Guest researcher at DFKI Berlin.

ID: 876490374642106374

linkhttps://nfelnlp.github.io/ calendar_today18-06-2017 17:22:44

88 Tweet

235 Followers

387 Following

Inseq

@inseqlib

2 years ago

Inseq v0.5 is finally out! 🐛 New tutorial, distributed and 4-bit quantized models, easier & better contrastive attribution, and more! 🎉 Thanks to Daniel Scalena Giuseppe Attanasio and all other contributors! Find out more in the release notes 👀 github.com/inseq-team/ins…

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Inseq

@inseqlib

2 years ago

Value Zeroing, a faithful approach for analyzing context mixing in Transformers, is now available on Inseq main branch for all Hugging Face text generation models! 🔀 🔍Paper introducing VZ: aclanthology.org/2023.eacl-main… 🐛VZ in Inseq: tinyurl.com/inseq-vz

Value Zeroing, a faithful approach for analyzing context mixing in Transformers, is now available on <a href="/InseqLib/">Inseq</a> main branch for all <a href="/huggingface/">Hugging Face</a> text generation models! 🔀

🔍Paper introducing VZ: aclanthology.org/2023.eacl-main…
🐛VZ in Inseq: tinyurl.com/inseq-vz

thumb_up_off_alt17

chat_bubble_outline1

repeat3

shareShare

Abhilasha Ravichander

@lasha_nlp

2 years ago

Looking for potential emergency reviewers for submissions in Interpretability and Model Analysis/NLP Applications! Topics include: LLM Hallucination, Alignment, Privacy. Please reach out if you have the bandwidth to help!🙏 #NLProc #ACL2024

thumb_up_off_alt26

chat_bubble_outline4

repeat11

shareShare

Nils Feldhus

@nfelnlp

2 years ago

Thanks a lot to all emergency reviewers who helped fill in the gaps for the #ARR February 2024 cycle! 🫶 We're good to go for the author response period. x.com/nfelnlp/status…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

BIFOLD

@bifoldberlin

2 years ago

New open #phd position: Contribute to the "FakeXplain - Development of transparent and meaningful explanations in the disinformation detection context " project. Research Assistant - salary grade E 13 TV-L Berliner Hochschulen jobs.tu-berlin.de/en/job-posting…

thumb_up_off_alt4

chat_bubble_outline0

repeat5

shareShare

Inseq

@inseqlib

2 years ago

Inseq v0.6 is out now on PyPI! 🔥 New CLI command for context attribution (Gabriele Sarti), new perturbation-based methods by Hosein Mohebbi & Cass Zhixue and optimizations incl. multi-gpu support! ⚡️ Huge shoutout to our contributors! ❤️ Release notes ⬇️ github.com/inseq-team/ins…

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

ACLRollingReview

@reviewacl

a year ago

If you haven't been invited to review for ARR 2024 June but are interested in helping us, please fill out this form by June 19: forms.office.com/pages/response…

thumb_up_off_alt40

chat_bubble_outline3

repeat36

shareShare

BlackboxNLP

@blackboxnlp

a year ago

The submission deadline (15 aug) for BlackboxNLP is slowly approaching! We're very excited to see your approaches to open up the black box 🤩 The submission portal has now been opened on OpenReview: openreview.net/group?id=EMNLP…

thumb_up_off_alt14

chat_bubble_outline0

repeat7

shareShare

Nils Feldhus

@nfelnlp

a year ago

Presenting my poster at INLG 2025 today on political bias evaluation assessing sycophancy in (German-language) LLMs: ACL Anthology: aclanthology.org/2024.inlg-main… This paper resulted from the great Bachelor thesis of Maximilian Bleick co-supervised with Aljoscha Burchardt and Sebastian Möller.

Presenting my poster at <a href="/inlgmeeting/">INLG 2025</a> today on political bias evaluation assessing sycophancy in (German-language) LLMs:

ACL Anthology: aclanthology.org/2024.inlg-main…

This paper resulted from the great Bachelor thesis of Maximilian Bleick co-supervised with <a href="/albu/">Aljoscha Burchardt</a> and Sebastian Möller.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

NAACL HLT 2025

@naaclmeeting

a year ago

📢 NAACL needs Reviewers & Area Chairs! 📝 If you haven't received an invite for ARR Oct 2024 & want to contribute, sign up by Oct 22nd! ➡️AC form: forms.office.com/r/8j6jXLfASt ➡️Reviewer form: forms.office.com/r/cjPNtL9gPE Please RT 🔁 and help spread the word! 🗣️ #NLProc ACLRollingReview

thumb_up_off_alt41

chat_bubble_outline1

repeat26

shareShare

Laura Kopf

@lkopf_ml

6 months ago

🔍 When do neurons encode multiple concepts? We introduce PRISM, a framework for extracting multi-concept feature descriptions to better understand polysemanticity. 📄 Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework arxiv.org/abs/2506.15538 🧵

thumb_up_off_alt13

chat_bubble_outline1

repeat4

shareShare

Laura Kopf

@lkopf_ml

3 months ago

Happy to share that our PRISM paper has been accepted at #NeurIPS2025 🎉 In this work, we introduce a multi-concept feature description framework that can identify and score polysemantic features. 📄 Paper: arxiv.org/abs/2506.15538 #NeurIPS #MechInterp #XAI

thumb_up_off_alt9

chat_bubble_outline1

repeat5

shareShare