Tamar Rott Shaham (@tamarrottshaham) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Mechanistic Interpretability for Vision @ CVPR2025

@miv_cvpr2025

2 months ago

Mechanistic Interpretability for Vision Workshop has officially begun #CVPR2025 ! 🚀 Join us at Grand C1 Hall for insightful perspectives on the state of interpretability in vision models by Tamar Rott Shaham.

Mechanistic Interpretability for Vision Workshop has officially begun <a href="/CVPR/">#CVPR2025</a> ! 🚀

Join us at Grand C1 Hall for insightful perspectives on the state of interpretability in vision models by <a href="/TamarRottShaham/">Tamar Rott Shaham</a>.

thumb_up_off_alt15

chat_bubble_outline0

repeat2

shareShare

Mechanistic Interpretability for Vision @ CVPR2025

@miv_cvpr2025

2 months ago

Don't miss out on Sonia's perspective (and live coding demo) about Prisma: an amazing open-source toolkit for vision and video interpretability Happening right now: Grand C1 Hall (on level 4) #CVPR2025

Don't miss out on <a href="/soniajoseph_/">Sonia</a>'s perspective (and live coding demo) about Prisma: an amazing open-source toolkit for vision and video interpretability

Happening right now: Grand C1 Hall (on level 4) <a href="/CVPR/">#CVPR2025</a>

thumb_up_off_alt15

chat_bubble_outline1

repeat2

shareShare

Leshem Choshen C U @ ICLR 🤖🤗

@lchoshen

2 months ago

🚀 Technical practitioners & grads — join to build an LLM evaluation hub! Infra Goals: 🔧 Share evaluation outputs & params 📊 Query results across experiments Perfect for 🧰 hands-on folks ready to build tools the whole community can use Join the EvalEval Coalition here 👇

thumb_up_off_alt44

chat_bubble_outline2

repeat14

shareShare

David Bau

@davidbau

2 months ago

How do you discover the ethical values of an AI when it is about what the AI *refuses* to say? In his preprint Can Rager develops a procedure for crawling refusals. It reveals huge differences in models from different countries! We should all audit our AI systems.

thumb_up_off_alt26

chat_bubble_outline1

repeat3

shareShare

Ekin Akyürek

@akyurekekin

2 months ago

There are three types of storage: activations (in-context), external memory, and model weights. If the models will spend days for a task, then they should be really good at compiling their in-context work to ab external memory or to their weights! Here we try to learn weights

thumb_up_off_alt202

chat_bubble_outline5

repeat14

shareShare

Jacob Andreas

@jacobandreas

2 months ago

👉 New preprint on a new family of Transformer-type models whose depth scales logarithmically with sequence length. Enables: - fast training - fast decoding - large memory capacity in associative recall - strong length generalization on state tracking

thumb_up_off_alt77

chat_bubble_outline1

repeat9

shareShare

Tamar Rott Shaham

@tamarrottshaham

a month ago

How do LMs track what humans believe? In our new work, we show they use a pointer-like mechanism we call lookback. Super proud of this work by Nikhil Prakash and team! This is the most intricate piece of LM reverse engineering I’ve seen!

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

Gary Marcus

@garymarcus

a month ago

LLM, reinventing age-old symbolic tools one step at a time

thumb_up_off_alt55

chat_bubble_outline13

repeat6

shareShare

David Bau

@davidbau

a month ago

The new "Lookback" paper from Nikhil Prakash contains a surprising insight... 70b/405b LLMs use double pointers! Akin to C programmers' double (**) pointers. They show up when the LLM is "knowing what Sally knows Ann knows", i.e., Theory of Mind. x.com/nikhil07prakas…

thumb_up_off_alt56

chat_bubble_outline1

repeat10

shareShare

Koyena Pal

@kpal_koyena

a month ago

🚨 Registration is live! 🚨 The New England Mechanistic Interpretability (NEMI) Workshop is happening August 22nd 2025 at Northeastern University! A chance for the mech interp community to nerd out on how models really work 🧠🤖 🌐 Info: nemiconf.github.io/summer25/ 📝 Register:

thumb_up_off_alt103

chat_bubble_outline2

repeat28

shareShare

Fazl Barez

@fazlbarez

a month ago

Excited to share our paper: "Chain-of-Thought Is Not Explainability"! We unpack a critical misconception in AI: models explaining their Chain-of-Thought (CoT) steps aren't necessarily revealing their true reasoning. Spoiler: transparency of CoT can be an illusion. (1/9) 🧵

thumb_up_off_alt588

chat_bubble_outline19

repeat119

shareShare

Zhijing Jin✈️ ICLR Singapore

@zhijingjin

a month ago

Check out our paper at arxiv.org/abs/2506.22957 Work led by my awesome students Younwoo (Ethan) Choi Changling Li Yongjin Yang at U of T Department of Computer Science Vector Institute Schwartz Reisman Institute Intelligent Systems ETH Zurich!

thumb_up_off_alt18

chat_bubble_outline0

repeat4

shareShare

Shivam Duggal

@shivamduggal4

24 days ago

Compression is the heart of intelligence From Occam to Kolmogorov—shorter programs=smarter representations Meet KARL: Kolmogorov-Approximating Representation Learning. Given an image, token budget T & target quality 𝜖 —KARL finds the smallest t≤T to reconstruct it within 𝜖🧵

thumb_up_off_alt329

chat_bubble_outline10

repeat62

shareShare

Zhijing Jin✈️ ICLR Singapore

@zhijingjin

17 days ago

[Implicit Personalization of #LLMs] How do we answer the question "What colo(u)r is a football?" Answer 1: "Brown🏈 ". Answer 2: "Black and white⚽". We propose a #Causal framework to test if LLMs adjust its answers depending on the cultural background inferred from the question.

thumb_up_off_alt96

chat_bubble_outline3

repeat15

shareShare

Inbar Huberman-Spiegelglas

@inbarhub

14 days ago

📷 FlowEdit has been accepted to #ICCV2025 Edit real images with text-to-image flow models! Check out: code github.com/fallenshock/Fl… webpage matankleiner.github.io/flowedit/ space to edit your images - huggingface.co/spaces/fallens… great ComfyUI plugins (logtd) matankleiner.github.io/flowedit/#comfy

📷 FlowEdit has been accepted to <a href="/ICCVConference/">#ICCV2025</a>

Edit real images with text-to-image flow models!

Check out:
code github.com/fallenshock/Fl…
webpage matankleiner.github.io/flowedit/
space to edit your images - huggingface.co/spaces/fallens…
great ComfyUI plugins (<a href="/logtdx/">logtd</a>) matankleiner.github.io/flowedit/#comfy

thumb_up_off_alt106

chat_bubble_outline4

repeat13

shareShare

Owain Evans

@owainevans_uk

13 days ago

New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

thumb_up_off_alt7,7K

chat_bubble_outline260

repeat1,1K

shareShare

Mehul Damani @ ICLR

@mehuldamani2

12 days ago

🚨New Paper!🚨 We trained reasoning LLMs to reason about what they don't know. o1-style reasoning training improves accuracy but produces overconfident models that hallucinate more. Meet RLCR: a simple RL method that trains LLMs to reason and reflect on their uncertainty --

thumb_up_off_alt892

chat_bubble_outline11

repeat286

shareShare

Bruno Mlodozeniec

@kayembruno

10 days ago

NeurIPS Conference, why take the option to provide figures in the rebuttals away from the authors during the rebuttal period? Grounding the discussion in hard evidential data (like plots) makes resolving disagreements much easier for both the authors and the reviewers. Left: NeurIPS

<a href="/NeurIPSConf/">NeurIPS Conference</a>, why take the option to provide figures in the rebuttals away from the authors during the rebuttal period? Grounding the discussion in hard evidential data (like plots) makes resolving disagreements much easier for both the authors and the reviewers.

Left: NeurIPS

thumb_up_off_alt89

chat_bubble_outline3

repeat21

shareShare