Itamar Zimerman (@itamarzimerman) Twitter Tweets • TwiCopy

Itamar Zimerman

@itamarzimerman

+ Follow

PhD candidate @ Tel Aviv University.
AI Research scientist @ IBM Research.
Interested in deep learning and algorithms.

ID: 825803747649544195

linkhttps://itamarzimm.github.io/ calendar_today29-01-2017 20:32:11

107 Tweet

406 Followers

436 Following

Itamar Zimerman

@itamarzimerman

7 months ago

Assaf's analysis of recurrent LLMs such as Mamba and RWKV is important. While these models are designed to be efficient for long-context tasks, their effectiveness remains limited due to memory overflows - even at large scale. More details about memory overflows in the paper📜🧵

thumb_up_off_alt11

chat_bubble_outline0

repeat3

shareShare

Yoni Slutzky

@yonislutzky

6 months ago

How do information flow patterns in Mamba compare to those in Transformers?🚨 Our new #ACL2025 paper pits Mamba-1 and Mamba-2 against other Transformer-based models and uncovers both universal and architecture-specific information flow patterns. arxiv.org/abs/2505.24244 🧵

thumb_up_off_alt10

chat_bubble_outline1

repeat3

shareShare

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

6 months ago

Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability 𝘚𝘪𝘮𝘱𝘭𝘦 𝘧𝘪𝘹𝘦𝘴 > 𝖯𝗋𝗈𝖻𝗅𝖾𝗆 All LRP‑based XAI tools ignore PE, so relevance gets lost 𝖥𝗂𝗑 Model each input as a (token, position) pair and add PE‑aware LRP rules

thumb_up_off_alt16

chat_bubble_outline1

repeat4

shareShare

Rohan Paul

@rohanpaul_ai

6 months ago

Reasoning depth of LLMs can now match task size, not a fixed budget. Reasoning models waste compute and often slip because their hidden chain of thought keeps running after the answer is clear. The authors learn an internal progress meter and nudge it so the model stops as

thumb_up_off_alt29

chat_bubble_outline2

repeat9

shareShare

Boaz Lavon

@boazlavon

6 months ago

LLMs Don’t Think Like Developers - Until Now. Together with shahar katz and liorwolf We made LLMs execute their code while generating it, just like a human developer. Meet EG-CFG: A new inference-time method that injects real-time execution feedback into the generation loop.

LLMs Don’t Think Like Developers - Until Now.

Together with <a href="/KatzShachar/">shahar katz</a> and <a href="/liorwolf/">liorwolf</a>
We made LLMs execute their code while generating it, just like a human developer.

Meet EG-CFG: A new inference-time method that injects real-time execution feedback into the generation loop.

thumb_up_off_alt272

chat_bubble_outline20

repeat29

shareShare

Hila Chefer

@hila_chefer

5 months ago

Exciting news from #ICML2025 & #ICCV2025 🥳 - 🥇 VideoJAM accepted as *oral* at #ICML2025 (top 1%) - Two talks at #ICCV2025 ☝️interpretability in the generative era ✌️video customization - Organizing two #ICCV2025 workshops ☝️structural priors for vision ✌️long video gen 🧵👇

thumb_up_off_alt173

chat_bubble_outline15

repeat17

shareShare