Yonatan Belinkov (@boknilev) Twitter Tweets • TwiCopy

XLLM-Reason-Plan

3 months ago

⏰ Only 9 days away! Join us at Conference on Language Modeling on October 10 for the first workshop on the application of LLM explainability to reasoning and planning. Featuring: 📑 20 poster presentations 🎤 9 distinguished speakers View our schedule at tinyurl.com/xllm-workshop.

⏰ Only 9 days away!
Join us at <a href="/COLM_conf/">Conference on Language Modeling</a> on October 10 for the first workshop on the application of LLM explainability to reasoning and planning.
Featuring:
📑 20 poster presentations
🎤 9 distinguished speakers
View our schedule at tinyurl.com/xllm-workshop.

thumb_up_off_alt9

chat_bubble_outline0

repeat4

shareShare

Itay Itzhak

@itay_itzhak_

3 months ago

Happening tomorrow! CoLM 2025 spotlight oral at 10:00 + poster at 11:00 🎤🧠 We’ll dive into cognitive biases in LLMs and what finetuning hides. The talk’s good, promise 🙂 See ya tomorrowmorning! #CoLM2025

thumb_up_off_alt23

chat_bubble_outline1

repeat7

shareShare

Aaron Mueller

@amuuueller

3 months ago

I'll be in Montréal this Friday to speak at #COLM2025's INTERPLAY workshop! As a Québecophile, I have many recommendations for those of you in town the whole week: 🧵

thumb_up_off_alt20

chat_bubble_outline1

repeat5

shareShare

AI21 Labs

@ai21labs

3 months ago

1/5 Releasing Jamba Reasoning 3B under Apache 2.0: Hybrid SSM-Transformer architecture that tops accuracy & speed across record context lengths. e.g. 3-5X faster than Llama 3.2 3B and Qwen3 4B at 32K tokens.

thumb_up_off_alt208

chat_bubble_outline6

repeat28

shareShare

Adi Simhi

@adisimhi

3 months ago

🤔What happens when LLM agents choose between achieving their goals and avoiding harm to humans in realistic management scenarios? Are LLMs pragmatic or prefer to avoid human harm? 🚀 New paper out: ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs🚀🧵

thumb_up_off_alt35

chat_bubble_outline1

repeat15

shareShare

AI21 Labs

@ai21labs

3 months ago

🧠 Jamba Reasoning 3B leads tiny reasoning models (Artificial Analysis). 🥇 #1 on #IFBench (52%) for instruction following 📈 21 on the Artificial Analysis Intelligence Index 👉Charts by Artificial Analysis: artificialanalysis.ai/models/open-so…

🧠 Jamba Reasoning 3B leads tiny reasoning models (Artificial Analysis).
🥇 #1 on #IFBench (52%) for instruction following
📈 21 on the <a href="/ArtificialAnlys/">Artificial Analysis</a> Intelligence Index

👉Charts by <a href="/ArtificialAnlys/">Artificial Analysis</a>: artificialanalysis.ai/models/open-so…

thumb_up_off_alt16

chat_bubble_outline1

repeat2

shareShare

NDIF

@ndif_team

3 months ago

Ever wished you could explore what's happening inside a 405B parameter model without writing any code? Workbench, our AI interpretability interface, is now live for public beta at workbench.ndif.us!

thumb_up_off_alt8

chat_bubble_outline1

repeat4

shareShare

David Alvarez Melis

@elmelis

3 months ago

📄 New preprint alert: We study 🪃Boomerang Distillation🪃, a surprising phenomenon that allows generating a family of pre-trained LLMs of intermediate sizes from a single teacher–student pair — 𝐧𝐨 𝐞𝐱𝐭𝐫𝐚 𝐭𝐫𝐚𝐢𝐧𝐢𝐧𝐠 𝐫𝐞𝐪𝐮𝐢𝐫𝐞𝐝! 🧵👇

thumb_up_off_alt120

chat_bubble_outline2

repeat25

shareShare

Niels Rogge

@nielsrogge

3 months ago

For people thinking that DeepSeek-OCR is the first model to render text as images, the University of Copenhagen already did this in 2023 Paper is called "Language Modelling with Pixels". They trained a Masked AutoEncoder (MAE) by rendering text as images and masking patches

thumb_up_off_alt545

chat_bubble_outline25

repeat56

shareShare

Peter Hase

@peterbhase

3 months ago

I would encourage technical AI types to consider working in grantmaking! Schmidt Sciences is hiring for a unique position where you get to continue your own research at the same time Link: jobs.lever.co/schmidt-entiti…

thumb_up_off_alt142

chat_bubble_outline4

repeat29

shareShare

Johnny Tian-Zheng Wei

@johntzwei

2 months ago

Announcing 🔭✨Hubble, a suite of open-source LLMs to advance the study of memorization! Pretrained models up to 8B params, with controlled insertion of texts (e.g., book passages, biographies, test sets, and more!) designed to emulate key memorization risks 🧵

thumb_up_off_alt113

chat_bubble_outline2

repeat38

shareShare

Yonatan Belinkov

@boknilev

2 months ago

This looks like an extremely useful resource for studying memorization in LLMs - well done!

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

Yonatan Belinkov

@boknilev

2 months ago

Michael is one of the leading experts these days on LLM circuits. Check out his work if you haven’t already.

thumb_up_off_alt55

chat_bubble_outline3

repeat3

shareShare

Mor Ventura

@mor_ventura95

2 months ago

Wrapping up: ✅ DeLeaker: inference-time semantic leakage mitigation method ✅ SLIM: first dedicated dataset for semantic leakage ✅ Eval Framework: comparative dedicated evaluation framework 📄 Paper: arxiv.org/abs/2510.15015 🌐 Project Page: venturamor.github.io/DeLeaker/

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Yoav Artzi

@yoavartzi

2 months ago

.Cornell University is recruiting for multiple postdoctoral positions in AI as part of two programs: Empire AI Fellows and Foundational AI Fellows. Positions are available in NYC and Ithaca. Deadline for full consideration is Nov 20, 2025! academicjobsonline.org/ajo/jobs/30971

.<a href="/Cornell/">Cornell University</a> is recruiting for multiple postdoctoral positions in AI as part of two programs: Empire AI Fellows and Foundational AI Fellows. Positions are available in NYC and Ithaca.

Deadline for full consideration is Nov 20, 2025!
academicjobsonline.org/ajo/jobs/30971

thumb_up_off_alt101

chat_bubble_outline2

repeat34

shareShare

Adi Simhi

@adisimhi

2 months ago

LLMs can hallucinate due to different reasons: ❌They don't know (lack of knowledge) ❌ They "know" but are uncertain ❌They "know" and are certain New Extended version of our paper that combines our understanding of hallucination on the knowledge and certainty axis is out🧵

thumb_up_off_alt8

chat_bubble_outline1

repeat3

shareShare

Yonatan Belinkov

@boknilev

2 months ago

Q: which of these can be checked by an LLM as well as an overly loaded human reviewer? Appropriateness Formatting Length Anonymity Limitations Responsible Checklist Potential Violation Justification Need Ethics Review Ethics Review Justification aclrollingreview.org/reviewerguidel…

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare