Kerem Zaman (@keremzaman3) Twitter Tweets • TwiCopy

Kerem Zaman

@keremzaman3

+ Follow

PhD student @uncnlp | prev. BSc @UniBogazici | kal '18

ID: 1289141085588029445

linkhttps://keremzaman.com calendar_today31-07-2020 10:10:01

326 Tweet

369 Followers

1,1K Following

Usman Anwar

@usmananwar391

2 months ago

✨New AI Safety paper on CoT Monitorability✨ We use information theory to answer when Chain-of-Thought monitoring works, and how to make it better.

thumb_up_off_alt164

chat_bubble_outline2

repeat25

shareShare

Niloofar (on faculty job market!)

@niloofar_mire

2 months ago

Privacy in LLMs is not just Memorization! We reviewed 1322 papers (2016–25) across ML, NLP & SEC: 92% fixate on memorization/chat leaks. We map 5 urgent problems + a roadmap, to prevent surveillance, inference, aggregation and other negative outcomes.

thumb_up_off_alt170

chat_bubble_outline2

repeat24

shareShare

Kerem Zaman

@keremzaman3

2 months ago

you don’t need to travel the **world** to learn how to make the best baklava tbh

thumb_up_off_alt15

chat_bubble_outline1

repeat0

shareShare

Kerem Zaman

@keremzaman3

2 months ago

shoutout to Ai2 Asta!! it’s incredibly good at surfacing the exact papers I’m looking for. the results are super precise and it deserves more attention!

thumb_up_off_alt5

chat_bubble_outline1

repeat0

shareShare

Emre Can Acikgoz

@emrecanacikgoz

2 months ago

Consider an LLM Agent that could train itself while testing. What if it could also sense its own weaknesses and use them at test-time training? 🚨New paper!🚨 We investigate a new test-time self-improvement (TT-SI) algorithm that enables agents to self-improve using only one

thumb_up_off_alt138

chat_bubble_outline5

repeat28

shareShare

Nil Gurel

@nilgurelphd

2 months ago

Excited for tomorrow! 🎙 Honored to join the AI x Flexible Biosensors panel at #SFTechWeek by a16z. Join us! 📅 Saturday, Oct 11 | 11:00 AM PT 🔗 RSVP: partiful.com/e/zRAxQyASrlwt… Tech Week

Excited for tomorrow! 🎙 Honored to join the AI x Flexible Biosensors panel at #SFTechWeek by <a href="/a16z/">a16z</a>. Join us!

📅 Saturday, Oct 11 | 11:00 AM PT

🔗 RSVP: partiful.com/e/zRAxQyASrlwt…

<a href="/Techweek_/">Tech Week</a>

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Kerem Zaman

@keremzaman3

2 months ago

hazır kumru bu kadar gündem olmuşken şu halüsinasyon meselesine biraz açıklık getirelim. LLM halüsinasyonlarının gerçekleştiği senaryolardan ikisi şöyle: - daha önce karşılaşmadığı bir bilgiye dair hatalı cevap vermesi - cevabını bilmesine rağmen hatalı cevap vermesi

thumb_up_off_alt46

chat_bubble_outline3

repeat8

shareShare

Niloofar (on faculty job market!)

@niloofar_mire

2 months ago

LLMs are solving IMO problems, but can they grade them? In our new paper, we find they catch errors well but *fumble partial credit*. Our solution: agentic workflows that auto-generate rubrics and grade step-by-step, matching human consistency.

thumb_up_off_alt113

chat_bubble_outline6

repeat9

shareShare

Kerem Zaman

@keremzaman3

2 months ago

did anyone get affected by the invitation verification letter issue for #EMNLP2025?

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jaap Jumelet

@jumeletj

2 months ago

🌍Introducing BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data! LLMs learn from vastly more data than humans ever experience. BabyLM challenges this paradigm by focusing on developmentally plausible data We extend this effort to 45 new languages!

thumb_up_off_alt32

chat_bubble_outline1

repeat15

shareShare

Michael Saxon

@m2saxon

2 months ago

The viral new "Definition of AGI" paper has fake citations which do not exist. And it specifically TELLS you to read them! Proof: different articles present at the specified journal/volume/page number, and their titles exist nowhere on any searchable repository.

thumb_up_off_alt1,1K

chat_bubble_outline102

repeat214

shareShare

Niloofar (on faculty job market!)

@niloofar_mire

2 months ago

I'm recruiting students for fall 2026 thru Language Technologies Institute | @CarnegieMellon & CMU Engineering & Public Policy, in: 1. Privacy & security of LLMs, coding, long horizon & embodied agents (robotics) 2. Tiny local llms 3. AI for scientific reasoning, esp. chemistry 4. Latent reasoning 5. anything YOU are passionate about!

thumb_up_off_alt782

chat_bubble_outline20

repeat140

shareShare