Siddharth Suresh (@siddsuresh97) Twitter Tweets • TwiCopy

Siddharth Suresh

@siddsuresh97

+ Follow

PhD student @UWMadison | Applied Scientist Intern @AmazonScience AGI Foundations| Human-AI Alignment|Prev Intern @BrownCLPS

ID: 789065132122992640

linkhttps://scholar.google.com/citations?user=xsyrntwAAAAJ&hl=en calendar_today20-10-2016 11:26:02

133 Tweet

359 Takipçi

1,1K Takip Edilen

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Computational Auditory Perception Group

@compaudition

7 months ago

We are recruiting postdocs! Want to grow your own social networks to study creativity, cultural evolution & decision-making? We are hiring a funded postdoc at Cornell in collaboration with UC Davis, CUNY, & Princeton. Apply here: academicjobsonline.org/ajo/jobs/28959

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

Andrew Lampinen

@andrewlampinen

7 months ago

Had fun talking at the Spurious Correlation & Shortcut Learning Workshop at ICLR! One example I brought up, which I think provides an uncommon perspective: a case where spurious shortcuts can improve generalization... even to out-of-distribution sets where the spurious feature doesn't generalize! Thread:

thumb_up_off_alt54

chat_bubble_outline3

repeat7

shareShare

Dimitris Papailiopoulos

@dimitrispapail

7 months ago

Do you want to do RL for coding and agentic workflows? Do you want to do science, and figure out when RL kicks in? What is the right algorithm (it's not GRPO)? how much reasoning you need in your base (you def need some! but is it a lot or A LOT)? Do you want to figure out how

thumb_up_off_alt207

chat_bubble_outline5

repeat22

shareShare

Ben Lonnqvist

@lonnqvistben

7 months ago

Now accepted at #icml2025 !

thumb_up_off_alt24

chat_bubble_outline0

repeat1

shareShare

Andrew Lampinen

@andrewlampinen

7 months ago

How do language models generalize from information they learn in-context vs. via finetuning? We show that in-context learning can generalize more flexibly, illustrating key differences in the inductive biases of these modes of learning — and ways to improve finetuning. Thread: 1/

thumb_up_off_alt751

chat_bubble_outline7

repeat146

shareShare

Qihong Lu | 吕其鸿

@qihong_lu

7 months ago

I’m thrilled to announce that I will start as a presidential assistant professor in Neuroscience at the City U of Hong Kong in Jan 2026! I have RA, PhD, and postdoc positions available! Come work with me on neural network models/experiments on human memory! RT appreciated! (1/5)

thumb_up_off_alt122

chat_bubble_outline12

repeat25

shareShare

Fenil Doshi

@fenildoshi009

5 months ago

🧵 What if two images have the same local parts but represent different global shapes purely through part arrangement? Humans can spot the difference instantly! The question is can vision models do the same? 1/15

thumb_up_off_alt571

chat_bubble_outline4

repeat103

shareShare

Jifan Zhang

@jifan_zhang

5 months ago

Releasing HumorBench today. Grok 4 is🥇 on this uncontaminated, non-STEM humor reasoning benchmark. 🫡🫡xAI Here are couple things I find surprising👇 1. this benchmark yields an almost perfect rank correlation with ARC-AGI. Yet the task of reasoning about New Yorker style

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Thomas Fel

@napoolar

5 months ago

🧠 Submit to CogInterp @ NeurIPS 2025! Bridging AI & cognitive science to understand how models think, reason & represent. CFP + details 👉 coginterp.github.io/neurips2025/

thumb_up_off_alt31

chat_bubble_outline0

repeat8

shareShare

Nicholas Roberts

@nick11roberts

5 months ago

🎉 Excited to share that our paper "Pretrained Hybrids with MAD Skills" was accepted to Conference on Language Modeling 2025! We introduce Manticore - a framework for automatically creating hybrid LMs from pretrained models without training from scratch. 🧵[1/n]

thumb_up_off_alt47

chat_bubble_outline1

repeat17

shareShare

lalit

@stochasticlalit

5 months ago

It was amazing to be part of this effort. Huge shout out to the team, and all the incredible pre-training and post-training efforts that ensure Gemini is the leading frontier model! deepmind.google/discover/blog/…

thumb_up_off_alt29

chat_bubble_outline2

repeat8

shareShare

Cameron R. Wolfe, Ph.D.

@cwolferesearch

5 months ago

Direct Preference Optimization (DPO) is simple to implement but complex to understand, which creates misconceptions about how it actually works… LLM Training Stages: LLMs are typically trained in four stages: 1. Pretraining 2. Supervised Finetuning (SFT) 3. Reinforcement

thumb_up_off_alt179

chat_bubble_outline2

repeat38

shareShare

Siddharth Suresh

@siddsuresh97

4 months ago

Come watch our talk (Kushin Mukherjee) in Salon-3 today at 2:15PM at CogSci. #cogsci2025

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Kushin Mukherjee

@kushin_m

4 months ago

Happening now at Salon 3!!

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Apurva Ratan Murty

@apurvaratan

4 months ago

Excited for this workshop at #CCN2025! Come listen to me talk about TopoNets: Topographic models across vision, language and audition. Look forward to seeing old friends and making new ones!

thumb_up_off_alt19

chat_bubble_outline0

repeat2

shareShare