Anirudh Khatry (@anirudhkhatry) Twitter Tweets • TwiCopy

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Elias Stengel-Eskin (on the faculty job market)

@eliaseskin

4 months ago

🇦🇹 I’m on my way to #ACL2025 to help present two papers (🧵s below) ➡️ MAT-Steer (07/30 at 11am), our method for steering LLMs w/ multiple attributes (e.g. truthfulness, bias reduction, and toxicity mitigation) simultaneously. ➡️ LAQuer (07/28 at 11am), a new task/framework for

thumb_up_off_alt61

chat_bubble_outline2

repeat15

shareShare

Greg Durrett

@gregd_nlp

4 months ago

📢I'm joining NYU (Courant CS + Center for Data Science) starting this fall! I’m excited to connect with new NYU colleagues and keep working on LLM reasoning, reliability, coding, creativity, and more! I’m also looking to build connections in the NYC area more broadly. Please

thumb_up_off_alt755

chat_bubble_outline91

repeat45

shareShare

Veridise

@veridiseinc

4 months ago

We’re excited to launch LLZK, an open-source intermediate representation (IR) for zero-knowledge circuits. Think LLVM, but for ZK. Built by Veridise and supported with a grant from the Ethereum Foundation, LLZK is now live on GitHub. Thread 🧵 1/4

thumb_up_off_alt94

chat_bubble_outline10

repeat19

shareShare

Ofir Press

@ofirpress

3 months ago

i'm tired of answering emails we gotta make swemail-bench plz

thumb_up_off_alt32

chat_bubble_outline5

repeat3

shareShare

Akari Asai

@akariasai

3 months ago

Grad school season reminder: many CS departments run student-led pre-application mentorship programs for prospective PhD applicants (due Oct. You can get feedback from current PhD students! Eg - UW’s CSE PAMS: cs.washington.edu/academics/grad… - MIT EECS GAAP: eecs-gaap.mit.edu 🧵

thumb_up_off_alt265

chat_bubble_outline10

repeat42

shareShare

Charles 🎉 Frye

@charles_irl

3 months ago

The ICLR 2026 deadline is ten days away. But you just found a bug in your evals, so now you need to re-run all your ablations. That's hundreds of experiments, and you need them done ASAP. Modal's got you. Introducing our ICLR 2026 compute grant program.

thumb_up_off_alt436

chat_bubble_outline14

repeat27

shareShare

Saikat Dutta

@saikatdutta2012

3 months ago

📢 The Software Engineering group at Cornell Bowers Computing and Information Science is growing fast -- we're now 8 PhD students strong! I’m recruiting PhD students for Fall 2026! If you are interested in the intersection of SE and AI, apply to Cornell CS and reach out! Ddl: Dec 15, 2025. RT!

thumb_up_off_alt27

chat_bubble_outline0

repeat8

shareShare

Elias Stengel-Eskin (on the faculty job market)

@eliaseskin

3 months ago

🚨 We study new privacy risks that emerge as systems become increasingly multi-agent: seemingly innocuous queries can be composed across distributed/siloed agents to infer private info. One solution: reason about the adversary's motivations/state and communicate across the org

thumb_up_off_alt22

chat_bubble_outline0

repeat12

shareShare

Jessy Li

@jessyjli

3 months ago

To appear #NeurIPS2025: Can AI aid scientists amidst their own workflows, when they do not know step-by-step workflows and may not know, in advance, the kinds of scientific utility a visualization would bring? The CosmicAI presents ✨AstroVisBench:

thumb_up_off_alt36

chat_bubble_outline0

repeat10

shareShare

Liyan Tang

@liyantang4

3 months ago

Our paper "ChartMuseum 🖼️" is now accepted to #NeurIPS2025 Datasets and Benchmarks Track! Even the latest models, such as GPT-5 and Gemini-2.5-Pro, still cannot do well on challenging 📉chart understanding questions , especially on those that involve visual reasoning 👀!

thumb_up_off_alt35

chat_bubble_outline1

repeat19

shareShare

Veridise

@veridiseinc

2 months ago

Checking in from the House of ZK Verifying Intelligence event in Singapore. Our raccoon T-shirts are disappearing fast 🦝🦝🦝

Checking in from the <a href="/HouseofZK/">House of ZK</a> Verifying Intelligence event in Singapore. Our raccoon T-shirts are disappearing fast 🦝🦝🦝

thumb_up_off_alt50

chat_bubble_outline1

repeat3

shareShare

Sarah Jabbour

@sarahjabbour_

2 months ago

Every year during PhD app season I try to help students with writing statements. I don’t have tons of time now, so I posted my own SOP from 6 (yikes) years ago so prospective students can see what a statement might look like! (link in my reply)

thumb_up_off_alt760

chat_bubble_outline9

repeat50

shareShare

Elias Stengel-Eskin (on the faculty job market)

@eliaseskin

2 months ago

🚨 TRAAC uses an online difficulty-adaptive, attention-based compression method to address a core problem in long thinking: an inability to adapt to problem difficulty! Leads to underthinking on hard problems, overthinking on easy ones, reducing accuracy and efficiency. TRAAC

thumb_up_off_alt39

chat_bubble_outline0

repeat11

shareShare

Atula Tejaswi

@atu_tej

2 months ago

Come check out our poster at #COLM2025 in Montreal! Poster Session 2, Tue Oct 7th, 4:30-6:30 PM.

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

Anirudh Khatry

@anirudhkhatry

2 months ago

CRUST-bench will be presented as a #spotlight paper at #COLM2025 this Thursday (10/9) in session 5! Come check out our poster after the talk to know more about challenges LLMs face in the C to Rust transpilation task.

thumb_up_off_alt9

chat_bubble_outline0

repeat4

shareShare

Greg Durrett

@gregd_nlp

2 months ago

Find my students and collaborators at COLM this week! Tuesday morning: Juan Diego Rodríguez (he/him) and Ramya Namuduri 's papers (find them if you missed it!) Wednesday pm: Manya Wadhwa 's EvalAgent Thursday am: Anirudh Khatry 's CRUST-Bench oral spotlight + poster

Find my students and collaborators at COLM this week!

Tuesday morning: <a href="/juand_r_nlp/">Juan Diego Rodríguez (he/him)</a> and <a href="/RamyaNamuduri/">Ramya Namuduri</a> 's papers (find them if you missed it!)

Wednesday pm: <a href="/ManyaWadhwa1/">Manya Wadhwa</a> 's EvalAgent

Thursday am: <a href="/AnirudhKhatry/">Anirudh Khatry</a> 's CRUST-Bench oral spotlight + poster

thumb_up_off_alt48

chat_bubble_outline0

repeat6

shareShare

rishi

@rishibommasani

2 months ago

Given that LMs are critical to research across many disciplines (perhaps even more so than compute/hardware access), what is known about how the top LM providers subsidize researcher access? Much more aggressive subsidization could accelerate the benefits of AI to science.

thumb_up_off_alt6

chat_bubble_outline1

repeat2

shareShare

Graham Neubig

@gneubig

2 months ago

I'm co-organizing a new 🏙️ Workshop on Agentic AI Benchmarks and Applications for Enterprise Tasks 🏙️ together with collaborators at Fujitsu! As agents move towards the real world, we need more rigorous research work on this topic.

thumb_up_off_alt130

chat_bubble_outline3

repeat8

shareShare

Isil Dillig

@isildillig

2 months ago

UTCS is hiring in all areas, including PL! Please DM me if you are on the job market this year and interested in joining our wonderful department :)

thumb_up_off_alt79

chat_bubble_outline1

repeat27

shareShare

CosmicAI

@cosmicai_inst

2 months ago

CosmicAI Ph.D Student Sebastian Joseph is representing us at SAIL! He discusses his research on benchmarking large language models for astro-specific computing and visualization tasks. U.S. National Science Foundation Simons Foundation UT Austin

thumb_up_off_alt4

chat_bubble_outline0

repeat2

shareShare