Preetum Nakkiran (@preetumnakkiran) Twitter Tweets • TwiCopy

Eugene Vinitsky 🍒🦋

6 months ago

We now know RL agents can zero-shot crush driving benchmarks. Can we put them on a car and replace the planning stack? We're hiring a postdoc at NYU to find out! Email me if interested and please help us get the word out.

thumb_up_off_alt185

chat_bubble_outline5

repeat30

shareShare

Miguel Angel Bautista

@itsbautistam

5 months ago

We have an open position at Apple MLR to work scalable and efficient generative models that perform across diverse data domains—including images, 3D, video, graphs, etc. We care deeply about simplifying modeling pipelines, developing powerful and scalable training recipes.

thumb_up_off_alt65

chat_bubble_outline2

repeat14

shareShare

Shreya Shankar

@sh_reya

5 months ago

We met on this app 4 years ago. We finally finished all our weddings! 💒

thumb_up_off_alt1,1K

chat_bubble_outline111

repeat14

shareShare

Preetum Nakkiran

@preetumnakkiran

5 months ago

putting this in papers for LLMs is like putting “novel”/“surprising”/clickbait in abstracts for humans

thumb_up_off_alt11

chat_bubble_outline1

repeat0

shareShare

Mehrdad Farajtabar

@mfarajtabar

5 months ago

We have a full-time position for research scientist in our team at #Apple. The topic is understanding and improving #reasoning abilities of #LLMs. We're also interested in developing new and efficient architectures based on transformer for language modeling, again reasoning

thumb_up_off_alt122

chat_bubble_outline2

repeat16

shareShare

Arwen Bradley

@arwenbradley

5 months ago

I’m at #ICML2025 this week! Will be presenting Mechanism of Projective Composition of Diffusion Models tomorrow afternoon. Stop by poster E-3105 to see oil paintings of Preetum Nakkiran’s dog!

thumb_up_off_alt16

chat_bubble_outline0

repeat3

shareShare

Shuangfei Zhai

@zhaisf

5 months ago

There is a more general version of this question: why not scale up the parameters of the attention operation and make it more expressive? (you can do it as suggested below, or simply increase the dimension of QKV) The empirical answer is that it’s not nearly as effective as

thumb_up_off_alt325

chat_bubble_outline16

repeat22

shareShare

Chhavi Yadav

@chhaviyadav_

5 months ago

Had an amazing time NewInML @ ICML 2025 ICML Conference giving a talk on "What I Wish I knew before starting a PhD (but learnt the hard way)"! Loved the post-talk discussions and the heart warming messages :) Sharing slides since some people asked, link in the tweet below 👇

Had an amazing time <a href="/NewInML/">NewInML @ ICML 2025</a> <a href="/icmlconf/">ICML Conference</a> giving a talk on "What I Wish I knew before starting a PhD (but learnt the hard way)"!
Loved the post-talk discussions and the heart warming messages :)

Sharing slides since some people asked, link in the tweet below 👇

thumb_up_off_alt67

chat_bubble_outline1

repeat13

shareShare

Christopher D. Long 🇺🇦

@octonion

4 months ago

Terence Tao has written a thread on Mathstodon about the damage being caused by the grant freezes at UCLA. mathstodon.xyz/@tao/114956840…

thumb_up_off_alt537

chat_bubble_outline12

repeat97

shareShare

Dylan Foster 🐢

@canondetortugas

4 months ago

Announcing the first workshop on Foundations of Language Model Reasoning (FoRLM) at NeurIPS 2025! 📝Soliciting abstracts that advance foundational understanding of reasoning in language models, from theoretical analyses to rigorous empirical studies. 📆 Deadline: Sept 3, 2025

thumb_up_off_alt158

chat_bubble_outline1

repeat27

shareShare

Preetum Nakkiran

@preetumnakkiran

4 months ago

giving my car the ability to end a drive if it’s feeling a lil’ tired

thumb_up_off_alt51

chat_bubble_outline1

repeat0

shareShare

Patrick McKenzie

@patio11

4 months ago

One reason to not spend overly much time lawyering the meaning of words to minimize LLM’s capabilities is that you should not want to redefine thinking such that many humans have never thought.

thumb_up_off_alt623

chat_bubble_outline20

repeat42

shareShare

Shreya Shankar

@sh_reya

3 months ago

On my way to VLDB! 🇬🇧 I am on the job market this year, seeking tenure-track CS faculty positions. I will be giving a talk on DocETL and on a panel titled “Where Does Academic Database Research Go From Here?” I would love to meet folks; please reach out if you’re also attending!

thumb_up_off_alt166

chat_bubble_outline10

repeat23

shareShare

Behnam Neyshabur

@bneyshabur

3 months ago

OK, Sara Wiltberger and I are experimenting with a small, project-based mentorship program designed for the age of AI. We’re looking for resourceful self-starters—from early high school to early-career professionals—who want to prove their abilities through hard work. You don’t

thumb_up_off_alt61

chat_bubble_outline2

repeat14

shareShare

Andrew Gordon Wilson

@andrewgwils

3 months ago

Advice for academics: don't try to beat industry at their own game. Invent a new more interesting game, with different rules.

thumb_up_off_alt361

chat_bubble_outline7

repeat21

shareShare

Charles 🎉 Frye

@charles_irl

3 months ago

The ICLR 2026 deadline is ten days away. But you just found a bug in your evals, so now you need to re-run all your ablations. That's hundreds of experiments, and you need them done ASAP. Modal's got you. Introducing our ICLR 2026 compute grant program.

thumb_up_off_alt436

chat_bubble_outline14

repeat27

shareShare