Imtiaz Humayun (@imtiazprio) 's Twitter Profile
Imtiaz Humayun

@imtiazprio

PhD Student @RiceUniversity with @rbaraniuk. Prev. @GoogleAI. Co-founded @bengali_ai. Like thinking about DL theory, interpretability and generative models 🇧🇩

ID: 1033593215130120192

linkhttp://imtiazhumayun.github.io calendar_today26-08-2018 05:53:20

210 Tweet

482 Followers

351 Following

Jaron Mink (@jaronmink) 's Twitter Profile Photo

The Happy Lab is hiring multiple PhD students for Fall 2025! RT appreciated :-) We discover how human factors can 1) mitigate ML-enabled abuse (deepfakes) and 2) be harnessed with ML-powered security to create safe, human-centric systems! Find us at: happyresearchlab.com :)

Mihaela van der Schaar (@mihaelavds) 's Twitter Profile Photo

Interested in a #PhD that let‘s you transform what is possible with #AI & #MachineLearning? That gives you the chance to publish at the biggest conferences and offers unrivalled career prospects? Let’s talk! Join our Open Day and meet me and my students: vanderschaar-lab.com/join-the-van-d…

Micah Goldblum (@micahgoldblum) 's Twitter Profile Photo

📢I’ll be admitting multiple PhD students this winter to Columbia University 🏙️ in the most exciting city in the world! If you are interested in dissecting modern deep learning systems to probe how they work, advancing AI safety, or automating data science, apply to my group.

📢I’ll be admitting multiple PhD students this winter to Columbia University 🏙️ in the most exciting city in the world!  If you are interested in dissecting modern deep learning systems to probe how they work, advancing AI safety, or automating data science, apply to my group.
Pablo Samuel Castro (@pcastr) 's Twitter Profile Photo

We've been rehearsing for over two months and we're ready to rock opening night tomorrow. If you're in Ottawa, don't miss out! You'll get to see *two* Castros for the price of one! 😅

We've been rehearsing for over two months and we're ready to rock opening night tomorrow. 
If you're in Ottawa, don't miss out! You'll get to see *two* Castros for the price of one! 😅
Shayne Longpre (@shayneredford) 's Twitter Profile Photo

Interested in how LLMs are really used? We are starting a research project to find out! In collaboration w/ Sara Hooker Anka Reuel | @ankareuel.bsky.social Ahmet Üstün Niloofar (✈️ ICML) and others. We are looking for two junior researchers to join us. Apply by Dec 15th! forms.gle/H2o3cNCPdG8eDk…

Yuke Wang (@yukewang1) 's Twitter Profile Photo

I will be hiring 2~3 PhD students at Rice CS in the Fall of 2025 in the system for deep learning. Feel free to reach out and apply by Jan.1, 2025. No application fee! csweb.rice.edu/academics/grad…

François Chollet (@fchollet) 's Twitter Profile Photo

I kinda miss when "AI Twitter" was folks doing AI research or app development, posting under their own identities. Slightly more intellectual depth than anime avatars whose main AI-related qualification is paying for a ChatGPT subscription.

Michael Eisen (@mbeisen) 's Twitter Profile Photo

Here's the thing. We have **NO IDEA** how to pick good graduate students. I served on admission committees for 10+ years, and chaired a few, and what I learned is that all the spreadsheets of grades and test scores and recommendations and essays and publications and interview

Behnam Neyshabur (@bneyshabur) 's Twitter Profile Photo

In 2021, when I decided to let go of deep learning theory and focus on improving math and reasoning in LLMs, I was inspired by the idea that one day LLMs could develop a solid theory explaining why deep learning works—and explain it to me in a way I can understand. Not too far

Imtiaz Humayun (@imtiazprio) 's Twitter Profile Photo

Checkout buddy Randall Balestriero discussing our work and more, why thinking of NNs as splines can provide strong intuitions of how they learn, generalize and grok! Also features a small cameo from yours truly!!

Imtiaz Humayun (@imtiazprio) 's Twitter Profile Photo

Indeed, it is that simple! The wiggliness induced by each layer allows NNs to approximate non-linear functions. More layers -> more possible wiggle -> more non-linearity. A nice way of thinking about this is imagining NNs doing origami on an elastic piece of paper!

Indeed, it is that simple! The wiggliness induced by each layer allows NNs to approximate non-linear functions. More layers -> more possible wiggle -> more non-linearity. A nice way of thinking about this is imagining NNs doing origami on an elastic piece of paper!
Andrew Gordon Wilson (@andrewgwils) 's Twitter Profile Photo

My new paper "Deep Learning is Not So Mysterious or Different": arxiv.org/abs/2503.02113. Generalization behaviours in deep learning can be intuitively understood through a notion of soft inductive biases, and formally characterized with countable hypothesis bounds! 1/12

My new paper "Deep Learning is Not So Mysterious or Different": arxiv.org/abs/2503.02113. Generalization behaviours in deep learning can be intuitively understood through a notion of soft inductive biases, and formally characterized with countable hypothesis bounds! 1/12
Tian Jin @ ICLR (@tjingrant) 's Twitter Profile Photo

📣 The Journey Matters: Our #ICLR2025 paper shows how to pretrain sparse LLMs with half the size of dense LLMs while maintaining quality. We found that the average parameter count during sparse pre-training predicts quality, not final size. An MIT/Rice/Google/ISTA collab 🧵 1/N

📣 The Journey Matters: Our #ICLR2025 paper shows how to pretrain sparse LLMs with half the size of dense LLMs while maintaining quality. We found that the average parameter count during sparse pre-training predicts quality, not final size. An MIT/Rice/Google/ISTA collab 🧵 1/N
Google (@google) 's Twitter Profile Photo

Say goodbye to the silent era of video generation: Introducing Veo 3 — with native audio generation. 🗣️ Quality is up from Veo 2, and now you can add dialogue between characters, sound effects and background noise. Veo 3 is available now in the Google Gemini App for Google AI Ultra

Thomas Walker (@thomas_m_walker) 's Twitter Profile Photo

GrokAlign: A method to accelerate grokking motivated by a geometric characterisation of the phenomenon. - The grokked state of a deep network is Jacobian aligned. - Centroids alignment provides an efficient and intuitive metric to monitor deep network training dynamics.

GrokAlign: A method to accelerate grokking motivated by a geometric characterisation of the phenomenon.
- The grokked state of a deep network is Jacobian aligned.
- Centroids alignment provides an efficient and intuitive metric to monitor deep network training dynamics.
Agrim Gupta (@agrimgupta92) 's Twitter Profile Photo

Introducing Genie 3, our state-of-the-art world model that generates interactive worlds from text, enabling real-time interaction at 24 fps with minutes-long consistency at 720p. 🧵👇