Imtiaz Humayun (@imtiazprio) Twitter Tweets • TwiCopy

Jaron Mink

a year ago

The Happy Lab is hiring multiple PhD students for Fall 2025! RT appreciated :-) We discover how human factors can 1) mitigate ML-enabled abuse (deepfakes) and 2) be harnessed with ML-powered security to create safe, human-centric systems! Find us at: happyresearchlab.com :)

thumb_up_off_alt109

chat_bubble_outline1

repeat48

shareShare

Mihaela van der Schaar

@mihaelavds

a year ago

Interested in a #PhD that let‘s you transform what is possible with #AI & #MachineLearning? That gives you the chance to publish at the biggest conferences and offers unrivalled career prospects? Let’s talk! Join our Open Day and meet me and my students: vanderschaar-lab.com/join-the-van-d…

thumb_up_off_alt96

chat_bubble_outline3

repeat25

shareShare

Micah Goldblum

@micahgoldblum

a year ago

📢I’ll be admitting multiple PhD students this winter to Columbia University 🏙️ in the most exciting city in the world! If you are interested in dissecting modern deep learning systems to probe how they work, advancing AI safety, or automating data science, apply to my group.

thumb_up_off_alt559

chat_bubble_outline6

repeat146

shareShare

Jay Cummings

@longformmath

a year ago

Math 🤝 Halloween

thumb_up_off_alt1,1K

chat_bubble_outline21

repeat313

shareShare

Imtiaz Humayun

@imtiazprio

a year ago

We have already achieved self-improvement in diffusion models, via its own synthetic data! Not that far away definitely!

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Pablo Samuel Castro

@pcastr

a year ago

We've been rehearsing for over two months and we're ready to rock opening night tomorrow. If you're in Ottawa, don't miss out! You'll get to see *two* Castros for the price of one! 😅

thumb_up_off_alt39

chat_bubble_outline1

repeat1

shareShare

Negar Rostamzadeh

@negar_rz

a year ago

Excited to be back to WiML at #NeurIPS2024 🥳

thumb_up_off_alt21

chat_bubble_outline0

repeat2

shareShare

Shayne Longpre

@shayneredford

a year ago

Interested in how LLMs are really used? We are starting a research project to find out! In collaboration w/ Sara Hooker Anka Reuel | @ankareuel.bsky.social Ahmet Üstün Niloofar (✈️ ICML) and others. We are looking for two junior researchers to join us. Apply by Dec 15th! forms.gle/H2o3cNCPdG8eDk…

thumb_up_off_alt140

chat_bubble_outline4

repeat37

shareShare

Yuke Wang

@yukewang1

a year ago

I will be hiring 2~3 PhD students at Rice CS in the Fall of 2025 in the system for deep learning. Feel free to reach out and apply by Jan.1, 2025. No application fee! csweb.rice.edu/academics/grad…

thumb_up_off_alt629

chat_bubble_outline16

repeat182

shareShare

jack morris

@jxmnop

a year ago

no AI here, just the coolest paper i've seen in a while

thumb_up_off_alt25,25K

chat_bubble_outline114

repeat1,1K

shareShare

François Chollet

@fchollet

a year ago

I kinda miss when "AI Twitter" was folks doing AI research or app development, posting under their own identities. Slightly more intellectual depth than anime avatars whose main AI-related qualification is paying for a ChatGPT subscription.

thumb_up_off_alt4,4K

chat_bubble_outline229

repeat242

shareShare

Michael Eisen

@mbeisen

a year ago

Here's the thing. We have **NO IDEA** how to pick good graduate students. I served on admission committees for 10+ years, and chaired a few, and what I learned is that all the spreadsheets of grades and test scores and recommendations and essays and publications and interview

thumb_up_off_alt3,3K

chat_bubble_outline107

repeat448

shareShare

Behnam Neyshabur

@bneyshabur

10 months ago

In 2021, when I decided to let go of deep learning theory and focus on improving math and reasoning in LLMs, I was inspired by the idea that one day LLMs could develop a solid theory explaining why deep learning works—and explain it to me in a way I can understand. Not too far

thumb_up_off_alt179

chat_bubble_outline6

repeat9

shareShare

Imtiaz Humayun

@imtiazprio

10 months ago

Checkout buddy Randall Balestriero discussing our work and more, why thinking of NNs as splines can provide strong intuitions of how they learn, generalize and grok! Also features a small cameo from yours truly!!

thumb_up_off_alt19

chat_bubble_outline0

repeat1

shareShare

Imtiaz Humayun

@imtiazprio

10 months ago

Indeed, it is that simple! The wiggliness induced by each layer allows NNs to approximate non-linear functions. More layers -> more possible wiggle -> more non-linearity. A nice way of thinking about this is imagining NNs doing origami on an elastic piece of paper!

thumb_up_off_alt925

chat_bubble_outline14

repeat105

shareShare

Andrew Gordon Wilson

@andrewgwils

10 months ago

My new paper "Deep Learning is Not So Mysterious or Different": arxiv.org/abs/2503.02113. Generalization behaviours in deep learning can be intuitively understood through a notion of soft inductive biases, and formally characterized with countable hypothesis bounds! 1/12

thumb_up_off_alt2,2K

chat_bubble_outline14

repeat318

shareShare

Tian Jin @ ICLR

@tjingrant

8 months ago

📣 The Journey Matters: Our #ICLR2025 paper shows how to pretrain sparse LLMs with half the size of dense LLMs while maintaining quality. We found that the average parameter count during sparse pre-training predicts quality, not final size. An MIT/Rice/Google/ISTA collab 🧵 1/N

thumb_up_off_alt28

chat_bubble_outline1

repeat5

shareShare

Google

@google

7 months ago

Say goodbye to the silent era of video generation: Introducing Veo 3 — with native audio generation. 🗣️ Quality is up from Veo 2, and now you can add dialogue between characters, sound effects and background noise. Veo 3 is available now in the Google Gemini App for Google AI Ultra

thumb_up_off_alt29,29K

chat_bubble_outline1,1K

repeat3,3K

shareShare

Thomas Walker

@thomas_m_walker

6 months ago

GrokAlign: A method to accelerate grokking motivated by a geometric characterisation of the phenomenon. - The grokked state of a deep network is Jacobian aligned. - Centroids alignment provides an efficient and intuitive metric to monitor deep network training dynamics.

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Agrim Gupta

@agrimgupta92

4 months ago

Introducing Genie 3, our state-of-the-art world model that generates interactive worlds from text, enabling real-time interaction at 24 fps with minutes-long consistency at 720p. 🧵👇

thumb_up_off_alt1,1K

chat_bubble_outline68

repeat176

shareShare