Kanchana Ranasinghe (@kahnchana) 's Twitter Profile
Kanchana Ranasinghe

@kahnchana

PhD student | Working on ML & CV | Former Intern at Apple MLR, Meta AI Research, Google Research | Dancer in free time 🤖 🕺🏻

ID: 953677148

linkhttp://kahnchana.github.io calendar_today17-11-2012 14:25:55

393 Tweet

242 Takipçi

357 Takip Edilen

Yann LeCun (@ylecun) 's Twitter Profile Photo

Erik Brynjolfsson Too often, we think a task is easy because some animal can do it. But the reality is that the task is fiendishly complex and the animal is much smarter than we think. Conversely, we think tasks like playing chess, calculating an integral, or producing grammatically correct text

Richard Ngo (@richardmcngo) 's Twitter Profile Photo

Magnus Carlsen claims that one or two signals per match from a chess AI indicating when he should think hardest would make him “almost invincible”. IMO you could boost researchers similarly with just one or two signals a year saying “think hard about the paper you just read”.

Daniel Geng (@dangengdg) 's Twitter Profile Photo

Hello! If you like pretty images and videos and want a rec for CVPR oral session, you should def go to Image/Video Gen, Friday at 9am: I'll be presenting "Motion Prompting" Ryan Burgert will be presenting "Go with the Flow" and Pascal CHANG will be presenting "LookingGlass"

Federico Baldassarre (@baldassarrefe) 's Twitter Profile Photo

DINOv2 meets text at #CVPR 2025! Why choose between high-quality DINO features and CLIP-style vision-language alignment? Pick both with dino.txt 🦖📖 We align frozen DINOv2 features with text captions, obtaining both image-level and patch-level alignment at a minimal cost. [1/N]

DINOv2 meets text at #CVPR 2025! Why choose between high-quality DINO features and CLIP-style vision-language alignment? Pick both with dino.txt 🦖📖

We align frozen DINOv2 features with text captions, obtaining both image-level and patch-level alignment at a minimal cost. [1/N]
Xun Huang (@xunhuang1995) 's Twitter Profile Photo

NVIDIA wants to sell you NVL72 rack ($3M) so you can do real-time video generation 😅 Good thing: you don't need it. Self Forcing does the job with one 4090, and with better quality 😊 self-forcing.github.io

NVIDIA wants to sell you NVL72 rack ($3M) so you can do real-time video generation 😅

Good thing: you don't need it. Self Forcing does the job with one 4090, and with better quality 😊

self-forcing.github.io
nico (@nicochristie) 's Twitter Profile Photo

Introducing Shortcut — the first superhuman Excel agent. Shortcut one-shots most knowledge work tasks on Excel. It even scores >80% on Excel World Championship Cases in ~10 minutes. That's 10x faster than humans. Our early preview is live. Just comment for an invite code.

Lucas Beyer (bl16) (@giffmana) 's Twitter Profile Photo

Oh wow, did you guys know that torch.compile can compile numpy code? And even run it on GPU? This is pretty neat for all kinds of "surrounding" code besides the model (like evals and fancy metrics) that I used to do with numba/numexpr (cuz CPU-XLA was pretty meh). Poll below

Oh wow, did you guys know that torch.compile can compile numpy code? And even run it on GPU?

This is pretty neat for all kinds of "surrounding" code besides the model (like evals and fancy metrics) that I used to do with numba/numexpr (cuz CPU-XLA was pretty meh).

Poll below
Ahmad Beirami @ ICLR 2025 (@abeirami) 's Twitter Profile Photo

reciprocal reviewing is a terrible idea that unfortunately more conferences are adopting, as if we didn't already have enough problems with the review quality from people who are willing to review.

Micah Goldblum (@micahgoldblum) 's Twitter Profile Photo

🚨 Did you know that small-batch vanilla SGD without momentum (i.e. the first optimizer you learn about in intro ML) is virtually as fast as AdamW for LLM pretraining on a per-FLOP basis? 📜 1/n

🚨 Did you know that small-batch vanilla SGD without momentum (i.e. the first optimizer you learn about in intro ML) is virtually as fast as AdamW for LLM pretraining on a per-FLOP basis? 📜 1/n
Seohong Park (@seohong_park) 's Twitter Profile Photo

Just like tokenization is a necessary evil in LLMs (at least for now), time discretization is a necessary evil in robotics/RL. I think there must be a better way to handle continuous time than via naive discretization...

Sander Dieleman (@sedielem) 's Twitter Profile Photo

Transformers haven't changed much since 2017, but there have been some innovations over the years. This is an excellent summary of architectural differences in recent LLMs. Nice diagrams too! 👏 It would be great to see something like this for diffusion Transformers as well 🤔

Salesforce AI Research (@sfresearch) 's Twitter Profile Photo

🌟 Happy National Intern Day! Today we celebrate the brilliant minds and diverse perspectives that our interns bring to Salesforce AI Research. Our interns contribute to groundbreaking AI research from day one, bringing fresh ideas that drive innovation and solve complex problems for

🌟 Happy National Intern Day!

Today we celebrate the brilliant minds and diverse perspectives that our interns bring to <a href="/SFResearch/">Salesforce AI Research</a>.

Our interns contribute to groundbreaking AI research from day one, bringing fresh ideas that drive innovation and solve complex problems for
Agrim Gupta (@agrimgupta92) 's Twitter Profile Photo

Introducing Genie 3, our state-of-the-art world model that generates interactive worlds from text, enabling real-time interaction at 24 fps with minutes-long consistency at 720p. 🧵👇