Shrikar (@shrikarhaha) Twitter Tweets • TwiCopy

Rishubh Parihar

7 months ago

“Make it red.” “No! More red!” “Ughh… slightly less red.” “Perfect!” ♥️ 🎚️Kontinuous Kontext adds slider-based control over edit strength to instruction-based image editing, enabling smooth, continuous transformations!

thumb_up_off_alt154

chat_bubble_outline17

repeat36

shareShare

Willis (Nanye) Ma

@ma_nanye

7 months ago

Excited to introduce DiffuseNNX, a comprehensive JAX/Flax NNX-based library for diffusion and flow matching! It supports multiple diffusion / flow-matching frameworks, Autoencoders, DiT variants, and sampling algorithms. Repo: github.com/willisma/diffu… Delve into details below!

thumb_up_off_alt219

chat_bubble_outline4

repeat52

shareShare

Saining Xie

@sainingxie

7 months ago

I used to think that semantic encoders primarily captured high-level, abstract representations and discarded fine-grained visual details, but I was wrong. we employ pretrained representation encoders (such as DINO, SigLIP, and MAE, all based on standardized ViTs) combined with

thumb_up_off_alt107

chat_bubble_outline4

repeat8

shareShare

Saining Xie

@sainingxie

7 months ago

as always, we’re releasing everything: the paper, the model, and the PyTorch code. this project has been led by three amazing students: Boyang Boyang Zheng (1st year phd), Willis Willis (Nanye) Ma (2nd year phd), and Peter Peter Tong (3rd year phd). we’ve been working on this for

thumb_up_off_alt145

chat_bubble_outline2

repeat10

shareShare

Rishubh Parihar

@rishubhparihar

7 months ago

✨ I’ll be presenting our work on depth-aware image editing at #ICCV2025 in Hawaii 🌴 next week! 📅 Oct 22 | 📍 Exhibit Hall I | 🧩 Poster #82 🌍 Project: rishubhpar.github.io/DAEdit/ 🤝 Working on image generation or editing? I’d love to chat at ICCV! Vision and AI Lab, IISc

thumb_up_off_alt13

chat_bubble_outline0

repeat4

shareShare

Ashok Elluswamy

@aelluswamy

7 months ago

Full video of the ICCV '25 presentation

thumb_up_off_alt1,1K

chat_bubble_outline46

repeat183

shareShare

Brad Gerstner

@altcap

6 months ago

Respect. 🫡🚀 Sundar Pichai

thumb_up_off_alt1,1K

chat_bubble_outline31

repeat40

shareShare

Aniket Didolkar

@aniket_d98

6 months ago

Can we scale thinking without scaling context budget!? In our latest work, we propose a test-time scaling strategy which combines parallel drafts in a sequential self-improvement loop to further boost reasoning capabilities of frontier LLMs such as O3 and Gemini-2.5-flash

thumb_up_off_alt22

chat_bubble_outline0

repeat4

shareShare

Songyou Peng

@songyoupeng

6 months ago

Our bigger group at Google DeepMind is hiring interns for next summer! If you are interested in working with us, apply through the link below and also email us. step 1: google.com/about/careers/… (US-based) step 2: send an email to [email protected]

thumb_up_off_alt502

chat_bubble_outline6

repeat44

shareShare

sarah guo // conviction

@saranormous

6 months ago

ok, I know I’m the biggest stan — but amongst many lessons I have learned from Pat Grady is that cynicism/selfishness is for losers, and openheartedness is for winners. believing in the talented people around you, assuming good intent, playing for your team - it is often enough!

thumb_up_off_alt552

chat_bubble_outline20

repeat30

shareShare

Theoretically Media

@theomediaai

6 months ago

Veo 3.1 now has a Camera Adjustment Feature, allowing you to change the angle and movement of a previously generated video. Taking it out for a test spin, here's our "Test" video, in the thread we'll check out how the feature does!

thumb_up_off_alt451

chat_bubble_outline35

repeat55

shareShare

Mahan Fathi

@mahanfathi

6 months ago

We're looking for Summer Interns to join the Post-Training Team at @NVIDIA! DM me with your updated resume and three concise bullets detailing your most relevant experience — e.g. publications, repos, blogs, etc. RT please to help us find top talent.

thumb_up_off_alt460

chat_bubble_outline13

repeat35

shareShare

Jeff Liang

@liangjeff95

6 months ago

One of the best talks I've ever had for World Model. Definitely worth watching!

thumb_up_off_alt263

chat_bubble_outline2

repeat35

shareShare

joao carreira

@joaocarreira

6 months ago

I'm looking for a student researcher to work with me at Google DeepMind in London, preferably starting early next year -- topics will be around novel video model architectures / learning from a single video stream / representation learning .

thumb_up_off_alt780

chat_bubble_outline17

repeat102

shareShare

Saining Xie

@sainingxie

6 months ago

Introducing Cambrian-S it’s a position, a dataset, a benchmark, and a model but above all, it represents our first steps toward exploring spatial supersensing in video. 🧶

thumb_up_off_alt621

chat_bubble_outline25

repeat94

shareShare

Ellis Brown

@_ellisbrown

6 months ago

MLLMs are great at understanding videos, but struggle with spatial reasoning—like estimating distances or tracking objects across time. the bottleneck? getting precise 3D spatial annotations on real videos is expensive and error-prone. introducing SIMS-V 🤖 [1/n]

thumb_up_off_alt121

chat_bubble_outline2

repeat30

shareShare

Shrikar

@shrikarhaha

6 months ago

Wow! Sometimes you're lucky to wake up and see something fundamentally different to address the challenge of modeling how humans can stream continuous frames of visual information and store infinite context! Exciting times researching video and spatial supersensing! ✨🫡

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Jehan Godrej

@jehangodrej

6 months ago

I wish that everyone gets to experience running the NYC Marathon once in their lives, every human deserves to experience something like it, and it made all the training, discipline, and suffering along the way this year worth it.

thumb_up_off_alt9

chat_bubble_outline1

repeat1

shareShare

Jiageng Mao

@pointscoder

6 months ago

🎥 Video Generation Enables Zero-Shot Robotic Manipulation 🤖 Introducing PhysWorld, a framework that bridges video generation and robot learning through (generated) real-to-sim world modeling. 🌐 Project: pointscoder.github.io/PhysWorld_Web/ 📄 Paper: arxiv.org/abs/2511.07416 💻 Code:

thumb_up_off_alt174

chat_bubble_outline7

repeat40

shareShare

Esha Cyril

@esha_cyril

6 months ago

carmel by-the-sea is one of the best places on earth debates welcome <3

thumb_up_off_alt1,1K

chat_bubble_outline244

repeat27

shareShare