Jason Y. Zhang (@jasonyzhang2) Twitter Tweets • TwiCopy

Keenan Crane

8 months ago

Here's a nice "proof without words": The sum of the squares of several positive values can never be bigger than the square of their sum. This picture helps make sense of how ℓ₁ and ℓ₂ norms regularize and sparsify solutions (resp.). [1/n]

thumb_up_off_alt5,5K

chat_bubble_outline50

repeat584

shareShare

Google DeepMind

@googledeepmind

7 months ago

Video, meet audio. 🎥🤝🔊 With Veo 3, our new state-of-the-art generative video model, you can add soundtracks to clips you make. Create talking characters, include sound effects, and more while developing videos in a range of cinematic styles. 🧵

thumb_up_off_alt7,7K

chat_bubble_outline267

repeat1,1K

shareShare

Oliver Wang

@oliver_wang2

7 months ago

Veo3 is out! deepmind.google/models/veo/ This model is awesome! It now generates audio as well as video. I'm really impressed by the background audio and music, and the synchronization of sound effects to the video. Try it out using Flow! labs.google/flow/about

thumb_up_off_alt73

chat_bubble_outline3

repeat6

shareShare

Philipp Henzler

@philipphenzler

7 months ago

Reference-powered Veo lets you go for walks in the Himalayas with your dog!

thumb_up_off_alt40

chat_bubble_outline5

repeat4

shareShare

Ricardo Martin-Brualla

@rmbrualla

7 months ago

image ⇒ video ⇒ 3D/4D I'm super excited to build the next generation of models that understand and can imagine the world like we do at SpAItial with amazing people. Sounds fun? We are hiring! spaitial.ai

thumb_up_off_alt194

chat_bubble_outline4

repeat13

shareShare

Stan Szymanowicz

@stanszymanowicz

6 months ago

Bolt3D is accepted to #ICCV2025 🥳 see you in Hawaii!

thumb_up_off_alt104

chat_bubble_outline5

repeat8

shareShare

Ruilong Li

@ruilong_li

5 months ago

For everyone interested in precise 📷camera control 📷 in transformers [e.g., video / world model etc] Stop settling for Plücker raymaps -- use camera-aware relative PE in your attention layers, like RoPE (for LLMs) but for cameras! Paper & code: liruilong.cn/prope/

thumb_up_off_alt410

chat_bubble_outline7

repeat75

shareShare

Skild AI

@skildai

4 months ago

Modern AI is confined to the digital world. At Skild AI, we are building towards AGI for the real world, unconstrained by robot type or task — a single, omni-bodied brain. Today, we are sharing our journey, starting with early milestones, with more to come in the weeks ahead.

thumb_up_off_alt633

chat_bubble_outline33

repeat146

shareShare

Jason Y. Zhang

@jasonyzhang2

4 months ago

Last year, my ring bearer was a Skild robot. Excited to see how far they've come!!

thumb_up_off_alt116

chat_bubble_outline4

repeat10

shareShare

Alireza Fathi

@alirezafathi

4 months ago

Our team at Google DeepMind Foundational Research has an opening for a full-time Research Scientist! Areas of Interest are Multimodal, 3D and Spatial Reasoning, Self-improving Agents. Looking for candidates with strong publications at top ML and CV conferences. Email:

thumb_up_off_alt352

chat_bubble_outline1

repeat28

shareShare

Shlomi Fruchter

@shlomifruchter

4 months ago

Excited to introduce Genie 3, our general purpose world model that creates interactive, playable environments from any text prompt. It can generate dynamic worlds at 720p and 24 FPS, with each frame created in response to user actions in *real-time*.

thumb_up_off_alt582

chat_bubble_outline40

repeat74

shareShare

Jon Barron

@jon_barron

4 months ago

The generative 3D/video corner of Google DeepMind that I run in is now hiring research scientists. If you're on the market for full-time roles in that space, email us! [email protected]

thumb_up_off_alt258

chat_bubble_outline6

repeat30

shareShare