Serkan Cabi (@serkancabi) Twitter Tweets • TwiCopy

Google DeepMind

2 years ago

Introducing Veo: our most capable generative video model. 🎥 It can create high-quality, 1080p clips that can go beyond 60 seconds. From photorealism to surrealism and animation, it can tackle a range of cinematic styles. 🧵 #GoogleIO

thumb_up_off_alt3,3K

chat_bubble_outline147

repeat915

shareShare

Demis Hassabis

@demishassabis

2 years ago

For a long time, we’ve been working towards a universal AI agent that can be truly helpful in everyday life. Today at #GoogleIO we showed off our latest progress towards this: Project Astra. Here’s a video of our prototype, captured in real time.

thumb_up_off_alt1,1K

chat_bubble_outline88

repeat289

shareShare

Serkan Cabi

@serkancabi

2 years ago

Best internet rabbit hole I've been to recently. Thanks Casey Handmer caseyhandmer.wordpress.com/2024/08/30/pot…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Google DeepMind

@googledeepmind

2 years ago

Introducing 2️⃣ new AI systems for robotics: 🤖 ALOHA Unleashed to perform two-armed manipulation tasks 🦾 DemoStart to control a multi-fingered robotic hand They learned to tackle a range of actions requiring dexterity. Here's how. 🧵 dpmd.ai/3MI2mkT

thumb_up_off_alt1,1K

chat_bubble_outline41

repeat222

shareShare

Serkan Cabi

@serkancabi

a year ago

LLMs can now touch the world. Finally, we are sharing what we have been working on for a while. There is so much to explore about what Gemini Robotics can do. Check out the videos and the tech report. deepmind.google/discover/blog/…

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Convergent Research

@convergent_fros

a year ago

we made a map! gap-map.org is a tool we built to help you explore the landscape of R&D gaps holding back science - and the bridge-scale fundamental development efforts that might allow humanity to solve them, across almost two dozen fields

thumb_up_off_alt463

chat_bubble_outline14

repeat133

shareShare

Oriol Vinyals

@oriolvinyalsml

a year ago

Hello Gemini 2.5 Flash-Lite! So fast, it codes *each screen* on the fly (Neural OS concept 👇). The frontier isn't always about large models and beating benchmarks. In this case, a super fast & good model can unlock drastic use cases. Read more: blog.google/products/gemin…

thumb_up_off_alt669

chat_bubble_outline30

repeat107

shareShare

Google DeepMind

@googledeepmind

a year ago

We’re bringing powerful AI directly onto robots with Gemini Robotics On-Device. 🤖 It’s our first vision-language-action model to help make robots faster, highly efficient, and adaptable to new tasks and environments - without needing a constant internet connection. 🧵

thumb_up_off_alt2,2K

chat_bubble_outline86

repeat494

shareShare

Dhruv Shah

@shahdhruv_

a year ago

Yesterday, we live demo-ed a “generalist” VLA for (I think) the first time ever to a broad audience Robotics: Science and Systems. Bring any object. Ask anything. New environment, new instructions, no fine-tuning. Just impeccable vibes! ✨

thumb_up_off_alt338

chat_bubble_outline7

repeat29

shareShare

Ben Reinhardt

@ben_reinhardt

9 months ago

thumb_up_off_alt59

chat_bubble_outline1

repeat4

shareShare

Serkan Cabi

@serkancabi

8 months ago

We helped making gravitational wave detection more sensitive. It was such a pleasure to be part of this project where physics meets machine learning. Google DeepMind is a wonderful and rare place where projects like this become possible.

thumb_up_off_alt19

chat_bubble_outline3

repeat3

shareShare

Serkan Cabi

@serkancabi

8 months ago

We just released our new generation of Gemini Robotics. It can control multiple robots, plan and execute more steps and even transfer skills from one robot to another.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Konstantinos Bousmalis

@bousmalis

7 months ago

One of the coolest features in our newest release for Gemini Robotics 1.5 was using the wrist camera of one of the arms of our bi-arm Franka robot for active vision! Without this capability the model would not have enough visibility to solve some of our most dexterous tasks.

thumb_up_off_alt58

chat_bubble_outline2

repeat8

shareShare

Ksenia Konyushkova

@ks_konyushkova

7 months ago

Meet Gemini Robotics 1.5! Our ER model brings embodied reasoning to the real world. It understands high-level goals like "pack ingredients for mushroom risotto" with planning & success detection. Also, check out the cool "active vision" behavior – observing the action up close!🔍

thumb_up_off_alt7

chat_bubble_outline0

repeat4

shareShare

Ksenia Konyushkova

@ks_konyushkova

7 months ago

Interacting with robots just got easier with the agentic capabilities of Gemini Robotics 1.5. Talk to the robot or show it things! See how the ER model reads a handwritten list on paper and packs the tools for the job.

thumb_up_off_alt117

chat_bubble_outline5

repeat10

shareShare

Konstantinos Bousmalis

@bousmalis

7 months ago

You have to watch this! For years now, I've been looking for signs of nontrivial zero-shot transfer across seen embodiments. When I saw the Alohas unhang tools from a wall used only on our Frankas I knew we had it! Gemini Robotics 1.5 is the first VLA to achieve such transfer!!

thumb_up_off_alt336

chat_bubble_outline21

repeat53

shareShare

Norman Di Palo

@normandipalo

7 months ago

This is a remarkable result, let me explain. These tasks were only shown on a single robot embodiment. With the advancements of Gemini Robotics 1.5, *robots can learn from data from other robots* You taught a humanoid to pack gifts? Now a bi-arm Franka knows how to do it too

thumb_up_off_alt118

chat_bubble_outline3

repeat16

shareShare

Konstantinos Bousmalis

@bousmalis

7 months ago

Another agentic example and one of the demos we showed Conference on Robot Learning! Loved demonstrating the capabilities of Gemini Robotics 1.5 all week and it was fun to see how excited people were to interact with it in their own language with their own writing, drawings, and objects!

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Caden Lu

@jyluxx

7 months ago

Interacting with Gemini Robotics 1.5 is so fun! Our Embodied Reasoning model planned the multi-step task and orchestrated our Vision Language Action model for precise execution!

thumb_up_off_alt40

chat_bubble_outline1

repeat10

shareShare

Google DeepMind

@googledeepmind

3 months ago

We’re scaling up robotics in Europe. 🤖 Our Robotics Accelerator is tailored for startups and designed to bridge the gap between technology and business, powering the next generation of physical agents.

thumb_up_off_alt1,1K

chat_bubble_outline77

repeat195

shareShare