Honglin Chen (@honglin_c) Twitter Tweets • TwiCopy

Shuchao Bi

@shuchaobi

6 months ago

Great work from the team. Please give it a try and let us what you think.

thumb_up_off_alt197

chat_bubble_outline15

repeat7

shareShare

Schmidt

@andrewschmidtfc

6 months ago

Terribly excited for y’all to get to try this one! It took a ton of clever, patient work — and I think the result is something special.

thumb_up_off_alt100

chat_bubble_outline18

repeat1

shareShare

A 🤫 ChatGPT launch late last week that people are starting to notice: * We updated our voice model for paid users to make it much more natural and easy to talk to * We made ChatGPT better at language translation. If you tell it to act as a translator between one language and

thumb_up_off_alt536

chat_bubble_outline41

repeat23

shareShare

Andrew Wilkinson

@awilkinson

6 months ago

The new version of ChatGPT Advanced Voice totally shocked me. I felt like I’d accidentally called a very smart woman — I actually felt a bit self conscious talking about personal stuff because the voice sounded so human.

thumb_up_off_alt434

chat_bubble_outline77

repeat13

shareShare

Chengxu Zhuang

@chengxuzhuang

6 months ago

Excited about this update from our team, esp Pingchuan Ma, Honglin Chen, and Damian Mrowca! Sounds more natural and human like, with better translation support. Enjoy the chats while we further improve the model, and let us know what could be even better!

thumb_up_off_alt70

chat_bubble_outline9

repeat5

shareShare

Aran Nayebi

@aran_nayebi

6 months ago

Love it! Most of that team at OpenAI came from Daniel Yamins' Stanford Stanford NeuroAI Lab -- it's like grad school all over again 🙂

thumb_up_off_alt21

chat_bubble_outline0

repeat1

shareShare

Boris Power

@borismpower

6 months ago

The rate of advancements is so high that most people don’t realize how far we advanced from Siri in voice interfaces. Give it a try!

thumb_up_off_alt254

chat_bubble_outline14

repeat9

shareShare

Elliott / Shangzhe Wu

@elliottszwu

6 months ago

Join us for the 4D Vision Workshop #CVPR2025 on June 11 starting at 9:20am! We'll have an incredible lineup of speakers discussing the frontier of 3D computer vision techniques for dynamic world modeling across spatial AI, robotics, astrophysics, and more. 4dvisionworkshop.github.io

Join us for the 4D Vision Workshop <a href="/CVPR/">#CVPR2025</a> on June 11 starting at 9:20am!

We'll have an incredible lineup of speakers discussing the frontier of 3D computer vision techniques for dynamic world modeling across spatial AI, robotics, astrophysics, and more.

4dvisionworkshop.github.io

thumb_up_off_alt98

chat_bubble_outline0

repeat20

shareShare

OpenAI

@openai

6 months ago

Haven’t tried the updated Advanced Voice that was recently launched to all paid users in ChatGPT? Then take a listen below. Prompt: Wish me an awkward happy birthday.

thumb_up_off_alt5,5K

chat_bubble_outline608

repeat417

shareShare

OpenAI

@openai

6 months ago

The updated Advanced Voice is great for translating conversations between people speaking different languages.

thumb_up_off_alt1,1K

chat_bubble_outline73

repeat145

shareShare

Shuchao Bi

@shuchaobi

6 months ago

Honglin Chen Pingchuan Ma and Damian made translation infinitely more reliable

thumb_up_off_alt14

chat_bubble_outline1

repeat2

shareShare

Peter Bakkum

@pbbakkum

6 months ago

I can’t overemphasize how good the new realtime speech2speech model is at function calling. It is fast and accurate with native audio input. It exceeded expectations from myself and posttraining researchers. This one — gpt-4o-realtime-preview-2025-06-03

thumb_up_off_alt338

chat_bubble_outline13

repeat17

shareShare

Hong-Xing "Koven" Yu

@koven_yu

6 months ago

#ICCV2025 🤩3D world generation is cool, but it is cooler to play with the worlds using 3D actions 👆💨, and see what happens! — Introducing *WonderPlay*: Now you can create dynamic 3D scenes that respond to your 3D actions from a single image! Web: kyleleey.github.io/WonderPlay/ 🧵1/7

thumb_up_off_alt175

chat_bubble_outline5

repeat37

shareShare

Seungwoo (Simon) Kim

@sekim1112

5 months ago

We prompt a generative video model to extract state-of-the-art optical flow, using zero labels and no fine-tuning. Our method, KL-tracing, achieves SOTA results on TAP-Vid & generalizes to challenging YouTube clips. Khai Loong Aw Klemen Kotar Cristóbal Eyzaguirre Ercilla Wanhee Lee

thumb_up_off_alt27

chat_bubble_outline1

repeat7

shareShare

Klemen Kotar

@klemenkotar

5 months ago

📷 New Preprint: SOTA optical flow extraction from pre-trained generative video models! While it seems intuitive that video models grasp optical flow, extracting that understanding has proven surprisingly elusive.

thumb_up_off_alt38

chat_bubble_outline1

repeat8

shareShare

Daniel Yamins

@dyamins

5 months ago

Over the past 18 months my lab has been developing a new approach to visual world modeling. There will be a magnum opus that ties it all together out in the next couple of weeks. But for now there are some individual application papers that have poked out.

thumb_up_off_alt71

chat_bubble_outline1

repeat13

shareShare

OpenAI

@openai

5 months ago

ChatGPT can now do work for you using its own computer. Introducing ChatGPT agent—a unified agentic system combining Operator’s action-taking remote browser, deep research’s web synthesis, and ChatGPT’s conversational strengths.

thumb_up_off_alt13,13K

chat_bubble_outline650

repeat2,2K

shareShare

Rahul Venkatesh

@rahul_venkatesh

5 months ago

AI models segment scenes based on how things appear, but babies segment based on what moves together. We utilize a visual world model that our lab has been developing, to capture this concept — and what's cool is that it beats SOTA models on zero-shot segmentation and physical

thumb_up_off_alt52

chat_bubble_outline6

repeat13

shareShare

Martin Schrimpf @ICLR2025

@martin_schrimpf

4 months ago

Great work by Yingtian Tang with Abdulkadir Gokce, Khaled Jedoui, and Daniel Yamins (and me). Check out the full thread for more details x.com/yingtian80536/… and of course the paper biorxiv.org/content/10.110… #NeuroAI #Vision #Neuroscience #AI

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

Honglin Chen

Shuchao Bi

Schmidt

Kevin Weil 🇺🇸

Andrew Wilkinson

Chengxu Zhuang

Aran Nayebi

Boris Power

Elliott / Shangzhe Wu

OpenAI

OpenAI

Shuchao Bi

Peter Bakkum

Hong-Xing "Koven" Yu

Seungwoo (Simon) Kim

Klemen Kotar

Daniel Yamins

OpenAI

Rahul Venkatesh

Martin Schrimpf @ICLR2025