Honglin Chen (@honglin_c) 's Twitter Profile
Honglin Chen

@honglin_c

Research @OpenAI. Previously CS PhD @Stanford @NeuroAILab @StanfordAILab.

ID: 1189312930665287680

calendar_today29-10-2019 22:48:01

67 Tweet

651 Takipçi

1,1K Takip Edilen

Schmidt (@andrewschmidtfc) 's Twitter Profile Photo

Terribly excited for y’all to get to try this one! It took a ton of clever, patient work — and I think the result is something special.

Kevin Weil 🇺🇸 (@kevinweil) 's Twitter Profile Photo

A 🤫 ChatGPT launch late last week that people are starting to notice: * We updated our voice model for paid users to make it much more natural and easy to talk to * We made ChatGPT better at language translation. If you tell it to act as a translator between one language and

Andrew Wilkinson (@awilkinson) 's Twitter Profile Photo

The new version of ChatGPT Advanced Voice totally shocked me. I felt like I’d accidentally called a very smart woman — I actually felt a bit self conscious talking about personal stuff because the voice sounded so human.

Chengxu Zhuang (@chengxuzhuang) 's Twitter Profile Photo

Excited about this update from our team, esp Pingchuan Ma, Honglin Chen, and Damian Mrowca! Sounds more natural and human like, with better translation support. Enjoy the chats while we further improve the model, and let us know what could be even better!

Boris Power (@borismpower) 's Twitter Profile Photo

The rate of advancements is so high that most people don’t realize how far we advanced from Siri in voice interfaces. Give it a try!

Elliott / Shangzhe Wu (@elliottszwu) 's Twitter Profile Photo

Join us for the 4D Vision Workshop #CVPR2025 on June 11 starting at 9:20am! We'll have an incredible lineup of speakers discussing the frontier of 3D computer vision techniques for dynamic world modeling across spatial AI, robotics, astrophysics, and more. 4dvisionworkshop.github.io

Join us for the 4D Vision Workshop <a href="/CVPR/">#CVPR2025</a> on June 11 starting at 9:20am!

We'll have an incredible lineup of speakers discussing the frontier of 3D computer vision techniques for dynamic world modeling across spatial AI, robotics, astrophysics, and more.

4dvisionworkshop.github.io
OpenAI (@openai) 's Twitter Profile Photo

Haven’t tried the updated Advanced Voice that was recently launched to all paid users in ChatGPT? Then take a listen below. Prompt: Wish me an awkward happy birthday.

Peter Bakkum (@pbbakkum) 's Twitter Profile Photo

I can’t overemphasize how good the new realtime speech2speech model is at function calling. It is fast and accurate with native audio input. It exceeded expectations from myself and posttraining researchers. This one — gpt-4o-realtime-preview-2025-06-03

Hong-Xing "Koven" Yu (@koven_yu) 's Twitter Profile Photo

#ICCV2025 🤩3D world generation is cool, but it is cooler to play with the worlds using 3D actions 👆💨, and see what happens! — Introducing *WonderPlay*: Now you can create dynamic 3D scenes that respond to your 3D actions from a single image! Web: kyleleey.github.io/WonderPlay/ 🧵1/7

Seungwoo (Simon) Kim (@sekim1112) 's Twitter Profile Photo

We prompt a generative video model to extract state-of-the-art optical flow, using zero labels and no fine-tuning. Our method, KL-tracing, achieves SOTA results on TAP-Vid & generalizes to challenging YouTube clips. Khai Loong Aw Klemen Kotar Cristóbal Eyzaguirre Ercilla Wanhee Lee

Klemen Kotar (@klemenkotar) 's Twitter Profile Photo

📷 New Preprint: SOTA optical flow extraction from pre-trained generative video models! While it seems intuitive that video models grasp optical flow, extracting that understanding has proven surprisingly elusive.

Daniel Yamins (@dyamins) 's Twitter Profile Photo

Over the past 18 months my lab has been developing a new approach to visual world modeling. There will be a magnum opus that ties it all together out in the next couple of weeks. But for now there are some individual application papers that have poked out.

OpenAI (@openai) 's Twitter Profile Photo

ChatGPT can now do work for you using its own computer. Introducing ChatGPT agent—a unified agentic system combining Operator’s action-taking remote browser, deep research’s web synthesis, and ChatGPT’s conversational strengths.

Rahul Venkatesh (@rahul_venkatesh) 's Twitter Profile Photo

AI models segment scenes based on how things appear, but babies segment based on what moves together. We utilize a visual world model that our lab has been developing, to capture this concept — and what's cool is that it beats SOTA models on zero-shot segmentation and physical

Martin Schrimpf @ICLR2025 (@martin_schrimpf) 's Twitter Profile Photo

Great work by Yingtian Tang with Abdulkadir Gokce, Khaled Jedoui, and Daniel Yamins (and me). Check out the full thread for more details x.com/yingtian80536/… and of course the paper biorxiv.org/content/10.110… #NeuroAI #Vision #Neuroscience #AI