Tsung-Yi Lin (@tsungyilincv) 's Twitter Profile
Tsung-Yi Lin

@tsungyilincv

Principal Research Scientist @Nvidia | Ex-@Google Brain Team | Computer Vision & Machine Learning

ID: 1058105899607252992

calendar_today01-11-2018 21:17:59

112 Tweet

2,2K Takipçi

345 Takip Edilen

Tsung-Yi Lin (@tsungyilincv) 's Twitter Profile Photo

Future frames light the path to smarter actions! 🚀🤖 CoT-VLA leverages visual chain-of-thought reasoning to unlock large-scale video data and guide goal-driven robotics. #CVPR2025 #AI #Robotics

Hermann (@kumbonghermann) 's Twitter Profile Photo

Excited to be presenting our new work–HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation– at #CVPR2025 this week. VAR (Visual Autoregressive Modelling) introduced a very nice way to formulate autoregressive image generation as a next-scale prediction task (from

Excited to be presenting our new work–HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation– at #CVPR2025 this week.

VAR (Visual Autoregressive Modelling) introduced a very nice way to formulate autoregressive image generation as a next-scale prediction task (from
Max Zhaoshuo Li 李赵硕 (@mli0603) 's Twitter Profile Photo

Cosmos-Reason1 has exciting updates 💡 Now it understands physical reality — judging videos as real or fake! Check out the resources👇 Paper: arxiv.org/abs/2503.15558 Huggingface: huggingface.co/nvidia/Cosmos-… Code: github.com/nvidia-cosmos/… Project page: research.nvidia.com/labs/dir/cosmo… (1/n)

Fangyin Wei (@fangyinwei) 's Twitter Profile Photo

Join us on the 1st workshop on Vision Meets Physics: Synergizing Physical Simulation and Computer Vision at #CVPR2025 tomorrow! Thought-provoking talks and expert insights from leading researchers that YOU CANNOT MISS! 📍104A ⏰ 8:45am June 12th visionmeetphysics.github.io

Qinsheng Zhang (@qsh_zh) 's Twitter Profile Photo

🚀 Introducing Cosmos-Predict2! Our most powerful open video foundation model for Physical AI. Cosmos-Predict2 significantly improves upon Predict1 in visual quality, prompt alignment, and motion dynamics—outperforming popular open-source video foundation models. It’s openly

Tsung-Yi Lin (@tsungyilincv) 's Twitter Profile Photo

Generating 3D models with parts is a key step toward scalable, interactive simulation environments. Check out our work — PartPacker — and the concurrent project, PartCrafter!" PartPacker: github.com/NVlabs/PartPac… PartCrafter: wgsxm.github.io/projects/partc…

Victor M (@victormustar) 's Twitter Profile Photo

Nvidia cooked with PartPacker 3D Generation A new method to create 3D objects from a single image, with each part separate and easy to edit 🔥 ⬇️ Demo available on Hugging Face

Satya Mallick (@learnopencv) 's Twitter Profile Photo

NVIDIA’s Cosmos Reason1 is a family of Vision Language Models trained to understand the physical world and make decisions for embodied reasoning. What makes Cosmos Reason1, as a promising contender for video understanding and embodied reasoning is mainly attributed to its dataset

Hanzi Mao (@hanna_mao) 's Twitter Profile Photo

We build Cosmos-Predict2 as a world foundation model for Physical AI builders — fully open and adaptable. Post-train it for specialized tasks or different output types. Available in multiple sizes, resolutions, and frame rates. 📷 Watch the repo walkthrough

NVIDIA Robotics (@nvidiarobotics) 's Twitter Profile Photo

Facing data bottlenecks in your robotics workflows? Explore how #NVIDIACosmos world foundation models from #NVIDIAResearch can be post trained for specific #PhysicalAI applications: 🔮 Cosmos Predict to simulate future scenarios. 🎨 Cosmos Transfer to create diverse synthetic

Tsung-Yi Lin (@tsungyilincv) 's Twitter Profile Photo

🚀Earlier this year we launched Cosmos-Reason1 — and it just climbed to #1 on the new Physical Reasoning Leaderboard, released alongside V-JEPA 2! 🤗Try it out: huggingface.co/nvidia/Cosmos-…

Tsung-Yi Lin (@tsungyilincv) 's Twitter Profile Photo

Training Physical AI agents depends on rich environments. Simulating diverse worlds is key to speeding up progress—excited to see @moonlake pushing this forward!

NVIDIA AI Developer (@nvidiaaidev) 's Twitter Profile Photo

NVIDIA Cosmos open models made major progress.✨ ✅ Cosmos Predict 2.5 unifies text, image, and video world generation into one model that creates longer and more coherent simulations with improved grounding and efficiency. ✅ Cosmos Transfer 2.5 introduces precise, spatially

Marco Pavone (@drmapavone) 's Twitter Profile Photo

Excited to unveil NVIDIA's latest work on #Reasoning Vision–Language–Action (#VLA) models — Alpamayo-R1! Alpamayo-R1 is a new #reasoning VLA architecture featuring a diffusion-based action expert built on top of the #Cosmos-#Reason backbone. It represents one of the core

Max Zhaoshuo Li 李赵硕 (@mli0603) 's Twitter Profile Photo

This is a really smart setup for evaluating forward and inverse world modeling with VLMs💡— congrats on the paper! I also really appreciate the deep dive into Cosmos-Reason1. Lots of insightful details to learn from 📖

This is a really smart setup for evaluating forward and inverse world modeling with VLMs💡— congrats on the paper! 
I also really appreciate the deep dive into Cosmos-Reason1. Lots of insightful details to learn from 📖