Brian Zheyuan Zhang (@zheyuanzhang99) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

What it would take to train a VLM to perform in-context learning (ICL) over egocentric videos? At #EMNLP2024 get the details on EILEV by Michigan SLED Lab's Peter Yu Peter Yu, Brian Zheyuan Zhang, @Hu_FY_, Shane Storks, PhD, Joyce Chai. 📰 arxiv.org/abs/2311.17041

thumb_up_off_alt5

chat_bubble_outline0

repeat2

shareShare

Shane Storks, PhD

@shanestorks

8 months ago

How well can VLMs detect and explain humans' procedural mistakes, like in cooking or assembly? 🧑‍🍳🧑‍🔧 My new pre-print with Itamar Bar-Yossef, Yayuan Li, Brian Zheyuan Zhang, Jason Corso, and Joyce Chai (Michigan SLED Lab MichiganAI Computer Science and Engineering at Michigan) dives into this! arxiv.org/pdf/2412.11927

How well can VLMs detect and explain humans' procedural mistakes, like in cooking or assembly?
🧑‍🍳🧑‍🔧

My new pre-print with Itamar Bar-Yossef, <a href="/YayuanLi/">Yayuan Li</a>, <a href="/zheyuanzhang99/">Brian Zheyuan Zhang</a>, <a href="/_JasonCorso_/">Jason Corso</a>, and Joyce Chai (<a href="/SLED_AI/">Michigan SLED Lab</a> <a href="/michigan_AI/">MichiganAI</a> <a href="/UMichCSE/">Computer Science and Engineering at Michigan</a>) dives into this!

arxiv.org/pdf/2412.11927

thumb_up_off_alt20

chat_bubble_outline1

repeat7

shareShare

Chuang Gan

@gan_chuang

8 months ago

I've decided to take a final screenshot of this empty project page. This is the most exciting and ambitious project of my career, and it's been in the works for two years. You can probably guess what this tweet signifies! Building a 3D world simulator is central to my deep

thumb_up_off_alt348

chat_bubble_outline9

repeat24

shareShare

Zhou Xian

@zhou_xian_

8 months ago

Everything you love about generative models — now powered by real physics! Announcing the Genesis project — after a 24-month large-scale research collaboration involving over 20 research labs — a generative physics engine able to generate 4D dynamical worlds powered by a physics

thumb_up_off_alt16,16K

chat_bubble_outline578

repeat3,3K

shareShare

Sakana AI

@sakanaailabs

7 months ago

Introducing ASAL: Automating the Search for Artificial Life with Foundation Models sakana.ai/asal/ Artificial Life (ALife) research holds key insights that can transform and accelerate progress in AI. By speeding up ALife discovery with AI, we accelerate our

thumb_up_off_alt2,2K

chat_bubble_outline78

repeat645

shareShare

Quentin Garrido

@garridoq_

6 months ago

The last paper of my PhD is finally out ! Introducing "Intuitive physics understanding emerges from self-supervised pretraining on natural videos" We show that without any prior, V-JEPA --a self-supervised video model-- develops an understanding of intuitive physics !

thumb_up_off_alt897

chat_bubble_outline19

repeat163

shareShare

Richard Sutton

@richardssutton

5 months ago

"What we want is a machine that can learn from experience." ---Alan Turing, 1947

thumb_up_off_alt2,2K

chat_bubble_outline67

repeat306

shareShare

UMass Amherst

@umassamherst

5 months ago

Andrew G. Barto and Richard S. Sutton have been awarded the prestigious 2024 ACM A.M. #TuringAward for developing a branch of artificial intelligence known as reinforcement learning. University of Alberta Manning College of Information & Computer Sciences #ManningCICS #ArtificialIntelligence #UMass bit.ly/3F6Poww

thumb_up_off_alt74

chat_bubble_outline1

repeat30

shareShare

Anthropic

@anthropicai

4 months ago

New Anthropic research: Tracing the thoughts of a large language model. We built a "microscope" to inspect what happens inside AI models and use it to understand Claude’s (often complex and surprising) internal mechanisms.

thumb_up_off_alt8,8K

chat_bubble_outline182

repeat1,1K

shareShare

Richard Sutton

@richardssutton

4 months ago

Rich's slogans for AI research (revised 2006): 1. Approximate the solution, not the problem (no special cases) 2. Drive from the problem 3. Take the agent’s point of view 4. Don’t ask the agent to achieve what it can’t measure 5. Don't ask the agent to know what it can't verify

thumb_up_off_alt915

chat_bubble_outline12

repeat154

shareShare

Isadora White

@isadorcw

3 months ago

The real world is an embodied multi-agent system with natural language communication. What if we had a benchmark and platform to study those challenges? ⛏️Introducing MINDcraft and MineCollab, the 1st platform and benchmark for studying embodied multi-agent LLM collaboration!

thumb_up_off_alt145

chat_bubble_outline10

repeat35

shareShare

AK

@_akhaliq

2 months ago

Virtual Community An Open World for Humans, Robots, and Society

thumb_up_off_alt114

chat_bubble_outline5

repeat13

shareShare

Martin Ziqiao Ma

@ziqiao_ma

a month ago

Can we scale 4D pretraining to learn general space-time representations that reconstruct an object from a few views at any time to any view at any other time? Introducing 4D-LRM: a Large Space-Time Reconstruction Model that ... 🔹 Predicts 4D Gaussian primitives directly from

thumb_up_off_alt99

chat_bubble_outline1

repeat39

shareShare

Martin Ziqiao Ma

@ziqiao_ma

a month ago

📣 Excited to announce SpaVLE: #NeurIPS2025 Workshop on Space in Vision, Language, and Embodied AI! 👉 …vision-language-embodied-ai.github.io 🦾Co-organized with an incredible team → Freda Shi · Jiayuan Mao · Jiafei Duan · Manling Li · David Hsu · Parisa Kordjamshidi 🌌 Why Space & SpaVLE? We

thumb_up_off_alt62

chat_bubble_outline0

repeat23

shareShare

AK

@_akhaliq

20 days ago

You can install anycoder as a Progressive Web App on your device. Visit huggingface.co/spaces/akhaliq… and in the footer click settings then follow instructions and click the install button in the URL address bar of your browser

thumb_up_off_alt52

chat_bubble_outline0

repeat11

shareShare

Brian Zheyuan Zhang

@zheyuanzhang99

18 days ago

Thanks AK for sharing our work! MindJourney enables: 🧠 Flexible perspective taking, powered by a controllable world model 📐 Robust spatial reasoning via test-time scaling 🔌 Plug & play performance boost — no fine-tuning needed!

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare

Yuncong Yang

@yuncongyy

15 days ago

Test-time scaling nailed code & math—next stop: the real 3D world. 🌍 MindJourney pairs any VLM with a video-diffusion World Model, letting it explore an imagined scene before answering. One frame becomes a tour—and the tour leads to new SOTA in spatial reasoning. 🚀 🧵1/

thumb_up_off_alt84

chat_bubble_outline3

repeat26

shareShare