Prior @ AI2 (@ai2prior) Twitter Tweets • TwiCopy

Jiafei Duan

6 months ago

👉 Pointing is our first “language”—babies master it before words. Precise spatial grounding powers robotics, assistive tech, HCI, and vision-language interfaces. 🤔 But can today's MLLMs point with pixel-level accuracy and truly ground visual reasoning?📷We introduce PointArena,

thumb_up_off_alt65

chat_bubble_outline2

repeat20

shareShare

Prior @ AI2

@ai2prior

6 months ago

Great to see Molmo leading on pointing👉

thumb_up_off_alt4

chat_bubble_outline0

repeat2

shareShare

Prior @ AI2

@ai2prior

5 months ago

Let us know how good is Molmo is at language guided pointing 👈 Vote here👇

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Jiafei Duan

@djiafei

5 months ago

Excited to be at #CVPR2025 in Nashville! 🎉 I’m presenting a demo paper with real-world robot demos and co-organizing two workshops: Robo 3D VLM and Generalization for Robotic Manipulation. Let’s connect if you’re into 🔥 Robotics + AI — and don’t miss our stacked speaker

thumb_up_off_alt61

chat_bubble_outline0

repeat8

shareShare

Prior @ AI2

@ai2prior

5 months ago

Building on our work with Molmo, we’re excited to introduce GraspMolmo — a vision-language model that predicts semantically meaningful grasps conditioned on natural language. A fantastic effort led by our PYI, Abhay Deshpande !

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Jiafei Duan

@djiafei

5 months ago

Following up on our work on Molmo: Molmo points, but how can those points power real-world robotics? Introducing GraspMolmo, VLM that plugs seamlessly into robotic systems to generate semantically meaningful grasp poses from natural language commands. 👉 abhaybd.github.io/GraspMolmo/

thumb_up_off_alt71

chat_bubble_outline0

repeat14

shareShare

Ranjay Krishna

@ranjaykrishna

5 months ago

I am doing something silly by testing whether I can remember and deliver multiple talks on the same day on different slices of my group’s research. If you are at #CVPR2025 on June 11th, come to one or all of them :D 9:05am: Behaviors & bodies: how they shape one another

thumb_up_off_alt65

chat_bubble_outline1

repeat8

shareShare

Ani Kembhavi

@anikembhavi

5 months ago

Our Molmo work won Best Paper Honorable mention at #CVPR2025 ! This large project was one of my best experiences with a fantastic team!

thumb_up_off_alt129

chat_bubble_outline5

repeat8

shareShare

Jason Ren

@renzhongzheng

4 months ago

🥳 Excited to share that I’ll be joining the CS Department at UNC-Chapel Hill (UNC Computer Science UNC AI) as an Assistant Professor starting Fall 2026! Before that, I’ll be working at Ai2 Prior (Ai2 Prior @ AI2) and UW (Allen School) on multimodal understanding and generation.

🥳 Excited to share that I’ll be joining the CS Department at UNC-Chapel Hill (<a href="/unccs/">UNC Computer Science</a> <a href="/unc_ai_group/">UNC AI</a>) as an Assistant Professor starting Fall 2026!

Before that, I’ll be working at Ai2 Prior (<a href="/allen_ai/">Ai2</a> <a href="/Ai2Prior/">Prior @ AI2</a>) and UW (<a href="/uwcse/">Allen School</a>) on multimodal understanding and generation.

thumb_up_off_alt115

chat_bubble_outline16

repeat14

shareShare

Yi Ru (Helen) Wang

@yiruhelenwang

4 months ago

🚨Tired of binary pass/fail metrics that miss the bigger picture? 🤖Introducing #RoboEval — an open benchmark that shows *how* robot manipulation policies behave and *why* they fail, not just *if* they succeed. 🧵1/n 🔗 robo-eval.github.io 📄 robo-eval.github.io/media/RoboEval…

thumb_up_off_alt190

chat_bubble_outline6

repeat35

shareShare

Jiafei Duan

@djiafei

4 months ago

It’s incredible to have both your advisors at the same company! With Dieter Fox building the Robotics team, and Ranjay Krishna leading PRIOR, Ai2 is set to become a powerhouse in robotics, computer vision, and embodied AI for open science research . Excited to be part

thumb_up_off_alt69

chat_bubble_outline0

repeat6

shareShare

Jiafei Duan

@djiafei

4 months ago

Excited to present our work at #ICML next week! Always happy to chat about all things 🔥 in Robotics and AI. I’m also be on the academic job market this coming year — would love to connect about any potential opportunities!

thumb_up_off_alt47

chat_bubble_outline0

repeat9

shareShare

Jiafei Duan

@djiafei

4 months ago

We’re presenting SAM2Act at #ICML! Come check out the many amazing projects from AI2, and stop by to chat with us and learn more about our work.

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Ai2

@allen_ai

3 months ago

🤖✨ What if models that take action in the physical world could think through your instructions? Meet MolmoAct, our new fully open Action Reasoning Model (ARM) that does just that. 🧵

thumb_up_off_alt236

chat_bubble_outline12

repeat50

shareShare

Jiafei Duan

@djiafei

3 months ago

Reasoning is central to purposeful action. Today we introduce MolmoAct — a fully open Action Reasoning Model (ARM) for robotics. Grounded in large-scale pre-training with action reasoning data, every predicted action is interpretable and user-steerable via visual trace. We are

thumb_up_off_alt310

chat_bubble_outline8

repeat52

shareShare

Haoquan Fang

@hq_fang

3 months ago

We are launching MolmoAct🤖✨ A fully open Action Reasoning Model (ARM) that can reason in space: it perceives → it plans → it acts. 🧵👇

thumb_up_off_alt40

chat_bubble_outline1

repeat8

shareShare

Chris Paxton

@chris_j_paxton

3 months ago

This to me really feels like how robot foundation models "should" work. i like that it can autoregressively predict depth tokens, lift to 2.5d, and use this for reasoning - it feels like a true robotics analogue of modern reasoning LLMs. Really exciting work.

thumb_up_off_alt187

chat_bubble_outline4

repeat17

shareShare

Ilir Aliu - eu/acc

@iliraliu_

3 months ago

First fully open Action Reasoning Model (ARM); can ‘think’ in 3D & turn your instructions into real-world actions: [📍 Bookmark for later] A model that reasons in space, time, and motion. It breaks down your command into three steps: ✅ Grounds the scene with depth-aware

thumb_up_off_alt455

chat_bubble_outline10

repeat76

shareShare

Ranjay Krishna

@ranjaykrishna

3 months ago

Most AI models still think in words. People, without even noticing, think with our bodies, planning how to move, grasp, and use things around us. MolmoAct brings that to robotics: reasoning in space before acting. This is how we will get to the GPT-moment for robotics.

thumb_up_off_alt68

chat_bubble_outline0

repeat12

shareShare

Jiafei Duan

@djiafei

3 months ago

We’re humbled by the overwhelming interest in MolmoAct! 🚀 At AI2, we’re committed to full open-source — data, training, evaluation, real-world robot system setup code, and more. We have a super-detailed technical report covering what worked (and what didn’t). Soon, you’ll have

thumb_up_off_alt30

chat_bubble_outline0

repeat2

shareShare