Kangrui Wang (@james_kkw) Twitter Tweets • TwiCopy

Jing-Jing Li

@drjingjing2026

a year ago

1/3 Today, an anecdote shared by an invited speaker at #NeurIPS2024 left many Chinese scholars, myself included, feeling uncomfortable. As a community, I believe we should take a moment to reflect on why such remarks in public discourse can be offensive and harmful.

thumb_up_off_alt3,3K

chat_bubble_outline191

repeat581

shareShare

Manling Li

@manlingli_

a year ago

[Long Tweet Ahead] Faculty Interview Tips & Common Questions: 🧘‍♀️0. Firstly, do not be nervous - Almost everything can be prepared in advance:) - Be grateful for everyone's time. - Think of it as an opportunity to share your research with others -- exciting, right? - Technical

thumb_up_off_alt493

chat_bubble_outline14

repeat78

shareShare

Zihan Wang - on RAGEN

@wzihanw

a year ago

🚀 Some great news to you: Reward is boosting! Reward curve is AI's ultimate language!!!

thumb_up_off_alt39

chat_bubble_outline3

repeat2

shareShare

Zihan Wang - on RAGEN

@wzihanw

a year ago

🚀 Introducing RAGEN—the world’s first reproduction of DeepSeek-R1(-Zero) methods for training agentic AI models! We’re betting big on the future of RL + LLM + Agents 🤖✨. This release is a minimally viable leap toward that vision. Code and more intro 🔗:

thumb_up_off_alt1,1K

chat_bubble_outline76

repeat241

shareShare

Kangrui Wang

@james_kkw

a year ago

I'm so excited that my lab mates were able to produce such groundbreaking work in such a short time. A big salute to Zihan Wang - on RAGEN 🫡, who literally didn't sleep during the weekend.

thumb_up_off_alt28

chat_bubble_outline2

repeat3

shareShare

Zihan Wang - on RAGEN

@wzihanw

9 months ago

Surprise finding: Our simplified AICO version actually outperforms TRICO on Sokoban, likely because of the game's sparse rewards only for a successful run🤯 But TRICO shows superior exploration skills, cracking the toughest puzzles while AICO stably handles simpler ones 🧩 (7/n)

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

Kangrui Wang

@james_kkw

9 months ago

Super excited to introduce VAGEN!! We trained a 3B VLM agent in Sokoban and it can sometimes solve 6-step game! Honored be part of the team!

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

Zihan Wang - on RAGEN

@wzihanw

9 months ago

No visual models one can survive the challenge, but... ... ... ... ... Our VAGEN can we are doing small progress but visual agent has yet even more to do

thumb_up_off_alt27

chat_bubble_outline4

repeat6

shareShare

Zihan Wang - on RAGEN

@wzihanw

9 months ago

And our tiny VAGEN:

thumb_up_off_alt7

chat_bubble_outline1

repeat2

shareShare

Zihan Wang - on RAGEN

@wzihanw

9 months ago

We are embarrassed to say that VAGEN is the No. 1 visual agent framework, but... it's true X Post: x.com/wzihanw/status… Blog: mll-lab.notion.site/vagen Code: github.com/RAGEN-AI/VAGEN

thumb_up_off_alt30

chat_bubble_outline1

repeat9

shareShare

Zihan Wang - on RAGEN

@wzihanw

8 months ago

🚀 Introducing T* and LV-Haystack — our latest leap forward in VLMs for long video understanding! 🧩 Lightweight plugin: T* boosting LLaVA-OV-72B (56→62%) and GPT-4o (50→53%)! ⚡ Fast inference: 34.9s → 10.4s latency, 691 → 170 TFLOPs v.s. SOTA. 📚 Large-scale dataset: 400

thumb_up_off_alt185

chat_bubble_outline13

repeat76

shareShare

Manling Li

@manlingli_

8 months ago

Introducing T* and LV-Haystack -- targeting needle-in-the-haystack for long videos! 🤗 LV-Haystack annotated 400+ hours of videos and 15,000+ samples. 🧩 Lightweight plugin for any proprietary and open-source VLMs: T* boosting LLaVA-OV-72B [56→62%] and GPT-4o [50→53%] within

thumb_up_off_alt89

chat_bubble_outline4

repeat17

shareShare

Zihan Wang - on RAGEN

@wzihanw

8 months ago

Why does your RL training always collapse? In our new paper of RAGEN, we explore what breaks when you train LLM *Agents* with multi-turn reinforcement learning—and possibly how to fix it. 📄 github.com/RAGEN-AI/RAGEN… 🌐 ragen-ai.github.io 1/🧵👇

thumb_up_off_alt415

chat_bubble_outline6

repeat83

shareShare

Manling Li

@manlingli_

8 months ago

We are very excited announcing our MLL lab! We are looking for collaborators on RAGEN, VAGEN, Chain-of-experts, T*, LongVideoHaystack, foundation models for embodied agents, etc mll-lab-nu.github.io

thumb_up_off_alt322

chat_bubble_outline4

repeat48

shareShare

Zihan Wang - on RAGEN

@wzihanw

6 months ago

Finally available on arxiv! arxiv.org/abs/2506.18945

thumb_up_off_alt107

chat_bubble_outline3

repeat19

shareShare

Manling Li

@manlingli_

6 months ago

Can VLMs build Spatial Mental Models like humans? Reasoning from limited views? Reasoning from partial observations? Reasoning about unseen objects behind furniture / beyond current view? Check out MindCube! 🌐mll-lab-nu.github.io/mind-cube/ 📰arxiv.org/pdf/2506.21458

thumb_up_off_alt280

chat_bubble_outline5

repeat56

shareShare