Qineng Wang (@qineng_wang) 's Twitter Profile
Qineng Wang

@qineng_wang

First-year PhD student at Northwestern University, advised by Prof. Manling Li. Previous at Zhejiang University for undergraduate study.

ID: 1722653114543828992

linkhttps://qinengwang-aiden.github.io/ calendar_today09-11-2023 16:31:39

29 Tweet

215 Takipçi

84 Takip Edilen

Zihan Wang - on RAGEN (@wzihanw) 's Twitter Profile Photo

🚀 Introducing RAGEN—the world’s first reproduction of DeepSeek-R1(-Zero) methods for training agentic AI models! We’re betting big on the future of RL + LLM + Agents 🤖✨. This release is a minimally viable leap toward that vision. Code and more intro 🔗:

🚀 Introducing RAGEN—the world’s first reproduction of DeepSeek-R1(-Zero) methods for training agentic AI models!

We’re betting big on the future of RL + LLM + Agents 🤖✨. This release is a minimally viable leap toward that vision.

Code and more intro 🔗:
Rui Yang (@ruiyang70669025) 's Twitter Profile Photo

🚀 New model results on EmbodiedBench! 🚀 🔹 Qwen2.5 VL surpasses Qwen2 VL as embodied agents! 🔹 InternVL2_5 MPO leads as the best-performing open-source model! Check out the latest results: embodiedbench.github.io Explore the evaluation code: github.com/EmbodiedBench/…

🚀 New model results on EmbodiedBench! 🚀
🔹 Qwen2.5 VL surpasses Qwen2 VL as embodied agents!
🔹 InternVL2_5 MPO leads as the best-performing open-source model!

Check out the latest results: embodiedbench.github.io
Explore the evaluation code: github.com/EmbodiedBench/…
Manling Li (@manlingli_) 's Twitter Profile Photo

Very happy to see the search agent Search-R1 and our RAGEN codebase has been able to support it😄 We put a lot of efforts to make the RAGEN codebase easy to reuse/follow. Welcome to play with RAGEN for agent framework using simple RL recipe like DeepSeek R1

Very happy to see the search agent Search-R1 and our RAGEN codebase has been able to support it😄

We put a lot of efforts to make the RAGEN codebase easy to reuse/follow.  

Welcome to play with RAGEN for agent framework using simple RL recipe like DeepSeek R1
Zihan Wang - on RAGEN (@wzihanw) 's Twitter Profile Photo

🚀 Introducing Chain-of-Experts (CoE), A Free-lunch optimization method for DeepSeek-like MoE models! within $200, we explore to train MoEs that enables 17.6-42% efficiency boost in memory! Code: github.com/ZihanWang314/c… Blog: notion.so/Chain-of-Exper…

🚀 Introducing Chain-of-Experts (CoE), A Free-lunch optimization method for DeepSeek-like MoE models!

within $200, we explore to train MoEs that enables 17.6-42% efficiency boost in memory!

Code: github.com/ZihanWang314/c…
Blog: notion.so/Chain-of-Exper…
Zihan Wang - on RAGEN (@wzihanw) 's Twitter Profile Photo

In the last two months, RAGEN has powered Agent RL training frameworks for over 300,000 people. Now, we’re introducing VAGEN—the first open-source framework that trains *Visual* Agents using multi-turn Reinforcement Learning! 🚀(1/n)

In the last two months, RAGEN has powered Agent RL training frameworks for over 300,000 people.
Now, we’re introducing VAGEN—the first open-source framework that trains *Visual* Agents using multi-turn Reinforcement Learning! 🚀(1/n)
Qineng Wang (@qineng_wang) 's Twitter Profile Photo

Training agentic settings with LLM +RL is truly a fascinating topic, and now we make a solid step towards *Multimodal* scenarios! With visual inputs, we now see how agents think and interact into an environment when information sources are diverse 😆

Zihan Wang - on RAGEN (@wzihanw) 's Twitter Profile Photo

No visual models one can survive the challenge, but... ... ... ... ... Our VAGEN can we are doing small progress but visual agent has yet even more to do

Zihan Wang - on RAGEN (@wzihanw) 's Twitter Profile Photo

We are embarrassed to say that VAGEN is the No. 1 visual agent framework, but... it's true X Post: x.com/wzihanw/status… Blog: mll-lab.notion.site/vagen Code: github.com/RAGEN-AI/VAGEN

Manling Li (@manlingli_) 's Twitter Profile Photo

We are very excited announcing our MLL lab! We are looking for collaborators on RAGEN, VAGEN, Chain-of-experts, T*, LongVideoHaystack, foundation models for embodied agents, etc mll-lab-nu.github.io

We are very excited announcing our MLL lab!

We are looking for collaborators on RAGEN, VAGEN, Chain-of-experts, T*, LongVideoHaystack, foundation models for embodied agents, etc

mll-lab-nu.github.io
Qineng Wang (@qineng_wang) 's Twitter Profile Photo

Big news! Our CVPR 2025 Workshop on Foundation Models + Embodied Agents has extended the non-archival deadline to May 17! If you’re exploring the intersection of foundation models + embodiment, we’d love to see your work. Catch us (and our tutorial) in Music City!

Manling Li (@manlingli_) 's Twitter Profile Photo

Today is the day! Welcome to join #CVPR2025 workshop on Foundation Models meet Embodied Agents! 🗓️Jun 11 📍Room 214 🌐…models-meet-embodied-agents.github.io/cvpr2025/ Looking forward to learning insights from wonderful speakers Jitendra MALIK Ranjay Krishna Katerina Fragkiadaki Shuang Li Yilun Du

Today is the day! Welcome to join <a href="/CVPR/">#CVPR2025</a> workshop on Foundation Models meet Embodied Agents!

🗓️Jun 11
📍Room 214
🌐…models-meet-embodied-agents.github.io/cvpr2025/

Looking forward to learning insights from wonderful speakers <a href="/JitendraMalikCV/">Jitendra MALIK</a> <a href="/RanjayKrishna/">Ranjay Krishna</a> <a href="/KaterinaFragiad/">Katerina Fragkiadaki</a> <a href="/ShuangL13799063/">Shuang Li</a> <a href="/du_yilun/">Yilun Du</a>
Qineng Wang (@qineng_wang) 's Twitter Profile Photo

Excited to be leading this project! Humans naturally build spatial mental models to understand object permanence and reason about space. We find that VLMs can effectively approximate this by first forming an overall scene understanding, then reasoning over it. More to explore!

Qineng Wang (@qineng_wang) 's Twitter Profile Photo

Please check our awesome project team here led by Prof. Manling Li ! If you have any thought or idea about spatial intelligence, feel free to contact us😆

Manling Li (@manlingli_) 's Twitter Profile Photo

Excited to do a talk at Agentic AI Summit! Session 4: Foundations of Agents 📍 Frontier Stage 📅 2:45pm PT Will talk about "RAGEN: Training Agents via Reinforcing Reasoning", including: - How to monitor RL training to converge for multi-turn LLM Agents? RAGEN

Excited to do a talk at Agentic AI Summit!

Session 4: Foundations of Agents
📍 Frontier Stage
📅 2:45pm PT

Will talk about "RAGEN: Training Agents via Reinforcing Reasoning", including:
- How to monitor RL training to converge for multi-turn LLM Agents? RAGEN