Junyu Zhang (@jyzhang1208) Twitter Tweets • TwiCopy

Junyu Zhang

@jyzhang1208

+ Follow

MSCS @IllinoisCS, Undergrad @HuazhongUST.

ID: 1684232293177774080

linkhttps://jyzhang1208.github.io calendar_today26-07-2023 16:01:28

6 Tweet

29 Takipçi

78 Takip Edilen

Junyu Zhang

@jyzhang1208

10 months ago

When we successfully built a framework that enables MLLM-based agents to plan for low-level manipulation tasks (a key component of EmbodiedBench), I was super excited! Could this be a step toward MLLM-based agents becoming so versatile that we no longer need dedicated VLA models?

thumb_up_off_alt14

chat_bubble_outline1

repeat0

shareShare

Rui Yang

@ruiyang70669025

9 months ago

🚀 New model results on EmbodiedBench! 🚀 🔹 Qwen2.5 VL surpasses Qwen2 VL as embodied agents! 🔹 InternVL2_5 MPO leads as the best-performing open-source model! Check out the latest results: embodiedbench.github.io Explore the evaluation code: github.com/EmbodiedBench/…

thumb_up_off_alt15

chat_bubble_outline1

repeat4

shareShare

elvis

@omarsar0

6 months ago

Reasoning Models Thinking Slow and Fast at Test Time Another super cool work on improving reasoning efficiency in LLMs. They show that slow-then-fast reasoning outperforms other strategies. Here are my notes:

thumb_up_off_alt258

chat_bubble_outline9

repeat56

shareShare

Junyu Zhang

@jyzhang1208

6 months ago

Huge thanks for sharing our work AK! AlphaOne deep dive & code release coming soon 🚀

thumb_up_off_alt33

chat_bubble_outline2

repeat6

shareShare

Junyu Zhang

@jyzhang1208

6 months ago

EmbodiedBench got an ICML 2025 Oral! Time to challenge your MLLMs on embodied tasks!🤖

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Chongyi Zheng

@chongyiz1

6 months ago

1/ How should RL agents prepare to solve new tasks? While prior methods often learn a model that predicts the immediate next observation, we build a model that predicts many steps into the future, conditioning on different user intentions: chongyi-zheng.github.io/infom.

thumb_up_off_alt92

chat_bubble_outline1

repeat15

shareShare

César de la Fuente

@delafuentelab

4 months ago

For years I have dreamt of a tool that could neutralize pathogens the moment they emerge. Today we unveil ApexOracle—an AI that, from a pathogen’s genome and phenotypic knowledge alone, predicts which antibiotics will work and invents new molecules for threats it has never seen.

thumb_up_off_alt222

chat_bubble_outline6

repeat38

shareShare

Chongyi Zheng

@chongyiz1

2 months ago

1/ How can we model the future rewards (returns) for RL agents? While prior methods round the returns into discrete bins or predict a finite number of quantiles, we use flexible models to predict the fine-grained structure of the full return distribution: pd-perry.github.io/value-flows.

thumb_up_off_alt41

chat_bubble_outline2

repeat6

shareShare