Qiyue Gao (@qiyuegao123) Twitter Tweets • TwiCopy

Qiyue Gao

@qiyuegao123

+ Follow

PhD student @UCSanDiego; Prev intern @allen_ai #AI #ML #NLP

ID: 1939706598521520128

calendar_today30-06-2025 15:24:34

15 Tweet

80 Followers

15 Following

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

5 months ago

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation "we introduce WM-ABench, a large-scale benchmark comprising 23 fine-grained evaluation dimensions across 6 diverse simulated environments with controlled counterfactual simulations. Through 660

thumb_up_off_alt296

chat_bubble_outline6

repeat66

shareShare

Qiyue Gao

@qiyuegao123

5 months ago

Thank you for sharing our work! 🚀 Vision-Language Models are advancing rapidly, and it's exciting to track their progress. We'll continuously update our leaderboard and datasets as new VLMs emerge. Stay tuned for more insights and results! Our website: wm-abench.maitrix.org

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

Eric Xing

@ericxing

4 months ago

I have been long arguing that a world model is NOT about generating videos, but IS about simulating all possibilities of the world to serve as a sandbox for general-purpose reasoning via thought-experiments. This paper proposes an architecture toward that arxiv.org/abs/2507.05169

thumb_up_off_alt512

chat_bubble_outline7

repeat87

shareShare

Zhiting Hu

@zhitinghu

4 months ago

Some critical reviews and clarifications on different perspectives of world models. 🔥🌶️ Stay tuned for more on PAN — its position on the roadmap towards next-level intelligence, strong results, and open-sources❗️🧠

thumb_up_off_alt26

chat_bubble_outline0

repeat7

shareShare

Zeming Chen

@eric_zemingchen

4 months ago

🗒️Can we meta-learn test-time learning to solve long-context reasoning? Our latest work, PERK, learns to encode long contexts through gradient updates to a memory scratchpad at test time, achieving long-context reasoning robust to complexity and length extrapolation while

thumb_up_off_alt17

chat_bubble_outline1

repeat9

shareShare