Yu Su @#ICLR2025 (@ysu_nlp) 's Twitter Profile
Yu Su @#ICLR2025

@ysu_nlp

Prof.@OhioState, co-director @osunlp. author of Mind2Web, SeeAct, MMMU, HippoRAG, BioCLIP, UGround. manifesting my thinking of intelligence into language agents

ID: 1240355312

linkhttp://ysu1989.github.io calendar_today04-03-2013 02:58:16

1,1K Tweet

9,9K Followers

926 Following

Yu Su @#ICLR2025 (@ysu_nlp) 's Twitter Profile Photo

🔎Agentic search like Deep Research is fundamentally changing web search, but it also brings an evaluation crisis⚠️ Introducing Mind2Web 2: Evaluating Agentic Search with Agents-as-a-Judge - 130 tasks (each requiring avg. 100+ webpages) from 1,000+ hours of expert labor -

🔎Agentic search like Deep Research is fundamentally changing web search, but it also brings an evaluation crisis⚠️

Introducing Mind2Web 2: Evaluating Agentic Search with Agents-as-a-Judge
- 130 tasks (each requiring avg. 100+ webpages) from 1,000+ hours of expert labor
-