Rui-Jie (Ridger) Zhu
@ridgerzhu
Ph.D. student at UC Santa Cruz, Intern at Bytedance Seed Team, working on scalable simple idea for #LLM.
ID: 1575365180971962368
29-09-2022 06:01:50
34 Tweet
196 Followers
85 Following
Is text-only information enough for LLM/VLM Web Agents? π€ Clearly not. π ββοΈ The modern web is a rich tapestry of text, images πΌοΈ, and videos π₯. To truly assist us, agents need to understand it all. That's why we built MM-BrowseComp. π We're introducing MM-BrowseComp π, a new