
Qian Liu
@sivil_taram
Researcher @ TikTok ๐ธ๐ฌ
๐ Sailor / StarCoder / OpenCoder
๐ผ Past: Research Scientist @SeaAIL; PhD @MSFTResearch
๐ง Contribution: @XlangNLP @BigCodeProject
ID: 1465140087193161734
http://siviltaram.github.io/ 29-11-2021 02:06:42
1,1K Tweet
3,3K Takipรงi
674 Takip Edilen

Is text-only information enough for LLM/VLM Web Agents? ๐ค Clearly not. ๐ โโ๏ธ The modern web is a rich tapestry of text, images ๐ผ๏ธ, and videos ๐ฅ. To truly assist us, agents need to understand it all. That's why we built MM-BrowseComp. ๐ We're introducing MM-BrowseComp ๐, a new


Introducing Mirage 2 โ a real-time, general-domain generative world engine you can play online Upload any imageโphotos, concept art, classic paintings, kids' drawingsโand step into it as a live, interactive world. Prompt your worlds with text to create any surreal scenes and





๐จ FINAL CALL: Only 2 days left to submit to the ๐ป๐๐๐ก ๐๐๐๐ฃ๐๐๐๐ ๐๐ ๐ฃ โ๐ ๐๐ ๐๐ ๐ฅ๐๐ ๐ธ๐๐๐๐ฅ๐๐ ๐ผ๐ฃ๐ (DL4C) workshop at NeurIPS2025 ! ๐Deadline: Aug 27th, 11:59PM UTC-12 Amazing speaker lineup including experts from CMU, UC Berkeley, Replit, poolside,













