Yu Gu @ICLR 2025 (@yugu_nlp) Twitter Tweets • TwiCopy

Yu Gu @ICLR 2025

@yugu_nlp

+ Follow

Agents/AI researcher, not LLM researcher.
Ph.D. from @osunlp. ex-Research Intern @MSFTResearch.

ID: 1259941035087724547

linkhttp://entslscheia.github.io calendar_today11-05-2020 20:18:52

471 Tweet

1,1K Followers

650 Following

Qian Liu

@sivil_taram

a year ago

🎉 Announcing the first Open Science for Foundation Models (SCI-FM) Workshop at #ICLR2025! Join us in advancing transparency and reproducibility in AI through open foundation models. 🤝 Looking to contribute? Join our Program Committee: bit.ly/4acBBjF 🔍 Learn more at:

thumb_up_off_alt175

chat_bubble_outline5

repeat45

shareShare

Yu Gu @ICLR 2025

@yugu_nlp

8 months ago

IMO this is another good work showing the paradigm of “learning = inference + long-term memory” (Previous one was HippoRAG led by Bernal Jiménez on semantic knowledge.) Here long-term memory is procedural knowledge organized as Python APIs. Such APIs allow reliable

thumb_up_off_alt11

chat_bubble_outline1

repeat2

shareShare

Just ANIMALS 🐾🌍

@justanimalss_

8 months ago

WTF just happened 🤣🤣

thumb_up_off_alt146,146K

chat_bubble_outline2,2K

repeat16,16K

shareShare

Yu Gu @ICLR 2025

@yugu_nlp

8 months ago

Will be at ICLR from April 24-28. Can't wait to see my old/new friends! Also, please reach out if you wanna discuss anything about research in agents! #ICLR2025

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Yu Gu @ICLR 2025

@yugu_nlp

8 months ago

question to RL people: why the reward in RL has to be numerical? is it a design by nature? or is it mainly an expedient design to simplify the model? eager to learn about your opinions

thumb_up_off_alt16

chat_bubble_outline4

repeat0

shareShare

Cognition

@cognition_labs

8 months ago

Project DeepWiki Up-to-date documentation you can talk to, for every repo in the world. Think Deep Research for GitHub – powered by Devin. It’s free for open-source, no sign-up! Visit deepwiki com or just swap github → deepwiki on any repo URL:

thumb_up_off_alt4,4K

chat_bubble_outline137

repeat724

shareShare

Yu Gu @ICLR 2025

@yugu_nlp

8 months ago

Almost all my knowledge about robotics comes from Luke!

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

François Chollet

@fchollet

8 months ago

Good scientists have a deep psychological need for crisp definitions and self-consistent models of the world. Most people are comfortable holding worldviews that are fragmented, inconsistent, and fluid, where one's degree of belief in various things fluctuates based on context

thumb_up_off_alt926

chat_bubble_outline52

repeat100

shareShare

Kai Zhang

@drogokhal4

7 months ago

Tired of editing methods that require training, handcrafted subjects, or external memory? 🚀 #UltraEdit — Training-, subject-, and memory-free, for Lifelong Model Editing Compare to the prior best ✅New SOTA on 4 datasets and 6 models 🏎️7× faster – 20K samples within 5 mins on a

thumb_up_off_alt35

chat_bubble_outline1

repeat10

shareShare

Vardaan Pahuja

@vardaanpahuja

7 months ago

🚀 Thrilled to unveil the most exciting project of my PhD: Explorer — Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents TL;DR: A scalable multi-agent pipeline that leverages exploration for diverse web agent trajectory synthesis. 📄 Paper:

thumb_up_off_alt53

chat_bubble_outline5

repeat23

shareShare

Zeyi Liao

@liaozeyi

7 months ago

⁉️Can you really trust Computer-Use Agents (CUAs) to control your computer⁉️ Not yet, Anthropic Opus 4 shows an alarming 48% Attack Success Rate against realistic internet injection❗️ Introducing RedTeamCUA: realistic, interactive, and controlled sandbox environments for

thumb_up_off_alt70

chat_bubble_outline1

repeat30

shareShare

Yu Gu @ICLR 2025

@yugu_nlp

6 months ago

Zihao has been aware of this for three months. There's a more generalized claim underlying recent assertions about RL (in the context of using Qwen for math) like you can do RL with one example/internal rewards/spurious rewards: for Qwen on math, you just don't need RL at all!

thumb_up_off_alt18

chat_bubble_outline1

repeat2

shareShare