Yu Feng (@anniefeng6) 's Twitter Profile
Yu Feng

@anniefeng6

CS PhD Student @Penn | NLP & ML @cogcomp @upennnlp @duke_nlp @ RUC | 🧗🏻‍♀️🎨🩰🎹

ID: 1027717560136093696

linkhttps://annieyufeng.github.io/ calendar_today10-08-2018 00:45:35

15 Tweet

200 Takipçi

208 Takip Edilen

AK (@_akhaliq) 's Twitter Profile Photo

BLINK Multimodal Large Language Models Can See but Not Perceive We introduce Blink, a new benchmark for multimodal language models (LLMs) that focuses on core visual perception abilities not found in other evaluations. Most of the Blink tasks can be solved by humans

BLINK

Multimodal Large Language Models Can See but Not Perceive

We introduce Blink, a new benchmark for multimodal language models (LLMs) that focuses on core visual perception abilities not found in other evaluations. Most of the Blink tasks can be solved by humans
Xingyu Fu (@xingyufu2) 's Twitter Profile Photo

Can GPT-4V and Gemini-Pro perceive the world the way humans do? 🤔 Can they solve the vision tasks that humans can in the blink of an eye? 😉 tldr; NO, they are far worse than us 💁🏻‍♀️ Introducing BLINK👁 zeyofu.github.io/blink/, a novel benchmark that studies visual perception

Can GPT-4V and Gemini-Pro perceive the world the way humans do? 🤔

Can they solve the vision tasks that humans can in the blink of an eye? 😉

tldr; NO, they are far worse than us 💁🏻‍♀️

Introducing BLINK👁 zeyofu.github.io/blink/, a novel benchmark that studies visual perception
Junlin Wang (@junlinwang3) 's Twitter Profile Photo

Excited to share work from my Together AI internship—a deep dive into inference‑time scaling methods 🧠 We rigorously evaluated verifier‑free inference-time scaling methods across both reasoning and non‑reasoning LLMs. Some key findings: 🔑 Even with huge rollout budgets,

Excited to share work from my <a href="/togethercompute/">Together AI</a> internship—a deep dive into inference‑time scaling methods 🧠

We rigorously evaluated verifier‑free inference-time scaling methods across both reasoning and non‑reasoning LLMs. Some key findings:

🔑 Even with huge rollout budgets,
Cognitive Computation Group (@cogcomp) 's Twitter Profile Photo

Excited to share our papers at #ICLR2025 in Singapore! Check out the summaries on our blog (ccgblog.seas.upenn.edu/2025/04/ccg-pa…), and then check out the papers at oral session 1B (BIRD) and poster session 2 (for all three)! Yu Feng, Xingyu Fu, Ben Zhou, 🌴Muhao Chen🌴, Dan Roth

Excited to share our papers at #ICLR2025 in Singapore!  Check out the summaries on our blog (ccgblog.seas.upenn.edu/2025/04/ccg-pa…), and then check out the papers at oral session 1B (BIRD) and poster session 2 (for all three)!
<a href="/AnnieFeng6/">Yu Feng</a>, <a href="/XingyuFu2/">Xingyu Fu</a>, <a href="/BenZhou96/">Ben Zhou</a>, <a href="/muhao_chen/">🌴Muhao Chen🌴</a>, <a href="/DanRothNLP/">Dan Roth</a>
Yu Feng (@anniefeng6) 's Twitter Profile Photo

🚨COLM 2025 Workshop on AI Agents: Capabilities and Safety Conference on Language Modeling This workshop explores AI agents’ capabilities—including reasoning and planning, interaction and embodiment, and real-world applications—as well as critical safety challenges related to reliability, ethics,

🚨COLM 2025 Workshop on AI Agents: Capabilities and Safety <a href="/COLM_conf/">Conference on Language Modeling</a> 

This workshop explores AI agents’ capabilities—including reasoning and planning, interaction and embodiment, and real-world applications—as well as critical safety challenges related to reliability, ethics,
Jeffrey (Young-Min) Cho (@jeffrey_ch0) 's Twitter Profile Photo

🤖💬 Herding instincts… in AIs? Yes, even LLMs can follow the crowd! • 📉 Conformity ↑ when agents lack confidence but trust peers • 🧠 Presentation format shapes peer influence • 🎯 Controlled herding can boost collaboration outcomes 👉 Read more: arxiv.org/abs/2505.21588

🤖💬 Herding instincts… in AIs? Yes, even LLMs can follow the crowd!

• 📉 Conformity ↑ when agents lack confidence but trust peers
• 🧠 Presentation format shapes peer influence
• 🎯 Controlled herding can boost collaboration outcomes

👉 Read more: arxiv.org/abs/2505.21588
Yu Feng (@anniefeng6) 's Twitter Profile Photo

👥 We’re looking for reviewers for the COLM 2025 Workshop on AI Agents: Capabilities & Safety Conference on Language Modeling! 🔗 Sign up: forms.gle/5vHzyGxjUgSMNK… Help shape exciting research on AI agents, their capabilities, and the safety challenges they raise. 🧠 #AI #AIagents #COLM2025