Cognitive Computation Group (@cogcomp) 's Twitter Profile
Cognitive Computation Group

@cogcomp

Dan Roth's Cognitive Computation Group at the University of Pennsylvania.

(Not the account of the Cognitive Computing Lab at Georgia Tech.)

ID: 38444828

linkhttp://cogcomp.org/ calendar_today07-05-2009 14:51:51

155 Tweet

633 Takipçi

64 Takip Edilen

Cognitive Computation Group (@cogcomp) 's Twitter Profile Photo

New on the blog: An interview with former postdoc Wenpeng_Yin, who talks with us about NLP, gardening, and his work at Penn State! ccgblog.seas.upenn.edu/2024/04/interv…

Xingyu Fu (@xingyufu2) 's Twitter Profile Photo

Can GPT-4V and Gemini-Pro perceive the world the way humans do? 🤔 Can they solve the vision tasks that humans can in the blink of an eye? 😉 tldr; NO, they are far worse than us 💁🏻‍♀️ Introducing BLINK👁 zeyofu.github.io/blink/, a novel benchmark that studies visual perception

Can GPT-4V and Gemini-Pro perceive the world the way humans do? 🤔

Can they solve the vision tasks that humans can in the blink of an eye? 😉

tldr; NO, they are far worse than us 💁🏻‍♀️

Introducing BLINK👁 zeyofu.github.io/blink/, a novel benchmark that studies visual perception
Wenpeng_Yin (@wenpeng_yin) 's Twitter Profile Photo

Our workshop, AI4Research (ai4research.github.io), is looking for reviewers who have expertise in areas such as AI4Material, AI4Medical, AI for Single-Cell, and AI4Education. Please let us know if you are interested.

Our workshop, AI4Research (ai4research.github.io), is looking for reviewers who have expertise in areas such as AI4Material, AI4Medical, AI for Single-Cell, and AI4Education. Please let us know if you are interested.
Chaitanya Malaviya (@cmalaviya11) 's Twitter Profile Photo

Excited to share new work done Google DeepMind: 🏔️ DOLOMITES: Domain-Specific Long-Form Methodical Tasks, a new long-form generation benchmark for evaluating language models on **realistic** domain-specific tasks. Website: dolomites-benchmark.github.io Paper: arxiv.org/abs/2405.05938

Excited to share new work done <a href="/GoogleDeepMind/">Google DeepMind</a>: 🏔️ DOLOMITES: Domain-Specific Long-Form Methodical Tasks, a new long-form generation benchmark for evaluating language models on **realistic** domain-specific tasks.

Website: dolomites-benchmark.github.io
Paper: arxiv.org/abs/2405.05938
Cognitive Computation Group (@cogcomp) 's Twitter Profile Photo

New on the blog: An interview with former CCG student researcher Celine Lee, who talks with us about code, creativity, and making connections in the NLP community! ccgblog.seas.upenn.edu/2024/06/interv…

New on the blog: An interview with former CCG student researcher Celine Lee, who talks with us about code, creativity, and making connections in the NLP community!
ccgblog.seas.upenn.edu/2024/06/interv…
Vivek Gupta (@keviv9) 's Twitter Profile Photo

I can’t make it to #ACL2024 in person this year, but I’ll be there virtually! 🎉💻 Thrilled to present our three papers on Complex Data Reasoning—Visual Flowcharts QA, Chart Fact-Checking, and Robustness in Finance QA. 🚀📊📚🔍 Stay tuned for details! Cognitive Computation Group UPenn NLP ASU School of Computing and Augmented Intelligence

Cognitive Computation Group (@cogcomp) 's Twitter Profile Photo

We’re excited to share our #ACL2024 conference and findings papers in our newest blog post: ccgblog.seas.upenn.edu/2024/08/ccg-pa…. Check out these papers from today’s sessions and next week’s virtual Findings presentations! Congrats to Vivek Gupta Yi Zhang Dan Roth and their co-authors!

We’re excited to share our #ACL2024 conference and findings papers in our newest blog post: ccgblog.seas.upenn.edu/2024/08/ccg-pa….  Check out these papers from  today’s sessions and next week’s virtual Findings presentations!  Congrats to
<a href="/keviv9/">Vivek Gupta</a>
<a href="/Wado_Will/">Yi Zhang</a>
<a href="/DanRothNLP/">Dan Roth</a>
and their co-authors!
Manling Li (@manlingli_) 's Twitter Profile Photo

Congrats to Peter Baile Chen, Yi Zhang, Dan Roth for the Outstanding Paper Award at #ACL2024 Workshop on Knowledgeable LMs! Is Table Retrieval a Solved Problem? Exploring Join-Aware Multi-Table Retrieval Peter Baile Chen, Yi Zhang, Dan Roth arxiv.org/pdf/2404.09889

Congrats to Peter Baile Chen, Yi Zhang, <a href="/DanRothNLP/">Dan Roth</a> for the Outstanding Paper Award at #ACL2024 Workshop on Knowledgeable LMs!

Is Table Retrieval a Solved Problem? Exploring Join-Aware Multi-Table Retrieval
Peter Baile Chen, Yi Zhang, Dan Roth
arxiv.org/pdf/2404.09889
Cognitive Computation Group (@cogcomp) 's Twitter Profile Photo

With special congratulations to Peter Chen, Yi Zhang, and Dan Roth for their Outstanding Paper Award at the #ACL2024 Workshop on Knowledgeable LMs! "Is Table Retrieval a Solved Problem? Exploring Join-Aware Multi-Table Retrieval" Peter Baile Chen, Yi Zhang, Dan Roth

Cognitive Computation Group (@cogcomp) 's Twitter Profile Photo

Before launching into our new semester, we asked our six summer interns to tell us about their experiences working with us this summer. Take a look! Special thanks to Vivek Gupta and Sihao Chen for their excellent mentoring! ccgblog.seas.upenn.edu/2024/08/summer…

Before launching into our new semester, we asked our six summer interns to tell us about their experiences working with us this summer.  Take a look!  Special thanks to <a href="/keviv9/">Vivek Gupta</a> and <a href="/soshsihao/">Sihao Chen</a> for their excellent mentoring!
ccgblog.seas.upenn.edu/2024/08/summer…
Chaitanya Malaviya (@cmalaviya11) 's Twitter Profile Photo

Excited to share ✨ Contextualized Evaluations ✨! Benchmarks like Chatbot Arena contain underspecified queries, which can lead to arbitrary eval judgments. What happens if we provide evaluators with context (e.g who's the user, what's their intent) when judging LM outputs? 🧵↓

Excited to share ✨ Contextualized Evaluations ✨!

Benchmarks like Chatbot Arena contain underspecified queries, which can lead to arbitrary eval judgments. What happens if we provide evaluators with context (e.g who's the user, what's their intent) when judging LM outputs? 🧵↓
Cognitive Computation Group (@cogcomp) 's Twitter Profile Photo

New interview with 🌴Muhao Chen🌴, former CCG postdoc, who talks with us about cats and hamsters, LLM safety, and far-flung national parks! ccgblog.seas.upenn.edu/2025/01/interv…

New interview with <a href="/muhao_chen/">🌴Muhao Chen🌴</a>, former CCG postdoc, who talks with us about cats and hamsters, LLM safety, and far-flung national parks! 
ccgblog.seas.upenn.edu/2025/01/interv…
Cognitive Computation Group (@cogcomp) 's Twitter Profile Photo

We’re excited to share our #NAACL2025 conference and findings papers on our blog: ccgblog.seas.upenn.edu/2025/03/ccg-pa…! Congratulations to Sihao Chen Chaitanya Malaviya Siyi Liu Yi Zhang Vivek Gupta Dan Roth and their co-authors! (image Mukhopadhyay et al., 2025)

We’re excited to share our #NAACL2025 conference and findings papers on our blog: ccgblog.seas.upenn.edu/2025/03/ccg-pa…!  Congratulations to
<a href="/soshsihao/">Sihao Chen</a>
<a href="/cmalaviya11/">Chaitanya Malaviya</a>
<a href="/liusiyi64198/">Siyi Liu</a>
<a href="/Wado_Will/">Yi Zhang</a>
<a href="/keviv9/">Vivek Gupta</a>
<a href="/DanRothNLP/">Dan Roth</a>
and their co-authors!

(image Mukhopadhyay et al., 2025)
Yu Feng (@anniefeng6) 's Twitter Profile Photo

#ICLR2025 Oral LLMs often struggle with reliable and consistent decisions under uncertainty 😵‍💫 — largely because they can't reliably estimate the probability of each choice. We propose BIRD 🐦, a framework that significantly enhances LLM decision making under uncertainty. BIRD

#ICLR2025 Oral

LLMs often struggle with reliable and consistent decisions under uncertainty 😵‍💫 — largely because they can't reliably estimate the probability of each choice.

We propose BIRD 🐦, a framework that significantly enhances LLM decision making under uncertainty.

BIRD
Cognitive Computation Group (@cogcomp) 's Twitter Profile Photo

Excited to share our papers at #ICLR2025 in Singapore! Check out the summaries on our blog (ccgblog.seas.upenn.edu/2025/04/ccg-pa…), and then check out the papers at oral session 1B (BIRD) and poster session 2 (for all three)! Yu Feng, Xingyu Fu, Ben Zhou, 🌴Muhao Chen🌴, Dan Roth

Excited to share our papers at #ICLR2025 in Singapore!  Check out the summaries on our blog (ccgblog.seas.upenn.edu/2025/04/ccg-pa…), and then check out the papers at oral session 1B (BIRD) and poster session 2 (for all three)!
<a href="/AnnieFeng6/">Yu Feng</a>, <a href="/XingyuFu2/">Xingyu Fu</a>, <a href="/BenZhou96/">Ben Zhou</a>, <a href="/muhao_chen/">🌴Muhao Chen🌴</a>, <a href="/DanRothNLP/">Dan Roth</a>
Cognitive Computation Group (@cogcomp) 's Twitter Profile Photo

Pleased to share our #NAACL2025 papers in Albuquerque! We’ve updated our blog post with the various session times – come check them out! ccgblog.seas.upenn.edu/2025/03/ccg-pa… Sihao Chen, Chaitanya Malaviya, Siyi Liu, Yi Zhang, Vivek Gupta, Dan Roth

Pleased to share our #NAACL2025 papers in Albuquerque! We’ve updated our blog post with the various session times – come check them out! ccgblog.seas.upenn.edu/2025/03/ccg-pa…
<a href="/soshsihao/">Sihao Chen</a>, <a href="/cmalaviya11/">Chaitanya Malaviya</a>, <a href="/liusiyi64198/">Siyi Liu</a>, <a href="/Wado_Will/">Yi Zhang</a>, <a href="/keviv9/">Vivek Gupta</a>, <a href="/DanRothNLP/">Dan Roth</a>
Xingyu Fu (@xingyufu2) 's Twitter Profile Photo

I will be in #ICML2025 next week and present #ReFocus on Tuesday afternoon. 📍 West Exhibition Hall B2-B3 #W-202 ⏱️ Tue 15 Jul 4:30 p.m. PDT - 7 p.m. PDT Happy to chat and connect! Feel free to DM 😁 ReFocus link: huggingface.co/datasets/ReFoc…

Cognitive Computation Group (@cogcomp) 's Twitter Profile Photo

Check out our papers at #ICML2025 in Vancouver! Summaries, links, and poster sessions on our blog: ccgblog.seas.upenn.edu/2025/07/ccg-pa…. Congrats to Xingyu Fu, Dan Roth, and their co-authors!

Check out our papers at #ICML2025 in Vancouver!  Summaries, links, and poster sessions on our blog: ccgblog.seas.upenn.edu/2025/07/ccg-pa….  Congrats to <a href="/XingyuFu2/">Xingyu Fu</a>, <a href="/DanRothNLP/">Dan Roth</a>, and their co-authors!