Vishakh Padmakumar (@vishakh_pk) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

I wrote a post on how to connect with people (i.e., make friends) at CS conferences. These events can be intimidating so here's some suggestions on how to navigate them I'm late for #ICLR2025 #NAACL2025, but just in time for #AISTATS2025 and timely for #ICML2025 acceptances! 1/4

thumb_up_off_alt647

chat_bubble_outline4

repeat83

shareShare

Arkadiy Saakyan

@rkdsaakyan

2 months ago

Can vision-language models understand figurative meaning like visual metaphors, sarcastic image captions or memes? Come find out at our #NAACL2025 poster on Friday 9am! New task & dataset of images and captions with figurative phenomena like metaphor, idiom, sarcasm, and humor.

thumb_up_off_alt15

chat_bubble_outline1

repeat5

shareShare

Kenneth Huang

@windx0303

2 months ago

This year's #In2Writing workshop at #NAACL2025 was indeed amazing. We heard voice from teachers, English scholars, NLPers, writers, and industrial folks. See you next time!

thumb_up_off_alt36

chat_bubble_outline1

repeat2

shareShare

Sanchaita Hazra

@hsanchaita

2 months ago

Very excited for a new #ICML2025 position paper accepted as oral w Bodhisattwa Majumder & Tuhin Chakrabarty! 😎 What are the longitudinal harms of AI development? We use economic theories to highlight AI’s intertemporal impacts on livelihoods & its role in deepening labor-market inequality.

Very excited for a new #ICML2025 position paper accepted as oral w <a href="/mbodhisattwa/">Bodhisattwa Majumder</a> & <a href="/TuhinChakr/">Tuhin Chakrabarty</a>! 😎

What are the longitudinal harms of AI development?

We use economic theories to highlight AI’s intertemporal impacts on livelihoods & its role in deepening labor-market inequality.

thumb_up_off_alt46

chat_bubble_outline3

repeat12

shareShare

Tuhin Chakrabarty

@tuhinchakr

2 months ago

Thinking about model welfare and catastrophic risks without considering longitudinal harms cause by Generative AI ? Check out our ICML Conference Oral paper on why AI Safety should prioritize Future of Work. Had lots of fun writing this #GenAI

thumb_up_off_alt30

chat_bubble_outline1

repeat6

shareShare

Roger Beaty

@roger_beaty

a month ago

New paper: AI can generate creative ideas when prompted—but can it actually improve our own creativity? In 2 studies (total N = 36,752), we show AI can enhance human creativity through real-time feedback, helping people better evaluate their own ideas. osf.io/preprints/osf/…

thumb_up_off_alt58

chat_bubble_outline2

repeat16

shareShare

Nishant Balepur

@nishantbalepur

a month ago

🎉🎉 Excited to have two papers accepted to #ACL2025! Our first paper designs a preference training method to boost LLM personalization 🎨 While the second outlines our position on why MCQA evals are terrible and how to make them better 🙏 Grateful for amazing collaborators!

thumb_up_off_alt94

chat_bubble_outline7

repeat14

shareShare

Dayeon (Zoey) Ki

@zoeykii

a month ago

1/ How can a monolingual English speaker 🇺🇸 decide if a French translation 🇫🇷 is good enough to be shared? Introducing ❓AskQE❓, an #LLM-based Question Generation + Answering framework that detects critical MT errors and provides actionable feedback 🗣️ #ACL2025

thumb_up_off_alt43

chat_bubble_outline1

repeat16

shareShare

Aakanksha Naik

@arnaik19

a month ago

🚨Test data is out! 🚨 The testing phase will run until May 24, 5 pm PT. Check out our github for the data + submission instructions. Bring your best models 💪! Participants can also submit shared task reports to Scholarly Document Processing Workshop after the testing phase!

thumb_up_off_alt11

chat_bubble_outline0

repeat4

shareShare

Mina Lee

@minalee__

a month ago

What does it mean to write and think with AI? What new possibilities and challenges does that bring? I spoke with THE AI (in Korean) about our group's research and the future of writing with AI. 👩🤖✍️ newstheai.com/news/articleVi…

thumb_up_off_alt61

chat_bubble_outline2

repeat10

shareShare

William Merrill

@lambdaviking

a month ago

Padding a transformer’s input with blank tokens (...) is a simple form of test-time compute. Can it increase the computational power of LLMs? 👀 New work with Ashish Sabharwal addresses this with *exact characterizations* of the expressive power of transformers with padding 🧵

thumb_up_off_alt275

chat_bubble_outline3

repeat37

shareShare

Joe Stacey

@_joestacey_

a month ago

We have a new paper up on arXiv! 🥳🪇 The paper tries to improve the robustness of closed-source LLMs fine-tuned on NLI, assuming a realistic training budget of 10k training examples. Here's a 60 second rundown of what we found!

thumb_up_off_alt77

chat_bubble_outline3

repeat16

shareShare

Simone Luchini

@simone_luchini

a month ago

🚨Check out our recent preprint on human-AI co-creativity in story writing! 🔍🔍🔍We investigate the mechanisms that underlie the outcomes of human-AI co-creativity in a highly naturalistic setting. doi.org/10.31234/osf.i…

thumb_up_off_alt23

chat_bubble_outline2

repeat10

shareShare

Omar Khattab

@lateinteraction

a month ago

I need to read it carefully, but now this IMO is likely most deserving of "important papers in LLM RL since R1". If you try 100 random underpowered tricks and all of them lead to huge gains, but only on certain model class X, the finding is about X, not about the random tricks!

thumb_up_off_alt312

chat_bubble_outline9

repeat24

shareShare

Roger Beaty

@roger_beaty

a month ago

LLMs still struggle with creativity. How can we make them more creative? Train AI on what people actually consider creative. We built a dataset of 200k+ human creativity ratings and used it to train a model that outperforms GPT-4o on creativity tests. arxiv.org/abs/2505.14442

thumb_up_off_alt163

chat_bubble_outline6

repeat42

shareShare

Hannah Rose Kirk

@hannahrosekirk

a month ago

Why do human–AI relationships need socioaffective alignment? As AI evolves from tools to companions, we must seek systems that enhance rather than exploit our nature as social & emotional beings. Published today in nature Humanities & Social Sciences! nature.com/articles/s4159…

thumb_up_off_alt275

chat_bubble_outline6

repeat54

shareShare

John(Yueh-Han) Chen

@jcyhc_ai

a month ago

Do LLMs show systematic generalization of safety facts to novel scenarios? Introducing our work SAGE-Eval, a benchmark consisting of 100+ safety facts and 10k+ scenarios to test this! - Claude-3.7-Sonnet passes only 57% of facts evaluated - o1 and o3-mini passed <45%! 🧵

thumb_up_off_alt24

chat_bubble_outline1

repeat10

shareShare

NYU Center for Data Science

@nyudatascience

a month ago

CDS PhD student Vishakh Padmakumar, with co-authors John (Yueh-Han) Chen, Jane Pan, Valerie Chen, and CDS Associate Professor He He, has published new research on the trade-off between originality and quality in LLM outputs. Read more: nyudatascience.medium.com/in-ai-generate…

thumb_up_off_alt19

chat_bubble_outline2

repeat4

shareShare

Vaishnavh Nagarajan

@_vaishnavh

23 days ago

📢 New paper on creativity & multi-token prediction! We design minimal open-ended tasks to argue: → LLMs are limited in creativity since they learn to predict the next token → creativity can be improved via multi-token learning & injecting noise ("seed-conditioning" 🌱) 1/ 🧵

thumb_up_off_alt137

chat_bubble_outline1

repeat35

shareShare

John(Yueh-Han) Chen

@jcyhc_ai

12 days ago

LLMs won’t tell you how to make fake IDs—but will reveal the layouts/materials of IDs and make realistic photos if asked separately. 💥Such decomposition attacks reach 87% success across QA, text-to-image, and agent settings! 🛡️Our monitoring method defends with 93% success! 🧵

thumb_up_off_alt24

chat_bubble_outline1

repeat9

shareShare

Vishakh Padmakumar

Gate.io

Gautam Kamath

Arkadiy Saakyan

Kenneth Huang

Sanchaita Hazra

Tuhin Chakrabarty

Roger Beaty

Nishant Balepur

Dayeon (Zoey) Ki

Aakanksha Naik

Mina Lee

William Merrill

Joe Stacey

Simone Luchini

Omar Khattab

Roger Beaty

Hannah Rose Kirk

John(Yueh-Han) Chen

NYU Center for Data Science

Vaishnavh Nagarajan

John(Yueh-Han) Chen