Omar Shaikh (@oshaikh13) Twitter Tweets • TwiCopy

Quan Ze Chen

4 months ago

Online groups and communities often need to make decisions around social concepts like what content is appropriate. But how do we ensure these decisions are aligned across human decision-makers or even AI systems? We explore this in our work (CI '25): 📜 Case Law Grounding ⚖️

thumb_up_off_alt22

chat_bubble_outline1

repeat8

shareShare

dilara

@dilarafsoylu

4 months ago

Should you RL your compound AI system or optimize its prompts? We think both! 🤯 A short preview of work co-led with Noah Ziems and Lakshya A Agrawal!👇

Should you RL your compound AI system or optimize its prompts? We think both! 🤯

A short preview of work co-led with <a href="/NoahZiems/">Noah Ziems</a> and <a href="/LakshyAAAgrawal/">Lakshya A Agrawal</a>!👇

thumb_up_off_alt288

chat_bubble_outline7

repeat42

shareShare

Jiaju Ma

@jama1017

4 months ago

We introduce MoVer, a Motion Verification DSL that automatically checks if AI-generated motion graphics animations match your text prompts! We make it easy for designers to specify and verify complex animations with LLM-powered iterative refinement. Catch our #SIGGRAPH2025 talk:

thumb_up_off_alt77

chat_bubble_outline3

repeat14

shareShare

Yujie Tao

@tao_yujie

4 months ago

Self-presentation is multifaceted, but the expression is often limited to physical accessories. How could Audio AR transform social interaction? We introduce Audio Personas, body-anchored sounds to dynamically shape social impression. Upcoming in TOCHI: arxiv.org/pdf/2505.00956

thumb_up_off_alt44

chat_bubble_outline3

repeat7

shareShare

Will Held

@williambarrheld

4 months ago

"GPT-5 shows scaling are coming to an end"

thumb_up_off_alt29

chat_bubble_outline1

repeat5

shareShare

Jessy Li

@jessyjli

4 months ago

The Echoes in AI paper showed quite the opposite with also a story continuation setup. Additionally, we present evidence that both *syntactic* and *discourse* diversity measures show strong homogenization that lexical and cosine used in this paper do not capture.

thumb_up_off_alt38

chat_bubble_outline2

repeat14

shareShare

Aryaman Arora

@aryaman2020

4 months ago

Omar Shaikh Dilara Soylu RISE and GRIND Omar Shaikh

thumb_up_off_alt13

chat_bubble_outline1

repeat2

shareShare

Yanzhe Zhang

@stevenyzzhang

4 months ago

Soon, AI agents will act for us—collaborating, negotiating, and sharing data. But can they truly protect our privacy? We simulate privacy-critical scenarios, using alternating search to evolve attacks and defenses, uncovering severe vulnerabilities and building protections.

thumb_up_off_alt77

chat_bubble_outline2

repeat26

shareShare

Tim Althoff

@timalthoff

4 months ago

I’m excited to share our new nature paper 📝, which provides strong evidence that the walkability of our built environment matters a great deal to our physical activity and health. Details in thread.🧵 nature.com/articles/s4158…

I’m excited to share our new <a href="/Nature/">nature</a> paper 📝, which provides strong evidence that the walkability of our built environment matters a great deal to our physical activity and health.

Details in thread.🧵

nature.com/articles/s4158…

thumb_up_off_alt2,2K

chat_bubble_outline46

repeat529

shareShare

Houjun Liu

@houjun_liu

4 months ago

New Paper Day! For EMNLP findings—in LM red-teaming, we show you have to optimize for **both** perplexity and toxicity for high-probability, hard to filter, and natural attacks!

thumb_up_off_alt29

chat_bubble_outline2

repeat12

shareShare

Kawin Ethayarajh

@ethayarajh

3 months ago

📢 Belated update, but I'm thrilled to share that I've joined The University of Chicago Chicago Booth as an Assistant Professor in the newly created Applied AI group! I'll continue to work on behavior-bound machine learning: understanding how AI shapes, is shaped, and should be shaped by the

📢 Belated update, but I'm thrilled to share that I've joined <a href="/UChicago/">The University of Chicago</a> <a href="/ChicagoBooth/">Chicago Booth</a> as an Assistant Professor in the newly created Applied AI group!
I'll continue to work on behavior-bound machine learning: understanding how AI shapes, is shaped, and should be shaped by the

thumb_up_off_alt610

chat_bubble_outline56

repeat27

shareShare

Ken Liu

@kenziyuliu

3 months ago

New paper! We explore a radical paradigm for AI evals: assessing LLMs on *unsolved* questions. Instead of contrived exams where progress ≠ value, we eval LLMs on organic, unsolved problems via reference-free LLM validation & community verification. LLMs solved ~10/500 so far:

thumb_up_off_alt362

chat_bubble_outline12

repeat72

shareShare

Yanzhe Zhang

@stevenyzzhang

3 months ago

Introducing Generative Interfaces - a new paradigm beyond chatbots. We generate interfaces on the fly to better facilitate LLM interaction, so no more passive reading of long text blocks. Adaptive and Interactive: creates the form that best adapts to your goals and needs!

thumb_up_off_alt134

chat_bubble_outline4

repeat40

shareShare

Jessy Lin

@realjessylin

3 months ago

🔍 How do we teach an LLM to 𝘮𝘢𝘴𝘵𝘦𝘳 a body of knowledge? In new work with AI at Meta, we propose Active Reading 📙: a way for models to teach themselves new things by self-studying their training data. Results: * 𝟔𝟔% on SimpleQA w/ an 8B model by studying the wikipedia

🔍 How do we teach an LLM to 𝘮𝘢𝘴𝘵𝘦𝘳 a body of knowledge?

In new work with <a href="/AIatMeta/">AI at Meta</a>, we propose Active Reading 📙: a way for models to teach themselves new things by self-studying their training data. Results:

* 𝟔𝟔% on SimpleQA w/ an 8B model by studying the wikipedia

thumb_up_off_alt1,1K

chat_bubble_outline15

repeat158

shareShare

Jeremy Howard

@jeremyphoward

3 months ago

Maybe this is part of why I find GPT-5 on ChatGPT so annoying -- apparently its system prompt is explicitly set to *not* ask clarifying questions?!? I find it really annoying the way it just goes off and tries to solve the world in one shot. I really want to iterate!

thumb_up_off_alt977

chat_bubble_outline77

repeat50

shareShare

Munyeong Kim

@kim_munyeong

3 months ago

I led a session at @mila_quebec HCAI reading group on Omar Shaikh et al.'s General User Model paper! So excited to find and present this paper to our group. We discussed both the paper's novelty and people's concerns, as well as technical approches to address them.

I led a session at @mila_quebec HCAI reading group on <a href="/oshaikh13/">Omar Shaikh</a> et al.'s General User Model paper! So excited to find and present this paper to our group. We discussed both the paper's novelty and people's concerns, as well as technical approches to address them.

thumb_up_off_alt14

chat_bubble_outline1

repeat3

shareShare