Ahmed Ahmed (@ahmedsqrd) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Agents research with LMs right now feels like Deep RL in the late 2010s. Tons of new algorithms on narrow domains, so I expect most of these results to not transfer at all. The thing is, its still early, so its going to get worse still before it gets better.

thumb_up_off_alt221

chat_bubble_outline9

repeat14

shareShare

Anikait Singh

@anikait_singh_

3 months ago

Personalization in LLMs is crucial for meeting diverse user needs, yet collecting real-world preferences at scale remains a significant challenge. Introducing FSPO, a simple framework leveraging synthetic preference data to adapt new users with meta-learning for open-ended QA! 🧵

thumb_up_off_alt133

chat_bubble_outline1

repeat11

shareShare

Chenchen Gu

@chenchenygu

3 months ago

Prompt caching lowers inference costs but can leak private information from timing differences. Our audits found 7 API providers with potential leakage of user data. Caching can even leak architecture info—OpenAI's embedding model is likely a decoder-only Transformer! 🧵1/9

thumb_up_off_alt121

chat_bubble_outline4

repeat34

shareShare

Kiran Garimella

@gvrkiran

3 months ago

As AI systems increasingly simulate human behavior, we must ask: How do we ensure they don’t amplify bias, deceive, or manipulate? This paper lays out a much-needed framework for responsible AI design. Its REALLY good. arxiv.org/abs/2503.02250

thumb_up_off_alt42

chat_bubble_outline1

repeat10

shareShare

Trajan Hammonds

@trajan317

3 months ago

people love and dream about movies like The Martian and Interstellar but then celebrate defunding basic science research to save a few dollars in taxes per year

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Krishnamurthy (Dj) Dvijotham

@djdvij

3 months ago

(1/n) Fine tuning APIs create significant security vulnerabilities, breaking alignment in frontier models for under $100! Introducing NOICE, a fine-tuning attack that requires just 1000 training examples to remove model safeguards. The strangest part: we use ONLY harmless data.

thumb_up_off_alt32

chat_bubble_outline1

repeat9

shareShare

Nathan Lambert

@natolambert

2 months ago

Google, Anthropic, xAI etc should have a Model Spec. Would help them with all of these if done right: Developers: Know what future models will become Internal: Focus to define and to deliver your goals Regulators: Transparency into wtf frontier labs care about

thumb_up_off_alt104

chat_bubble_outline6

repeat5

shareShare

Ken Liu

@kenziyuliu

2 months ago

An LLM generates an article verbatim—did it “train on” the article? It’s complicated: under n-gram definitions of train-set inclusion, LLMs can complete “unseen” texts—both after data deletion and adding “gibberish” data. Our results impact unlearning, MIAs & data transparency🧵

thumb_up_off_alt290

chat_bubble_outline10

repeat79

shareShare

Ahmed Ahmed

@ahmedsqrd

2 months ago

Incredibly relevant work— suggesting the real bottleneck for effective misinformation isn’t technical detection (watermarking) but purely capabilities (how persuasive and fluent models become)… a chilling shift for AI safety efforts

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Ahmed Ahmed

@ahmedsqrd

2 months ago

Thoughtful retrospective on where RL folks went wrong (focus on algorithms, ignore priors should have been flipped) and where to go next. Domains w dense rewards seem ~solved (verified reasoning, RLHF) so more open ended evals are next (chatbot arena)

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Dylan HadfieldMenell

@dhadfieldmenell

a month ago

Let's open-source GPT-4. If the non-profit genuinely controlled OpenAI, it's hard to see why they wouldn't release the model. Tons of science that would be unlocked with this. Tons of papers that become reproducible.

thumb_up_off_alt28

chat_bubble_outline2

repeat4

shareShare

Ahmed Ahmed

@ahmedsqrd

a month ago

Excited to announce our work got accepted as a spotlight to ICML!! If people have recs for Vancouver LMK!

thumb_up_off_alt14

chat_bubble_outline2

repeat0

shareShare