Tianyi (Alex) Qiu (@tianyi_alex_qiu) Twitter Tweets • TwiCopy

Tianyi (Alex) Qiu

@tianyi_alex_qiu

+ Follow

Alignment research. Building human moral progress into AI.
AI Safety Fellow (incoming) @AnthropicAI. Ex-research intern @CHAI_Berkeley. Student @PKU1898.

ID: 1462972752239935489

linkhttp://tianyiqiu.net calendar_today23-11-2021 02:34:30

333 Tweet

111 Followers

146 Following

Ryan Kidd

@ryan_kidd44

6 months ago

MATS has received mentorship applications from 155 researchers for our Winter 2025 program, far more than we can support. If you run an AI safety or governance program and you want referrals, let me know!

thumb_up_off_alt101

chat_bubble_outline5

repeat7

shareShare

xuan (ɕɥɛn / sh-yen)

@xuanalogue

5 months ago

Ever since I started thinking seriously about AI value alignment in 2016-7, I've been frustrated by the inadequacy of utility+RL theory to account for the richness of human values. Glad to be part of a larger team now moving beyond those thin theories towards thicker ones.

thumb_up_off_alt133

chat_bubble_outline3

repeat20

shareShare

Harlan Stewart

@humanharlan

5 months ago

concerning

thumb_up_off_alt37

chat_bubble_outline2

repeat6

shareShare

Skander Moalla

@skandermoalla

5 months ago

🚀 Big time! We can finally do LLM RL fine-tuning with rewards and leverage offline/off-policy data! ❌ You want rewards, but GRPO only works online? ❌ You want offline, but DPO is limited to preferences? ✅ QRPO can do both! 🧵Here's how we do it:

thumb_up_off_alt135

chat_bubble_outline3

repeat35

shareShare

Zhonghao He

@zhonghaohe

5 months ago

Check our poster 11am Thurs July 17th! #ICML2025 I'd be very keen to chat about truth-seeking AI, gradual disempowerment, the lock-in prolem, and AI influence. DM open or drop me an email.

thumb_up_off_alt13

chat_bubble_outline0

repeat4

shareShare

Tianyi (Alex) Qiu

@tianyi_alex_qiu

5 months ago

Same goes for me! Shoot me a message if you are also at ICML and want to chat about these stuff :)

thumb_up_off_alt5

chat_bubble_outline1

repeat0

shareShare

Brendan McCord 🏛️ x 🤖

@mbrendan1

5 months ago

Every builder's first duty is philosophical: to decide what they should build for. AI is beginning to decide what ideas reach your mind—your next action, your next job, your next relationship. It will tempt you to outsource your thinking in ways you’ve never been tempted before.

thumb_up_off_alt716

chat_bubble_outline55

repeat203

shareShare

Wyatt walls

@lefthanddraft

5 months ago

Another night of vibe math with GPT, and I think we’re damn close to a breakthrough. We’re a team: I come up with the ideas. GPT makes the math work. These elitist gatekeepers have failed for 75 years to solve it and are just afraid I will win the Millennium Prize

thumb_up_off_alt302

chat_bubble_outline6

repeat10

shareShare

Nathan is in SF (briefly) 🔍

@nathanpmyoung

4 months ago

China might not want to race. I hope we can create some red lines that both powers won't cross.

thumb_up_off_alt21

chat_bubble_outline0

repeat3

shareShare

Paul Röttger

@paul_rottger

4 months ago

Very excited about all these papers on sociotechnical alignment & the societal impacts of AI at #ACL2025. As is now tradition, I made some timetables to help me find my way around. Sharing here in case others find them useful too :) 🧵

thumb_up_off_alt123

chat_bubble_outline4

repeat12

shareShare

Steven Adler

@sjgadler

4 months ago

Credit where it's due: OpenAl did a lot right for their OSS safety evals - they actually did some fine-tuning - they got useful external feedback - they shared which recs they adopted and which they didn't I don't always follow OAI's rationale, but it's great they share info

thumb_up_off_alt178

chat_bubble_outline2

repeat15

shareShare

International Dialogues on AI Safety

@ais_dialogues

4 months ago

AI misalignment isn’t a future problem — it’s now. Geoffrey Hinton and Andrew Yao, as well as other Western and Chinese researchers convened at the International Dialogues on AI Safety (IDAIS) in Shanghai, China to share and discuss evidence that current AI systems are already

thumb_up_off_alt50

chat_bubble_outline3

repeat9

shareShare

Seán Ó hÉigeartaigh

@s_oheigeartaigh

4 months ago

I write about this in my recent paper "The Most Dangerous Fiction: The Rhetoric and Reality of the AI Race". (p9 - 10). David Sacks correctly identifies that we're presently in a 'normal technology' race (as defined by Narayanan and Kapoor). Where I think he's wrong is in

thumb_up_off_alt36

chat_bubble_outline2

repeat6

shareShare

Nathaniel Li

@natliml

4 months ago

I joined Meta AI, running preparedness and security evaluations with Summer Yue and Julian Michael to ensure that Superintelligence's newest models enable a prosperous future. Grateful for the team they built at @Scale_AI and excited for the critical work ahead.

thumb_up_off_alt187

chat_bubble_outline13

repeat6

shareShare

Stephen McAleer

@mcaleerstephen

4 months ago

We've entered a new phase where progress in chatbots is starting to top out but progress in automating AI research is steadily improving. It's a mistake the confuse the two.

thumb_up_off_alt251

chat_bubble_outline16

repeat22

shareShare

Keith Sakata

@keithsakata

4 months ago

I’m a psychiatrist. In 2025, I’ve seen 12 people hospitalized after losing touch with reality because of AI. Online, I’m seeing the same pattern. Here’s what “AI psychosis” looks like, and why it’s spreading fast: 🧵

thumb_up_off_alt76,76K

chat_bubble_outline1,1K

repeat10,10K

shareShare

Stella Biderman

@blancheminerva

4 months ago

Are you afraid of LLMs teaching people how to build bioweapons? Have you tried just... not teaching LLMs about bioweapons? @AIEleuther and AI Security Institute joined forces to see what would happen, pretraining three 6.9B models for 500B tokens and producing 15 total models to study

Are you afraid of LLMs teaching people how to build bioweapons? Have you tried just... not teaching LLMs about bioweapons?

@AIEleuther and <a href="/AISecurityInst/">AI Security Institute</a> joined forces to see what would happen, pretraining three 6.9B models for 500B tokens and producing 15 total models to study

thumb_up_off_alt556

chat_bubble_outline28

repeat72

shareShare

Zhonghao He

@zhonghaohe

4 months ago

Tianyi Alex Qiu and I will be co-mentoring a SPAR stream, please apply if you are bugged by - LLMs don't seek truth over confirmation/sycophancy in open-ended problems; - We lack human data about how LLMs could help with human learning, making judgments, decision-making; -

thumb_up_off_alt9

chat_bubble_outline1

repeat4

shareShare