Sam Bowman (@sleepinyourhat) Twitter Tweets • TwiCopy

Sam Bowman

@sleepinyourhat

+ Follow

AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.

ID:338526004

linkhttps://cims.nyu.edu/~sbowman/ calendar_today19-07-2011 18:19:52

2,2K Tweets

35,2K Followers

3,1K Following

Follow People

(((ل()(ل() 'yoav))))👾

+ Follow

Percy Liang

Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist

+ Follow

Kyunghyun Cho

a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).

+ Follow

Aran Komatsuzaki

@TeraflopAI

+ Follow

Sasha Rush

Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGz

+ Follow

Sam Bowman

@sleepinyourhat

4 weeks ago

I made a bet internally that we wouldn't have a million people engage with tweets about Claude being a bridge, but I'm pretty happy to be on track to lose that bet.

thumb_up_off_alt78

chat_bubble_outline0

account_circle

Human-aligned AI Summer School

@humanalignedai

1 month ago

Join us in Prague on July 17-20, 2024 for the 4th Human-aligned AI Summer School! We'll have researchers, students, and practitioners for four intensive days focused on the latest approaches to aligning AI systems with human values. You can apply now at humanaligned.ai!

Join us in Prague on July 17-20, 2024 for the 4th Human-aligned AI Summer School! We'll have researchers, students, and practitioners for four intensive days focused on the latest approaches to aligning AI systems with human values. You can apply now at humanaligned.ai!

thumb_up_off_alt23

chat_bubble_outline0

account_circle

Sasha de Marigny

1 month ago

The first ever detailed look inside a modern, production-grade large language model (in this case, Claude 3 Sonnet.)

thumb_up_off_alt68

chat_bubble_outline0

account_circle

Adam Gleave

1 month ago

Impressive progress by UK's AISI one year in -- and excited to see them opening an office in SF!

thumb_up_off_alt27

chat_bubble_outline0

account_circle

METR

1 month ago

Over the last few months, we’ve increased our focus on developing evaluations for automated AI research and development, because we think this capability could be extraordinarily destabilizing if realized.

We are looking for ML engineers and researchers to help drive AI R&D

thumb_up_off_alt25

chat_bubble_outline0

account_circle

METR

1 month ago

We were very excited to see the publication of the Frontier Safety Framework from Google DeepMind! More companies sharing their concrete proposals for preparing for transformative capabilities from AI systems is great: it increases the concreteness of the options available to the

thumb_up_off_alt24

chat_bubble_outline0

account_circle

Sam Bowman

@sleepinyourhat

1 month ago

👇👇👇

thumb_up_off_alt11

chat_bubble_outline0

account_circle

andy jones

1 month ago

this is extremely cool
* activations & activation steering
* multi-tenant to keep costs for users down
* pip install-able
* actually taking a swing at public engineering infra 😍😍😍

thumb_up_off_alt36

chat_bubble_outline0

account_circle

Tomek Korbak

1 month ago

If you're at #iclr24 , come see our poster (#129) tomorrow (Tuesday) at 10:45am to learn about the role human preferences play in making LLMs more sycophantic!

thumb_up_off_alt50

chat_bubble_outline0

account_circle

Jason Wei

1 month ago

Enjoyed this paper that plots emergent abilities with pretraining loss on the x-axis, which is actually a suggestion that Oriol Vinyals also made a few years back: arxiv.org/abs/2403.15796

The paper uses intermediate checkpoints to plot a variety of pretraining losses. For some

Enjoyed this paper that plots emergent abilities with pretraining loss on the x-axis, which is actually a suggestion that @OriolVinyalsML also made a few years back: arxiv.org/abs/2403.15796 The paper uses intermediate checkpoints to plot a variety of pretraining losses. For some

thumb_up_off_alt355

chat_bubble_outline0

account_circle

Ajeya Cotra

1 month ago

We just clarified eligibility criteria for our agent benchmarks RFP (openphilanthropy.org/rfp-llm-benchm…) and included a link to METR's task development resources (metr.github.io/autonomy-evals…) which many applicants may find helpful

thumb_up_off_alt20

chat_bubble_outline0

account_circle

evgrieve

1 month ago

Peak wisteria on Stuyvesant Street in the East Village …

Peak wisteria on Stuyvesant Street in the East Village …

thumb_up_off_alt23,0K

chat_bubble_outline0

account_circle

Sam Bowman

@sleepinyourhat

1 month ago

Very excited to see this come out:

thumb_up_off_alt148

chat_bubble_outline0

account_circle

Sam Bowman

@sleepinyourhat

1 month ago

This result is pretty clearly specific to the style of backdoor we're working with, and doesn't support broad claims like 'interpretability solves misalignment', but it's still surprisingly strong. Worth a look!

thumb_up_off_alt68

chat_bubble_outline0

account_circle

david rein

1 month ago

I very distinctly remember while I was in the thick of it making GPQA telling Robert Long that “I knew the project was going to be ambitious/hard, but I didn’t appreciate what that actually meant”

In retrospect I probably still would’ve done it, but we basically had to restart the

thumb_up_off_alt25

chat_bubble_outline0

account_circle

Owain Evans

2 months ago

Full lecture slides and reading list for Roger Grosse's class on AI Alignment are up:
alignment-w2024.notion.site

Full lecture slides and reading list for Roger Grosse's class on AI Alignment are up: alignment-w2024.notion.site

thumb_up_off_alt194

chat_bubble_outline0

account_circle

Sam Bowman

@sleepinyourhat

2 months ago

🤖🥇🤖

thumb_up_off_alt68

chat_bubble_outline0

account_circle

David Krueger

2 months ago

I’m super excited to release our 100+ page collaborative agenda - led by Usman Anwar - on “Foundational Challenges In Assuring Alignment and Safety of LLMs” alongside 35+ co-authors from NLP, ML, and AI Safety communities!

Some highlights below...

I’m super excited to release our 100+ page collaborative agenda - led by @usmananwar391 - on “Foundational Challenges In Assuring Alignment and Safety of LLMs” alongside 35+ co-authors from NLP, ML, and AI Safety communities! Some highlights below...

thumb_up_off_alt432

chat_bubble_outline0

account_circle

Sasha Rush

2 months ago

I like to think of myself as a researcher, but almost certainly the most valuable use of my time is writing US Visa letters.

thumb_up_off_alt572

chat_bubble_outline0

account_circle

Cem Anil

2 months ago

One of our most crisp findings was that in-context learning usually follows simple power laws as a function of number of demonstrations.

We were surprised we didn’t find this stated explicitly in the literature.

Soliciting pointers: have we missed anything?

thumb_up_off_alt69

chat_bubble_outline0

account_circle