bilal 🇵🇸 (@bilalchughtai_) Twitter Tweets • TwiCopy

L Rudolf L

a year ago

If you're at NeurIPS, come see Kaivu Hariharan present our LLM situational awareness benchmark, the SAD paper, on Friday, 4:30-7:30pm, West Ballroom A-D #5101

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

SAD to announce i won't be at neurips this year, but Kaivu Hariharan will be presenting our work on situational awareness on friday from 4:30-7:30pm in west ballroom a-d, poster #5101 - go check it out!

SAD to announce i won't be at neurips this year, but <a href="/KaivuHariharan/">Kaivu Hariharan</a> will be presenting our work on situational awareness on friday from 4:30-7:30pm in west ballroom a-d, poster #5101 - go check it out!

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

bilal 🇵🇸

@bilalchughtai_

a year ago

nosetgauge.substack.com/p/capital-agi-… Rudolf has a good new blog post on the importance of capital and default decline of relevance of human labour post-AGI.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Fin Moorhouse

@finmoorhouse

a year ago

Gordon Moore based his prediction on 5 data points.

thumb_up_off_alt679

chat_bubble_outline25

repeat76

shareShare

bilal 🇵🇸

@bilalchughtai_

10 months ago

new paper! we discuss open problems in - methods and foundations of mech interp - applications of mech interp towards scientific and engineering goals - sociotechnical aspects of mech interp

thumb_up_off_alt36

chat_bubble_outline2

repeat4

shareShare

Shakeel

@shakeelhashim

10 months ago

Pretty wild: Pope Leo XIV says that the potential existential risk from AI "demands serious attention"

Pretty wild: <a href="/Pontifex/">Pope Leo XIV</a> says that the potential existential risk from AI "demands serious attention"

thumb_up_off_alt140

chat_bubble_outline5

repeat20

shareShare

bilal 🇵🇸

@bilalchughtai_

10 months ago

improving the public discourse surrounding AI development and its impacts seems incredibly important to me, yet very few people are working on it! i've been impressed with tarbell's work so far and would encourage applications!

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Neel Nanda

@neelnanda5

10 months ago

Apps are open for my MATS stream, where I try to teach how to do great mech interp research. Due Feb 28! I love mentoring and have had 40+ mentees, who’ve made valuable contributions to the field, incl 10 top conference papers! You don’t need to be at a big lab to do mech interp

thumb_up_off_alt332

chat_bubble_outline11

repeat31

shareShare

bilal 🇵🇸

@bilalchughtai_

10 months ago

thoroughly endorse!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Max Nadeau

@maxnadeau_

10 months ago

🧵 Announcing Open Philanthropy's Technical AI Safety RFP! We're seeking proposals across 21 research areas to help make AI systems more trustworthy, rule-following, and aligned, even as they become more capable.

🧵 Announcing <a href="/open_phil/">Open Philanthropy</a>'s Technical AI Safety RFP!

We're seeking proposals across 21 research areas to help make AI systems more trustworthy, rule-following, and aligned, even as they become more capable.

thumb_up_off_alt252

chat_bubble_outline4

repeat83

shareShare

bilal 🇵🇸

@bilalchughtai_

10 months ago

another new paper! we build and evaluate the efficacy of simple probes, trained on internal model activations, in detecting instances of models acting strategically deceptive when placed in semi-realistic agentic scenarios.

thumb_up_off_alt17

chat_bubble_outline2

repeat0

shareShare

Cas (Stephen Casper)

@stephenlcasper

10 months ago

Imagine if the 2015 Paris Climate Summit was renamed the "Energy Action Summit," invited leaders from across the fossil fuel industry, raised millions for fossil fuels, ignored IPCC reports, and produced an agreement that didn't even mention climate change. #AIActionSummit 🤦

thumb_up_off_alt409

chat_bubble_outline11

repeat64

shareShare

L Rudolf L

@lrudl_

10 months ago

for years tech's had a meme: being a lawyer/doctor/engineer is the unambitious normie thing. but the AIs will shortly do all the coding. what's left? human legitimacy, human care, physically twisting the damn screws. full revenge of the normie career. checkmate, techies

thumb_up_off_alt17

chat_bubble_outline6

repeat3

shareShare

Luke Drago

@luke_drago_

10 months ago

Everyone’s trying to build AGI, loosely defined as systems that could outperform humans at all work. What happens to you when it exists? Let’s talk about how AGI will take your (white collar) job. Allow me to introduce you to pyramid replacement:

thumb_up_off_alt540

chat_bubble_outline15

repeat55

shareShare

L Rudolf L

@lrudl_

10 months ago

everyone says transformative AI is coming. but what might such a world actually look like when it comes to the most important questions: will Demis Hassabis win another Nobel? what does North Korea do? what's the future of academia? my insanely detailed scenario has the answers:

thumb_up_off_alt31

chat_bubble_outline3

repeat8

shareShare

Daniel Kokotajlo

@dkokotajlo

8 months ago

"How, exactly, could AI take over by 2027?" Introducing AI 2027: a deeply-researched scenario forecast I wrote alongside Scott Alexander, Eli Lifland, and Thomas Larsen

"How, exactly, could AI take over by 2027?"

Introducing AI 2027: a deeply-researched scenario forecast I wrote alongside <a href="/slatestarcodex/">Scott Alexander</a>, <a href="/eli_lifland/">Eli Lifland</a>, and <a href="/thlarsen/">Thomas Larsen</a>

thumb_up_off_alt4,4K

chat_bubble_outline348

repeat876

shareShare