Daniel Filan (@dfrsrchtwts) 's Twitter Profile
Daniel Filan

@dfrsrchtwts

Want to usher in an era of human-friendly superintelligence, don't know how.
Podcast: axrp.net
Apply to MATS: matsprogram.org/apply

ID: 1276310243123720192

calendar_today26-06-2020 00:24:21

931 Tweet

1,1K Followers

159 Following

Owain Evans (@owainevans_uk) 's Twitter Profile Photo

Some recent talks/interviews: Podcast on introspection, self-awareness and emergent misalignment youtu.be/3D4pgIKR4cQ?fe… Emergent misalignment talk youtu.be/pimIny8jJd8?fe… On giving AIs false beliefs youtu.be/0ONSOMf5jh4?fe…

Jaime Sevilla (@jsevillamol) 's Twitter Profile Photo

A couple of weeks ago I posted a summary of Epoch's mission, clearing up some common misunderstanding of what we are trying to achieve. Give it a read! epoch.ai/blog/what-is-e…

Agus 🔎 🔸 (@austinc3301) 's Twitter Profile Photo

🚀 We're launching mentor applications for SPAR's Fall 2025 round! SPAR is a part-time, remote research program where researchers tackle impactful three-month AI safety and policy projects alongside talented mentees. Applications open until July 15, see below!

AXRP - the AI X-risk Research Podcast (@axrpodcast) 's Twitter Profile Photo

My apologies: if you downloaded my most recent episode, my audio cut out around 0:57:40. The issue should be fixed if you re-download the audio. You can also watch on YouTube, which does not have the same problem.

jessicat (@jessi_cata) 's Twitter Profile Photo

There has been much criticism of the AI 2027 model. As a check, I ran a Monte Carlo model based on METR data (2032 median). It seems like a more straightforward extrapolation. (I still think it'll probably be slower than this in real life, though) getguesstimate.com/models/25870

There has been much criticism of the AI 2027 model. As a check, I ran a Monte Carlo model based on METR data (2032 median). It seems like a more straightforward extrapolation. (I still think it'll probably be slower than this in real life, though)

getguesstimate.com/models/25870
Adam Shai (@adamimos) 's Twitter Profile Photo

How do transformers carry out recurrent computations while being fundamentally feedforward? Excited to present our work on Constrained Belief Updating at #ICML2025, where we show that attention carries out a spectral algorithm in order to parallelize Bayes updating.

How do transformers carry out recurrent computations while being fundamentally feedforward? Excited to present our work on Constrained Belief Updating at #ICML2025, where we show that attention carries out a spectral algorithm in order to parallelize Bayes updating.
Asterisk (@asteriskmgzn) 's Twitter Profile Photo

Asterisk is launching an AI blogging fellowship! We're looking for people with unique perspectives on AI who want to take the first step to writing in public. We'll help you build a blog — and provide editorial feedback, mentorship from leading bloggers, a platform, & $1K

Asterisk is launching an AI blogging fellowship!

We're looking for people with unique perspectives on AI who want to take the first step to writing in public. We'll help you build a blog  — and provide editorial feedback, mentorship from leading bloggers, a platform, & $1K
Neel Nanda (@neelnanda5) 's Twitter Profile Photo

My Winter MATS applications are open! You'll work full-time writing a mech interp paper supervised by me. Due Aug 29 I've supervised 30+ papers by now (incl 15 top conference papers) but cohorts still get better each time. I'm hyped to see what this cohort achieves! Highlights:

My Winter MATS applications are open! You'll work full-time writing a mech interp paper supervised by me. Due Aug 29

I've supervised 30+ papers by now (incl 15 top conference papers) but cohorts still get better each time. I'm hyped to see what this cohort achieves!

Highlights:
AI Security Institute (@aisecurityinst) 's Twitter Profile Photo

📢Introducing the Alignment Project: A new fund for research on urgent challenges in AI alignment and control, backed by over £15 million. ▶️ Up to £1 million per project ▶️ Compute access, venture capital investment, and expert support Learn more and apply ⬇️