Joshua Saxe (@joshua_saxe) 's Twitter Profile
Joshua Saxe

@joshua_saxe

AI+cybersecurity at Meta; past lives in academic history, labor / community organizing, classical/jazz piano, hacking scene

ID: 1397281496

linkhttps://www.malwaredatascience.com/ calendar_today02-05-2013 14:00:35

3,3K Tweet

3,3K Takipçi

1,1K Takip Edilen

Alex Dimakis (@alexgdimakis) 's Twitter Profile Photo

There are still posts about 'new papers showing AI models cannot reason'. There are unfortunately problems into how these evaluations were done and also many of those limitations are known, peer-reviewed and published. Here is a simplified version of what's going on as far as I

François Fleuret (@francoisfleuret) 's Twitter Profile Photo

BTW, if you don't know those terms: - "aleatoric randomness" is real randomness, and - "epistemic randomness" is apparent randomness due to an incorrect understanding / modeling. That the first exists is for the reader to decide.

Kyle Corbitt (@corbtt) 's Twitter Profile Photo

I wouldn't have said this 6 months ago, but I now believe all serious agents will be RL'd on their specific task. The gains are too easy and too huge to ignore. Either OpenAI et. al. will provide APIs to do this on-platform, or open source will win.

Edward Raff (@edwardraffml) 's Twitter Profile Photo

Hark, a book has appeared! "How Large Language Models Work" with Drew Farris & Stella Biderman @ ICML , is finally officially done and printed! When my mom called me and asked me if I had heard about "this ChatGPT thing", I knew it was going to be necessary to make something accessible!

Hark, a book has appeared! "How Large Language Models Work" with <a href="/drewfarris/">Drew Farris</a> &amp; <a href="/BlancheMinerva/">Stella Biderman @ ICML</a> , is finally officially done and printed! When my mom called me and asked me if I had heard about "this ChatGPT thing", I knew it was going to be necessary to make something accessible!
Xin Eric Wang @ ICLR 2025 (@xwang_lk) 's Twitter Profile Photo

The teasing LeCun gets from some LLM believers today might be nothing compared to the skepticism he faced in the 90s. Back then, few believed in neural nets. If Yann LeCun weren’t stubborn, he wouldn’t have won the Turing Award.

Nathan Lambert (@natolambert) 's Twitter Profile Photo

Helen (Helen Toner) is one of those people who you should be following closely if you care about where AI is heading and the (geo)political implications. Knows how the bigger world works but very up to speed & articulate on cutting edge advancements. youtube.com/watch?v=dzwi7s…

Nabeel S. Qureshi (@nabeelqu) 's Twitter Profile Photo

Ok, a few reflections on the book: 1. qntm defines antimemes as self-erasing information, but this book has a different (but related) definition of the concept: antimemes are (a) high-impact and (b) low transmissibility. Roughly, they are "important secrets". 2. The low

Mikita Balesni 🇺🇦 (@balesni) 's Twitter Profile Photo

A simple AGI safety technique: AI’s thoughts are in plain English, just read them We know it works, with OK (not perfect) transparency! The risk is fragility: RL training, new architectures, etc threaten transparency Experts from many orgs agree we should try to preserve it:

A simple AGI safety technique: AI’s thoughts are in plain English, just read them

We know it works, with OK (not perfect) transparency!

The risk is fragility: RL training, new architectures, etc threaten transparency

Experts from many orgs agree we should try to preserve it:
Jason Wei (@_jasonwei) 's Twitter Profile Photo

Becoming an RL diehard in the past year and thinking about RL for most of my waking hours inadvertently taught me an important lesson about how to live my own life. One of the big concepts in RL is that you always want to be “on-policy”: instead of mimicking other people’s

Helen Toner (@hlntnr) 's Twitter Profile Photo

Got back last night from the World AI Conference in Shanghai. Megathread with photos/videos/thoughts from the conf itself + giant expo next door (ended up going back to the expo 3 times bc there were so many interesting booths) First up: robots robots robots (yes, inc Unitree)

Got back last night from the World AI Conference in Shanghai. Megathread with photos/videos/thoughts from the conf itself + giant expo next door (ended up going back to the expo 3 times bc there were so many interesting booths)

First up: robots robots robots
(yes, inc Unitree)