Jonathan Balloch (@jonathanballoch) 's Twitter Profile
Jonathan Balloch

@jonathanballoch

I mostly tweet about #ai, #robots, #science, @packers...

Robotics PhD student @GeorgiaTech studying #reinforcementlearning and #AI

Thought/opinions are mine

ID: 891231270

linkhttps://jballoch.com calendar_today19-10-2012 15:44:51

2,2K Tweet

376 Takipçi

1,1K Takip Edilen

Seohong Park (@seohong_park) 's Twitter Profile Photo

Is RL really scalable like other objectives? We found that just scaling up data and compute is *not* enough to enable RL to solve complex tasks. The culprit is the horizon. Paper: arxiv.org/abs/2506.04168 Thread ↓

Scott Manley (@djsnm) 's Twitter Profile Photo

Casey Handmer Marin county has argued it can’t build more housing because of water supply constraints, desalination was one solution considered, but then of course people complained that if the water supply problem was solved they’d have to be more honest as to why they didn’t want more

Taelin (@victortaelin) 's Twitter Profile Photo

Something about this kind of prompt is simply unfathomable to LLMs. They just can't perform better than chance, and I'm not sure why. Most people will dismiss this as just being "hard math stuff", but it is not, I swear. It is just alien to you because it is *niche*, thus, it

Something about this kind of prompt is simply unfathomable to LLMs. They just can't perform better than chance, and I'm not sure why.

Most people will dismiss this as just being "hard math stuff", but it is not, I swear. It is just alien to you because it is *niche*, thus, it
François Chollet (@fchollet) 's Twitter Profile Photo

When we're able to delegate something, to have some of our work done by an automation process or someone else, we automatically *feel* more productive. "Well, that was easy! At the very least it saved me a bunch of typing!" But the relationship between task delegation and

Eugene Vinitsky 🍒🦋 (@eugenevinitsky) 's Twitter Profile Photo

Huge number of accepted papers declining to present their work because they're scared to leave the United States. Great way to make people feel welcome and a part of the country.

Lisan al Gaib (@scaling01) 's Twitter Profile Photo

And that kids, is why we don't do drugs. You might not like it, but Grok-4 didn't get us any closer to AGI or ASI than o3. It's an incredible model, but it doesn't solve any of the previous models problems and just scaling RL won't get us there

Jonathan Balloch (@jonathanballoch) 's Twitter Profile Photo

Its wild that this works, but try appending to your LLM system prompts: "if you don't know say so, and then try to guess what it is most likely". Corrects the artificial confidence of models extremely well

Dhruv Batra (@dhruvbatradb) 's Twitter Profile Photo

Just finished reading this — brilliant paper! An important question, an empirical observation with an explanation, and rejection of alternative plausible explanations — all the necessary pieces of the scientific method. Kudos to the authors!

Georgia Channing (@cgeorgiaw) 's Twitter Profile Photo

Way too many people think that AlphaFold "solved" ML for proteins. It didn't. It did revolutionize protein structure prediction, but that’s just one part of a much bigger puzzle. This is Part 1 of a series on what AlphaFold did (and didn’t) solve—and what comes next. ⬇️

Way too many people think that AlphaFold "solved" ML for proteins.

It didn't.
It did revolutionize protein structure prediction, but that’s just one part of a much bigger puzzle.

This is Part 1 of a series on what AlphaFold did (and didn’t) solve—and what comes next. ⬇️
Jonathan Balloch (@jonathanballoch) 's Twitter Profile Photo

I think people underestimate how important and how challenging of a assertion this is. Symbolic logic and planning took the world very, very far, and it remains an open question how to seamlessly combine this with the embedding spaces learned in deep networks

Jonathan Balloch (@jonathanballoch) 's Twitter Profile Photo

hey remember when the Browns traded away Baker Mayfield for a conditional pick while continuing to pay $10mil of his salary? lol

Jonathan Balloch (@jonathanballoch) 's Twitter Profile Photo

it is wild that we haven't figured out a more efficient way to make electricity from heat than: 1) make thing hot 2) use heat to make water into steam 3) steam turns turbine Like how are we still depending on steam turbines??

Javi (@jvrsanch) 's Twitter Profile Photo

I went from believing OpenAI pursued AGI to realizing that Sam is creating a new YC I can't believe how simple it is: 1. Sell the promise of AGI to raise billions 2. Build great models but not AGI and offer them via API 3. Let startups struggle to figure out what works and what