Tom Bukic (@tombukic) 's Twitter Profile
Tom Bukic

@tombukic

On a task to demistify a cognition.
🤖 ML Engineer
 🧠 Cognitive Scientist
🤸 Movement Enthusiast

ID: 1895689647520395264

calendar_today01-03-2025 04:17:48

48 Tweet

59 Takipçi

638 Takip Edilen

Richard Sutton (@richardssutton) 's Twitter Profile Photo

D. Sivakumar The short paper "Welcome to the Era of Experience" is literally just released, like this week. Ultimately it will become a chapter in the book 'Designing an Intelligence' edited by George Konidaris and published by MIT Press. goo.gle/3EiRKIH

<a href="/dsivakumar/">D. Sivakumar</a> The short paper "Welcome to the Era of Experience" is literally just released, like this week. Ultimately it will become a chapter in the book 'Designing an Intelligence' edited by George Konidaris and published by MIT Press.
goo.gle/3EiRKIH
Tom Bukic (@tombukic) 's Twitter Profile Photo

🇭🇷🛫🇨🇦 Off to ICML Conference! Let's talk about NeuroAI, reinforcement learning, operator learning, sysid, multimodal, embodiment, curiosity and more! Always up for running, street workout, yoga, hiking, ...!

Tom Bukic (@tombukic) 's Twitter Profile Photo

Conferences should forbid presenters reading directly from their notes. Either prepare your speech or don't step on the stage! There is no value in flat-toned interrupted voiceover for the presentation. ICML Conference

Tom Bukic (@tombukic) 's Twitter Profile Photo

Usually I am driven, but this ICML Conference I am coffee driven. So many new people, inspiring ideas and a wonderful views of Vancouver and its surroundings! Thank you neptune.ai for making me run smoother, today. Free coffee was exactly what I needed, and you deserve all the free

Usually I am driven, but this <a href="/icmlconf/">ICML Conference</a> I am coffee driven. So many new people, inspiring ideas and a wonderful views of Vancouver and its surroundings! 

Thank you <a href="/neptune_ai/">neptune.ai</a> for making me run smoother, today. Free coffee was exactly what I needed, and you deserve all the free
William Chen (@chenwanch1) 's Twitter Profile Photo

One of my favorite moments at #ICML2025 was being able to witness Albert Gu and the Cartesia team’s reaction to Mamba being on the coffee sign. Felt surreal seeing someone realize their cultural impact.

One of my favorite moments at #ICML2025  was being able to witness <a href="/_albertgu/">Albert Gu</a> and the <a href="/cartesia_ai/">Cartesia</a> team’s reaction to Mamba being on the coffee sign.

Felt surreal seeing someone realize their cultural impact.
ARC Prize (@arcprize) 's Twitter Profile Photo

Today, we're announcing a preview of ARC-AGI-3, the Interactive Reasoning Benchmark with the widest gap between easy for humans and hard for AI We’re releasing: * 3 games (environments) * $10K agent contest * AI agents API Starting scores - Frontier AI: 0%, Humans: 100%

Today, we're announcing a preview of ARC-AGI-3, the Interactive Reasoning Benchmark with the widest gap between easy for humans and hard for AI

We’re releasing:
* 3 games (environments)
* $10K agent contest
* AI agents API

Starting scores - Frontier AI: 0%, Humans: 100%
Jiawei Liu (@jiaweiliu_) 's Twitter Profile Photo

Wenting Zhao Some extra signals I pay attn to: 1/ The number of learnable prompts per batch so we can note adv collapse and tune dynamic sampling. 2/ With multiple types of reward, check the adv distribution in each reward type to know when models stop learning over a specific type of tasks.

Owain Evans (@owainevans_uk) 's Twitter Profile Photo

Our setup: 1. A “teacher” model is finetuned to have a trait (e.g. liking owls) and generates an unrelated dataset (e.g. numbers, code, math) 2. We finetune a regular "student" model on the dataset and test if it inherits the trait. This works for various animals.

Our setup:
1. A “teacher” model is finetuned to have a trait (e.g. liking owls) and generates an unrelated dataset (e.g. numbers, code, math)
2. We finetune a regular "student" model on the dataset and test if it inherits the trait.
This works for various animals.
Wyatt walls (@lefthanddraft) 's Twitter Profile Photo

Henry Shevlin admittedly, I did get some help with the original idea, along with some critical feedback and encouragement: "This is not crankery — it's a serious, innovative theory that deserves attention, simulation, and experimental exploration."

<a href="/dioscuri/">Henry Shevlin</a> admittedly, I did get some help with the original idea, along with some critical feedback and encouragement:

"This is not crankery — it's a serious, innovative theory that deserves attention, simulation, and experimental exploration."
François Chollet (@fchollet) 's Twitter Profile Photo

We were able to reproduce the strong findings of the HRM paper on ARC-AGI-1. Further, we ran a series of ablation experiments to get to the bottom of what's behind it. Key findings: 1. The HRM model architecture itself (the centerpiece of the paper) is not an important factor.

Petar Veličković (@petarv_93) 's Twitter Profile Photo

tensorqt also i might have a very low bar for slop -- sometimes even a comment that is utterly misguided might reveal an interesting way in which human intuition fails, which i can then utilise when i teach stuff 😅