Grant Watson (@grhwatson) Twitter Tweets • TwiCopy

Majdi Hassan

8 months ago

(1/n)🚨You can train a model solving DFT for any geometry almost without training data!🚨 Introducing Self-Refining Training for Amortized Density Functional Theory — a variational framework for learning a DFT solver that predicts the ground-state solutions for different

thumb_up_off_alt151

chat_bubble_outline3

repeat38

shareShare

Amil Dravid

@_amildravid

8 months ago

Artifacts in your attention maps? Forgot to train with registers? Use 𝙩𝙚𝙨𝙩-𝙩𝙞𝙢𝙚 𝙧𝙚𝙜𝙞𝙨𝙩𝙚𝙧𝙨! We find a sparse set of activations set artifact positions. We can shift them anywhere ("Shifted") — even outside the image into an untrained token. Clean maps, no retrain.

thumb_up_off_alt325

chat_bubble_outline4

repeat62

shareShare

Corin Wagen

@corinwagen

8 months ago

Gabriele Corso Patrick Walters Various forms of this discussion are playing out in a lot of different "AI x science" areas right now. (I'm team extrapolation-is-good, but open to being wrong.) I wrote about closely related topics previously, albeit in an esoteric format: corinwagen.github.io/public/blog/20…

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Sakana AI

@sakanaailabs

8 months ago

We’re excited to introduce AB-MCTS! Our new inference-time scaling algorithm enables collective intelligence for AI by allowing multiple frontier models (like Gemini 2.5 Pro, o4-mini, DeepSeek-R1-0528) to cooperate. Blog: sakana.ai/ab-mcts Paper: arxiv.org/abs/2503.04412

thumb_up_off_alt1,1K

chat_bubble_outline27

repeat222

shareShare

Giannis Daras

@giannis_daras

8 months ago

Announcing Ambient Protein Diffusion, a state-of-the-art 17M-params generative model for protein structures. Diversity improves by 91% and designability by 26% over previous 200M SOTA model for long proteins. The trick? Treat low pLDDT AlphaFold predictions as low-quality data

thumb_up_off_alt187

chat_bubble_outline4

repeat39

shareShare

Albert Gu

@_albertgu

7 months ago

Tokenization is just a special case of "chunking" - building low-level data into high-level abstractions - which is in turn fundamental to intelligence. Our new architecture, which enables hierarchical *dynamic chunking*, is not only tokenizer-free, but simply scales better.

thumb_up_off_alt1,1K

chat_bubble_outline58

repeat177

shareShare

Bilawal Sidhu

@bilawalsidhu

6 months ago

Damn it worked! Genie 3 world --> inpaint UI --> 4x topaz AI upscale --> train 3d gaussian splat You can step inside a painting of Socrates from 1787. Better than any image-to-3d model I've seen. I think Google has stumbled upon the killer app for VR -- the literal holodeck.

thumb_up_off_alt7,7K

chat_bubble_outline166

repeat728

shareShare

Yilun Du

@du_yilun

5 months ago

Excited to share Equilibrium Matching (EqM)! EqM simplifies and outperforms flow matching, enabling strong generative performance of FID 1.96 on ImageNet 256x256. EqM learns a single static EBM landscape for generation, enabling a simple gradient-based generation procedure.

thumb_up_off_alt1,1K

chat_bubble_outline13

repeat164

shareShare

Ivan Skorokhodov

@isskoro

4 months ago

I think this paper [arxiv.org/abs/2510.08570] wins the "strangest" (in a good sense) 1-step diffusion award of this year. They parametrize a model as an invertible network, which maps from the sample space to the representation space, which is assumed to be linear: i.e. we assume

thumb_up_off_alt387

chat_bubble_outline9

repeat50

shareShare

Saining Xie

@sainingxie

4 months ago

three years ago, DiT replaced the legacy unet with a transformer-based denoising backbone. we knew the bulky VAEs would be the next to go -- we just waited until we could do it right. today, we introduce Representation Autoencoders (RAE). >> Retire VAEs. Use RAEs. 👇(1/n)

thumb_up_off_alt1,1K

chat_bubble_outline55

repeat321

shareShare

Sakana AI

@sakanaailabs

4 months ago

Introducing Petri Dish Neural Cellular Automata (PD-NCA) 🦠 The search for open-ended complexification, a north star of Artificial Life (ALife) simulations, is a question that fascinates us deeply. In this work we explore the role of continual adaptation in ALife simulation,

thumb_up_off_alt321

chat_bubble_outline13

repeat68

shareShare

Oriol Vinyals

@oriolvinyalsml

3 months ago

The secret behind Gemini 3? Simple: Improving pre-training & post-training 🤯 Pre-training: Contra the popular belief that scaling is over—which we discussed in our NeurIPS '25 talk with Ilya Sutskever and Quoc Le—the team delivered a drastic jump. The delta between 2.5 and 3.0 is

thumb_up_off_alt3,3K

chat_bubble_outline107

repeat499

shareShare

hardmaru

@hardmaru

3 months ago

Excited to announce our MIT Press book “Neuroevolution: Harnessing Creativity in AI Agent Design” by Sebastian Risi (Sebastian Risi), Yujin Tang (Yujin Tang), Risto Miikkulainen, and myself. We explore decades of work on evolving intelligent agents and shows how neuroevolution can

thumb_up_off_alt1,1K

chat_bubble_outline16

repeat225

shareShare

sway

@swaystar123

2 months ago

Speedrunning ImageNet Diffusion - 360x faster training There have been many new techniques demonstrating convergence speedups compared to DiT in the past few years, however all of these have been studied in isolation, against increasingly outdated baselines. I present SR-DiT

thumb_up_off_alt485

chat_bubble_outline22

repeat61

shareShare

Sakana AI

@sakanaailabs

2 months ago

Our AI agent has achieved 1st place in a competitive optimization programming contest against over 800 human participants. Blog: sakana.ai/ahc058 In AtCoder Heuristic Contest 058, Sakana AI’s ALE-Agent took the top spot. For context on the difficulty of these challenges,

thumb_up_off_alt327

chat_bubble_outline15

repeat63

shareShare

Sakana AI

@sakanaailabs

a month ago

Introducing Digital Red Queen (DRQ): Adversarial Program Evolution in Core War with LLMs Blog: sakana.ai/drq Core War is a programming game where self-replicating assembly programs, called warriors, compete for control of a virtual machine. In this dynamic

thumb_up_off_alt579

chat_bubble_outline21

repeat98

shareShare

机器之心 JIQIZHIXIN

@synced_global

18 days ago

New paradigm from Kaiming He's team: Drifting Models! With this approach, you can generate a perfect image in a single step. The team trains a "drifting field" that smoothly moves samples toward equilibrium with the real data distribution. The result? A one-step generator that

thumb_up_off_alt1,1K

chat_bubble_outline15

repeat167

shareShare

OpenAI

@openai

18 days ago

We worked with Ginkgo Bioworks to connect GPT-5 to an autonomous lab, so it could propose experiments, run them at scale, learn from the results, and decide what to try next. That closed loop brought protein production cost down by 40%.

thumb_up_off_alt5,5K

chat_bubble_outline275

repeat750

shareShare

Rohan Paul

@rohanpaul_ai

7 days ago

Terence Tao: AI isn’t hype anymore in Math discovery. Terence Tao is one of the greatest living mathematicians, in his new lecture explains how AI and human professional mathematicians are now complementary. "There has been a really visible increase in capability. It is not

thumb_up_off_alt1,1K

chat_bubble_outline38

repeat261

shareShare

Martin Bauer

@martinmbauer

7 days ago

Yes, this is a significant result and a solid research paper. And it would’ve been much harder to achieve without GPT. While I understand the instinct, I think it is more interesting to evaluate what type of contribution the AI has made as opposed to focussing on how relevant

thumb_up_off_alt448

chat_bubble_outline13

repeat41

shareShare