Amy Lu (@amyxlu) Twitter Tweets • TwiCopy

Amy Lu

4 months ago

FWIW: I personally don't think a scaling "wall" is conclusive, but I do think that the signal-to-noise ratio in language >> images >>> proteins >> DNA. So the LLM / "compressor" should be more intentional, esp. since mutation assays look at such remarkably finegrained details

thumb_up_off_alt56

chat_bubble_outline2

repeat4

shareShare

Amy Lu

@amyxlu

4 months ago

I think we started using transformers in ~2019 because protein and DNA also have global and local patterns, like language. Maybe somewhere along the way we forgot that some biological tasks don’t neatly fit self-attention’s strengths for and are mostly local patterns (for ex.,

thumb_up_off_alt140

chat_bubble_outline2

repeat6

shareShare

Amy Lu

@amyxlu

4 months ago

Deadline is on May 26 AoE!! Best paper awards are awarded for each track (incl. the AI for Science track) ✏️🤖🧪🗺️

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Ahmed Alaa

@_ahmedmalaa

3 months ago

Our Rising Stars series is back with Paula Nicoleta Gradu sharing a control-theoretic perspective on medication tapering problems! Link: youtu.be/IP9unMmvbNE?si…

Our Rising Stars series is back with <a href="/paula_gradu/">Paula Nicoleta Gradu</a> sharing a control-theoretic perspective on medication tapering problems!

Link: youtu.be/IP9unMmvbNE?si…

thumb_up_off_alt25

chat_bubble_outline0

repeat3

shareShare

Amy Lu

@amyxlu

3 months ago

It’s finally happening!!! Diffusion is so much more satisfying than autoregressive for protein & DNA sequences that don’t really have directionality 🥹 Waiting for this to empirically land & replace BERT/one-step discrete diffusion for protein foundation models 👀

thumb_up_off_alt587

chat_bubble_outline13

repeat31

shareShare

Biology+AI Daily

@biologyaidaily

3 months ago

Flash Invariant Point Attention １．FlashIPA introduces a linear-scaling reformulation of Invariant Point Attention (IPA), a core algorithm in protein and RNA structure modeling. It achieves SE(3)-invariant geometry-aware attention with dramatically reduced memory and runtime,

thumb_up_off_alt51

chat_bubble_outline0

repeat14

shareShare

Amy Lu

@amyxlu

3 months ago

also to be clear!! some of the best academic works in discrete diffusion have been for proteins/molecules/DNA. Point here is Google/Inception etc fully investing in this could bear more fruit than at first glance Also this idea of scratch pad / baked-in error correction /

thumb_up_off_alt34

chat_bubble_outline0

repeat0

shareShare

Kevin Frans

@kvfrans

3 months ago

Over the past year, I've been compiling some "alchemist's notes" on deep learning. Right now it covers basic optimization, architectures, and generative models. Focus is on learnability -- each page has nice graphics and an end-to-end implementation. notes.kvfrans.com

thumb_up_off_alt210

chat_bubble_outline3

repeat28

shareShare

Amy Lu

@amyxlu

3 months ago

Submission deadline is now **May 31 AoE**! Best Paper awards are given for the robotics, RL theory, language modeling, and AI for Science tracks. Exploration is an evolving and expanding scope of research -- excited to explore (ha ha) the intersections together 🗺️🧭🤖🧪

thumb_up_off_alt19

chat_bubble_outline0

repeat2

shareShare

Amy Lu

@amyxlu

2 months ago

ditto on thanks to all co-authors 😊 check out our work in Patterns, a Cell Press journal!

thumb_up_off_alt19

chat_bubble_outline0

repeat0

shareShare

Amy Lu

@amyxlu

2 months ago

MFW people are surprised that scaling up transformer-based protein language models didn't help with hella high-resolution variant effect fitness prediction tasks

thumb_up_off_alt44

chat_bubble_outline1

repeat3

shareShare

Clara Fannjiang

@clara_fannjiang

2 months ago

✨ new work with Ji Won Park #icml2025! a zoo of algos exists for designing new proteins & molecules w/ AI🧬🧪how do you pick which one to use? our method selects design algos that will achieve user-specified, population-level success criteria w/ high-prob guarantees.👇

✨ new work with <a href="/jiwoncpark/">Ji Won Park</a> #icml2025! a zoo of algos exists for designing new proteins & molecules w/ AI🧬🧪how do you pick which one to use? our method selects design algos that will achieve user-specified, population-level success criteria w/ high-prob guarantees.👇

thumb_up_off_alt28

chat_bubble_outline1

repeat8

shareShare

Albert Gu

@_albertgu

2 months ago

Tokenization is just a special case of "chunking" - building low-level data into high-level abstractions - which is in turn fundamental to intelligence. Our new architecture, which enables hierarchical *dynamic chunking*, is not only tokenizer-free, but simply scales better.

thumb_up_off_alt1,1K

chat_bubble_outline58

repeat177

shareShare

Deepak Pathak

@pathak2206

a month ago

Thrilled to finally release this study! 🚀 We view (discrete) diffusion models as implicitly doing data augmentation over autoregressive. Through this lens, we find that diffusion outperforms AR in data-constrained settings, but it requires larger models and way more epochs to

thumb_up_off_alt300

chat_bubble_outline7

repeat40

shareShare

Anshul Kundaje (anshulkundaje@bluesky)

@anshulkundaje

a month ago

Data diversity, quality & relevance rules over model size any day of the week. Very clever approach of generating synthetic protein sequences from backbone structures to give big boosts to pLMs.

thumb_up_off_alt49

chat_bubble_outline0

repeat10

shareShare