Danqi7 (@danqiliao73090) Twitter Tweets • TwiCopy

Danqi7

5 months ago

Tim is an incredible mentor that I've learned a lot from. If you’re interested in a PhD in ML, AI safety, or statistical ML, don’t miss this opportunity. :)

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

Listening to Ilya Sutskever recent chat with Dwarkesh Patel on how we are moving from age of scaling to age of research, I just realized that research is literally "re"-"search". Like if existing ideas are explored local minimum, the job now is to search again, to explore new directions

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Danqi7

@danqiliao73090

5 months ago

Generate by density? 🙅‍♀️ Generate by geometry 🙂‍↕️

thumb_up_off_alt10

chat_bubble_outline2

repeat0

shareShare

Krishnaswamy Lab

@krishnaswamylab

4 months ago

(1/n) Just in time for New Years! #ImmunoStruct, our multimodal model that predicts class I peptide-MHC immunogenicity is out at Nature Machine Intelligence ! nature.com/articles/s4225…

thumb_up_off_alt177

chat_bubble_outline6

repeat55

shareShare

Danqi7

@danqiliao73090

2 months ago

Coding in the AI agent era is framed more as the discriminator than the generator. But, to be a solid discriminator, you have to become a good generator first. You cannot build the muscles to become either if you outsource the learning, which is often tempting because it's the

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Danqi7

@danqiliao73090

2 months ago

Chris Albon mine is this

<a href="/chrisalbon/">Chris Albon</a> mine is this

thumb_up_off_alt42

chat_bubble_outline2

repeat1

shareShare

Danqi7

@danqiliao73090

2 months ago

Come to my dissertation talk to find out what geometry and manifold can do in ML/AI.😊

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

Ming "Tommy" Tang

@tangming2005

2 months ago

The real bottlenecks in drug development: - Target validation. Is this protein actually the right one to go after? - Clinical trials. Does the drug work in actual patients?

thumb_up_off_alt36

chat_bubble_outline2

repeat3

shareShare

Yulu Gan

@yule_gan

2 months ago

Simply adding Gaussian noise to LLMs (one step—no iterations, no learning rate, no gradients) and ensembling them can achieve performance comparable to or even better than standard GRPO/PPO on math reasoning, coding, writing, and chemistry tasks. We call this algorithm RandOpt.

thumb_up_off_alt1,1K

chat_bubble_outline48

repeat231

shareShare

Danqi7

@danqiliao73090

2 months ago

The hype around OpenClaw (especially in China) reminded me that fear is still the #1 driver of behavior. FOMO. Fear of being replaced. Fear of missing the next wave. Social media amplifies and profits from fear far more effectively than hope or curiosity.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Danqi7

@danqiliao73090

2 months ago

“Engineers will be replaced.” “Doctors won’t be needed.” “AI will replace humans.” Fear grabs attention → attention drives adoption → adoption fuels hype. In StarCraft lore the Zerg need an Overmind to coordinate the swarm. Humans don’t need an Overmind. Fear works just fine.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Danqi7

@danqiliao73090

2 months ago

More perturbation data 👀 With the recent surge in cell perturbation / state transition models, excited to see what people build with it

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Danqi7

@danqiliao73090

a month ago

People claiming themselves doing open science or open source yet do not publish their training data.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Danqi7

@danqiliao73090

a month ago

This is interesting and kinda reminds me of this recent talk from Jeff Dean on making multiple passes on the same data youtube.com/watch?v=g8BuAt…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Krishnaswamy Lab

@krishnaswamylab

16 days ago

(1/n) 🎉 Excited to share our paper "HEIST: Hierarchical Embeddings for Spatial Transcriptomics" to be presented at #ICLR202! Heist is a foundation model for spatial-omics data, trained on 22.3M cells from 124 tissues across 15 organs, that jointly models spatial proximity AND

thumb_up_off_alt129

chat_bubble_outline3

repeat31

shareShare

Danqi7

@danqiliao73090

15 days ago

Tried the same query for three AI scientists platform and the results and the reasoning steps are very much similar. 👀

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Danqi7

@danqiliao73090

13 days ago

The GPT-2 replication tutorial by Andrej Karpathy might be the best technical video on the internet. I watched every second. One thing that surprised me: padding the tokenizer to an even number of total tokens actually speeds up training. The whole speedup section is packed with gems

The GPT-2 replication tutorial by <a href="/karpathy/">Andrej Karpathy</a> might be the best technical video on the internet. I watched every second. One thing that surprised me: padding the tokenizer to an even number of total tokens actually speeds up training. The whole speedup section is packed with gems

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Danqi7

@danqiliao73090

12 days ago

I think model/training is just as important as data here. There's no shortage of bio data (quality aside), but the modeling paradigm, especially for cells, hasn't even converged yet. LLMs were able to scale up even on low-quality data once the field converged on the right

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Danqi7

@danqiliao73090

6 days ago

Yeah we need more than just 150M cells from CELLxGENE. And we need more temporal data in addition to static cell snapshots

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare