Vinay Ramasesh (@vinayramasesh) 's Twitter Profile
Vinay Ramasesh

@vinayramasesh

Research scientist @DeepMind working towards a better understanding of deep learning. Physics PhD @UCBerkeley

ID: 853041007553675266

linkhttps://ramasesh.github.io calendar_today15-04-2017 00:23:19

397 Tweet

756 Followers

759 Following

Stephan Hoyer (@shoyer) 's Twitter Profile Photo

For the record: Gemini's insistence on producing diverse images of people is a slightly glitchy feature, not a bug. If ahistorical images bother you much more than the myriad other generative AI issues, maybe this would be a good opportunity for self-reflection...

Tamay Besiroglu (@tamaybes) 's Twitter Profile Photo

The Chinchilla scaling paper by Hoffmann et al. has been highly influential in the language modeling community. We tried to replicate a key part of their work and discovered discrepancies. Here's what we found. (1/9)

The Chinchilla scaling paper by Hoffmann et al. has been highly influential in the language modeling community. We tried to replicate a key part of their work and discovered discrepancies. Here's what we found. (1/9)
Oriol Vinyals (@oriolvinyalsml) 's Twitter Profile Photo

Today we have published our updated Gemini 1.5 Model Technical Report. As Jeff Dean highlights, we have made significant progress in Gemini 1.5 Pro across all key benchmarks; TL;DR: 1.5 Pro > 1.0 Ultra, 1.5 Flash (our fastest model) ~= 1.0 Ultra. As a math undergrad, our drastic

Today we have published our updated Gemini 1.5 Model Technical Report. As <a href="/JeffDean/">Jeff Dean</a> highlights, we have made significant progress in Gemini 1.5 Pro across all key benchmarks; TL;DR: 1.5 Pro &gt; 1.0 Ultra, 1.5 Flash (our fastest model) ~= 1.0 Ultra.

As a math undergrad, our drastic
Michael Newman (@mikenewmquantum) 's Twitter Profile Photo

Houston, we are below the quantum error correction threshold! 🚀 In “Quantum error correction below the surface code threshold” (arxiv.org/abs/2408.13687), we implement a 101-qubit surface code. Each time we increase the distance by two, the logical error rate is cut in half!

Houston, we are below the quantum error correction threshold! 🚀

In “Quantum error correction below the surface code threshold” (arxiv.org/abs/2408.13687), we implement a 101-qubit surface code. 

Each time we increase the distance by two, the logical error rate is cut in half!
kat (@katclone) 's Twitter Profile Photo

Friends — Do you know anyone with personal or professional experience who has dealt with ADENOID CYSTIC CARCINOMA? Please send any leads to Ashlee Vance, contact and additional details in QT’d tweet 🤍

roon (@tszzl) 's Twitter Profile Photo

Nora Belrose what I meant here is that we do not use our full understanding of physics to build a weather model. you approximate a handful of effects on a coarse area and run the engine. this doesn’t mean that deep learning is more true than physics, it means that running detailed physical

Enrique Piqueras (@epiqueras1) 's Twitter Profile Photo

Another day another tool. JAX Rooflines! When evaluating different chips or topologies for a workload you have to use a bunch of rules of thumb and flops/bandwidth calculations to arrive at relative performance numbers. Now you can free some brain flops and let JAX do the math

Another day another tool. JAX Rooflines!

When evaluating different chips or topologies for a workload you have to use a bunch of rules of thumb and flops/bandwidth calculations to arrive at relative performance numbers.

Now you can free some brain flops and let JAX do the math
Stanislav Fort (@stanislavfort) 's Twitter Profile Photo

Richard Y. Chappell🔸 This is not a fact. It is only true if you are reasoning very short term. The bulk of everything good long term comes from research. But research looks quite unproductive short term. If you take what you say too seriously, you'll cause more harm in aggregate over the long term.

Siqi Chen (@blader) 's Twitter Profile Photo

if you could press a button that cures your child’s brain tumor in exchange for ending your life immediately, every parent would hesitate for zero seconds before fighting to be the first to press it the cruelest thing is that no such button exists. but there is always a move 👇

if you could press a button that cures your child’s brain tumor in exchange for ending your life immediately, every parent would hesitate for zero seconds before fighting to be the first to press it

the cruelest thing is that no such button exists.

but there is always a move 👇
Elon Musk (@elonmusk) 's Twitter Profile Photo

Anyone – of any race, creed or nationality – who came to America and worked like hell to contribute to this country will forever have my respect. America is the land of freedom and opportunity. Fight with every fiber of your being to keep it that way! 🇺🇸🇺🇸🇺🇸🇺🇸🇺🇸🇺🇸🇺🇸🇺🇸🇺🇸🇺🇸🇺🇸🇺🇸🇺🇸

Jacob Austin (@jacobaustin132) 's Twitter Profile Photo

Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n

Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n
Nima Alidoust (@nalidoust) 's Twitter Profile Photo

Sharing to see if someone can help. Sri Kosuri is searching for clinicians or patients that have encountered pediatric interdigitating dendritic cell sarcoma (IDCS). Leads would be appreciated.

John Preskill (@preskill) 's Twitter Profile Photo

In their roadmap, Microsoft described a protocol for demonstrating a topologically protected qubit. There is no publicly available evidence that this test has been conducted successfully. I hope we will hear more soon. arxiv.org/abs/2502.12252

Sam Bowman (@s8mb) 's Twitter Profile Photo

We can use a harmless wavelength of light to kill nearly all airborne pathogens. The technology exists! It works! And now there is a plan for bringing it out into daily life.

We can use a harmless wavelength of light to kill nearly all airborne pathogens. 

The technology exists! It works! And now there is a plan for bringing it out into daily life.
Anselm Levskaya (@anselmlevskaya) 's Twitter Profile Photo

Armand Domalewski There's just no evidence for the lab leak, and quite a lot for zoonosis. (I used to engineer viral vectors for a living.) The rootclaim debate with Peter Miller and Saar Wilf is long but the best public airing of the evidence. youtube.com/channel/UCAkFd…

Neel Nanda (@neelnanda5) 's Twitter Profile Photo

LLM evals have the glaring blind spot of being focused on "things AI researchers understand". I appreciate people like Adam using their expertise to help, like in this post on evaluating how useful LLMs are at instructing on how to make metal parts. Spoiler: They're terrible

Mary Wang (@maryxw) 's Twitter Profile Photo

Very excited to share this resource that Adam Marblestone and I have been working on for the past few months! It’s an interactive map of scientific problems and potential solutions. What’s most exciting? What’s missing? Let us know! 👇

David Pfau (@pfau) 's Twitter Profile Photo

We desperately need to take all the people in SF who talk about accelerating biology but have only ever done math or CS and have them do a six month rotation in a wet lab.