tgeo92
@tjoskz
ID: 863938419428175874
15-05-2017 02:05:45
1,1K Tweet
441 Followers
4,4K Following
I wrote a blogpost "On the speed of ViTs and CNNs". Addresses the following concerns I often hear: - worry about ViTs speed at high resolution. - how high resolution do I need? - is it super important to keep the aspect ratio? I think Yann LeCun might like it too! Link below
New Video: How to write an okay research paper. Reviewers all agree! Sasha Rush's papers are "reasonably structured" and "somewhat clear, despite other flaws". youtu.be/qNlwVGxkG7Q?si…
I'm thrilled that the paper Kaiqing Zhang and I co-authored has made it onto the reading list for this course! ambujtewari.com/LLM-fall2024/ Any questions would be appreciated.
Giving your models more time to think before prediction, like via smart decoding, chain-of-thoughts reasoning, latent thoughts, etc, turns out to be quite effective for unblocking the next level of intelligence. New post is here :) “Why we think”: lilianweng.github.io/posts/2025-05-…