Mick (@mickvanhulst) Twitter Tweets • TwiCopy

Bingyi Kang

6 months ago

After a year of team work, we're thrilled to introduce Depth Anything 3 (DA3)! 🚀 Aiming for human-like spatial perception, DA3 extends monocular depth estimation to any-view scenarios, including single images, multi-view images, and video. In pursuit of minimal modeling, DA3

thumb_up_off_alt3,3K

chat_bubble_outline70

repeat450

shareShare

Yunhao Luo

@yluo_y

5 months ago

🚀Excited to share our NeurIPS Conference paper Compositional Diffuser (CompDiffuser)! CompDiffuser scales planning horizons at test time, able to construct long-horizon plans while only trained on short-horizon data. Project page: comp-diffuser.github.io Thread 👇(1/n)

thumb_up_off_alt39

chat_bubble_outline2

repeat12

shareShare

Chieh-Hsin (Jesse) Lai

@jcjesselai

4 months ago

🎉 Our diffusion-model monograph 《The Principles of Diffusion Models》 now has an official website! 🔗 Link in thread 💡 Two highlights: 1️⃣ Blog Post: a big-picture walkthrough from diffusion models → fast generation & flow map models; 2️⃣ Teaching Guide: lightweight course

thumb_up_off_alt598

chat_bubble_outline10

repeat83

shareShare

Zihan Wang - on RAGEN

@wzihanw

4 months ago

Everything is a world model if you squint hard enough.

thumb_up_off_alt872

chat_bubble_outline29

repeat113

shareShare

sway

@swaystar123

4 months ago

To everyone who was questioning the muon results: You are right! The LR was badly tuned, and weight decay was also hampering it. Using lr 1e-3 and 0 wd, got a new best, 3.39 FID @ 400k

thumb_up_off_alt94

chat_bubble_outline6

repeat3

shareShare

Mick

@mickvanhulst

4 months ago

Whenever I read about LLMs as a jury, I wonder how different these LLMs really are. Ideally, one would assume their errors are uncorrelated, so that if their mistakes are independent, the error rate becomes epsilon squared. But their base models have all been trained on the

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Hamel Husain

@hamelhusain

4 months ago

This deserved its own flashcard b/c I've seen bus stop ads from eval vendors encouraging the opposite in San Francisco 🤣 The only thing generic metrics do is waste your time. Links in reply.

thumb_up_off_alt122

chat_bubble_outline8

repeat10

shareShare

Niels Rogge

@nielsrogge

4 months ago

One of the best visual explanations I've ever seen for why scaling Transformers works, but is suboptimal, as it's just brute-forcing things, by Llion Jones (co-author of the Transformer) on Machine Learning Street Talk "In the (rejected) paper "Intelligent Matrix Exponentiation", they show

One of the best visual explanations I've ever seen for why scaling Transformers works, but is suboptimal, as it's just brute-forcing things, by <a href="/YesThisIsLion/">Llion Jones</a> (co-author of the Transformer) on <a href="/MLStreetTalk/">Machine Learning Street Talk</a>

"In the (rejected) paper "Intelligent Matrix Exponentiation", they show

thumb_up_off_alt626

chat_bubble_outline27

repeat49

shareShare

Mick

@mickvanhulst

3 months ago

Quite curious about Grokipedia! May be a great opportunity to make it a verifiable large-scale knowledge graph for better context management via graph traversal. Reduces LLM need to memorize the world, enabling a "cognitive core"-like approach (see Karpathy's tweets). Do wonder

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Google

@google

3 months ago

We’re launching full-length, on demand practice exams for standardized tests in Google Gemini, starting with the SAT, available now at no cost. Practice SATs are grounded in rigorously vetted content in partnership with The Princeton Review, and Gemini will provide immediate feedback

thumb_up_off_alt7,7K

chat_bubble_outline278

repeat661

shareShare

Naval

@naval

3 months ago

Self-directed learning through AIs is an autodidact’s paradise.

thumb_up_off_alt19,19K

chat_bubble_outline777

repeat1,1K

shareShare

Wall Street Mav

@wallstreetmav

3 months ago

If science class explained everything with Midwest emo music, we would have a lot more people paying attention. Nuclear power explained. It’s just a fancy way to heat water and make electricity. 🔊

thumb_up_off_alt15,15K

chat_bubble_outline277

repeat3,3K

shareShare

Boring_Business

@boringbiz_

3 months ago

This was an eye opener from Jensen Huang When asked whether he would rather relive his 20s or be 20 years old today, this is what he had to say: "I thought our 20s were happier than these 20s. I think everyone deserves some time to be oblivious, and not wear all of the world's

thumb_up_off_alt15,15K

chat_bubble_outline183

repeat1,1K

shareShare

Obsidian

@obsdmd

3 months ago

Anything you can do in Obsidian you can do from the command line. Obsidian CLI is now available in 1.12 (early access).

thumb_up_off_alt16,16K

chat_bubble_outline448

repeat1,1K

shareShare

Robert Cincotta

@drrobcincotta

3 months ago

I have been testing the new Obsidian CLI with Claude Code on my research vault (4,663 files, 16 GB)... I know too many notes!! Early results are significant. Its going to change the way in which Claude Code can interact with Obsidian The way I see it, there are three ways

thumb_up_off_alt1,1K

chat_bubble_outline45

repeat78

shareShare

Dimitri

@dimitrikennedy

2 months ago

The next big level up for me wont be a better model, just boring infra. Better container orchestration. Personal compute networking. Secret management. Environment replication. Designed for a world where one person is running 100+ parallel workstreams.

thumb_up_off_alt24

chat_bubble_outline1

repeat2

shareShare

Om Patel

@om_patel5

a month ago

I taught Claude to talk like a caveman to use 75% less tokens. normal claude: ~180 tokens for a web search task caveman claude: ~45 tokens for the same task "I executed the web search tool" = 8 tokens caveman version: "Tool work" = 2 tokens every single grunt swap saves 6-10

thumb_up_off_alt1,1K

chat_bubble_outline114

repeat103

shareShare

ℏεsam

@hesamation

a month ago

thumb_up_off_alt1,1K

chat_bubble_outline34

repeat122

shareShare

Mick

@mickvanhulst

18 days ago

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

James Zou

@james_y_zou

3 days ago

Big Update🤩: #paperclip now includes full papers from all of arXiv, PubMed Central and 150 million abstracts!🖇️ You can give your LLM all that knowledge in one line—all optimally indexed for AI agents. Much more thorough and ~100x faster than web search, and free.

thumb_up_off_alt1,1K

chat_bubble_outline40

repeat231

shareShare