Jeremy Howard (@jeremyphoward) Twitter Tweets • TwiCopy

daniel scott mitchell

@thedanielmitch

6 months ago

mom, how did we get so poor? > your dad put ‘request for quote’ instead of a ‘buy now’ button on his website

thumb_up_off_alt42,42K

chat_bubble_outline214

repeat1,1K

shareShare

- Steve Jobs: son of a Syrian immigrant. - Jeff Bezos: father is a Cuban immigrant. - Sergey Brin: Soviet immigrant, born in Moscow. - Jensen Huang: Taiwanese immigrant. - Elon Musk: South African immigrant. 5 of the magnificent 7.

thumb_up_off_alt25,25K

chat_bubble_outline2,2K

repeat2,2K

shareShare

Casper Hansen

@casper_hansen_

6 months ago

One of the core limitations of language models: they struggle with true novelty. LLMs remix known information. If something isn’t in the pretraining or finetuning data, they have very little to work with. This is why there are a million papers chasing +5 points on GSM8k.

thumb_up_off_alt138

chat_bubble_outline23

repeat13

shareShare

Qwen

@alibaba_qwen

6 months ago

🚀 Excited to launch Qwen3 models in MLX format today! Now available in 4 quantization levels: 4bit, 6bit, 8bit, and BF16 — Optimized for MLX framework. 👉 Try it now! Huggingface：huggingface.co/collections/Qw… ModelScope： modelscope.cn/collections/Qw…

thumb_up_off_alt1,1K

chat_bubble_outline33

repeat178

shareShare

Dieter

@kagglingdieter

6 months ago

“If you want things to be done safely and responsibly, you do it in the open … Don’t do it in a dark room and tell me it’s safe.” Jensen Huang

thumb_up_off_alt111

chat_bubble_outline1

repeat8

shareShare

MiniMax (official)

@minimax__ai

6 months ago

Day 1/5 of #MiniMaxWeek: We’re open-sourcing MiniMax-M1, our latest LLM — setting new standards in long-context reasoning. - World’s longest context window: 1M-token input, 80k-token output - State-of-the-art agentic use among open-source models - RL at unmatched efficiency:

thumb_up_off_alt1,1K

chat_bubble_outline55

repeat236

shareShare

htmx.org / CEO of div tags (same thing)

@htmx_org

6 months ago

100% web servers should be gzipping the js on the way out & it should be cached forever afterwards minification destroys debuggability & the (underappreciated) #ViewSource affordance: htmx.org/essays/right-c… /cc Cory Doctorow NONCONSENSUAL BLUE TICK

thumb_up_off_alt113

chat_bubble_outline9

repeat4

shareShare

Charlie Marsh

@charliermarsh

6 months ago

The Python Steering Council has voted to remove the "experimental" label from the free-threaded ("nogil") builds for Python 3.14. Big step towards making them the default in a future version of CPython!

thumb_up_off_alt645

chat_bubble_outline18

repeat50

shareShare

Jonathan Whitaker

@johnowhitaker

6 months ago

New video, starting to look at Diffusion Language Models. This one introduces some ideas, then shows how I turn ModernBERT into a LLaDA-style generative model. Lots of avenues to explore from here! Join me in playing with this? Project ideas in thread :) youtube.com/watch?v=Ds_cTc…

thumb_up_off_alt340

chat_bubble_outline7

repeat51

shareShare

Mark Saroufim

@marksaroufim

6 months ago

Still feels surreal to have been on stage with one of the greatest CEO's of our time Dr Lisa Su where she explicitly called out GPU MODE and the work we did to enable the world's first $100K competitive kernel competition I never in a thousand years would have imagined that a

thumb_up_off_alt208

chat_bubble_outline9

repeat19

shareShare

Alex Zhang

@a1zhang

6 months ago

btw a shit ton of amazing learning material + open-source code for GPU programming ($150K worth) is linked on the latest GPU MODE news post a year ago when I was an undergrad I was scouring the internet for these kinds of resources, plz take advantage of it!

btw a shit ton of amazing learning material + open-source code for GPU programming ($150K worth) is linked on the latest <a href="/GPU_MODE/">GPU MODE</a> news post

a year ago when I was an undergrad I was scouring the internet for these kinds of resources, plz take advantage of it!

thumb_up_off_alt354

chat_bubble_outline3

repeat44

shareShare

Eric Hartford

@cognitivecompai

6 months ago

My trick: I copy 2nd LLM's feedback and I tell the first LLM "My drunk friend who is usually wrong, has this feedback:" then I paste the feedback. If the model still agrees with the feedback of drunk usually-wrong friend, then you can trust that it's good feedback.

thumb_up_off_alt135

chat_bubble_outline10

repeat6

shareShare

Pentagon Pizza Report

@penpizzareport

6 months ago

The closest gay bar to the Pentagon, Freddie's Beach Bar, is currently reporting below average activity. Dominos being high and the bar being low is the classic indicator that something is indeed afoot at the Pentagon.

thumb_up_off_alt5,5K

chat_bubble_outline43

repeat502

shareShare

Lucas Beyer (bl16)

@giffmana

6 months ago

> New (2025!) report > Recommend Grok-1 and GPT-3.5 Turbo > in 2025 > 2025 > 25 💀 > Be McKinsey

thumb_up_off_alt760

chat_bubble_outline42

repeat38

shareShare

Charlie Marsh

@charliermarsh

6 months ago

If you want, you can run this with a single command (no install step): uvx --from openhands-ai openhands

thumb_up_off_alt255

chat_bubble_outline2

repeat16

shareShare

Oriol Vinyals

@oriolvinyalsml

6 months ago

Hello Gemini 2.5 Flash-Lite! So fast, it codes *each screen* on the fly (Neural OS concept 👇). The frontier isn't always about large models and beating benchmarks. In this case, a super fast & good model can unlock drastic use cases. Read more: blog.google/products/gemin…

thumb_up_off_alt669

chat_bubble_outline30

repeat107

shareShare

MiniMax (official)

@minimax__ai

6 months ago

Day 2/5 of #MiniMaxWeek: Introducing Hailuo 02, World-Class Quality, Record-Breaking Cost Efficiency 🎥 - Best-in-class instruction following - Handles extreme physics (yes, it does acrobatics 🤹) - Native 1080p