Alan Dao (@alandao_ai) Twitter Tweets • TwiCopy

Alan Dao

@alandao_ai

+ Follow

AI Researcher at Menlo Research. Author of Jan, Lucy, Jan-nano, Ichigo, AlphaMaze, and various other works at Menlo Research.

ID: 1247124079271751680

linkhttps://alandao.net calendar_today06-04-2020 11:28:55

546 Tweet

324 Takipçi

23 Takip Edilen

Xuan-Son Nguyen

@ngxson

4 months ago

Welcome back, OpenAI ! Day-0 support llama.cpp with MXFP4, let it rock 🚀🤘

Welcome back, <a href="/OpenAI/">OpenAI</a> !

Day-0 support llama.cpp with MXFP4, let it rock 🚀🤘

thumb_up_off_alt30

chat_bubble_outline3

repeat6

shareShare

😱Unreasonable efficiency of GPT-OSS-20B reasoning trace 😱 Yeah… it’s really good. This is exactly what we wanted to achieve with the Lucy model, a natural and effective reasoning trace that is less prone to hallucination. Well, Lucy is a 1.7 B model after all, so it’s

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Mitko Vasilev

@iotcoi

4 months ago

Alan Dao Ivan Fioravanti ᯅ It’s too bad that M-chips can't use FP4. To achieve the same quality of output as a Blackwell chip, Apple requires four times the memory. This is yet another strike from NVIDIA's software ecosystem monopoly.

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Alan Dao

@alandao_ai

4 months ago

Shocking insight - GPT5 has a 25% improvement over GPT4

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

Kieran Klaassen

@kieranklaassen

4 months ago

Claude Caude can run GPT5. GPT-5 is good at fixing nasty bugs and doing research. In a different way than Claude and they can work together. Just create a Claude code agent called gpt5:

thumb_up_off_alt982

chat_bubble_outline72

repeat65

shareShare

Shuangfei Zhai

@zhaisf

4 months ago

Unlike an RNN, one attention block alone cannot model anything interesting. And it’s the stacking of it that does wonders. Understanding this compositionality should be at least as important as understanding the attn module itself.

thumb_up_off_alt956

chat_bubble_outline36

repeat70

shareShare

Alan Dao

@alandao_ai

4 months ago

Haha lmao cool response 😂 But deep research is overkill for simpleQA don't ya think??

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

The Mandorlarian

@mandorlarian

4 months ago

👋 Jan Fkn unreal, 91% 🤩🔥? You guys keep delivering! Is this you as well, Alan Dao ?

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Sebastian Raschka

@rasbt

4 months ago

Pretty cool. I think 2025-2026 will be a stronger focus on these in open source tooling. I.e. having LLMs delegate knowledge-based queries to search, which in turn frees up model capacity to improve reasoning capabilities and tool use.

thumb_up_off_alt388

chat_bubble_outline19

repeat41

shareShare

@levelsio

4 months ago

I really really like 👋 Jan It's a very friendly app to locally run LLMs, great for privacy I've tried others like LM Studio and Ollama and they're nice but very engineer-built, a bit too difficult for me Jan is simple and cute and pretty and a great alternative to talk to