Zhongwen Xu (@zhongwen2009) Twitter Tweets • TwiCopy

Looool

5 months ago

Simon Used notebookllm to recreate slides for professor david mackay’s YouTube course on information theory, using transcripts and slides which is basically him imaged while deriving equations and showing examples on blackboard, the slides generated are so information dense and with

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

-Zho-

@zho_zho_zho

5 months ago

卧槽，Nano Banana Pro 上限太高了！！！！！！原作者 Kris 的创意太好玩了！延伸了一下： “I want to see how this was designed｜我想看看这个是如何设计出来的” 把我的清华给🍌Pro，这结果卧槽，太强了，连平面轴测构造都要给我画好了啊啊啊啊啊啊 ZHNO｜创意系列｜Nano Banana Pro

thumb_up_off_alt431

chat_bubble_outline9

repeat70

shareShare

Dwarkesh Patel

@dwarkesh_sp

5 months ago

Tomorrow

thumb_up_off_alt11,11K

chat_bubble_outline643

repeat467

shareShare

(((ل()(ل() 'yoav))))👾

@yoavgo

5 months ago

the fascinating (to me) quality of hard-core RL researchers (e.g. Sutton, but also many others) is the ability to have this very broad, all encompassing view of RL as the principle basis of intelligence, while at the same time working on super low level stuff like temporal

thumb_up_off_alt367

chat_bubble_outline17

repeat15

shareShare

zhyncs

@zhyncs42

5 months ago

Hard to believe ChatGPT is only three years old. The world feels completely different now.

thumb_up_off_alt22

chat_bubble_outline0

repeat1

shareShare

DeepSeek

@deepseek_ai

5 months ago

🏆 World-Leading Reasoning 🔹 V3.2: Balanced inference vs. length. Your daily driver at GPT-5 level performance. 🔹 V3.2-Speciale: Maxed-out reasoning capabilities. Rivals Gemini-3.0-Pro. 🥇 Gold-Medal Performance: V3.2-Speciale attains gold-level results in IMO, CMO, ICPC World

thumb_up_off_alt567

chat_bubble_outline13

repeat67

shareShare

Chujie Zheng

@chujiezheng

5 months ago

Glad to introduce our research on understanding the "mathematical principles" behind reinforcement learning (RL) with LLMs, and how stabilization techniques work 🧠 📄 huggingface.co/papers/2512.01… 👇 Thread below

thumb_up_off_alt613

chat_bubble_outline15

repeat101

shareShare

Kevin Patrick Murphy

@sirbayes

5 months ago

I am pleased to announce another update to my RL tutorial (arxiv.org/abs/2412.05265). This time I have added code for RLFT for multi-turn LLM agents, using the awesome Tinker library from Thinking Machines, and the simple ReBN training loop from GEM by Zichen Liu et al. With ~100

thumb_up_off_alt1,1K

chat_bubble_outline14

repeat151

shareShare

Gautam Kamath

@thegautamkamath

5 months ago

#NeurIPS2026 will be held in Sydney, Australia! #ICML2017 was also in Sydney and was an absolute blast

thumb_up_off_alt524

chat_bubble_outline18

repeat27

shareShare

Logan Kilpatrick

@officiallogank

4 months ago

Gemini 3 Flash punches way above its weight class, surpassing 2.5 Pro on many benchmarks, while being much cheaper, faster, and more token efficient.

thumb_up_off_alt2,2K

chat_bubble_outline146

repeat169

shareShare

Zhongwen Xu

@zhongwen2009

4 months ago

Pleased to share our engineering practices for medium-sized LLMs in multi-turn agentic search, where we boosted Qwen3 8B and Qwen3 A3B from 1-2 turn search and 10% accuracy on Browsecomp-Plus to 15+ and 20+ turns with 30% accuracy. The devils are in the details; we hope our

thumb_up_off_alt512

chat_bubble_outline13

repeat54

shareShare

isaac 🧩

@isaacbmiller1

4 months ago

Having played with Qwen-3 32B (not the 30B-A3 version as they do here) on BrowseComp Plus quite a bit, the funniest thing is that it just gives up really easily. The original paper had it at ~3% recall and less than one(!!!) tool call per question. LESS THAN ONE! IT JUST WASNT

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Zhongwen Xu

@zhongwen2009

4 months ago

We just uploaded the trained weights to HF. Feel free to play with the models! A3B: huggingface.co/aidenjhwu/Sear… 8B: huggingface.co/aidenjhwu/Sear…

thumb_up_off_alt82

chat_bubble_outline1

repeat11

shareShare

九原客

@9hills

4 months ago

yan5xu blog.vllm.ai/2025/10/28/Kim…

thumb_up_off_alt106

chat_bubble_outline2

repeat20

shareShare

Andrej Karpathy

@karpathy

4 months ago

I've never felt this much behind as a programmer. The profession is being dramatically refactored as the bits contributed by the programmer are increasingly sparse and between. I have a sense that I could be 10X more powerful if I just properly string together what has become

thumb_up_off_alt37,37K

chat_bubble_outline1,1K

repeat4,4K

shareShare

Boris Cherny

@bcherny

4 months ago

Andrej Karpathy I feel this way most weeks tbh. Sometimes I start approaching a problem manually, and have to remind myself “claude can probably do this”. Recently we were debugging a memory leak in Claude Code, and I started approaching it the old fashioned way: connecting a profiler, using the

thumb_up_off_alt7,7K

chat_bubble_outline155

repeat484

shareShare

Jackson Kernion

@jacksonkernion

4 months ago

I'm trying to figure out what to care about next. I joined Anthropic 4+ years ago, motivated by the dream of building AGI. I was convinced from studying philosophy of mind that we're approaching sufficient scale and that anything that can be learned can be learned in an RL env.

thumb_up_off_alt1,1K

chat_bubble_outline157

repeat47

shareShare

Boris Cherny

@bcherny

4 months ago

Anas Daniel Simon Willison I built it just now, it's a great feature request

thumb_up_off_alt1,1K

chat_bubble_outline37

repeat23

shareShare

Jaana Dogan ヤナドガン

@rakyll

4 months ago

I'm not joking and this isn't funny. We have been trying to build distributed agent orchestrators at Google since last year. There are various options, not everyone is aligned... I gave Claude Code a description of the problem, it generated what we built last year in an hour.

thumb_up_off_alt24,24K

chat_bubble_outline791

repeat2,2K

shareShare

Jerry Tworek

@millionint

3 months ago

It’s the best moment in history for a small team to make a gigantic difference

thumb_up_off_alt493

chat_bubble_outline15

repeat28

shareShare

Zhongwen Xu

Looool

-Zho-

Dwarkesh Patel

(((ل()(ل() 'yoav))))👾

zhyncs

DeepSeek

Chujie Zheng

Kevin Patrick Murphy

Gautam Kamath

Logan Kilpatrick

Zhongwen Xu

isaac 🧩

Zhongwen Xu

九原客

Andrej Karpathy

Boris Cherny

Jackson Kernion

Boris Cherny

Jaana Dogan ヤナ ドガン

Jerry Tworek

Jaana Dogan ヤナドガン