Zitong Yang (@zitongyang0) Twitter Tweets • TwiCopy

Thinking Machines

2 months ago

LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.

thumb_up_off_alt3,3K

chat_bubble_outline77

repeat533

shareShare

Berkeley Physics

@berkeleyphysics

2 months ago

Nobel laureate George Smoot, UC Berkeley physicist whose work with satellite experiments confirmed the Big Bang theory, has died at 80. news.berkeley.edu/2025/09/29/nob…

thumb_up_off_alt16

chat_bubble_outline4

repeat8

shareShare

Thinking Machines

@thinkymachines

2 months ago

Introducing Tinker: a flexible API for fine-tuning language models. Write training loops in Python on your laptop; we'll run them on distributed GPUs. Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!

thumb_up_off_alt2,2K

chat_bubble_outline102

repeat288

shareShare

Ruiqi Zhong

@zhongruiqi

2 months ago

Very excited about this release!! As a former grad student I struggled to finetune llms. Even when the gpus are enough, it was painful to set up the infra correctly. Tinker allows more researchers to understand and language models, beyond a few well-funded labs.

thumb_up_off_alt191

chat_bubble_outline2

repeat10

shareShare

CLS

@chengleisi

2 months ago

the feeling when you spent two months building the training infra and finally got the first experiment running 🥹

thumb_up_off_alt40

chat_bubble_outline1

repeat1

shareShare

Sam Buchanan

@_sdbuchanan

2 months ago

We wrote a book about representation learning! It’s fully open source, available and readable online, and covers everything from theoretical foundations to practical algorithms. 👷‍♂️ We’re hard at work updating the content for v2.0, and would love your feedback and contributions

thumb_up_off_alt1,1K

chat_bubble_outline13

repeat203

shareShare

Druv Pai

@druv_pai

a month ago

🚨 We wrote a new AI textbook "Learning Deep Representations of Data Distributions"! TL;DR: We develop principles for representation learning in large scale deep neural networks, show that they underpin existing methods, and build new principled methods.

thumb_up_off_alt193

chat_bubble_outline5

repeat50

shareShare

Druv Pai

@druv_pai

a month ago

Why and how do diffusion models memorize vs generalize? Can we have scaling laws for memorization? This is increasingly relevant scientifically and pragmatically (e.g. Sora 2). 🚨 Our new preprint "On the Edge of Memorization in Diffusion Models" addresses this timely question!

thumb_up_off_alt350

chat_bubble_outline5

repeat62

shareShare

CLS

@chengleisi

a month ago

I’ll be at #COLM2025 this week! I’ll give a lightening talk at the Visions Workshop on 11am Friday and hang around our LM4SCI @ COLM2025 workshop! DM me if you wanna chat. We have some exciting ongoing projects on automating post-/pre-training research.

thumb_up_off_alt35

chat_bubble_outline1

repeat5

shareShare

Zitong Yang

@zitongyang0

a month ago

The passing of the physicist Chen-Ning Yang (nytimes.com/2025/10/18/sci…) saddens me. He has been a long-time hero and role model for me. Below is a short essay I wrote yesterday about Yang that I shared with many of my friends. I translated it into English using Gemini: ``` The

thumb_up_off_alt417

chat_bubble_outline10

repeat65

shareShare

John Schulman

@johnschulman2

a month ago

Fine-tuning APIs are becoming more powerful and widespread, but they're harder to safeguard against misuse than fixed-weight sampling APIs. Excited to share a new paper: Detecting Adversarial Fine-tuning with Auditing Agents (arxiv.org/abs/2510.16255). Auditing agents search

thumb_up_off_alt432

chat_bubble_outline10

repeat44

shareShare

Diyi Yang

@diyi_yang

a month ago

Thanks Thinking Machines for supporting Tinker access for our CS329x students on Homework 2 😉

Thanks <a href="/thinkymachines/">Thinking Machines</a> for supporting Tinker access for our CS329x students on Homework 2 😉

thumb_up_off_alt580

chat_bubble_outline6

repeat33

shareShare

Zitong Yang

@zitongyang0

a month ago

Neural Architecture Search is so visionary

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Simon Guo 🦝

@simonguozirui

a month ago

Wrote a 1-year retrospective with Alex L Zhang on KernelBench and the journey toward automated GPU/CUDA kernel generations! Since my labmates (Anne Ouyang, Simran Arora, William Hu) and I first started working towards this vision around last year’s @GPU_mode hackathon, we have

Wrote a 1-year retrospective with <a href="/a1zhang/">Alex L Zhang</a> on KernelBench and the journey toward automated GPU/CUDA kernel generations!

Since my labmates (<a href="/anneouyang/">Anne Ouyang</a>, <a href="/simran_s_arora/">Simran Arora</a>, <a href="/_williamhu/">William Hu</a>) and I first started working towards this vision around last year’s @GPU_mode hackathon, we have

thumb_up_off_alt290

chat_bubble_outline11

repeat64

shareShare

Diyi Yang

@diyi_yang

a month ago

Stanford NLP 25th Anniversary🤩🤩🤩

thumb_up_off_alt599

chat_bubble_outline9

repeat39

shareShare

Stanford NLP Group

@stanfordnlp

a month ago

More Stanford NLP Group 25th Anniversary Reunion lightning talks: …, Zitong Yang, Yijia Shao, Will Held, Taylor Sorensen (Taylor Sorensen), …

More Stanford NLP Group 25th Anniversary Reunion lightning talks: …, <a href="/ZitongYang0/">Zitong Yang</a>, <a href="/EchoShao8899/">Yijia Shao</a>, <a href="/WilliamBarrHeld/">Will Held</a>, <a href="/ma_tay_/">Taylor Sorensen</a> (Taylor Sorensen), …

thumb_up_off_alt70

chat_bubble_outline0

repeat15

shareShare

Thinking Machines

@thinkymachines

24 days ago

Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other

thumb_up_off_alt2,2K

chat_bubble_outline60

repeat381

shareShare

Kevin Lu

@_kevinlu

24 days ago

in our new post, we walk through great prior work from Rishabh Agarwal & the Qwen team exploring on-policy distillation using an open source recipe: you can run our experiments on Tinker today! github.com/thinking-machi… i'm especially excited by the use of on-policy

thumb_up_off_alt325

chat_bubble_outline12

repeat25

shareShare

Judy Shen

@judyhshen

13 days ago

I DEFENDED MY PHD THIS WEEK! 🎉 So grateful for the guidance of my advisor and committee! Special thanks to my friends and family who supported me through every up and down 🥺🥰

thumb_up_off_alt664

chat_bubble_outline26

repeat25

shareShare

Sarah Cen

@cen_sarah

10 days ago

In the AI ecosystem, who supplies the data? the compute? the models? We just released a new tool on the AI Supply Chain. Our dataset reveals how AI models, data, compute, capital, and even talent change hands. Here’s why you should care 👇

thumb_up_off_alt130

chat_bubble_outline13

repeat35

shareShare