David Cardozo ๐Ÿ‡จ๐Ÿ‡ฆ (@_davidcardozo) 's Twitter Profile
David Cardozo ๐Ÿ‡จ๐Ÿ‡ฆ

@_davidcardozo

GDE ML in JAX/FLAX | Machine Learning Scientist in Quebec | Kubeflow Community member. Just build it!

ID: 139483711

linkhttps://davidcardozo.com/ calendar_today02-05-2010 19:23:51

1,1K Tweet

628 Followers

1,1K Following

merve (@mervenoyann) 's Twitter Profile Photo

MatchAnything is an insane framework authors have tried to get every view that they can and dump them to modern keypoint matching models for instance you can match iphone map view to google aerial view, thermal camera views to day view even if the images are warped!

Sayak Paul (@risingsayak) 's Twitter Profile Photo

Poco, using Framework H is good friends with Kutu, who uses Framework X. Kutu prefers Framework X as it apparently delivers better speedups than H when Framework X code is written in a proper manner. But Kutu has never ever demonstrated it IRL. Just buzzword keywords. Poco, on

Divya Makkar (@_divyamakkar) 's Twitter Profile Photo

I spent the past few months building JAXformer: One of the first open source guides on how to scale modern transformers in JAX. Trained entirely on TPUs, it supports distributed ML, Ray tokenization, MoE, n-D parallelism and end-to-end inference. Hereโ€™s how to do it:

I spent the past few months building JAXformer: One of the first open source guides on how to scale modern transformers in JAX.

Trained entirely on TPUs, it supports distributed ML, Ray tokenization, MoE, n-D parallelism and end-to-end inference.

Hereโ€™s how to do it:
Stas Bekman (@stasbekman) 's Twitter Profile Photo

Someone has finally published nuances of how NCCL algorithms and protocols work. Thank you so much to the authors since documentation is so scarce! arxiv.org/abs/2507.04786

Someone has finally published nuances of how NCCL algorithms and protocols work. 

Thank you so much to the authors since documentation is so scarce! 

arxiv.org/abs/2507.04786
Sasha Rush (@srush_nlp) 's Twitter Profile Photo

Fun tensor-puzzle in the wild in the recent anthropic blog post. Can anyone do it in 1 line? anthropic.com/engineering/a-โ€ฆ

Fun tensor-puzzle in the wild in the recent anthropic blog post. Can anyone do it in 1 line? 

anthropic.com/engineering/a-โ€ฆ
Vipul Gupta (@vipul_1011) 's Twitter Profile Photo

Took me 4 days to read this blog, but totally worth it. A great detailed guide to post training. Why aren't many people talking about it. Kudos to Han Fang Karthik A Sankararaman ๐Ÿ‡ฎ๐Ÿ‡ณ๐Ÿ‡บ๐Ÿ‡ธ for sharing their thoughts. tokens-for-thoughts.notion.site/post-training-โ€ฆ

Sayak Paul (@risingsayak) 's Twitter Profile Photo

Today, we're shipping native support for context-parallelism to help make diffusion inference go brrr on multiple GPUs ๐Ÿš€ Our CP API is made to work with two flavors of distributed attention: Ring & Ulysses. Huge thanks to Aryan V S for shipping this! Deets โฌ‡๏ธ

Today, we're shipping native support for context-parallelism to help make diffusion inference go brrr on multiple GPUs ๐Ÿš€

Our CP API is made to work with two flavors of distributed attention: Ring & Ulysses.

Huge thanks to <a href="/aryanvs_/">Aryan V S</a> for shipping this!

Deets โฌ‡๏ธ
Sayak Paul (@risingsayak) 's Twitter Profile Photo

Feeling so happy that we got accepted to #NeurIPS2025 ๐Ÿ˜ญ This was a genuinely fulfilling piece of work, and a lot of knobs needed tinkering with. Check out the thread below for more details!

Alexia Jolicoeur-Martineau (@jm_alexia) 's Twitter Profile Photo

New paper ๐Ÿ“œ: Tiny Recursion Model (TRM) is a recursive reasoning approach with a tiny 7M parameters neural network that obtains 45% on ARC-AGI-1 and 8% on ARC-AGI-2, beating most LLMs. Blog: alexiajm.github.io/2025/09/29/tinโ€ฆ Code: github.com/SamsungSAILMonโ€ฆ Paper: arxiv.org/abs/2510.04871

Adam Paszke (@apaszke) 's Twitter Profile Photo

Want to improve GPU compute/comms overlap? We just published a new short tutorial for you! A few small changes to the Pallas:MGPU matmul kernel is all it takes to turn it into an all-gather collective matmul that overlaps NVLINK comms with local compute: docs.jax.dev/en/latest/pallโ€ฆ