Roy Frostig (@froystig) Twitter Tweets • TwiCopy

Roy Frostig

@froystig

+ Follow

research scientist at @googledeepmind. co-author of JAX (github.com/jax-ml/jax)

ID: 14295070

linkhttps://cs.stanford.edu/~rfrostig/ calendar_today03-04-2008 17:26:40

123 Tweet

1,1K Takipçi

603 Takip Edilen

Mathieu Blondel

@mblondel_ml

4 years ago

After 6 months of hard work, happy to share JAXopt: hardware accelerated, batchable and differentiable optimizers in JAX github.com/google/jaxopt We ambition to cover many use cases in ML: stochastic optim of DL models, constrained/non-smooth optim, bi-level optim, optim layers...

thumb_up_off_alt657

chat_bubble_outline5

repeat104

shareShare

David Hall

@dlwh

2 years ago

Today, I’m excited to announce the release of Levanter 1.0, our new JAX-based framework for training foundation models, which we’ve been working on Center for Research on Foundation Models. Levanter is designed to be legible, scalable and reproducible. crfm.stanford.edu/2023/06/16/lev…

thumb_up_off_alt400

chat_bubble_outline6

repeat86

shareShare

Sharad Vikram

@sharadvikram

2 years ago

Built with JAX!

thumb_up_off_alt285

chat_bubble_outline5

repeat25

shareShare

Daniel Johnson

@_ddjohnson

2 years ago

Excited to share Penzai, a JAX research toolkit from Google DeepMind for building, editing, and visualizing neural networks! Penzai makes it easy to see model internals and lets you inject custom logic anywhere. Check it out on GitHub: github.com/google-deepmin…

thumb_up_off_alt2,2K

chat_bubble_outline39

repeat404

shareShare

Adam Paszke

@apaszke

a year ago

Many of you are excited about H100 attention, so it’s a good time to show you Mosaic GPU: a Python DSL for H100s. The attention example matches FA3 performance, while being only ~200 lines of Python: github.com/google/jax/blo… It's easy to install too! Latest JAX packages have it.

thumb_up_off_alt667

chat_bubble_outline14

repeat111

shareShare

Sharad Vikram

@sharadvikram

a year ago

Finally got around to writing a guide for matrix multiplication on TPUs using Pallas. Check it out! jax.readthedocs.io/en/latest/pall…

thumb_up_off_alt158

chat_bubble_outline1

repeat25

shareShare

Dan F-M

@exoplaneteer

a year ago

I've finally landed my first proper JAX feature since joining the team: a supported "foreign function interface", which makes it easier to call into external libraries from within JAX code. Check it out: jax.readthedocs.io/en/latest/ffi.…

thumb_up_off_alt99

chat_bubble_outline2

repeat14

shareShare

Jeremy Bernstein

@jxbz

a year ago

Modula x JAX = Modulax theseriousadult is cracked and ported Modula into JAX in a few days. I haven't had a chance to test yet, but I'm really excited about this project. Tagging Roy Frostig and Matthew Johnson github.com/GallagherComma… (1/3)

thumb_up_off_alt31

chat_bubble_outline1

repeat4

shareShare

Sharad Vikram

@sharadvikram

a year ago

We now have a guide to writing distributed communication on TPU using Pallas, written by Justin Fu! jax.readthedocs.io/en/latest/pall… Overlapping comms + compute is a crucial performance optimization for large scale ML. Write your own custom overlapped kernels in Python!

We now have a guide to writing distributed communication on TPU using Pallas, written by <a href="/JustinFu769512/">Justin Fu</a>! jax.readthedocs.io/en/latest/pall…

Overlapping comms + compute is a crucial performance optimization for large scale ML. Write your own custom overlapped kernels in Python!

thumb_up_off_alt246

chat_bubble_outline4

repeat45

shareShare

Roy Frostig

@froystig

9 months ago

Our online book on systems principles of LLM scaling is live. We hope that it helps you make the most of your computing resources. Enjoy!

thumb_up_off_alt71

chat_bubble_outline0

repeat10

shareShare

Jeff Dean

@jeffdean

9 months ago

Training our most capable Gemini models relies heavily on our JAX software stack + Google's TPU hardware platforms. If you want to learn more, see this awesome book "How to Scale Your Model": jax-ml.github.io/scaling-book/ It was put together by my Google DeepMind colleagues

thumb_up_off_alt993

chat_bubble_outline23

repeat167

shareShare