Gilad (@gil2rok) Twitter Tweets • TwiCopy

Gilad

@gil2rok

+ Follow

stats + probabilistic ml researcher (sampling, generative models). I sort in exponential time. novelty seeker. @FlatironInst @Columbia. DMs open.

ID: 1727791812884963328

linkhttps://gil2rok.github.io/ calendar_today23-11-2023 20:51:12

939 Tweet

746 Followers

2,2K Following

Gilad

@gil2rok

4 months ago

I wish Docker was as elegant as JAX. Is there no such tool for containerization?

thumb_up_off_alt3

chat_bubble_outline2

repeat0

shareShare

Gilad

@gil2rok

4 months ago

I dunno… Dating apps + LaTeX is a dangerous combination !

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

- Probabilistic ML: An Introduction by Kevin Murphy - Probabilistic ML: Advanced topics by also Kevin Murphy - Reinforcement Learning: An Overview by also also by Kevin Murphy - A First Course in Monte Carlo Methods by D. Sanz-Alonso and O. Al-Ghattas

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Gilad

@gil2rok

4 months ago

SF looking beautiful today!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Nathan Lambert

@natolambert

4 months ago

America needs to take open models more seriously. This summer the early lead in open model adoption of the US via Llama has been overtaken by Chinese models. With The American Truly Open Models (ATOM) Project we're looking to build support and express the urgency of this issue.

thumb_up_off_alt596

chat_bubble_outline28

repeat105

shareShare

Gilad

@gil2rok

4 months ago

This is my favorite NeurIPS workshop. Such cool work!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Gilad

@gil2rok

4 months ago

TIL: Big tech companies often use *monorepos* (single repo for all code) to manage dependencies across *microservices *(independently deployable services). Counter-intuitive but powerful: centralized code, distributed deployment.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Gilad

@gil2rok

4 months ago

oh you're an ai influencer? Name every ai researcher.

thumb_up_off_alt1

chat_bubble_outline2

repeat0

shareShare

Gilad

@gil2rok

4 months ago

JAX FOR THE WIN!! It’s a production ready world modeling code base

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Gilad

@gil2rok

4 months ago

One of the most significant parts of GPT-5 (and other top LLms) is that it takes us closer to “on-demand software” This will radically transform the economy

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Gilad

@gil2rok

4 months ago

Spotted Cluley in SF

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Gilad

@gil2rok

4 months ago

Any advice for when to use Claude Code vs Cursor Simon Willison? Claude code: requires high trust, large code changes, non-critical code Cursor: more granular changes, encourages more hands on approvals, when you need to understand the code changes

thumb_up_off_alt2

chat_bubble_outline3

repeat0

shareShare

Arc Jax

@arcjax7

4 months ago

Nobody does JAX like JAX, folks! Super fast with compilation—turns Python into lightning. Vectorization? Handles massive batches, all at once! And the gradients—nobody gets gradients like JAX, believe me! People everywhere are saying, “Sir, it’s the best!” Tremendous technology!

thumb_up_off_alt267

chat_bubble_outline7

repeat10

shareShare

Gilad

@gil2rok

4 months ago

What I like about JAX is not that it’s faster (mostly on TPUs), but b/c the abstractions are just so much cleaner than PyTorch. Clear thinking saves you more time than faster training!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Gilad

@gil2rok

3 months ago

Diffusion LLMs outperform traditional autoregressive LLMs by learning multiple orderings of the same data (instead of learning ONLY left-to-right) This is only helpful when we are data constrained (train for >1 epoch). Does this occur in most frontier labs? Genuine question.

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Gilad

@gil2rok

3 months ago

Claude Code just roasted my distributes training setup so hard: “worst possible configuration” lol

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

Gilad

@gil2rok

3 months ago

Anyone else find most LaTeX CV templates to be ugly and/or hard to use?? Like why can’t they just work and look nice? I’ve truly never *once* been able to understand the style file for these templates !!

thumb_up_off_alt6

chat_bubble_outline1

repeat1

shareShare