Tamay Besiroglu (@tamaybes) Twitter Tweets • TwiCopy

Tamay Besiroglu

@tamaybes

+ Follow

Thinking about economics, computing and machine learning @EpochAIResearch @MIT_CSAIL

ID:995052639602839552

linkhttps://besiroglu.github.io/webpage/ calendar_today11-05-2018 21:26:51

1,0K Tweets

3,0K Followers

720 Following

Tom Adamczewski

3 days ago

I've been working on a new product: MakeDistribution.com!

It's the best solution to creating a subjective probability distribution, i.e. one that reflects human judgement rather than being fit to data.

Sounds simple, but most existing solutions have important flaws (1/15)

thumb_up_off_alt395

chat_bubble_outline0

account_circle

Maria de la Lama

6 days ago

Come work with me! You might be a great fit for this role if you have an operations mindset, strong communications skills, are service-minded and organized, and care deeply about Epoch’s mission.

thumb_up_off_alt19

chat_bubble_outline0

account_circle

Tamay Besiroglu

1 week ago

Who are some of the top historians of the field of AI?

thumb_up_off_alt18

chat_bubble_outline0

account_circle

Tamay Besiroglu

1 week ago

Cool to see our replication of Chinchilla amongst the top ML papers of the week in what was a packed week for AI.

thumb_up_off_alt26

chat_bubble_outline0

account_circle

Tamay Besiroglu

1 week ago

This is how you would train compute-optimal models to match Llama3 using our updated Chinchilla scaling law.

Models are clearly being overtrained 5x to 10x further than is *pre-training* compute optimal.

thumb_up_off_alt20

chat_bubble_outline0

account_circle

Gabriel Synnaeve

1 week ago

Algorithmic progress is faster than hardware progress.

arxiv.org/abs/2403.05812

Algorithmic progress is faster than hardware progress. arxiv.org/abs/2403.05812

thumb_up_off_alt72

chat_bubble_outline0

account_circle

Tamay Besiroglu

1 week ago

I'm thrilled to see that our work has apparently unified the Chinchilla scaling laws. It's great to hear that they're making the data open source!

thumb_up_off_alt50

chat_bubble_outline0

account_circle

finbarr

@finbarrtimbers

1 week ago

great to see a major paper like Chinchilla be updated and releasing data

thumb_up_off_alt20

chat_bubble_outline0

account_circle

(((ل()(ل() 'yoav))))👾

1 week ago

data forensics 101

thumb_up_off_alt26

chat_bubble_outline0

account_circle

Sasha Doubov

1 week ago

we are in the csi miami 'enhance' era of reproducibility research

we are in the csi miami 'enhance' era of reproducibility research

thumb_up_off_alt88

chat_bubble_outline0

account_circle

Mathieu Acher

2 weeks ago

I don't know and can't assess the impact of the results on the topic of scaling laws, but the reproducibility effort is remarkable! We need more work like this, in many fields of CS.

thumb_up_off_alt5

chat_bubble_outline0

account_circle

Matthew Barnett

2 weeks ago

tl;dr: the parametric Chinchilla scaling law appears to have been poorly fit, undermining any analysis that relied on its exact fitted values. We fit the same scaling law to a reconstruction of their data, getting different and IMO better results.

tl;dr: the parametric Chinchilla scaling law appears to have been poorly fit, undermining any analysis that relied on its exact fitted values. We fit the same scaling law to a reconstruction of their data, getting different and IMO better results.

thumb_up_off_alt80

chat_bubble_outline0

account_circle

Gabriele Sarti

2 weeks ago

Exhibit #49864 on the absurd lengths reproducibility studies must go in the era of proprietary LLMs 😭

thumb_up_off_alt24

chat_bubble_outline0

account_circle

Matt Clifford

@matthewclifford

2 weeks ago

Intriguing and potentially important work!

thumb_up_off_alt3

chat_bubble_outline0

account_circle

fpc ok :)