David Marx (@digthatdata.bsky.social) (@digthatdata) Twitter Tweets • TwiCopy

David Marx (@digthatdata.bsky.social)

@digthatdata

+ Follow

Generative AI MLE, FOSS toolmaker, innovation catalyst @CoreWeave + @AiEleuther. bsky.app/profile/digtha…

ID: 2211601081

linkhttps://github.com/dmarx calendar_today24-11-2013 00:56:11

10,10K Tweet

4,4K Followers

1,1K Following

Dylan Foster 🐢

@canondetortugas

a year ago

Is KL-regularization the right tool for language model alignment? The χPO algorithm: We show that a one-line change to DPO—moving from KL to chi-squared regularization—is sufficient to achieve state-of-the-art theoretical guarantees, provably alleviating over-optimization.

thumb_up_off_alt199

chat_bubble_outline3

repeat23

shareShare

Tanishq Kumar

@tanishqkumar07

a year ago

[1/7] New paper alert! Heard about the BitNet hype or that Llama-3 is harder to quantize? Our new work studies both! We formulate scaling laws for precision, across both pre and post-training arxiv.org/pdf/2411.04330. TLDR; - Models become harder to post-train quantize as they

thumb_up_off_alt854

chat_bubble_outline21

repeat160

shareShare

Rohan Choudhury

@rchoudhury997

a year ago

Excited to finally release our NeurIPS 2024 (spotlight) paper! We introduce Run-Length Tokenization (RLT), a simple way to significantly speed up your vision transformer on video with no loss in performance!

thumb_up_off_alt1,1K

chat_bubble_outline22

repeat173

shareShare

David Marx (@digthatdata.bsky.social)

@digthatdata

a year ago

what a time to be alive

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

David Marx (@digthatdata.bsky.social)

@digthatdata

a year ago

BRRRRRRRRRRRRRR

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

LAION

@laion_ai

a year ago

We announce LAION-DISCO-12M - a collection of 12 million links to publicly available YouTube samples paired with metadata to support basic machine learning research in foundation models for generic audio and music. laion.ai/blog/laion-dis…

thumb_up_off_alt191

chat_bubble_outline0

repeat44

shareShare

Jeremy Howard

@jeremyphoward

a year ago

Narrative on X: 🦋 has no AI/ML and just talks about itself My actual feed on 🦋:

thumb_up_off_alt1,1K

chat_bubble_outline61

repeat68

shareShare

Jeremy Howard

@jeremyphoward

a year ago

You can choose what feeds you have on your homepage -- I personally like the "popular with friends" feed (in the above tweet). The default is "following" - which looks like this for me:

thumb_up_off_alt127

chat_bubble_outline4

repeat5

shareShare

Jeremy Howard

@jeremyphoward

a year ago

Because X has tended to censor discussion of social networks I won't link directly, but look for this post to get an instant AI/ML feed thanks to M A Osborne

Because X has tended to censor discussion of social networks I won't link directly, but look for this post to get an instant AI/ML feed thanks to <a href="/maosbot/">M A Osborne</a>

thumb_up_off_alt201

chat_bubble_outline10

repeat15

shareShare

David Marx (@digthatdata.bsky.social)

@digthatdata

10 months ago

And just like that, the AI/ML migration off twitter finally happened. RIP, this shitty pay-to-play platform.

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

David Marx (@digthatdata.bsky.social)

@digthatdata

10 months ago

thumb_up_off_alt16

chat_bubble_outline4

repeat1

shareShare

Fern

@hi_tysam

10 months ago

New NanoGPT training speed record: 3.28 FineWeb val loss in 4.66 minutes Previous record: 5.03 minutes Changelog: - FlexAttention blocksize warmup - hyperparameter tweaks

thumb_up_off_alt244

chat_bubble_outline8

repeat22

shareShare

Keller Jordan

@kellerjordan0

10 months ago

This is officially the new record! Congrats Fern (who is also an OG of CIFAR-10 speedrunning) x.com/hi_tysam/statu…

thumb_up_off_alt144

chat_bubble_outline3

repeat11

shareShare

David Marx (@digthatdata.bsky.social)

@digthatdata

10 months ago

Yo this paper is wild.

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Haiwen Huang

@haiwenhuang_

10 months ago

🔥 Are you ever dissatisfied with the imprecise names in vision-language datasets? 🚀 At #NeurIPS2024, we introduce 𝐑𝐄𝐍𝐎𝐕𝐀𝐓𝐄, showing how better segmentation dataset names lead to 𝐛𝐞𝐭𝐭𝐞𝐫 𝐭𝐫𝐚𝐢𝐧𝐢𝐧𝐠 & 𝐞𝐯𝐚𝐥𝐮𝐚𝐭𝐢𝐨𝐧. Let’s dive in! 🧵👇

thumb_up_off_alt34

chat_bubble_outline1

repeat6

shareShare

Visu_AI_Poetry

@visu_ai_poetry

9 months ago

Happy Christmas friends

thumb_up_off_alt22

chat_bubble_outline12

repeat5

shareShare

David Marx (@digthatdata.bsky.social)

@digthatdata

9 months ago

sora can't handle my prompts

thumb_up_off_alt7

chat_bubble_outline1

repeat1

shareShare

David Marx (@digthatdata.bsky.social)

@digthatdata

8 months ago

> Republicans: "We love America! It has the greatest system of governance. Look, I even carry the constitution next to my heart like a little bible :*) " > Also republicans: "DISMANTLE THE GOVERNMENT! FUCK THE SEPARATION OF POWERS! GOD KING PRESIDENT CULT OF PERSONALITY!"

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

anton

@atroyn

8 months ago

'we're in this bizarre world where the best way to learn about llms... is to read papers by chinese companies. i do not think this is a good state of the world' - us labs keeping their architectures and algorithms secret is ultimately hurting ai development in the us.

thumb_up_off_alt127

chat_bubble_outline6

repeat27

shareShare

David Marx (@digthatdata.bsky.social)

@digthatdata

7 months ago

TAXATION WITHOUT REPRESENTATION.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare