Zach Mueller (@TheZachMueller) Twitter Tweets • TwiCopy

Zach Mueller

@TheZachMueller

+ Follow

🤗 Technical Lead for the Accelerate Project | Passionate about Open Source | Nerd who enjoys touching the grass | #ADHD | He/Him

ID:721018777664626688

linkhttps://muellerzr.github.io/ calendar_today15-04-2016 16:54:07

14,6K Tweets

9,5K Followers

388 Following

Jonathan Whitaker

10 hours ago

QDoRA strikes a nice balance - efficient like QLoRA but performs more like full finetuning.

I hope 'quant. base + trainable adapters' becomes the default way to share models. We can train QDoRA w/ FSDP now, the next piece is fast inference without merging in adapters...

thumb_up_off_alt42

chat_bubble_outline0

account_circle

Mark Saroufim

12 hours ago

dev-discuss.pytorch.org/t/how-to-measu…

thumb_up_off_alt142

chat_bubble_outline0

account_circle

clem 🤗

@ClementDelangue

20 hours ago

The GPT4 of datasets took down Hugging Face, sorry all 😅😅😅

thumb_up_off_alt861

chat_bubble_outline0

account_circle

Maria Khalusova

@mariaKhalusova

21 hours ago

Hey, somewhat unexpectedly, looks like I'll be at PyCon US this year. Who should I meet while there?

thumb_up_off_alt7

chat_bubble_outline0

account_circle

Thomas Wolf

1 day ago

Julia Turc Hugging Face I like the training examples of accelerate better personally. Much lighter: github.com/huggingface/ac…

thumb_up_off_alt35

chat_bubble_outline0

account_circle

Zach Mueller

@TheZachMueller

22 hours ago

That’s it folks, AI is done for the day. No new SOTA, no new releases, that’s it.

Time to go get sunlight

That’s it folks, AI is done for the day. No new SOTA, no new releases, that’s it. Time to go get sunlight

thumb_up_off_alt38

chat_bubble_outline0

account_circle

Clémentine Fourrier 🍊

23 hours ago

⚠️We've decided to pause the Open LLM Leaderboard temporarily (hopefully till the end of day) to prevent evaluation failures due to network problems on the hub.

If your model failed this morning, tell us, we'll relaunch once everything's good.

Infra/hub teams are on it! 💪

thumb_up_off_alt35

chat_bubble_outline0

account_circle

Hamel Husain

1 day ago

Who is creating a pytest-like tool for testing LLMs (where you can also track metrics, version data alongside code, etc)?

looking for OSS

thumb_up_off_alt261

chat_bubble_outline0

account_circle

anton

1 day ago

Zuck releasing a billion dollar model is actually wild, like really undermining what OAI is doing. flexing compute like “yea we can do that not a big deal”

thumb_up_off_alt2,2K

chat_bubble_outline0

account_circle

Zach Mueller

@TheZachMueller

1 day ago

For those curious, biggest drive I could find at Micro Center is 18TB, you’d need 3 of those so that’s near $1,000 to download a dataset 🤯🤯🤯

microcenter.com/product/637516…

thumb_up_off_alt39

chat_bubble_outline0

account_circle

Guilherme Penedo

2 days ago

We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data.
We filtered and deduplicated all CommonCrawl between 2013 and 2024.
Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!

We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!

thumb_up_off_alt1,4K

chat_bubble_outline0

account_circle

Xenova

2 days ago

Meta's Segment Anything Model (SAM) can now run in your browser w/ WebGPU (+ fp16), meaning up to 8x faster image encoding (10s → 1.25s)! 🤯⚡️

Video is not sped up! Everything runs 100% locally thanks to 🤗 Transformers.js and onnxruntime-web!

🔗 Demo: hf.co/spaces/Xenova/…

thumb_up_off_alt1,1K

chat_bubble_outline0

account_circle

Gergely Orosz

2 days ago

I know I am late to the party but HuggingFace is such an amazing platform for LLMs.

If I had to describe my impression after using it for a little time:

“GitHub, but for AI models.”

thumb_up_off_alt387

chat_bubble_outline0

account_circle

Programmer Humor

@PR0GRAMMERHUM0R

3 days ago

thisWillBeTheLastTimeReally reddit.com/r/programmerhu…

thisWillBeTheLastTimeReally reddit.com/r/programmerhu…

thumb_up_off_alt4,2K

chat_bubble_outline0

account_circle

Zach Mueller

@TheZachMueller

3 days ago

There is an art to being truly helpful on forums. It’s a careful balance of:

1. What is the critical information a user needs answers to (in as short and direct answer as possible)
2. What information can you give them to go investigate and learn more of on their own (spark

thumb_up_off_alt14

chat_bubble_outline0

account_circle

Zach Mueller

@TheZachMueller

3 days ago

My job… is just… Accelerator

thumb_up_off_alt9

chat_bubble_outline0

account_circle

fpc ok :)