Thomas Wolf (@Thom_Wolf) Twitter Tweets • TwiCopy

repeat1

account_circle

Thomas Wolf

3 days ago

here we go again with the usual set of meeting options between SF and Europe – time to disrupt time zones with quantum mechanics or something

thumb_up_off_alt25

repeat1

account_circle

Thomas Wolf

4 days ago

This grew quite quickly!

thumb_up_off_alt40

repeat3

account_circle

Remi Cadene

@RemiCadene

6 days ago

Proof of concept that you can do a lot with low-cost hardware (200$) and a smart robot brain. Is robotics a software problem?

account_circle

OpenELM: a family of Open-source Efficient Language Models

Welcome Apple Inc. in the family of open-source LLM trainers!

🤯
huggingface.co/collections/ap…

And together with a new library: CoreNet
github.com/apple/corenet

account_circle

Remi Cadene

@RemiCadene

1 week ago

Do you have recommendation on papers for robot navigation in home? Is end-to-end navigation a thing? Is it possible to avoid SLAM or use it only as conditioning/input?

thumb_up_off_alt55

repeat8

account_circle

Thomas Wolf

1 week ago

This take on the FineWeb release is one of the most interesting feedback and also a reason FineWeb is very different from even larger datasets like RedPajama-V2 (which is double its size!)

Surprisingly, the size of the dataset of 15T tokens is not very important, what is much…

account_circle

Thomas Wolf

1 week ago

Most exciting paper of the week? Clearly this one 👇 Finally a successor to the super impressive phi-1.5/2 models – so much looking forward to playing with the weights, come help me encourage the authors to share them in the comments 😅
huggingface.co/papers/2404.14…

account_circle

Quentin Gallouédec

@QGallouedec

1 week ago

🆕 Introducing JAT, the first open-source multi-modal, multi-task multi-domain agent! 🤖 A step toward open generalist agents! 🚀

📰 Blog: huggingface.co/blog/jat

account_circle

Thomas Wolf

1 week ago

Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes??

Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…

account_circle

Thomas Wolf

1 week ago

Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes??

Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…

account_circle

Guilherme Penedo

@gui_penedo

1 week ago

We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data.
We filtered and deduplicated all CommonCrawl between 2013 and 2024.
Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!

account_circle

Thomas Wolf

1 week ago

🍿 quite enjoying this recent data-visualization battle where open-source model teams showcase a 2D mapping of the open-source AI models field along interestingly different axes

good data viz skills becoming crucial in any AI model training team these days

account_circle

Julien Chaumond

@julien_c

1 week ago

and remember

that Llama 3 is in HuggingChat 🔥

running on fast optimized Hugging Face inference

thank you AI at Meta

and remember that Llama 3 is in HuggingChat 🔥 running on fast optimized @huggingface inference thank you @metaai

account_circle

Thibault Schrepel

@ProfSchrepel

1 week ago

“I don’t want (AI) to be the property of two US tech companies” Thomas Wolf, co-founder of Hugging Face. Listen to the full conversation:
➝ youtu.be/dGR0vAJAlmI
➝ spoti.fi/49zlhr3
➝ apple.co/3xBZuSk
#scalingtheory

thumb_up_off_alt8

repeat2