Clem Delangue 🤗 (@clemdelangue) Twitter Tweets • TwiCopy

Luca Soldaini ✈️ ICLR 25

6 years ago

Absolutely PACKED room for Sebastian Ruder @ ACL, Thomas Wolf, Swabha Swayamdipta, and Matthew Peters’s tutorial on transfer learning for NLP #NAACL2019

Absolutely PACKED room for <a href="/seb_ruder/">Sebastian Ruder @ ACL</a>, <a href="/Thom_Wolf/">Thomas Wolf</a>, <a href="/swabhz/">Swabha Swayamdipta</a>, and <a href="/mattthemathman/">Matthew Peters</a>’s tutorial on transfer learning for NLP #NAACL2019

thumb_up_off_alt85

chat_bubble_outline0

repeat11

shareShare

Thomas Wolf

@thom_wolf

6 years ago

Colab for our tutorial #NAACLTransfer Sebastian Ruder @ ACL colab.research.google.com/drive/1iDHCYIr…

thumb_up_off_alt229

chat_bubble_outline2

repeat58

shareShare

Clem Delangue 🤗

@clemdelangue

6 years ago

They’re very big fans of Thomas Wolf here at #NAACL2019

They’re very big fans of <a href="/Thom_Wolf/">Thomas Wolf</a> here at #NAACL2019

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Julien Chaumond

@julien_c

6 years ago

Welcome to Minne-SOTA #NAACL2019

thumb_up_off_alt11

chat_bubble_outline0

repeat3

shareShare

Clem Delangue 🤗

@clemdelangue

6 years ago

Best Long Paper #naacl2019 BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Jacob Devlin, Ming-Wei Chang, Kenton Lee and Kristina Toutanova #NLProc

thumb_up_off_alt18

chat_bubble_outline0

repeat3

shareShare

🔥 Thrilled to release our Swift Core ML implementation of BERT for question answering.🔥🔥 Transformers models now also live on the edge. 📱📲 You now CAN do state-of-the-art NLP on mobile devices! github.com/huggingface/sw… Built w/ Lysandre and Thomas Wolf at Hugging Face

thumb_up_off_alt323

chat_bubble_outline5

repeat92

shareShare

Thomas Wolf

@thom_wolf

6 years ago

New release of Transformers repo is shaping up & I'm very excited! Gifts for all: -SOTA Lovers: new XLNet & XLM archi + 6 new Bert/GPT trained chkpt -Research Lovers: unified model API, attention/hidden-state outputs to swap/study models -Speed Lovers: Torchscript & head pruning!

thumb_up_off_alt502

chat_bubble_outline4

repeat112

shareShare

Thomas Wolf

@thom_wolf

6 years ago

A question I get from time to time is how to convert a pretrained TensorFlow model in PyTorch easily and reliably. We're starting to be quite familiar with the process so I've written a short blog post summarizing our workflow and some lessons learned 👇 medium.com/huggingface/fr…

thumb_up_off_alt499

chat_bubble_outline8

repeat117

shareShare

Hugging Face

@huggingface

6 years ago

💃PyTorch-Transformers 1.1.0 is live💃 It includes RoBERTa, the transformer model from @facebookai, current state-of-the-art on the SuperGLUE leaderboard! Thanks to Myle Ott Julien Chaumond Lysandre and all the 100+ contributors!

thumb_up_off_alt632

chat_bubble_outline6

repeat191

shareShare

Julien Chaumond

@julien_c

6 years ago

1,060 days ago, Thomas Wolf and I launched a Deep learning for NLP study group: medium.com/huggingface/la…

thumb_up_off_alt101

chat_bubble_outline5

repeat16

shareShare

Kyosuke Nishida

@kyoun

6 years ago

DistilBERT (huggingface) BERT baseから蒸留にて6層に小型化(40%減)。推論は60%高速化、精度はGLUEで95%程度保持。8個の16GB V100 GPUで3.5日ぐらいで学習。hidden sizeは768のままで、層数の方が高速化には効果があるとのこと。github github.com/huggingface/py… blog medium.com/huggingface/di…

thumb_up_off_alt80

chat_bubble_outline1

repeat23

shareShare

Clem Delangue 🤗

@clemdelangue

6 years ago

.Julien Chaumond watching the Apple keynote

.<a href="/julien_c/">Julien Chaumond</a> watching the <a href="/Apple/">Apple</a> keynote

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Julien Chaumond

@julien_c

6 years ago

GPT-2 on device is blazing fast on iPhone 11 ⚡️ Core ML 3 is officially out so we can do state-of-the-art text generation on mobile (117M parameters running ~3 times par second on the neural engine!) We put together a small video benchmark ⬇️

thumb_up_off_alt597

chat_bubble_outline9

repeat152

shareShare

Timothy Liu

@timothy_lkh_

6 years ago

Happy to have a small PR accepted to the HuggingFace Transformer library demonstrating substantial mixed precision speed-up with @NVIDIA Tensor Core #GPU even at small batch size in the demo script github.com/huggingface/tr…

thumb_up_off_alt48

chat_bubble_outline1

repeat3

shareShare

François Chollet

@fchollet

6 years ago

Perhaps a great opportunity to use Hugging Face's TF 2.0 Transformer implementations :)

thumb_up_off_alt52

chat_bubble_outline1

repeat9

shareShare

Thomas Wolf

@thom_wolf

6 years ago

The @SustaiNLP2020 workshop at #EMNLP2020 will try to remove a little bit of SOTA addiction from NLP research 😉 We'll promote sensible trade-offs between performances & models that are - computationally more efficient - conceptually simpler ... [1/2] x.com/DoingJobs/stat…

thumb_up_off_alt255

chat_bubble_outline1

repeat34

shareShare

chansung

@algo_diver

6 years ago

Some more results. Now I made it fully supported by all kinds of model and vocabs. Good experience to use Hugging Face with Slack. And it looks pretty smart

Some more results. Now I made it fully supported by all kinds of model and vocabs.

Good experience to use <a href="/huggingface/">Hugging Face</a> with <a href="/SlackHQ/">Slack</a>. And it looks pretty smart

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Thomas Wolf

@thom_wolf

6 years ago

Interesting work (and a nice large and clean dataset as well, looking forward to see it released): "Compressive Transformers for Long-Range Sequence Modelling" by Jack W. Rae, Anna Potapenko, Siddhant M. Jayakumar, Timothy P. Lillicrap (at DeepMind) Paper: arxiv.org/abs/1911.05507

thumb_up_off_alt294

chat_bubble_outline1

repeat79

shareShare

Soumith Chintala

@soumithchintala

6 years ago

The first full paper on @pytorch after 3 years of development. It describes our goals, design principles, technical details uptil v0.4 Catch the poster at #NeurIPS2019 Authored by Adam Paszke , Sam Gross et. al. arxiv.org/abs/1912.01703

thumb_up_off_alt1,1K

chat_bubble_outline13

repeat414

shareShare