fairseq (@fairseq) Twitter Tweets • TwiCopy

fairseq

@fairseq

+ Follow

Sequence modeling toolkit for @PyTorch

ID:1257733655503437829

linkhttps://github.com/pytorch/fairseq/ calendar_today05-05-2020 18:07:55

12 Tweets

1,6K Followers

11 Following

Follow People

AI at Meta

Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.

+ Follow

PyTorch

Tensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundation

+ Follow

Thomas Wolf

Co-founder and CSO @HuggingFace - open-source and open-science

+ Follow

Alexis Conneau

Audio AGI Research Lead @OpenAI - GPT-Next - Past: XLM, Unsupervised ASR, Unsupervised MT, Wav2vec 2.0/XLSR, MUSE, Unsupervised cross-lingual transfer

+ Follow

Mike Lewis

Llama3 pre-training lead. Partially to blame for things like the Cicero Diplomacy bot, BART, RoBERTa, kNN-LM, top-k sampling & Deal Or No Deal.

+ Follow

Mikel Artetxe

2 years ago

We are releasing a family of dense and MoE language models with up to 13B and 1.1T parameters. We find that MoEs are more efficient, but the gap narrows at scale and varies greatly across domains and tasks.

Paper: arxiv.org/abs/2112.10684

Models & code: github.com/pytorch/fairse…

We are releasing a family of dense and MoE language models with up to 13B and 1.1T parameters. We find that MoEs are more efficient, but the gap narrows at scale and varies greatly across domains and tasks. Paper: arxiv.org/abs/2112.10684 Models & code: github.com/pytorch/fairse…

thumb_up_off_alt93

chat_bubble_outline0

account_circle

fairseq

2 years ago

Models and code available in fairseq: github.com/pytorch/fairse…

thumb_up_off_alt7

chat_bubble_outline0

account_circle

fairseq

2 years ago

Mixture of experts training in fairseq is now 40% faster thanks to Microsoft's Tutel library!
Blog: microsoft.com/en-us/research…
Fairseq code: github.com/pytorch/fairse…
Tutel code: github.com/microsoft/tutel

thumb_up_off_alt15

chat_bubble_outline0

account_circle

AI at Meta

2 years ago

We’re introducing GSLM, the first language model that breaks free completely of the dependence on text for training. This “textless NLP” approach learns to generate expressive speech using only raw audio recordings as input. Learn more and get the code:
ai.facebook.com/blog/textless-…

We’re introducing GSLM, the first language model that breaks free completely of the dependence on text for training. This “textless NLP” approach learns to generate expressive speech using only raw audio recordings as input. Learn more and get the code: ai.facebook.com/blog/textless-…

thumb_up_off_alt1,2K

chat_bubble_outline0

account_circle

fairseq

3 years ago

fairseq now supports CPU offloading and full parameter+optimizer state sharding via fairscale's FullyShardedDataParallel module. See our tutorial to train a 13B parameter LM on 1 GPU: fb.me/fairseq_fsdp

thumb_up_off_alt58

chat_bubble_outline0

account_circle

fairseq

3 years ago

We just released 0.10.0, which is our last significant release before 1.0.0 when we will migrate to Hydra. Changelog: github.com/pytorch/fairse…

thumb_up_off_alt17

chat_bubble_outline0

account_circle

Naman Goyal

3 years ago

Facebook AI Research's sequence modeling library fairseq has made it's twitter debut. Please follow for latest updates.

thumb_up_off_alt43

chat_bubble_outline0

account_circle

PyTorch

4 years ago

Fairseq includes support for sequence to sequence learning for speech and audio recognition tasks, faster exploration and prototyping of new research ideas while offering a clear path to production. bit.ly/2WfP85X

thumb_up_off_alt124

chat_bubble_outline0

account_circle

PyTorch

4 years ago

roberta = torch.hub.load('pytorch/fairseq', 'roberta.large')

thumb_up_off_alt357

chat_bubble_outline0

account_circle

PyTorch

5 years ago

fairseq now supports the training of gated convolutional language models (arxiv.org/abs/1612.08083). It can train a Google Billion Word language model on 128 GPUs in less than a day.

thumb_up_off_alt37

chat_bubble_outline0

account_circle

PyTorch

5 years ago

FairSeq Toolkit - Major Update
- Distributed Training
- Transformer models (big Transformer on WMT Eng-German in < 5 hours on DGX-1)
- Fast Inference: translations @ 92 sent/sec for big Transformer
- Story Generation
Read more at Michael Auli's post: facebook.com/photo.php?fbid…

FairSeq Toolkit - Major Update - Distributed Training - Transformer models (big Transformer on WMT Eng-German in < 5 hours on DGX-1) - Fast Inference: translations @ 92 sent/sec for big Transformer - Story Generation Read more at Michael Auli's post: facebook.com/photo.php?fbid…

thumb_up_off_alt141

chat_bubble_outline0

account_circle

Yann LeCun

6 years ago

Fairseq, now in PyTorch!
The open-source convolutional sequence-to-sequence engine from FAIR is now available in... fb.me/1gCPauX6V

thumb_up_off_alt314

chat_bubble_outline0

account_circle