Manuel Mager (Turatemai)(@pywirrarika) 's Twitter Profileg
Manuel Mager (Turatemai)

@pywirrarika

Applied Scientist | Amazon AWS
Posts are my own opinion.

ID:223694721

linkhttp://code.kiutz.com calendar_today07-12-2010 02:33:19

2,9K Tweets

893 Followers

1,0K Following

Manuel Mager (Turatemai)(@pywirrarika) 's Twitter Profile Photo

Mexico has way better freedom of speech in our universities, than in the USA. Don't get me wrong. We have our own really bad issues.

account_circle
Graham Neubig(@gneubig) 's Twitter Profile Photo

We have created a web site and open-source toolkit to try to make NLP more accessible for people who speak or perform linguistic research on traditionally underrepresented languages.

If you're interested in using it, please take a look and reach out to me and Zaid for details!

account_circle
Shruti Rijhwani(@shrutirij) 's Twitter Profile Photo

Gemini 1.5 Pro is now available!

✨Try it out: aistudio.google.com✨

Grateful to have been a small part of this effort and working together with an amazing team!

account_circle
Philipp Schmid(@_philschmid) 's Twitter Profile Photo

Easily Fine-tune AI at Meta Llama 3 70B! 🦙 I am excited to share a new guide on how to fine-tune Llama 3 70B with PyTorch FSDP, Q-Lora, and Flash Attention 2 (SDPA) using Hugging Face build for consumer-size GPUs (4x 24GB). 🚀

Blog: philschmid.de/fsdp-qlora-lla…

The blog covers:
👨‍💻…

account_circle
Luciana Benotti(@LucianaBenotti) 's Twitter Profile Photo

A t-shirt that says 'Hallucinations in LLMs are here to stay'.

'Our work shows that even in an ideal, unchanging world with perfect training data and no prompts,
one should expect hallucinations from LLMs' arxiv.org/pdf/2311.14648…

account_circle
Thomas Wolf(@Thom_Wolf) 's Twitter Profile Photo

Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes??

Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…

account_circle
Manuel Mager (Turatemai)(@pywirrarika) 's Twitter Profile Photo

If you have time, read our open letter on our proposal to change the name of our conference. Also feel free to share your views and concerns. :)

account_circle
Manuel Mager (Turatemai)(@pywirrarika) 's Twitter Profile Photo

Come to the US they said, its a free and democratic country they said... you will have free speech rights they said... students are not going to be arrested because of their ideas, they said....

account_circle
Manuel Mager (Turatemai)(@pywirrarika) 's Twitter Profile Photo

Hi Twitter! I am trying to find datasets with explicit annotations for harmful/toxic content. An example of such a dataset is huggingface.co/datasets/lmsys… Any recommendations?

account_circle
NAACL(@naacl) 's Twitter Profile Photo

Dear Members, to better explain some of the arguments for a possible name change, several members of our community who reside or originate from the Americas outside of US/Canada have written an open letter 👉 naacl.org/posts/2024-04-…

Original survey forms.gle/r8SWiu8goG79kw…

Dear #NAACL Members, to better explain some of the arguments for a possible name change, several members of our community who reside or originate from the Americas outside of US/Canada have written an open letter 👉 naacl.org/posts/2024-04-… Original survey forms.gle/r8SWiu8goG79kw…
account_circle
Momchil Hardalov(@mhardalov) 's Twitter Profile Photo

Looking for an internship in for autumn 2024? Our Amazon Science team at Amazon Web Services in Barcelona 🌴 has multiple open positions.

Some of the relevant topics are building Guardrails for LLMs, preventing prompt attacks, etc. ⤵️

 amazon.jobs/en/jobs/248179…

account_circle
Devendra Chaplot(@dchaplot) 's Twitter Profile Photo

We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1:
- Free to use under Apache 2.0 license
- Outperforms all open models
- Native function calling
- Masters English, French, Italian, German and Spanish.
- Seq_len = 64K

mistral.ai/news/mixtral-8…

We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1: - Free to use under Apache 2.0 license - Outperforms all open models - Native function calling - Masters English, French, Italian, German and Spanish. - Seq_len = 64K mistral.ai/news/mixtral-8…
account_circle
Luciana Benotti(@LucianaBenotti) 's Twitter Profile Photo

I enjoyed your paper Abeba Birhane! Thank you for it!

As you say, names are important. It would be great if you could say 'US' and not 'America', in the paper, when you mean US. We Latinamericans are also suffering from colonialism, and from data extractivism through undersea cables.

account_circle