Instruction Workshop, NeurIPS 2023 (@itif_workshop) Twitter Tweets • TwiCopy

Instruction Workshop, NeurIPS 2023

@itif_workshop

+ Follow

The official account of the 1st Workshop on Instruction Tuning and Instruction Following (ITIF), colocated with NeurIPS, in December 2023.

ID: 1689312241542430721

linkhttps://an-instructive-workshop.github.io/ calendar_today09-08-2023 16:27:08

162 Tweet

261 Followers

26 Following

Shayne Longpre

6 months ago

📢 Check out Anthony Chen and my invited talk at the at the USC ISI Natural Language Seminar: 📜 "The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI" youtube.com/watch?v=np9HeJ… Thank you Justin Cho 조현동 for hosting!

thumb_up_off_alt31

chat_bubble_outline2

SambaNova Systems

6 months ago

🚀🌟🚀Excited to announce Samba-CoE v0.2, which outperforms DBRX by Databricks Mosaic Research and Databricks, Mixtral-8x7B from Mistral AI, and Grok-1 by Grok at a breakneck speed of 330 tokens/s. These breakthrough speeds were achieved without sacrificing precision and only on 8 sockets,

🚀🌟🚀Excited to announce Samba-CoE v0.2, which outperforms DBRX by <a href="/DbrxMosaicAI/">Databricks Mosaic Research</a> and <a href="/databricks/">Databricks</a>, Mixtral-8x7B from <a href="/MistralAI/">Mistral AI</a>, and Grok-1 by <a href="/grok/">Grok</a> at a breakneck speed of 330 tokens/s.
These breakthrough speeds were achieved without sacrificing precision and only on 8 sockets,

thumb_up_off_alt383

chat_bubble_outline25

Shayne Longpre

6 months ago

📢 Want to automatically generate your bibtex for 1000s of Hugging Face text datasets? Minh Chien Vu just added this feature + data summaries for: ➡️ huge collections like Flan, P3, Aya... ➡️ popular OpenAI-generated datasets ➡️ ~2.5k+ datasets & growing 🔗:

📢 Want to automatically generate your bibtex for 1000s of <a href="/huggingface/">Hugging Face</a> text datasets?

<a href="/chien_vu1692/">Minh Chien Vu</a> just added this feature + data summaries for:

➡️ huge collections like Flan, P3, Aya...
➡️ popular OpenAI-generated datasets
➡️ ~2.5k+ datasets & growing

🔗:

thumb_up_off_alt41

chat_bubble_outline0

Shayne Longpre

5 months ago

Excited to see our 🍮Flan-Palm🌴 work finally published in Journal of Machine Learning Research 2024! Looking back, I see this work as pushing hard on scaling: post-training data, models, prompting, & eval. We brought together the methods and findings of many awesome prior works, scaled them up, and

thumb_up_off_alt51

chat_bubble_outline1

Jack Jingyu Zhang

@jackjingyuzhang

5 months ago

Thanks @elvis for sharing our work! 🤔 LLMs often generate fluent but hallucinated text. How can we reliably ✨verify✨ their correctness against trusted sources? We tackle the verifiability goal by aligning LLMs to generate verbatim quotes from their pre-training data 📚.

thumb_up_off_alt25

chat_bubble_outline1

Shayne Longpre

5 months ago

A 🧵 on my favorite, influential works on "Data Measurements" 🚂 Datasets drive AI progress 📚 But... massive datasets remain impenetrable & poorly understood for *years* 🔍 Data forensics uncover their mysteries 1/

thumb_up_off_alt127

chat_bubble_outline1

Seungone Kim

5 months ago

#NLProc Introducing 🔥Prometheus 2, an open-source LM specialized on evaluating other language models. ✅Supports both direct assessment & pairwise ranking. ✅ Improved evaluation capabilities compared to its predecessor. ✅Can assess based on user-defined evaluation criteria.

#NLProc
Introducing 🔥Prometheus 2, an open-source LM specialized on evaluating other language models.

✅Supports both direct assessment & pairwise ranking.
✅ Improved evaluation capabilities compared to its predecessor.
✅Can assess based on user-defined evaluation criteria.

thumb_up_off_alt160

chat_bubble_outline3

Shayne Longpre

5 months ago

🚨 New #ICML2024 position piece. The most overlooked risks of AI stem from autonomous weaponry For 4 reasons: 1⃣ Arms race w/ ⬇️ human oversight 2⃣ Reduces cost of starting conflicts 3⃣ Evades accountability 4⃣ Battlefield errors aren’t considered costly See our work led by

🚨 New #ICML2024 position piece.

The most overlooked risks of AI stem from autonomous weaponry

For 4 reasons:
1⃣ Arms race w/ ⬇️ human oversight
2⃣ Reduces cost of starting conflicts
3⃣ Evades accountability
4⃣ Battlefield errors aren’t considered costly

See our work led by

thumb_up_off_alt33

chat_bubble_outline3

Shayne Longpre

4 months ago

🔭 New perspective piece at #ICML2024 & Massachusetts Institute of Technology (MIT) AI Impact Award winner🎉 🌟Data Authenticity, Consent, and Provenance for AI Are All Broken: What Will It Take to Fix Them?🌟 w/ Robert Mahari Naana Obeng-Marnu Will Brannon Tobin South katy ilonka gero Alex Pentland Jad Kabbara 🔗:

🔭 New perspective piece at #ICML2024 & <a href="/MIT/">Massachusetts Institute of Technology (MIT)</a> AI Impact Award winner🎉

🌟Data Authenticity, Consent, and Provenance for AI Are All Broken: What Will It Take to Fix Them?🌟

w/ <a href="/RobertMahari/">Robert Mahari</a> <a href="/naana_om/">Naana Obeng-Marnu</a> <a href="/wwbrannon/">Will Brannon</a> <a href="/TobinSouth/">Tobin South</a> <a href="/katyilonka/">katy ilonka gero</a> <a href="/alex_pentland/">Alex Pentland</a> <a href="/jad_kabbara/">Jad Kabbara</a>

🔗:

thumb_up_off_alt121

chat_bubble_outline10

Seungone Kim

3 months ago

🤔How can we systematically assess an LM's proficiency in a specific capability without using summary measures like helpfulness or simple proxy tasks like multiple-choice QA? Introducing the ✨BiGGen Bench, a benchmark that directly evaluates nine core capabilities of LMs.

🤔How can we systematically assess an LM's proficiency in a specific capability without using summary measures like helpfulness or simple proxy tasks like multiple-choice QA?

Introducing the ✨BiGGen Bench, a benchmark that directly evaluates nine core capabilities of LMs.

thumb_up_off_alt185

chat_bubble_outline7

Shayne Longpre

2 months ago

✨New Preprint ✨ How are shifting norms on the web impacting AI? We find: 📉 A rapid decline in the consenting data commons (the web) ⚖️ Differing access to data by company, due to crawling restrictions (e.g.🔻26% OpenAI, 🔻13% Anthropic) ⛔️ Robots.txt preference protocols

✨New Preprint ✨ How are shifting norms on the web impacting AI?

We find:

📉 A rapid decline in the consenting data commons (the web)

⚖️ Differing access to data by company, due to crawling restrictions (e.g.🔻26% OpenAI, 🔻13% Anthropic)

⛔️ Robots.txt preference protocols

thumb_up_off_alt255

chat_bubble_outline10

Nayan Saxena

2 months ago

✨Incredibly proud to share our new paper led by Massachusetts Institute of Technology (MIT) MIT Media Lab showing a rapid decline in consenting data for AI, asymmetries in data access by company (🔻26% OpenAI, 🔻13% Anthropic), and inefficiencies in robots.txt preference protocols. dataprovenance.org/consent-in-cri…

✨Incredibly proud to share our new paper led by <a href="/MIT/">Massachusetts Institute of Technology (MIT)</a> <a href="/medialab/">MIT Media Lab</a> showing a rapid decline in consenting data for AI, asymmetries in data access by company (🔻26% OpenAI, 🔻13% Anthropic), and inefficiencies in robots.txt preference protocols.

dataprovenance.org/consent-in-cri…

thumb_up_off_alt60

chat_bubble_outline2

Minh Chien Vu

2 months ago

The Data Provenance Initiative led by Massachusetts Institute of Technology (MIT) MIT Media Lab is releasing a large-scale audit of 1800+ LLM training datasets! We found significant data access asymmetries by the company (🔻26% OpenAI, 🔻13% Anthropic). See Shayne Longpre's thread for more ⬇️ x.com/ShayneRedford/…

thumb_up_off_alt16

chat_bubble_outline0

Shayne Longpre

2 months ago

Excellent breakdown by Kevin Roose The New York Times of the recent shifts in web norms, and the consent to use its data for AI. nytimes.com/2024/07/19/tec…

thumb_up_off_alt22

chat_bubble_outline4

Daphne Ippolito

2 months ago

In the past, I've studied how curation decisions for pre-training data influence what LMs are good and bad at. In our new preprint, we look at how the fabric of the internet (the primary source of most of these datasets), is itself changing, and the effects this might have.

thumb_up_off_alt39

chat_bubble_outline0

Shayne Longpre

2 months ago

Headed to 🛬🇦🇹 Vienna #ICML2024 Reach out if you'd like to chat or catch up! Work together w/ collaborators: - A Safe Harbor for AI Evaluation ⛴️(arxiv.org/abs/2403.04893) -- Tuesday 10:30 am Oral - On the Societal Impact of Open Foundation Models (arxiv.org/abs/2403.07918) --

thumb_up_off_alt92

chat_bubble_outline9

Shayne Longpre

a month ago

📢 AI is increasingly (mis)used in the context of autonomous weaponry. Fantastic to see this covered by Catherine Caruso in Harvard Medical School news. Also see the #ICML2024 Oral led by Riley Simmons-Edler @RyanBadman1 and Kanaka Rajan.

thumb_up_off_alt8

chat_bubble_outline0

Shayne Longpre

a month ago

Honored for the Data Provenance Initiative to be awarded the Infrastructure Grant Award, by Mozilla! 🎉🎉🎉 As part of this grant, we were invited to present at MozFest House Amsterdam, where we gave an early look at trends in the AI data supply chain: 📽️

Honored for the Data Provenance Initiative to be awarded the Infrastructure Grant Award, by <a href="/mozilla/">Mozilla</a>! 🎉🎉🎉

As part of this grant, we were invited to present at MozFest House Amsterdam, where we gave an early look at trends in the AI data supply chain:

📽️

thumb_up_off_alt74

chat_bubble_outline7

Shayne Longpre

25 days ago

📢 Excited to see our piece the "Data Provenance Initiative: A large-scale audit of dataset licensing and attribution in AI" now in: 📜 nature Machine Intelligence ➡️ nature.com/articles/s4225… 🗞️Massachusetts Institute of Technology (MIT) News ➡️ news.mit.edu/2024/study-lar… 1/

thumb_up_off_alt66

chat_bubble_outline3