Dynabench (@dynabenchai) 's Twitter Profile
Dynabench

@dynabenchai

Official dynabench account

ID: 1308975744035573767

linkhttp://dynabench.org calendar_today24-09-2020 03:45:33

90 Tweet

500 Takipçi

5 Takip Edilen

Davis Blalock (@davisblalock) 's Twitter Profile Photo

"DataPerf: Benchmarks for Data-Centric AI Development" What if instead of holding the data constant and benchmarking different models, we held the model constant and benchmarked different data pipelines? [1/7]

"DataPerf: Benchmarks for Data-Centric AI Development"

What if instead of holding the data constant and benchmarking different models, we held the model constant and benchmarked different data pipelines? [1/7]
Adina Williams (@adinamwilliams) 's Twitter Profile Photo

How can we improve benchmarking? The Dynabench experiment aims to make faster progress with dynamic data collection, and today, we are pleased to introduce our next stage: @MetaAI has funded 5 exciting research proposals on the theme of "Rethinking Benchmarking”! Congrats to:

GenBench (@genbench) 's Twitter Profile Photo

Ever wanted to know more about generalisation in NLP but overwhelmed with the number of papers on ArXiv? Fear not! We read 400+ papers, 600+ experiments, and designed a taxonomy 📝 to categorise the research for you! (1/n) 🧵 arxiv.org/abs/2210.03050

Ever wanted to know more about generalisation in NLP but overwhelmed with the number of papers on ArXiv? Fear not! We read 400+ papers, 600+ experiments, and designed a taxonomy 📝 to categorise the research for you! (1/n) 🧵 

arxiv.org/abs/2210.03050
Aaron Mueller (@amuuueller) 's Twitter Profile Photo

Announcing the BabyLM 👶 Challenge, the shared task at CoNLL 2025 and CMCL'23! We’re calling on researchers to pre-train language models on (relatively) small datasets inspired by the input given to children learning language. babylm.github.io arxiv.org/abs/2301.11796

Announcing the BabyLM 👶 Challenge,
the shared task at <a href="/conll_conf/">CoNLL 2025</a>  and CMCL'23!

We’re calling on researchers to pre-train language models on (relatively) small datasets inspired by the input given to children learning language.

babylm.github.io
arxiv.org/abs/2301.11796
AI at Meta (@aiatmeta) 's Twitter Profile Photo

Today, Meta researchers together with MLCommons working group, are launching DataPerf, the first platform for building data & data-centric AI algorithm leaderboards. We're excited for how DataPerf will help to push the data-centric AI field forward ⬇️

Douwe Kiela (@douwekiela) 's Twitter Profile Photo

Progress in AI continues to outpace benchmarks. Check out this new plot, inspired by Dynabench, that shows just how quickly it's happening. Read more about it here: contextual.ai/plotting-progr…

Progress in AI continues to outpace benchmarks.

Check out this new plot, inspired by <a href="/DynabenchAI/">Dynabench</a>,  that shows just how quickly it's happening.

Read more about it here: contextual.ai/plotting-progr…