Michal Valko(@misovalko) 's Twitter Profileg
Michal Valko

@misovalko

Llama @AIatMeta Paris & Inria & MVA - Ex: Gemini and BYOL @GoogleDeepMind

ID:14346204

linkhttps://misovalko.github.io/ calendar_today09-04-2008 22:16:09

1,3K Tweets

5,1K Followers

2,2K Following

Victor Sanh(@SanhEstPasMoi) 's Twitter Profile Photo

New multimodal model in town: Idefics2!

💪 Strong 8B-parameters model: often on par with open 30B counterparts.
🔓Open license: Apache 2.0.
🚀 Strong improvement over Idefics1: +12 points on VQAv2, +30 points on TextVQA while having 10x fewer parameters.
📚 Better data:…

account_circle
ICLR 2024(@iclr_conf) 's Twitter Profile Photo

Announcing the Invited Speakers:
blog.iclr.cc/2024/04/15/ann…

Kyunghyun Cho (Kyunghyun Cho)
Priya Donti (@priyald17)
Kate Downing (katedowninglaw.com)
Raia Hadsell (@RaiaHadsell)
Moritz Hardt (mrtz.org)
Devi Parikh (@deviparikh)
Jie Tang (@jietang)

Announcing the #ICLR2024 Invited Speakers: blog.iclr.cc/2024/04/15/ann… Kyunghyun Cho (@kchonyc) Priya Donti (@priyald17) Kate Downing (katedowninglaw.com) Raia Hadsell (@RaiaHadsell) Moritz Hardt (mrtz.org) Devi Parikh (@deviparikh) Jie Tang (@jietang)
account_circle
Max Welling(@wellingmax) 's Twitter Profile Photo

After two good years at Microsoft Research AI4Science, I am very excited to announce that as of this month I have, together with Chad Edwards, co-founded a new startup in the field of molecular and materials discovery.

account_circle
Omar Sanseviero(@osanseviero) 's Twitter Profile Photo

Introducing: Zephyr 141B-A35B 🥁

🔥Mixtral-8x22B fine-tune
🤯 Using DORPO: new alignment algorithm (no SFT, open )
🚀 With 7k instances of (open) data

Very strong IFEval, BBH, AGIEval... Enjoy! 🤗

hf.co/HuggingFaceH4/…

account_circle
Michal Valko(@misovalko) 's Twitter Profile Photo

Come and join us within Naila Murray’s org who leads the foundational and exploratory research in Europe!

Apply at:
lnkd.in/eC3RaWND
lnkd.in/e5R5-QJr

AI at Meta AI at Meta Meta Open Source

account_circle
Google DeepMind(@GoogleDeepMind) 's Twitter Profile Photo

Our generative technology Imagen 2 can now create short, 4-second live images from a single prompt. 🖼

It’s available to use in @GoogleCloud’s platform. → dpmd.ai/43QZrOt

account_circle
Quentin Berthet(@qberthet) 's Twitter Profile Photo

The 2024 Google PhD Fellowship awards is accepting student nominations here through May 8!

This program supports graduate students doing exceptional and innovative research in computer science and related fields as they pursue their PhD.

See details: research.google/programs-and-e…

account_circle
Mistral AI(@MistralAI) 's Twitter Profile Photo

magnet:?xt=urn:btih:9238b09245d0d8cd915be09927769d5f7584c1c9&dn=mixtral-8x22b&tr=udp%3A%2F%2Fopen.demonii.com%3A1337%2Fannounce&tr=http%3A%2F%https://t.co/OdtBUsbeV5%3A1337%2Fannounce

account_circle
Clément(@clmt) 's Twitter Profile Photo

Gemma is expanding.... we just announced CodeGemma, a version of Gemma tuned for code generation. And bonus... Gemma is now bumped to v1.1, addressing lots of feedback we got.

Congrats Gemma team for one more amazing release!

developers.googleblog.com/2024/04/gemma-…

account_circle
Samuel L Smith(@SamuelMLSmith) 's Twitter Profile Photo

Announcing RecurrentGemma!
github.com/google-deepmin…

- A 2B model with open weights based on Griffin
- Replaces transformer with mix of gated linear recurrences and local attention
- Competitive with Gemma-2B on downstream evals
- Higher throughput when sampling long sequences

Announcing RecurrentGemma! github.com/google-deepmin… - A 2B model with open weights based on Griffin - Replaces transformer with mix of gated linear recurrences and local attention - Competitive with Gemma-2B on downstream evals - Higher throughput when sampling long sequences
account_circle
Omar Sanseviero(@osanseviero) 's Twitter Profile Photo

This week was one of the most fun ever 😍

- Met Jonathan Frankle (DBRX), Sophia Yang, Ph.D. (Mistral) and Yann LeCun (🐐) in person
- Met super interesting people from the Llama team (@misovalko @thomasscialom) and collaborators (@christiankeller, Code Llama team, and more)
- Saw Sara Hooker

This week was one of the most fun ever 😍 - Met @jefrankle (DBRX), @sophiamyang (Mistral) and @ylecun (🐐) in person - Met super interesting people from the Llama team (@misovalko @thomasscialom) and collaborators (@christiankeller, Code Llama team, and more) - Saw @sarahookr…
account_circle
cohere(@cohere) 's Twitter Profile Photo

Today, we’re introducing Command R+: a state-of-the-art RAG-optimized LLM designed to tackle enterprise-grade workloads and speak the languages of global business.

Our R-series model family is now available on Microsoft Azure, and coming soon to additional cloud providers.

Today, we’re introducing Command R+: a state-of-the-art RAG-optimized LLM designed to tackle enterprise-grade workloads and speak the languages of global business. Our R-series model family is now available on Microsoft Azure, and coming soon to additional cloud providers.
account_circle