Stéphane Clinchant (@sclincha) 's Twitter Profile
Stéphane Clinchant

@sclincha

ID: 2873971593

calendar_today12-11-2014 17:30:54

86 Tweet

118 Takipçi

206 Takip Edilen

Matei Zaharia (@matei_zaharia) 's Twitter Profile Photo

Want to build your own chat AI from scratch? We're launching a Building LLMs course at #DataAISummit to teach everyone how to build a Dolly clone: databricks.com/dataaisummit. Tiny model, big attitude, for anyone. #DemocratizeAI

Want to build your own chat AI from scratch? We're launching a Building LLMs course at <a href="/Data_AI_Summit/">#DataAISummit</a> to teach everyone how to build a Dolly clone: databricks.com/dataaisummit. Tiny model, big attitude, for anyone. #DemocratizeAI
Aran Komatsuzaki (@arankomatsuzaki) 's Twitter Profile Photo

Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text 103M documents containing 585M images interleaved with 43B English tokens github.com/allenai/mmc4

Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text

103M documents containing 585M images interleaved with 43B English tokens

github.com/allenai/mmc4
Databricks Mosaic Research (@dbrxmosaicai) 's Twitter Profile Photo

📢 Introducing MPT: a new family of open-source commercially usable LLMs from .. Trained on 1T tokens of text+code, MPT models match and - in many ways - surpass LLaMa-7B. This release includes 4 models: MPT-Base, Instruct, Chat, & StoryWriter (🧵) mosaicml.com/blog/mpt-7b

📢 Introducing MPT: a new family of open-source commercially usable LLMs from <a href="/MosaicML/">.</a>. Trained on 1T tokens of text+code, MPT models match and - in many ways - surpass LLaMa-7B. This release includes 4 models: MPT-Base, Instruct, Chat, &amp; StoryWriter (🧵)
mosaicml.com/blog/mpt-7b
NAVER LABS Europe (@naverlabseurope) 's Twitter Profile Photo

Before signing off for the weekend sign up for next week's 📢 Open virtual seminar w Alireza Mohammadshahi Ph.D.EPFL and @idiap_ch-@uzh_en ! Reference-Free Metric for Evaluating Question Generation by Answering the Question 📅Tue 16th May 9.30am CEST Register: tinyurl.com/uxzvaddr

Before signing off for the weekend sign up for next week's 📢 Open virtual seminar w <a href="/alireza_mshi/">Alireza Mohammadshahi</a>
Ph.D.<a href="/EPFL/">EPFL</a> and @idiap_ch-@uzh_en ! Reference-Free Metric for Evaluating Question Generation by Answering the Question
📅Tue 16th May 9.30am CEST   
Register: tinyurl.com/uxzvaddr
AToMiC@TREC2023 (@trec_atomic) 's Twitter Profile Photo

📢 REMINDER for TREC-AToMiC participants! News: Test topics are out! 🎉 Check them here: trec-atomic.github.io/annoucements/t…. We've carefully selected 200 sections from vital Wikipedia articles. Get ready for some fascinating exploration! Happy searching! 🚀

Yeskendir 🇰🇿 (@yeskendir_k) 's Twitter Profile Photo

Excited to share our #ACL2023 paper "Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model". arxiv:arxiv.org/abs/2212.09811 Joint work w/ Alexandre Berard, Vassilina Nikoulina during my internship NAVER LABS Europe 1/6

Excited to share our #ACL2023  paper "Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model". 
arxiv:arxiv.org/abs/2212.09811
Joint work w/  Alexandre Berard, Vassilina Nikoulina during my internship <a href="/naverlabseurope/">NAVER LABS Europe</a> 1/6
Nadia Chirkova (@nadiinchi) 's Twitter Profile Photo

I will present our #ACL paper “Should you marginalize over possible tokenizations?” on Wednesday, Jul 12, 11:00-12:30, at Poster session 7! Come to chat about tokenization in LMs. w/ Germán Kruszewski Jos Rozen Marc Dymetman. arxiv.org/abs/2306.17757

I will present our #ACL paper “Should you marginalize over possible tokenizations?” on Wednesday, Jul 12, 11:00-12:30, at Poster session 7! Come to chat about tokenization in LMs. w/ <a href="/germank/">Germán Kruszewski</a> <a href="/josrzn/">Jos Rozen</a> <a href="/MarcDymetman/">Marc Dymetman</a>. arxiv.org/abs/2306.17757
Laure Soulier (@lauresoulier) 's Twitter Profile Photo

What a great pleasure and honor to share this session about generative AI, ethics, bias, and politics with 3 passionate speakers Philippe Limantour, Andrew Wyckoff, and Juha Heikkilä. Thanks AI2S2 Symposium for the invitation. See you in Geneva on Monday!

Stéphane Clinchant (@sclincha) 's Twitter Profile Photo

😀We're looking for a talented researcher to join our team at Naver Labs Europe (NAVER LABS Europe) , working on LLMs and Retrieval!😃 Please apply here: europe.naverlabs.com/job/research-s… !

Vaibhav (VB) Srivastav (@reach_vb) 's Twitter Profile Photo

AllenAI COOKED, Llama 3.1 Tulu 405B beats DeepSeek V3 - all whilst being 40% SMALLER! 🔥 Fully open model weights, data and training pipeline 🤗

AllenAI COOKED, Llama 3.1 Tulu 405B beats DeepSeek V3 - all whilst being 40% SMALLER! 🔥

Fully open model weights, data and training pipeline 🤗
Nadia Chirkova (@nadiinchi) 's Twitter Profile Photo

Arrived in Singapore for #ICLR2025 and will be presenting PROVENCE on Friday, Poster session 3 at 10am, poster #255! Blogpost: huggingface.co/blog/nadiinchi… Will be happy to meet & chat about #LLMs, #RAG, #InformationRetrieval and #MultilingualNLP :) #NLProc NAVER LABS Europe

Arrived in Singapore for #ICLR2025 and will be presenting PROVENCE on Friday, Poster session 3 at 10am, poster #255!

Blogpost: huggingface.co/blog/nadiinchi…

Will be happy to meet &amp; chat about #LLMs, #RAG, #InformationRetrieval and #MultilingualNLP :)

#NLProc <a href="/naverlabseurope/">NAVER LABS Europe</a>
Xin Eric Wang @ ICLR 2025 (@xwang_lk) 's Twitter Profile Photo

𝘏𝘶𝘮𝘢𝘯𝘴 𝘵𝘩𝘪𝘯𝘬 𝘧𝘭𝘶𝘪𝘥𝘭𝘺—𝘯𝘢𝘷𝘪𝘨𝘢𝘵𝘪𝘯𝘨 𝘢𝘣𝘴𝘵𝘳𝘢𝘤𝘵 𝘤𝘰𝘯𝘤𝘦𝘱𝘵𝘴 𝘦𝘧𝘧𝘰𝘳𝘵𝘭𝘦𝘴𝘴𝘭𝘺, 𝘧𝘳𝘦𝘦 𝘧𝘳𝘰𝘮 𝘳𝘪𝘨𝘪𝘥 𝘭𝘪𝘯𝘨𝘶𝘪𝘴𝘵𝘪𝘤 𝘣𝘰𝘶𝘯𝘥𝘢𝘳𝘪𝘦𝘴. But current reasoning models remain constrained by discrete tokens, limiting their full

𝘏𝘶𝘮𝘢𝘯𝘴 𝘵𝘩𝘪𝘯𝘬 𝘧𝘭𝘶𝘪𝘥𝘭𝘺—𝘯𝘢𝘷𝘪𝘨𝘢𝘵𝘪𝘯𝘨 𝘢𝘣𝘴𝘵𝘳𝘢𝘤𝘵 𝘤𝘰𝘯𝘤𝘦𝘱𝘵𝘴 𝘦𝘧𝘧𝘰𝘳𝘵𝘭𝘦𝘴𝘴𝘭𝘺, 𝘧𝘳𝘦𝘦 𝘧𝘳𝘰𝘮 𝘳𝘪𝘨𝘪𝘥 𝘭𝘪𝘯𝘨𝘶𝘪𝘴𝘵𝘪𝘤 𝘣𝘰𝘶𝘯𝘥𝘢𝘳𝘪𝘦𝘴. But current reasoning models remain constrained by discrete tokens, limiting their full