Stéphane Clinchant (@sclincha) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Want to build your own chat AI from scratch? We're launching a Building LLMs course at #DataAISummit to teach everyone how to build a Dolly clone: databricks.com/dataaisummit. Tiny model, big attitude, for anyone. #DemocratizeAI

Want to build your own chat AI from scratch? We're launching a Building LLMs course at <a href="/Data_AI_Summit/">#DataAISummit</a> to teach everyone how to build a Dolly clone: databricks.com/dataaisummit. Tiny model, big attitude, for anyone. #DemocratizeAI

thumb_up_off_alt209

chat_bubble_outline5

repeat38

shareShare

Jonathan Frankle

@jefrankle

2 years ago

Sequence length 65k, anyone?

thumb_up_off_alt35

chat_bubble_outline1

repeat6

shareShare

Aran Komatsuzaki

@arankomatsuzaki

2 years ago

Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text 103M documents containing 585M images interleaved with 43B English tokens github.com/allenai/mmc4

thumb_up_off_alt308

chat_bubble_outline6

repeat71

shareShare

Databricks Mosaic Research

@dbrxmosaicai

2 years ago

📢 Introducing MPT: a new family of open-source commercially usable LLMs from .. Trained on 1T tokens of text+code, MPT models match and - in many ways - surpass LLaMa-7B. This release includes 4 models: MPT-Base, Instruct, Chat, & StoryWriter (🧵) mosaicml.com/blog/mpt-7b

📢 Introducing MPT: a new family of open-source commercially usable LLMs from <a href="/MosaicML/">.</a>. Trained on 1T tokens of text+code, MPT models match and - in many ways - surpass LLaMa-7B. This release includes 4 models: MPT-Base, Instruct, Chat, & StoryWriter (🧵)
mosaicml.com/blog/mpt-7b

thumb_up_off_alt1,1K

chat_bubble_outline22

repeat213

shareShare

NAVER LABS Europe

@naverlabseurope

2 years ago

Before signing off for the weekend sign up for next week's 📢 Open virtual seminar w Alireza Mohammadshahi Ph.D.EPFL and @idiap_ch-@uzh_en ! Reference-Free Metric for Evaluating Question Generation by Answering the Question 📅Tue 16th May 9.30am CEST Register: tinyurl.com/uxzvaddr

thumb_up_off_alt8

chat_bubble_outline0

repeat3

shareShare

AToMiC@TREC2023

@trec_atomic

2 years ago

📢 REMINDER for TREC-AToMiC participants! News: Test topics are out! 🎉 Check them here: trec-atomic.github.io/annoucements/t…. We've carefully selected 200 sections from vital Wikipedia articles. Get ready for some fascinating exploration! Happy searching! 🚀

thumb_up_off_alt1

chat_bubble_outline0

repeat2

shareShare

Yeskendir 🇰🇿

@yeskendir_k

2 years ago

Excited to share our #ACL2023 paper "Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model". arxiv:arxiv.org/abs/2212.09811 Joint work w/ Alexandre Berard, Vassilina Nikoulina during my internship NAVER LABS Europe 1/6

thumb_up_off_alt14

chat_bubble_outline2

repeat3

shareShare

Jos Rozen

@josrzn

2 years ago

Outstanding!

thumb_up_off_alt10

chat_bubble_outline1

repeat3

shareShare

Nadia Chirkova

@nadiinchi

2 years ago

I will present our #ACL paper “Should you marginalize over possible tokenizations?” on Wednesday, Jul 12, 11:00-12:30, at Poster session 7! Come to chat about tokenization in LMs. w/ Germán Kruszewski Jos Rozen Marc Dymetman. arxiv.org/abs/2306.17757

thumb_up_off_alt23

chat_bubble_outline3

repeat5

shareShare

Laure Soulier

@lauresoulier

2 years ago

What a great pleasure and honor to share this session about generative AI, ethics, bias, and politics with 3 passionate speakers Philippe Limantour, Andrew Wyckoff, and Juha Heikkilä. Thanks AI2S2 Symposium for the invitation. See you in Geneva on Monday!

thumb_up_off_alt14

chat_bubble_outline0

repeat2

shareShare

Stéphane Clinchant

@sclincha

a year ago

😀We're looking for a talented researcher to join our team at Naver Labs Europe (NAVER LABS Europe) , working on LLMs and Retrieval!😃 Please apply here: europe.naverlabs.com/job/research-s… !

thumb_up_off_alt42

chat_bubble_outline0

repeat20

shareShare

Vaibhav (VB) Srivastav

@reach_vb

6 months ago

AllenAI COOKED, Llama 3.1 Tulu 405B beats DeepSeek V3 - all whilst being 40% SMALLER! 🔥 Fully open model weights, data and training pipeline 🤗

thumb_up_off_alt435

chat_bubble_outline18

repeat59

shareShare

Nadia Chirkova

@nadiinchi

3 months ago

Arrived in Singapore for #ICLR2025 and will be presenting PROVENCE on Friday, Poster session 3 at 10am, poster #255! Blogpost: huggingface.co/blog/nadiinchi… Will be happy to meet & chat about #LLMs, #RAG, #InformationRetrieval and #MultilingualNLP :) #NLProc NAVER LABS Europe

thumb_up_off_alt15

chat_bubble_outline2

repeat4

shareShare

Xin Eric Wang @ ICLR 2025

@xwang_lk

2 months ago

𝘏𝘶𝘮𝘢𝘯𝘴 𝘵𝘩𝘪𝘯𝘬 𝘧𝘭𝘶𝘪𝘥𝘭𝘺—𝘯𝘢𝘷𝘪𝘨𝘢𝘵𝘪𝘯𝘨 𝘢𝘣𝘴𝘵𝘳𝘢𝘤𝘵 𝘤𝘰𝘯𝘤𝘦𝘱𝘵𝘴 𝘦𝘧𝘧𝘰𝘳𝘵𝘭𝘦𝘴𝘴𝘭𝘺, 𝘧𝘳𝘦𝘦 𝘧𝘳𝘰𝘮 𝘳𝘪𝘨𝘪𝘥 𝘭𝘪𝘯𝘨𝘶𝘪𝘴𝘵𝘪𝘤 𝘣𝘰𝘶𝘯𝘥𝘢𝘳𝘪𝘦𝘴. But current reasoning models remain constrained by discrete tokens, limiting their full

thumb_up_off_alt931

chat_bubble_outline27

repeat136

shareShare

Stéphane Clinchant

Gate.io

Matei Zaharia

Jonathan Frankle

Aran Komatsuzaki

Databricks Mosaic Research

NAVER LABS Europe

AToMiC@TREC2023

Yeskendir 🇰🇿

Jos Rozen

Nadia Chirkova

Laure Soulier

Stéphane Clinchant

Vaibhav (VB) Srivastav

Nadia Chirkova

Xin Eric Wang @ ICLR 2025