Thibaut Thonet
@tthonet
Research scientist at @naverlabseurope. Interested in NLProc, Machine Learning, Information Retrieval. Any pronouns.
ID: 2876226376
14-11-2014 09:12:06
538 Tweet
411 Followers
816 Following
After spending just 20 minutes with the Mistral AI model, I am shocked by how unsafe it is. It is very rare these days to see a new model so readily reply to even the most malicious instructions. I am super excited about open-source LLMs, but this can't be it! Examples below 🧵
😀We're looking for a talented researcher to join our team at Naver Labs Europe (NAVER LABS Europe) , working on LLMs and Retrieval!😃 Please apply here: europe.naverlabs.com/job/research-s… !
What’s a good baseline for RAG? 🤔 The literature shows consistent differences in experimental setups, retrievers, datasets, and metrics. So, we built the BERGEN library github.com/naver/bergen to enhance reproducibility and identify strong baselines : 🧵 NAVER LABS Europe
🧵 Excited to share PRISM, our new methodology for auditing biases in Large Language Models, as part of @PHAWM_project w/Leif Azzopardi. This flexible, role-based framework can be tailored to audit biases on political, social, or any dimensions. Responsible Ai UK arxiv.org/abs/2410.18906
📢 Job Alert 📢 NAVER LABS Europe My team is looking for a Research Scientist in Visual Representation Learning - More info tinyurl.com/3jz935fk
If you are at #EMNLP2024 and interested in RAG, come to discuss our BERGEN library! Vassilina Nikoulina will present the BERGEN poster tomorrow, Nov 13, 16:00-17:30, location: Jasmine Repo: github.com/naver/bergen Paper: aclanthology.org/2024.findings-… NAVER LABS Europe #NLProc #RAG
I'll present two papers at the main COLING 2025 conference next week: (1) ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models (w/ Thibaut Thonet and Jos Rozen) - see arxiv.org/abs/2403.20262 and github.com/utter-project/…
We are #hiring for Internship: Advanced Constraint Processing in LLMs at Naver Labs Europe. We are attending COLING 2025 if you would like to meet! - via #Whova event app
Excited to share that Provence is accepted to #ICLR2025! Provence is a method for training an efficient & high-performing context pruner for #RAG, either standalone or combined with a reranker huggingface.co/blog/nadiinchi… w/ @thibault_formal Vassilina Nikoulina Stéphane Clinchant NAVER LABS Europe
🚀 AutoMin 2025 – 3rd Edition of the Automatic Minuting Shared Task at #SIGDIAL2025! NAVER LABS Europe & UTTER push LLMs in: 📌 Meeting Summarization 📌 Meeting QA 📅 Aug 28, 2025 Join the future of NLP, AI & Meeting Understanding! 🔗 ufal.github.io/automin-2025/ #NLP #LLM #UTTER
🎉 Delighted to share that our work “FaST: Feature-aware Sampling and Tuning for Personalized Preference Alignment with Limited Data” has been accepted to #EMNLP2025 main! 🧑🏻💻 Joint work with Germán Kruszewski, Jos Rozen, Pierre Erbacher, Marc Dymetman 📄 Preprint: arxiv.org/abs/2508.04698