Virginie Do (@gini_do) 's Twitter Profile
Virginie Do

@gini_do

gêne AI researcher 😬 @AIatMeta

ID: 916940326132224000

linkhttps://virginie-do.github.io calendar_today08-10-2017 08:16:24

127 Tweet

666 Followers

399 Following

Hailey Nguyen (@hailey_huong) 's Twitter Profile Photo

Thrilled to see this result, showing our dedication to align Llama to not only be helpful but also capable of understanding safety nuances, with strong generalization to unseen external benchmarks!

Andrei Lupu (@_andreilupu) 's Twitter Profile Photo

Eagle-eyed readers might have spotted 🌈 Rainbow Teaming among the key prompt generation techniques used for Llama 3 This was always the North Star for our project, and I'm thrilled that Mikayel Samvelyan, Sharath Raparthy and I pulled it off! Here's what we learned along the way 🧵

Eagle-eyed readers might have spotted 🌈 Rainbow Teaming among the key prompt generation techniques used for Llama 3

This was always the North Star for our project, and I'm thrilled that <a href="/_samvelyan/">Mikayel Samvelyan</a>, <a href="/sharathraparthy/">Sharath Raparthy</a> and I pulled it off!

Here's what we learned along the way 🧵
kyutai (@kyutai_labs) 's Twitter Profile Photo

Today, we release several Moshi artifacts: a long technical report with all the details behind our model, weights for Moshi and its Mimi codec, along with streaming inference code in Pytorch, Rust and MLX. More details below 🧵 ⬇️ Paper: kyutai.org/Moshi.pdf Repo:

Jamelle Watson-Daniels (@jamellewd) 's Twitter Profile Photo

🚨We are hiring PhD Interns at Meta Ai for summer 2025! 🚨 Broadly, my team is working on understanding and improving the intersection between Ai and Society (social systems). We have an excellent team of researchers excited to have you! … 🧵1/3

Neel Nanda (@neelnanda5) 's Twitter Profile Photo

This seems like really cool work, I'm excited to give it a read! It's awesome that our paper on interpretable jailbreaks, led by @littlefish3625, can be leveraged to make models more robust. This seems like a nice case study in the real world applications of interpretability!

Wassim (Wes) Bouaziz (@_vassim) 's Twitter Profile Photo

Want to know if a ML model was trained on your dataset? Introducing ✨Data Taggants✨! We use data poisoning to leave a harmless and stealthy signature on your dataset that radiates through trained models. Learn how to protect your dataset from unauthorized use... A 🧵

Virginie Do (@gini_do) 's Twitter Profile Photo

My twitter feed this weekend is full of people praising Roberta and recommending this internship opportunity 😂 I agree so I’m adding one more tweet to this list!

Adrien Bardes (@adrienbardes) 's Twitter Profile Photo

Job alert 🚨 My team AI at Meta is looking for a PhD intern to join us in 2025 in Paris. We are working on self-supervised learning from video, world modelling and JEPA ! Apply here or reach out directly: metacareers.com/jobs/168411027…

Roberta Raileanu (@robertarail) 's Twitter Profile Photo

Super excited to share 🧠MLGym 🦾 – the first Gym environment for AI Research Agents 🤖🔬 We introduce MLGym and MLGym-Bench, a new framework and benchmark for evaluating and developing LLM agents on AI research tasks. The key contributions of our work are: 🕹️ Enables the

Virginie Do (@gini_do) 's Twitter Profile Photo

I am at #ICLR and honored to present this work on Saturday afternoon at the poster session. Thanks Lei Yu Karen Hambardzumyan Nicola Cancedda for this wonderful collaboration! I am also happy to chat about Llama / agents / safety 👋

elvis (@omarsar0) 's Twitter Profile Photo

Very cool work from Meta Superintelligence Lab. They are open-sourcing Meta Agents Research Environments (ARE), the platform they use to create and scale agent environments. Great resource to stress-test agents in environments closer to real apps. Read on for more:

Very cool work from Meta Superintelligence Lab.

They are open-sourcing Meta Agents Research Environments (ARE), the platform they use to create and scale agent environments.

Great resource to stress-test agents in environments closer to real apps.

Read on for more:
Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

🧠Great research from Meta Superintelligence Labs. Proposes Meta Agents Research Environments (ARE) for scaling up agent environments and evaluations. ARE lets researchers build realistic agent environments, run agents asynchronously, and verify them cleanly. On top of it

🧠Great research from <a href="/Meta/">Meta</a> Superintelligence Labs.

Proposes Meta Agents Research Environments (ARE) for scaling up agent environments and evaluations.

ARE lets researchers build realistic agent environments, run agents asynchronously, and verify them cleanly.

On top of it
clem 🤗 (@clementdelangue) 's Twitter Profile Photo

We need better agent evaluations! Glad to have collaborated with Meta Super Intelligence Lab to release Gaia2 and ARE! GPT5 (high) from OpenAI is leading on execution, search, ambiguity, adaptability and noise. Kimi-K2 from Kimi.ai is leading open weight. Full

We need better agent evaluations! Glad to have collaborated with <a href="/Meta/">Meta</a> Super Intelligence Lab to release Gaia2 and ARE! 

GPT5 (high) from <a href="/OpenAI/">OpenAI</a> is leading on execution, search, ambiguity, adaptability and noise.

Kimi-K2 from <a href="/Kimi_Moonshot/">Kimi.ai</a> is leading open weight.

Full
Romain Froger (@froger_romain) 's Twitter Profile Photo

Gaia2 Leaderboard Update: DeepSeek is leading OSS models! We’ve added fresh models (DeepSeek v3.1, Qwen 235B thinking, GPT-OSS 120B) and uncovered cool insights on cost, reasoning, and efficiency. Blogpost 👉 tinyurl.com/ybvxtmny

Gaia2 Leaderboard Update: DeepSeek is leading OSS models!  
We’ve added fresh models (DeepSeek v3.1, Qwen 235B thinking, GPT-OSS 120B) and uncovered cool insights on cost, reasoning, and efficiency.  

Blogpost 👉 tinyurl.com/ybvxtmny