Romain Froger (@froger_romain) Twitter Tweets • TwiCopy

elvis

3 months ago

Very cool work from Meta Superintelligence Lab. They are open-sourcing Meta Agents Research Environments (ARE), the platform they use to create and scale agent environments. Great resource to stress-test agents in environments closer to real apps. Read on for more:

thumb_up_off_alt1,1K

chat_bubble_outline39

repeat185

shareShare

Grégoire Mialon

@mialon_gregoire

3 months ago

🏗️ ARE: scaling up agent environments and evaluations In the LLM+RL era, evals and envs are the bottleneck Happy to release Gaia2, an extensible benchmark for agents aiming to reduce the sim2real gap + ARE, the platform in which Gaia2 is built Enjoy evaluating your agents! 👇

thumb_up_off_alt102

chat_bubble_outline1

repeat28

shareShare

Clémentine Fourrier 🍊

@clefourrier

3 months ago

Did you see that the Agent Research Environment is MCP compatible? -> using any MCP tools with any agent is now completely trivial! Check it out! We've used an LLM agent to 1) move a robot arm remotely 2) depending on real time web search results! :D How to in thread ^^

thumb_up_off_alt31

chat_bubble_outline1

repeat9

shareShare

Rohan Paul

@rohanpaul_ai

3 months ago

🧠Great research from Meta Superintelligence Labs. Proposes Meta Agents Research Environments (ARE) for scaling up agent environments and evaluations. ARE lets researchers build realistic agent environments, run agents asynchronously, and verify them cleanly. On top of it

🧠Great research from <a href="/Meta/">Meta</a> Superintelligence Labs.

Proposes Meta Agents Research Environments (ARE) for scaling up agent environments and evaluations.

ARE lets researchers build realistic agent environments, run agents asynchronously, and verify them cleanly.

On top of it

thumb_up_off_alt27

chat_bubble_outline2

repeat11

shareShare

Virginie Do

@gini_do

3 months ago

We released Gaia2 and ARE, our platform for agent environments and evals in the LLM+RL era! This was such a fun project to work on, I hope you like it too :) Paper: arxiv.org/abs/2509.17158 Demo Hugging Face: huggingface.co/spaces/meta-ag… Blog post: huggingface.co/blog/gaia2

thumb_up_off_alt13

chat_bubble_outline2

repeat1

shareShare

Virginie Do

@gini_do

3 months ago

merci la dream team ✨🫶 Grégoire Mialon Romain Froger Dheeraj Mekala Amine Benhalloum ✈️ NeurIPS Pierre, Maxime, Ulyana Piterbarg, Thomas Scialom and everyone else!

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

clem 🤗

@clementdelangue

3 months ago

We need better agent evaluations! Glad to have collaborated with Meta Super Intelligence Lab to release Gaia2 and ARE! GPT5 (high) from OpenAI is leading on execution, search, ambiguity, adaptability and noise. Kimi-K2 from Kimi.ai is leading open weight. Full

We need better agent evaluations! Glad to have collaborated with <a href="/Meta/">Meta</a> Super Intelligence Lab to release Gaia2 and ARE!

GPT5 (high) from <a href="/OpenAI/">OpenAI</a> is leading on execution, search, ambiguity, adaptability and noise.

Kimi-K2 from <a href="/Kimi_Moonshot/">Kimi.ai</a> is leading open weight.

Full

thumb_up_off_alt494

chat_bubble_outline19

repeat53

shareShare

Gabriel Synnaeve

@syhw

3 months ago

(🧵) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models. ai.meta.com/research/publi…

thumb_up_off_alt1,1K

chat_bubble_outline56

repeat262

shareShare

echen

@echen

3 months ago

Two years ago, I had the privilege of collaborating with Thomas Scialom and Grégoire Mialon at @MetaAI on GAIA, one of the first agentic benchmarks, designed to measure progress towards useful-in-the-real-world AGI. This week, the team launched Gaia2, built inside their new

Two years ago, I had the privilege of collaborating with <a href="/ThomasScialom/">Thomas Scialom</a> and <a href="/mialon_gregoire/">Grégoire Mialon</a> at @MetaAI on GAIA, one of the first agentic benchmarks, designed to measure progress towards useful-in-the-real-world AGI.

This week, the team launched Gaia2, built inside their new

thumb_up_off_alt12

chat_bubble_outline3

repeat4

shareShare

Surge AI

@hellosurgeai

3 months ago

Thrilled to see @MetaAI launch 𝗚𝗮𝗶𝗮𝟮, built inside their new 𝗔𝗴𝗲𝗻𝘁 𝗥𝗟 𝗘𝗻𝘃𝗶𝗿𝗼𝗻𝗺𝗲𝗻𝘁 platform! 🚀 Proud that Surge AI helped contribute — just as we did with GAIA two years ago. A quick story 🧵

thumb_up_off_alt11

chat_bubble_outline1

repeat1

shareShare

Axel Darmouni

@adarmouni

3 months ago

Read the ARE & GAIA2 paper from AI at Meta by Grégoire Mialon, Romain Froger, Amine Benhalloum and Thomas Scialom Very very interesting eval setup! Basically what they do is that they created the updated GAIA2 using a specific pattern of setup, and ARE (Agents Research

Read the ARE & GAIA2 paper from <a href="/AIatMeta/">AI at Meta</a> by <a href="/mialon_gregoire/">Grégoire Mialon</a>, <a href="/froger_romain/">Romain Froger</a>, <a href="/amine_benh/">Amine Benhalloum</a> and <a href="/ThomasScialom/">Thomas Scialom</a>

Very very interesting eval setup!
Basically what they do is that they created the updated GAIA2 using a specific pattern of setup, and ARE (Agents Research

thumb_up_off_alt9

chat_bubble_outline1

repeat2

shareShare

Grégoire Mialon

@mialon_gregoire

2 months ago

We released ARE and Gaia2 one week ago, time to share some observations and add new models to the leaderboard! huggingface.co/blog/meta-agen…

thumb_up_off_alt23

chat_bubble_outline0

repeat7

shareShare

Tiezhen WANG

@xianbao_qian

22 days ago

Welcome Nex-N1, a new series of agentic foundational models, to Hugging Face - available in different sizes from 8B, 30B, 32B to 671B - strong in tool-use, web-search and real-world agentic workflow - some SFT dataset has been open sourced Technical report come up soon!

Welcome Nex-N1, a new series of agentic foundational models, to <a href="/huggingface/">Hugging Face</a>

- available in different sizes from 8B, 30B, 32B to 671B
- strong in tool-use, web-search and real-world agentic workflow
- some SFT dataset has been open sourced

Technical report come up soon!

thumb_up_off_alt466

chat_bubble_outline19

repeat75

shareShare

Grégoire Mialon

@mialon_gregoire

13 days ago

I am at #NeurIPS2025! I am hiring an intern for our Paris team to succeed Dheeraj Mekala and Ulyana Piterbarg, DM if you want to work on what's next for agents Will also have a look back on Gaia and introduce Gaia2 at the Scaling Environments for Agents workshop on Sunday!

thumb_up_off_alt94

chat_bubble_outline4

repeat8

shareShare

Romain Froger

@froger_romain

13 days ago

I'll be NeurIPS Conference in San Diego this week, together with the co-authors of ARE/Gaia2 Grégoire Mialon & Amine Benhalloum . Would love to connect: let’s talk about what’s next for agents!

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

elvis

@omarsar0

8 days ago

// THE CASE FOR ENVIRONMENT SCALING // Environment scaling may be as important as model scaling for agentic AI. Current AI research suggests that building a powerful agentic AI model isn't just about better reasoning. It's also about better environments. The default approach

thumb_up_off_alt103

chat_bubble_outline8

repeat13

shareShare

Romain Froger

@froger_romain

7 days ago

Great to see Gaia2 being adopted by the community as a frontier eval! 😄 At #NeurIPS2025, we were so pleased to meet so many people building on top of ARE!

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare