SEA Workshop (@seaworkshop) 's Twitter Profile
SEA Workshop

@seaworkshop

Scaling Environments for Agents (SEA) Workshop (NeurIPS 2025)

ID: 1997445184502837248

linkhttps://sea-workshop.github.io/ calendar_today06-12-2025 23:17:29

58 Tweet

68 Followers

3 Following

Changyu Chen (@cameron_chann) 's Twitter Profile Photo

Would be around the scaling environments for agents workshop #NeurIPS2025 today 🌊 Come say hi and chat about GEM 💎, RL and experience scaling 💡 gem: arxiv.org/pdf/2510.01051

Would be around the scaling environments for agents workshop #NeurIPS2025 today 🌊

Come say hi and chat about GEM 💎, RL and experience scaling 💡

gem: arxiv.org/pdf/2510.01051
Rebecca Qian (@rebeccatqian) 's Twitter Profile Photo

Come see Darshan Deshpande and Varun Gangal present MEMTRACK at the Scaling Environments for Agents workshop right now at Upper Level 23ABC at #NeurIPS2025 San Diego🔥 We built a RL environment that measures long-term memory and state tracking by putting agents in a workplace with

Come see <a href="/getdarshan/">Darshan Deshpande</a> and <a href="/VarunGangal/">Varun Gangal</a> present MEMTRACK at the Scaling Environments for Agents workshop right now at Upper Level 23ABC at #NeurIPS2025 San Diego🔥

We built a RL environment that measures long-term memory and state tracking by putting agents in a workplace with
Ayush Noori (@ayushnoori) 's Twitter Profile Photo

Now presenting our second poster, “Enabling multi-agent collaboration in knowledge graph environments,” at the NeurIPS Conference Scaling Environments for Agents workshop, wrapping up soon! ft. Yusuf who stopped by to say hi, always nice to catch up with friends at NeurIPS 😁

Now presenting our second poster, “Enabling multi-agent collaboration in knowledge graph environments,” at the <a href="/NeurIPSConf/">NeurIPS Conference</a> Scaling Environments for Agents workshop, wrapping up soon!

ft. Yusuf who stopped by to say hi, always nice to catch up with friends at NeurIPS 😁
SEA Workshop (@seaworkshop) 's Twitter Profile Photo

Congrats to the following paper authors attaining Outstanding Paper Awards at SEA Workshop! RPGBENCH: Evaluating Large Language Models as Role-Playing Game Engines Pengfei Yu, Dongming Shen, Silin Meng, Jaewon Lee, Weisu Yin, Andrea Yaoyun Cui, Zhenlin Xu, Yi Zhu, Xingjian Shi,

Congrats to the following paper authors attaining Outstanding Paper Awards at <a href="/SEAWorkshop/">SEA Workshop</a>!

RPGBENCH: Evaluating Large Language Models as Role-Playing Game Engines

Pengfei Yu, Dongming Shen, Silin Meng, Jaewon Lee, Weisu Yin, Andrea Yaoyun Cui, Zhenlin Xu, Yi Zhu, Xingjian Shi,
SEA Workshop (@seaworkshop) 's Twitter Profile Photo

Congrats to the following paper authors attaining Outstanding Paper Awards at SEA Workshop! GEM: A Gym for Agentic LLMs Zichen Liu, Anya Sims, Keyu Duan, Changyu Chen, Haotian Xu, Simon Yu, Chenmien Tan, Shaopan Xiong, Weixun Wang, Bo Liu, Hao Zhu, Weiyan Shi, Diyi Yang, Wee

Congrats to the following paper authors attaining Outstanding Paper Awards at <a href="/SEAWorkshop/">SEA Workshop</a>!

GEM: A Gym for Agentic LLMs

Zichen Liu, Anya Sims, Keyu Duan, Changyu Chen, Haotian Xu, Simon Yu, Chenmien Tan, Shaopan Xiong, Weixun Wang, Bo Liu, Hao Zhu, Weiyan Shi, Diyi Yang, Wee
SEA Workshop (@seaworkshop) 's Twitter Profile Photo

The best poster awards go to: 1. Go-Browse: Training Web Agents with Structured Exploration Apurva Gandhi, Graham Neubig 2. Scaling Open-Ended Reasoning to Predict the Future Nikhil Chandak, Shashwat Goel, Ameya Prabhu, Moritz Hardt, Jonas Geiping 🎉Congrats!

The best poster awards go to:

1. Go-Browse: Training Web Agents with Structured Exploration
Apurva Gandhi, Graham Neubig

2. Scaling Open-Ended Reasoning to Predict the Future
Nikhil Chandak, Shashwat Goel, Ameya Prabhu, Moritz Hardt, Jonas Geiping

🎉Congrats!
vincent sunn chen (@vincentsunnchen) 's Twitter Profile Photo

Excellent discussion about closing the agent post-training → evaluation gap SEA Workshop today: 1. Scaling complexity: To truly test unseen/real-world capabilities, we need custom environments that scale on specific axes, e.g. tools, open-endedness, dynamic/continuous

Changyu Chen (@cameron_chann) 's Twitter Profile Photo

🏆 Excited to share GEM received the Outstanding Paper Award SEA Workshop of #NeurIPS2025 . What a great way to wrap up this amazing neurips journey! Huge thanks to the workshop committee and organizers for the recognition. Grateful for our incredible collaborators and advisors

🏆 Excited to share GEM received the Outstanding Paper Award <a href="/SEAWorkshop/">SEA Workshop</a> of #NeurIPS2025 . What a great way to wrap up this amazing neurips journey!

Huge thanks to the workshop committee and organizers for the recognition.

Grateful for our incredible collaborators and advisors
Shuyan Zhou (@shuyanzhxyc) 's Twitter Profile Photo

It was really fun to meet new people and discuss agent environments. Thanks to the workshop organizers for putting together such a great event! Here is the slide deck from the talk: shuyanzhou.com/assets/slides/…

Eigent AI (@eigent_ai) 's Twitter Profile Photo

Excited to collaborate with Z.ai using GLM-4.7 on Eigent. We asked Eigent to organize today’s work files and generate a daily report. - The task is split across agents - The developer agent creates a local folder and moves the files - The document agent generates the report

Daniel Furelos-Blanco (@_danielfb) 's Twitter Profile Photo

🚀 Last December we presented ATLAS at NeurIPS SEA Workshop. Now thrilled to share at #AAAI2026 AAAI! ⚠️ RL policy generalization across tasks & levels is hard, especially when some tasks aren't realizable. 🗺️ ATLAS tackles this via autocurricula over tasks AND levels 🧵

CAMEL-AI.org (@camelaiorg) 's Twitter Profile Photo

🚨 CAMEL-AI Live Talk: We have invited Zixuan Ke (Research Scientist at Salesforce AI Research) to share MAS-ZERO: Designing Multi-Agent Systems with Zero Supervision This paper has been accepted to the SEA Workshop at NeurIPS 2025, proposing a zero-supervision framework for

🚨 CAMEL-AI Live Talk: We have invited <a href="/KeZixuan/">Zixuan Ke</a> (Research Scientist at Salesforce AI Research) to share MAS-ZERO: Designing Multi-Agent Systems with Zero Supervision 

This paper has been accepted to the <a href="/SEAWorkshop/">SEA Workshop</a> at NeurIPS 2025, proposing a zero-supervision framework for
CAMEL-AI.org (@camelaiorg) 's Twitter Profile Photo

Open AI and Anthropic just release GPT-5.3-Codex and Opus 4.6 model, terminal capability is now on top of their list evaluating modal capability. But terminal training hits a wall fast: there aren’t enough high-quality environments. In SETA, we just shipped 1,376 validated

Prime Intellect (@primeintellect) 's Twitter Profile Photo

Introducing Lab: A full-stack platform for training your own agentic models Build, evaluate and train on your own environments at scale without managing the underlying infrastructure. Giving everyone their own frontier AI lab.

Snorkel AI (@snorkelai) 's Twitter Profile Photo

We’re incredibly excited to launch Open Benchmarks Grants, a new program committing $3M in grants to fund new open source benchmarks advancing agentic AI. We’re partnering up with @HuggingFace, Together AI, Prime Intellect, Factory HQ, Harbor Framework, and PyTorch to

CAMEL-AI.org (@camelaiorg) 's Twitter Profile Photo

🚨 CAMEL AI Live Talk this Friday: Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations with Wei Liu Join the CAMEL-AI 🐫 Community to explore Dr. Kernel, a reinforcement learning framework for generating high-performance Triton kernels, enabling scalable

🚨 CAMEL AI Live Talk this Friday: Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations with Wei Liu

Join the CAMEL-AI 🐫 Community to explore Dr. Kernel, a reinforcement learning framework for generating high-performance Triton kernels, enabling scalable
CAMEL-AI.org (@camelaiorg) 's Twitter Profile Photo

MiroFish recently ranked #1 on GitHub Trending. The multi-agent social simulation behind it? Built on OASIS. 🏝️ This Friday, we're bringing OASIS first author Ziyi Yang to CAMEL Live Talk 📢 to break down how you simulate 1 million LLM agents and what it reveals about how

MiroFish recently ranked #1 on GitHub Trending. The multi-agent social simulation behind it? Built on OASIS. 🏝️

This Friday, we're bringing OASIS first author Ziyi Yang to CAMEL Live Talk 📢 to break down how you simulate 1 million LLM agents and what it reveals about how