SEA Workshop (@seaworkshop) Twitter Tweets • TwiCopy

Changyu Chen

5 months ago

Would be around the scaling environments for agents workshop #NeurIPS2025 today 🌊 Come say hi and chat about GEM 💎, RL and experience scaling 💡 gem: arxiv.org/pdf/2510.01051

thumb_up_off_alt99

chat_bubble_outline2

repeat15

shareShare

Come see Darshan Deshpande and Varun Gangal present MEMTRACK at the Scaling Environments for Agents workshop right now at Upper Level 23ABC at #NeurIPS2025 San Diego🔥 We built a RL environment that measures long-term memory and state tracking by putting agents in a workplace with

Come see <a href="/getdarshan/">Darshan Deshpande</a> and <a href="/VarunGangal/">Varun Gangal</a> present MEMTRACK at the Scaling Environments for Agents workshop right now at Upper Level 23ABC at #NeurIPS2025 San Diego🔥

We built a RL environment that measures long-term memory and state tracking by putting agents in a workplace with

thumb_up_off_alt33

chat_bubble_outline4

repeat6

shareShare

Ayush Noori

@ayushnoori

5 months ago

Now presenting our second poster, “Enabling multi-agent collaboration in knowledge graph environments,” at the NeurIPS Conference Scaling Environments for Agents workshop, wrapping up soon! ft. Yusuf who stopped by to say hi, always nice to catch up with friends at NeurIPS 😁

Now presenting our second poster, “Enabling multi-agent collaboration in knowledge graph environments,” at the <a href="/NeurIPSConf/">NeurIPS Conference</a> Scaling Environments for Agents workshop, wrapping up soon!

ft. Yusuf who stopped by to say hi, always nice to catch up with friends at NeurIPS 😁

thumb_up_off_alt28

chat_bubble_outline0

repeat4

shareShare

SEA Workshop

@seaworkshop

5 months ago

Congrats to the following paper authors attaining Outstanding Paper Awards at SEA Workshop! RPGBENCH: Evaluating Large Language Models as Role-Playing Game Engines Pengfei Yu, Dongming Shen, Silin Meng, Jaewon Lee, Weisu Yin, Andrea Yaoyun Cui, Zhenlin Xu, Yi Zhu, Xingjian Shi,

Congrats to the following paper authors attaining Outstanding Paper Awards at <a href="/SEAWorkshop/">SEA Workshop</a>!

RPGBENCH: Evaluating Large Language Models as Role-Playing Game Engines

Pengfei Yu, Dongming Shen, Silin Meng, Jaewon Lee, Weisu Yin, Andrea Yaoyun Cui, Zhenlin Xu, Yi Zhu, Xingjian Shi,

thumb_up_off_alt9

chat_bubble_outline3

repeat2

shareShare

SEA Workshop

@seaworkshop

5 months ago

Congrats to the following paper authors attaining Outstanding Paper Awards at SEA Workshop! GEM: A Gym for Agentic LLMs Zichen Liu, Anya Sims, Keyu Duan, Changyu Chen, Haotian Xu, Simon Yu, Chenmien Tan, Shaopan Xiong, Weixun Wang, Bo Liu, Hao Zhu, Weiyan Shi, Diyi Yang, Wee

Congrats to the following paper authors attaining Outstanding Paper Awards at <a href="/SEAWorkshop/">SEA Workshop</a>!

GEM: A Gym for Agentic LLMs

Zichen Liu, Anya Sims, Keyu Duan, Changyu Chen, Haotian Xu, Simon Yu, Chenmien Tan, Shaopan Xiong, Weixun Wang, Bo Liu, Hao Zhu, Weiyan Shi, Diyi Yang, Wee

thumb_up_off_alt22

chat_bubble_outline3

repeat4

shareShare

SEA Workshop

@seaworkshop

5 months ago

The best poster awards go to: 1. Go-Browse: Training Web Agents with Structured Exploration Apurva Gandhi, Graham Neubig 2. Scaling Open-Ended Reasoning to Predict the Future Nikhil Chandak, Shashwat Goel, Ameya Prabhu, Moritz Hardt, Jonas Geiping 🎉Congrats!

thumb_up_off_alt20

chat_bubble_outline2

repeat8

shareShare

vincent sunn chen

@vincentsunnchen

5 months ago

Excellent discussion about closing the agent post-training → evaluation gap SEA Workshop today: 1. Scaling complexity: To truly test unseen/real-world capabilities, we need custom environments that scale on specific axes, e.g. tools, open-endedness, dynamic/continuous

thumb_up_off_alt14

chat_bubble_outline2

repeat2

shareShare

Changyu Chen

@cameron_chann

5 months ago

🏆 Excited to share GEM received the Outstanding Paper Award SEA Workshop of #NeurIPS2025 . What a great way to wrap up this amazing neurips journey! Huge thanks to the workshop committee and organizers for the recognition. Grateful for our incredible collaborators and advisors

🏆 Excited to share GEM received the Outstanding Paper Award <a href="/SEAWorkshop/">SEA Workshop</a> of #NeurIPS2025 . What a great way to wrap up this amazing neurips journey!

Huge thanks to the workshop committee and organizers for the recognition.

Grateful for our incredible collaborators and advisors

thumb_up_off_alt66

chat_bubble_outline7

repeat11

shareShare

SEA Workshop

@seaworkshop

5 months ago

Congratulations also to Daniel Furelos-Blanco for winning the best tweet award! Thanks for featuring SEA Workshop and for the nice content!

thumb_up_off_alt9

chat_bubble_outline1

repeat2

shareShare

Shuyan Zhou

@shuyanzhxyc

5 months ago

It was really fun to meet new people and discuss agent environments. Thanks to the workshop organizers for putting together such a great event! Here is the slide deck from the talk: shuyanzhou.com/assets/slides/…

thumb_up_off_alt62

chat_bubble_outline3

repeat9

shareShare

CAMEL-AI.org

@camelaiorg

4 months ago

x.com/i/article/2009…

thumb_up_off_alt97

chat_bubble_outline5

repeat13

shareShare

Eigent AI

@eigent_ai

3 months ago

Excited to collaborate with Z.ai using GLM-4.7 on Eigent. We asked Eigent to organize today’s work files and generate a daily report. - The task is split across agents - The developer agent creates a local folder and moves the files - The document agent generates the report

thumb_up_off_alt74

chat_bubble_outline5

repeat8

shareShare

Daniel Furelos-Blanco

@_danielfb

3 months ago

🚀 Last December we presented ATLAS at NeurIPS SEA Workshop. Now thrilled to share at #AAAI2026 AAAI! ⚠️ RL policy generalization across tasks & levels is hard, especially when some tasks aren't realizable. 🗺️ ATLAS tackles this via autocurricula over tasks AND levels 🧵

thumb_up_off_alt17

chat_bubble_outline1

repeat2

shareShare

CAMEL-AI.org

@camelaiorg

3 months ago

🚨 CAMEL-AI Live Talk: We have invited Zixuan Ke (Research Scientist at Salesforce AI Research) to share MAS-ZERO: Designing Multi-Agent Systems with Zero Supervision This paper has been accepted to the SEA Workshop at NeurIPS 2025, proposing a zero-supervision framework for

🚨 CAMEL-AI Live Talk: We have invited <a href="/KeZixuan/">Zixuan Ke</a> (Research Scientist at Salesforce AI Research) to share MAS-ZERO: Designing Multi-Agent Systems with Zero Supervision

This paper has been accepted to the <a href="/SEAWorkshop/">SEA Workshop</a> at NeurIPS 2025, proposing a zero-supervision framework for

thumb_up_off_alt16

chat_bubble_outline3

repeat2

shareShare

CAMEL-AI.org

@camelaiorg

3 months ago

Open AI and Anthropic just release GPT-5.3-Codex and Opus 4.6 model, terminal capability is now on top of their list evaluating modal capability. But terminal training hits a wall fast: there aren’t enough high-quality environments. In SETA, we just shipped 1,376 validated

thumb_up_off_alt36

chat_bubble_outline3

repeat6

shareShare

Prime Intellect

@primeintellect

3 months ago

Introducing Lab: A full-stack platform for training your own agentic models Build, evaluate and train on your own environments at scale without managing the underlying infrastructure. Giving everyone their own frontier AI lab.

thumb_up_off_alt2,2K

chat_bubble_outline111

repeat251

shareShare

Snorkel AI

@snorkelai

3 months ago

We’re incredibly excited to launch Open Benchmarks Grants, a new program committing $3M in grants to fund new open source benchmarks advancing agentic AI. We’re partnering up with @HuggingFace, Together AI, Prime Intellect, Factory HQ, Harbor Framework, and PyTorch to

thumb_up_off_alt69

chat_bubble_outline6

repeat8

shareShare

CAMEL-AI.org

@camelaiorg

2 months ago

🚨 CAMEL AI Live Talk this Friday: Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations with Wei Liu Join the CAMEL-AI 🐫 Community to explore Dr. Kernel, a reinforcement learning framework for generating high-performance Triton kernels, enabling scalable

thumb_up_off_alt12

chat_bubble_outline1

repeat3

shareShare

CAMEL-AI.org

@camelaiorg

2 months ago

MiroFish recently ranked #1 on GitHub Trending. The multi-agent social simulation behind it? Built on OASIS. 🏝️ This Friday, we're bringing OASIS first author Ziyi Yang to CAMEL Live Talk 📢 to break down how you simulate 1 million LLM agents and what it reveals about how

thumb_up_off_alt16

chat_bubble_outline1

repeat4

shareShare

Eigent AI

@eigent_ai

2 months ago

x.com/i/article/2031…

thumb_up_off_alt55

chat_bubble_outline3

repeat8

shareShare

SEA Workshop

Changyu Chen

Rebecca Qian

Ayush Noori

SEA Workshop

SEA Workshop

SEA Workshop

vincent sunn chen

Changyu Chen

SEA Workshop

Shuyan Zhou

CAMEL-AI.org

Eigent AI

Daniel Furelos-Blanco

CAMEL-AI.org

CAMEL-AI.org

Prime Intellect

Snorkel AI

CAMEL-AI.org

CAMEL-AI.org

Eigent AI