
Kevin Wang
@kevinwang_111
PhD student @UTAustin | 3D Foundation model, VLM, LLM Planning
ID: 1730054209867796480
https://www.kevin-ai.com/ 30-11-2023 02:41:02
33 Tweet
155 Takipรงi
84 Takip Edilen

๐ถ Excited to introduce SPIN-Bench! ๐ TL;DR: Benchmarking LLM capabilities across various strategic planning game environments. ๐ ๐ Project Page: spinbench.github.io ๐ arXiv: arxiv.org/abs/2503.12349 ๐ฎ Interact with PDDL domains: spinbench.github.io/tools/pddl/traโฆ ๐ LLM