
Maxim Saplin
@msmxm








When prompted to play chess, LLMs can't score a single win against a random player, Andrej Karpathy maxim-saplin.github.io/llm_chess/

Do you mind trying o3 in this eval: maxim-saplin.github.io/llm_chess/ - Greg Brockman?




@msmxm
When prompted to play chess, LLMs can't score a single win against a random player, Andrej Karpathy maxim-saplin.github.io/llm_chess/
Do you mind trying o3 in this eval: maxim-saplin.github.io/llm_chess/ - Greg Brockman?