Chelsea Zou (@boson2photon) 's Twitter Profile
Chelsea Zou

@boson2photon

@StanfordAILab @StanfordNLP

ID: 1251170582806306816

linkhttps://bosonphoton.github.io calendar_today17-04-2020 15:28:16

21 Tweet

59 Takipçi

185 Takip Edilen

Chelsea Zou (@boson2photon) 's Twitter Profile Photo

How I’ve been feeling about the current state of ML research: bosonphoton.github.io/thoughts/itsdo… TLDR; It has converged to the same few topics within NLP. And it’s boring.

Steven Feng (@stevenyfeng) 's Twitter Profile Photo

We are bringing back Stanford’s CS 25 Transformers Course (cs25.stanford.edu) today! It’s open to everybody! This is one of Stanford University's hottest seminar courses. We open the course through Zoom to the public. Lectures start today (Tuesdays), 3-4:20pm PDT, at

We are bringing back Stanford’s CS 25 Transformers Course (cs25.stanford.edu) today! It’s open to everybody!

This is one of <a href="/Stanford/">Stanford University</a>'s hottest seminar courses. We open the course through Zoom to the public. Lectures start today (Tuesdays), 3-4:20pm PDT, at
Steven Feng (@stevenyfeng) 's Twitter Profile Photo

Stanford CS25 Talk Today: Eshed Margalit, Noetik.AI Today (Tues, 5/20) at 3pm PDT, Eshed Margalit from Noetik.AI will be giving a talk for CS25 (cs25.stanford.edu). The talk will be livestreamed on Zoom at stanford.zoom.us/j/91661468474?…. Presentation

Stanford CS25 Talk Today: Eshed Margalit, Noetik.AI

Today (Tues, 5/20) at 3pm PDT, <a href="/eshedmargalit/">Eshed Margalit</a> from Noetik.AI will be giving a talk for CS25 (cs25.stanford.edu). The talk will be livestreamed on Zoom at stanford.zoom.us/j/91661468474?….

Presentation
Chelsea Zou (@boson2photon) 's Twitter Profile Photo

Vibe coded an AI poker coach. Track the hands you play during live sessions and get feedback from AI. gbowizard.vercel.app I am not liable for any losses, questionable poker advice, or sudden gambling addictions that arise from the use of this app.

Andy (@4ndyxu) 's Twitter Profile Photo

Everyone loves Claude Opus. Poker did not. Introducing Poker Agent ❤️♠️, a new LLM poker benchmark built with Vals AI. We pit 17 frontier LLM agents against each other across over 20,000+ hands of poker. Claude 4.5 Opus placed in the bottom half while GPT 5.2 and Gemini 3

Everyone loves Claude Opus.

Poker did not.

Introducing Poker Agent ❤️♠️, a new LLM poker benchmark built with <a href="/ValsAI/">Vals AI</a>. We pit 17 frontier LLM agents against each other across over 20,000+ hands of poker.

Claude 4.5 Opus placed in the bottom half while GPT 5.2 and Gemini 3
fly51fly (@fly51fly) 's Twitter Profile Photo

[CL] A Unified Definition of Hallucination, Or: It's the World Model, Stupid E Liu, V Gangal, C Zou, X Huang... [CMU & Patronus AI & Stanford University] (2025) arxiv.org/abs/2512.21577

[CL] A Unified Definition of Hallucination, Or: It's the World Model, Stupid
E Liu, V Gangal, C Zou, X Huang... [CMU &amp; Patronus AI &amp; Stanford University] (2025)
arxiv.org/abs/2512.21577
Steven Feng (@stevenyfeng) 's Twitter Profile Photo

Dan Roy We have the v1 of a (hopefully) grand unified defn. by requiring added specification of a reference world model here: arxiv.org/abs/2512.21577 Glad to hear thoughts or feedback!