George Tsoukalas (@gtsoukal) 's Twitter Profile
George Tsoukalas

@gtsoukal

PhD student at UT Austin interested in automatic theorem proving.

ID: 1569401509879889922

calendar_today12-09-2022 19:04:25

44 Tweet

210 Takipçi

258 Takip Edilen

Amitayush Thakur (@amitayushthakur) 's Twitter Profile Photo

1/🧵Excited to share CLEVER — a new benchmark for end-to-end verified code generation in Lean. Can we go from natural language to a formally verified Lean program? CLEVER puts this to the test. 📄 arxiv.org/abs/2505.13938 💻 github.com/trishullab/cle…

1/🧵Excited to share CLEVER — a new benchmark for end-to-end verified code generation in Lean.
Can we go from natural language to a formally verified Lean program? CLEVER puts this to the test.
📄 arxiv.org/abs/2505.13938
💻 github.com/trishullab/cle…
Renaissance Philanthropy (@renphil21) 's Twitter Profile Photo

Today XTX Markets and Renaissance Philanthropy announce 29 grantees and $18M to accelerate the discovery of AI tools for mathematics — one of the largest philanthropic investments in AI+math to date. Learn more and explore the projects: renaissancephilanthropy.org/news-and-insig… #AIforMath

George Tsoukalas (@gtsoukal) 's Twitter Profile Photo

New PutnamBench leader! Hilbert solves 462 of 660 problems (70%) through a combination of a large reasoner (Gemini 2.5) and specialized formal proving models (DeepSeekProverv2 + Goedel Prover). Read more about their approach here: arxiv.org/pdf/2509.22819