David Liu
@davidwnliu
PhD Machine Learning & Computational Neuroscience @Cambridge_Eng | BA/MSci Computational and Theoretical Physics @DeptofPhysics
dev @thedavindicode
ID: 1446743206624903168
http://davindicode.github.io 09-10-2021 07:44:41
52 Tweet
106 Followers
131 Following
Do current LLMs perform simple tasks (e.g., grade school math) reliably? We know they don't (is 9.9 larger than 9.11?), but why? Turns out that, for one reason, benchmarks are too noisy to pinpoint such lingering failures. w/ Josh Vendrow Eddie Vendrow Sara Beery 1/5
LLMs have complex joint beliefs about all sorts of quantities. And my postdoc James Requeima visualized them! In this thread we show LLM predictive distributions conditioned on data and free-form text. LLMs pick up on all kinds of subtle and unusual structure: 🧵