Ben Recht (@beenwrekt) 's Twitter Profile
Ben Recht

@beenwrekt

optimization. machine learning. uc berkeley.

I blog at argmin.substack.com

The world won't end.

ID: 352013269

linkhttp://eecs.berkeley.edu/~brecht calendar_today10-08-2011 01:17:04

8,8K Tweet

30,30K Followers

339 Following

Ben Recht (@beenwrekt) 's Twitter Profile Photo

All decisions are made under uncertainty. Almost no decisions can be reduced to gambling. argmin.net/p/all-bets-are…

Ben Recht (@beenwrekt) 's Twitter Profile Photo

Demos play an outsized role in the complex world of artificial intelligence evaluation. Maybe that's not a bad thing. argmin.net/p/demo-or-die

Ben Recht (@beenwrekt) 's Twitter Profile Photo

It turns out to be hard to evaluate natural language with natural language. What should we take away from the conundrum of LLM evaluation? argmin.net/p/evaluation-o…

Ben Recht (@beenwrekt) 's Twitter Profile Photo

What if instead of The Terminator and The Matrix, we used Severance and South Park to think about AI? argmin.net/p/maybe-just-b…

Ben Recht (@beenwrekt) 's Twitter Profile Photo

Reading Marion Courtade and Kieran Healy's *The Ordinal Society* as a broader critique of quantitative social science. argmin.net/p/pretending-n…

Ben Recht (@beenwrekt) 's Twitter Profile Photo

Rossi’s Metallic Rules and why it’s hard to measure benefit when evaluating social programs. argmin.net/p/rossis-metal…

Ben Recht (@beenwrekt) 's Twitter Profile Photo

Guest post! Deb Raji.bsky.social reflects on our machine learning evaluation class (and the chaos of co-teaching with me). argmin.net/p/to-measure-i…

Ben Recht (@beenwrekt) 's Twitter Profile Photo

On the cultivation of a "pure" computer science and why we should reject this concept. argmin.net/p/computationa…

Ben Recht (@beenwrekt) 's Twitter Profile Photo

Louis Fein’s utopian vision of academic computer science was delightfully boring. argmin.net/p/may-you-live…

Ben Recht (@beenwrekt) 's Twitter Profile Photo

I weigh in on the Trump administration’s newfound obsession with Gold Standard Science and reproducibility. Though it’s not all in bad faith, it’s likely to backfire. argmin.net/p/this-is-fine

Ben Recht (@beenwrekt) 's Twitter Profile Photo

I wrote a defense of peer review, as it will be the system academia uses to reinvent itself. argmin.net/p/a-defense-of…

Ben Recht (@beenwrekt) 's Twitter Profile Photo

p-values supposedly measure the correlation between interventions and outcomes. But what happens when a measure becomes a target? argmin.net/p/milton-fried…

Ben Recht (@beenwrekt) 's Twitter Profile Photo

In a new paper, Juanky Perdomo and I ask when forecasting and prediction can be solved by sneaky accounting. The answer turns out to be “more often than you’d expect." argmin.net/p/in-defense-o…