Josh Greaves (@joshua_gre63805) 's Twitter Profile
Josh Greaves

@joshua_gre63805

Tech Lead @withmartian | RL, LLMs, Routing, ML for Science | Ex-@google Brain @googledeepmind

ID: 1915299696006094851

calendar_today24-04-2025 07:00:27

1 Tweet

7 Followers

56 Following

Josh Greaves (@joshua_gre63805) 's Twitter Profile Photo

The more time I spend on production AI, the more I think eval is harder than training. You can always throw more RL at a problem. The hard part is knowing what to optimize for—and trusting that your eval actually captures it.

Josh Greaves (@joshua_gre63805) 's Twitter Profile Photo

CRB now tracks deep review agents - agents that take the time they need to perform a thorough review. Does more compute actually mean better reviews? Now we can measure it.