Gagan Madan
@_gaganm
Research Eng @GoogleDeepMind. Probably approximately incorrect
ID: 2411332483
https://gaganm.github.io/ 25-03-2014 17:25:25
336 Tweet
356 Followers
629 Following
rohan anil I think a lot of good agentic harnesses (eg coding) are highly tuned for specialized use cases, which make it significantly better than sum of its parts. Viewing it purely from the lens of model building misses out the complexities in the harness which actually make it awesome.
rohan anil Building useful agentic harnesses is hard and severely underrated IMO. It requires a deep understanding of model behavior, often not captured in evals. A lot of people building models are far too comfortable just looking at numbers instead of deeper analysis.