
Shishir Patil
@shishirpatil_
CS PhD @ UC Berkeley. Creator of Gorilla, GoEx, RAFT, OpenFunctions and Berkeley Function Calling Leaderboard. Previously researcher @GoogleAI @MSFTResearch
ID: 55854264
https://shishirpatil.github.io/ 11-07-2009 15:31:33
296 Tweet
3,3K Followers
992 Following

Super excited to share š§ MLGym 𦾠ā the first Gym environment for AI Research Agents š¤š¬ We introduce MLGym and MLGym-Bench, a new framework and benchmark for evaluating and developing LLM agents on AI research tasks. The key contributions of our work are: š¹ļø Enables the

we shippād š on-device lms and frontier cloud lms. andā¦they were a matchāŗļø. 98% accuracy, just 17.5% the cloud API costs beyond excited to dropĀ minions: where local lms meet cloud lms š joint work w/Sabri Eyuboglu & Dan Biderman at @hazyresearch. ty Together AI,








Astasia Myers Good observation! I had done some work with Shishir Patil and Raluca Ada Popa on using LLMs to evaluate the credentials (and risks) of potential tool calls.




