
Jonathan Lim
@jonathanlimsc
ML Engineer, MSc @Mila_Quebec. Multimodal foundation models 🏛️ and generalist agents 🤖
ID: 1210338440
http://jonathanlimsc.com 23-02-2013 01:59:46
384 Tweet
387 Takipçi
1,1K Takip Edilen

















The PhD thesis of my 14th PhD student, Khurram Javed (Khurram Javed), is now available. Title: Real-time Reinforcement Learning for Achieving Goals in Big Worlds Url: incompleteideas.net/papers/javed_k… Abstract: In this dissertation, I motivate the need for real-time learning and

cat OpenAI Google Gemini App Paul Jankura David Hershey Okay clear definitions for Agents, when to use Agents vs workflows, tips on Evals! Not much new here but always a good reminder to always build your evals early and llm as a judge goes fairly far.


