Arnav Garg
@grg_arnav
Leading ML @Predibase | Previously @Atlassian @Tesla @UCLA | Co-Founder of @DataresUcla
ID: 819976407941926912
13-01-2017 18:36:24
162 Tweet
128 Takipçi
262 Takip Edilen
New Course: Reinforcement Fine-Tuning LLMs with GRPO! Learn to use reinforcement learning to improve your LLM performance in this short course, built in collaboration with @Predibase, and taught by Travis Addair, its Co-Founder and CTO, and Arnav Garg, its Senior Engineer and
It was an honor getting to work together with the DeepLearning.ai team and my colleague Arnav Garg on this course covering all things Reinforcement Fine-Tuning and GRPO. Similar to our last course on efficient LLM inference, we wanted to really drill into the intuition
I had a blast working with the DeepLearning.AI team and my colleague Travis Addair over the last few months to put this course together on Reinforcement Fine-Tuning with GRPO! We’ve tried to make this course as practical as possible and help you build intuition. Hope you enjoy!
🚀 Fresh off our hit DeepLearning.AI course on RFT + #GRPO, we’re going live! 🎙️ Let’s Talk Tokens: Live #AMA on Reinforcement Fine-Tuning with the Experts Who Built the Definitive Course! #RFT isn’t just research any more—it’s driving real-world GenAI with tighter feedback
🧠 Join the 10k developers supercharging their #LLM skills with Reinforcement Fine-tuning—and it's free! 🧠 Reinforcement Fine-Tuning (#RFT) and #GRPO are fast becoming popular techniques to teach LLMs how to reason. We teamed up with DeepLearning.AI to build the definitive