
Travis Addair
@travisaddair
Co-Founder & CTO @Predibase
OSS: LoRAX (loraexchange.ai) | horovod.ai | @ludwig_ai
ID: 2702302872
https://predibase.com/ 03-08-2014 01:40:09
354 Tweet
582 Takipçi
223 Takip Edilen
















It was an honor getting to work together with the DeepLearning.ai team and my colleague Arnav Garg on this course covering all things Reinforcement Fine-Tuning and GRPO. Similar to our last course on efficient LLM inference, we wanted to really drill into the intuition