
Tahsin Mayeesha
@tahsin_mayeesha
Exploring Ph.D. opportunities for Fall '24 in NLP,HCI, Social Computing, AI Ethics.
ID: 922849429836533760
https://tahsin-mayeesha.github.io/ 24-10-2017 15:37:04
4,4K Tweet
693 Followers
4,4K Following




My Reinforcement Learning (RL) & Agents 3 hour workshop is out! I talk about: 1. RL fundamentals & hacks 2. "Luck is all you need" 3. Building smart agents with RL 4. Closed vs Open-source 5. Dynamic 1bit GGUFs & RL in Unsloth AI 6. The Future of Training youtube.com/watch?v=OkEGJ5…







🧵When training reasoning models, what's the best approach? SFT, Online RL, or perhaps Offline RL? At KRAFTON AI and SK telecom, we've explored this critical question, uncovering interesting insights! Let’s dive deeper, starting with the basics first. 1) SFT SFT (aka hard





I am sharing a podcast describing my ”Convex Optimization Theory” book from 2009: notebooklm.google.com/notebook/d69a4… The .pdf copy of the book can be found at web.mit.edu/dimitrib/www/C… and includes a 180-pages supplement of mathematical optimization exercises with detailed solutions

