Gabriel Goh (@gabeeegoooh) 's Twitter Profile
Gabriel Goh

@gabeeegoooh

4o Image Gen Lead and other things.

ID: 702344724519096320

linkhttp://gabgoh.github.io calendar_today24-02-2016 04:10:06

158 Tweet

15,15K Followers

348 Following

Aran Komatsuzaki (@arankomatsuzaki) 's Twitter Profile Photo

RL’s Razor: On-policy RL forgets less than SFT. Even at matched accuracy, RL shows less catastrophic forgetting Key factor: RL’s on-policy updates bias toward KL-minimal solutions Theory + LLM & toy experiments confirm RL stays closer to base model

RL’s Razor: On-policy RL forgets less than SFT.

Even at matched accuracy, RL shows less catastrophic forgetting

Key factor: RL’s on-policy updates bias toward KL-minimal solutions

Theory + LLM & toy experiments confirm RL stays closer to base model
gabriel (@gabrielpeterss4) 's Twitter Profile Photo

we just reached number 3 on app store! i have had so many friends tell me sora 2 is the first time ai made them laugh, this is truly a new experience!

we just reached number 3 on app store!

i have had so many friends tell me sora 2 is the first time ai made them laugh, this is truly a new experience!
Gabriel Goh (@gabeeegoooh) 's Twitter Profile Photo

i did not contribute much to sora 2 - but I had the pleasure of witnessing it's development. the level of care, attention and love put into a single model and a single product was amazing to witness. i've always believed deep learning rewards care, but I am now convinced the

OpenAI (@openai) 's Twitter Profile Photo

We’ve developed a new way to train small AI models with internal mechanisms that are easier for humans to understand. Language models like the ones behind ChatGPT have complex, sometimes surprising structures, and we don’t yet fully understand how they work. This approach