Theophile Sautory
@tsautory
How to combine diffusion models and physics?
ID: 1227889234805170176
https://theosau.github.io/ 13-02-2020 09:36:34
27 Tweet
25 Followers
160 Following
Ambience's RFT-tuned model boosted ICD-10 coding accuracy 27% over expert clinicians—and it's now spotlighted in OpenAI Developers' Reinforcement Fine-Tuning use case guide. Excited to see what the community builds with this powerful new customization technique: platform.openai.com/docs/guides/rf…
new guide on “exploring model graders for reinforcement fine-tuning” on openai’s platform. h/t Theophile Sautory final result was a high-signal, domain-sensitive grader that guided the model toward better explanations. cookbook.openai.com/examples/reinf…