
Gabriel Synnaeve
@syhw
Nerd & Dad
syhw.bsky.social
ID: 79440047
https://syhw.github.io/ 03-10-2009 11:33:30
8,8K Tweet
15,15K Followers
1,1K Following












๐จ Your RL only improves ๐ฝ๐ฎ๐๐@๐ญ, not ๐ฝ๐ฎ๐๐@๐ธ? ๐จ Thatโs not a bug โ itโs a ๐ณ๐ฒ๐ฎ๐๐๐ฟ๐ฒ ๐ผ๐ณ ๐๐ต๐ฒ ๐ผ๐ฏ๐ท๐ฒ๐ฐ๐๐ถ๐๐ฒ youโre optimizing. You get what you optimize for. If you want better pass@k, you need to optimize for pass@k at training time. ๐งต How?



Hello World: My team at FAIR / AI at Meta (AI Research Agent) is looking to hire contractors across software engineering and ML. If you are interested and based in the UK, please fill in the following short EoI form: docs.google.com/forms/d/e/1FAIโฆ



