Shashank Gupta
@shashank27392
PhD at @irlab_amsterdam | Prev. @AIatMeta (NYC '24, London '23), @Flipkart | Interested in ML & IR.
ID: 3102522680
http://shashank-gupta.com 22-03-2015 04:03:26
4,4K Tweet
1,1K Followers
2,2K Following
PPO vs. DPO? 🤔 Our new paper proves that it depends on whether your models can represent the optimal policy and/or reward. Paper: arxiv.org/abs/2505.19770 Led by Ruizhe Shi Minhak Song
Delip Rao e/σ Amey | अमेय I have often failed interviews. I have even failed interview where I was asked an interview question I used regularly at Google which I knew inside out. I fail coding interviews not because I can't code, but because the stressful synthetic nature of the situation causes my brain
Shah Rukh Khan (Shah Rukh Khan) wins his first-ever National Award—33 years after debuting on the big screen—for his performance in the action thriller film Jawan at the 2023 National Awards. The veteran actor shares the accolade with Vikrant Massey (Vikrant Massey), who won the award