Shashank Gupta (@shashank27392) 's Twitter Profile
Shashank Gupta

@shashank27392

PhD at @irlab_amsterdam | Prev. @AIatMeta (NYC '24, London '23), @Flipkart | Interested in ML & IR.

ID: 3102522680

linkhttp://shashank-gupta.com calendar_today22-03-2015 04:03:26

4,4K Tweet

1,1K Followers

2,2K Following

Ji-Ha (@ji_ha_kim) 's Twitter Profile Photo

I got recommended Terence Tao's YouTube channel created in 2010, where he uploaded his first video just yesterday! He showcases his process of formalizing a proof in Lean 4 with the help of GitHub Copilot and the "canonical" tactic in Lean.

I got recommended Terence Tao's YouTube channel created in 2010, where he uploaded his first video just yesterday!
He showcases his process of formalizing a proof in Lean 4 with the help of GitHub Copilot and the "canonical" tactic in Lean.
Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

I am pleased to announce a new version of my RL tutorial. Major update to the LLM chapter (eg DPO, GRPO, thinking), minor updates to the MARL and MBRL chapters and various sections (eg offline RL, DPG, etc). Enjoy! arxiv.org/abs/2412.05265

I am pleased to announce a new version of my RL tutorial. Major update to the LLM chapter (eg DPO, GRPO, thinking), minor updates to the MARL and MBRL chapters and various sections (eg offline RL, DPG, etc). Enjoy!
arxiv.org/abs/2412.05265
Ahmad Beirami @ ICLR 2025 (@abeirami) 's Twitter Profile Photo

As we go through a lot of excitement about RL recently with lots of cool work/results, here is a reminder that RL with a reverse KL-regularizer to the base model cannot learn new skills that were not already present in the base model. It can only amplify the existing weak skills.

As we go through a lot of excitement about RL recently with lots of cool work/results, here is a reminder that RL with a reverse KL-regularizer to the base model cannot learn new skills that were not already present in the base model. It can only amplify the existing weak skills.
Simon Shaolei Du (@simonshaoleidu) 's Twitter Profile Photo

PPO vs. DPO? 🤔 Our new paper proves that it depends on whether your models can represent the optimal policy and/or reward. Paper: arxiv.org/abs/2505.19770 Led by Ruizhe Shi Minhak Song

Tianyuan Zhang (@tianyuanzhang99) 's Twitter Profile Photo

Bored of linear recurrent memories (e.g., linear attention) and want a scalable, nonlinear alternative? Our new paper “Test-Time Training Done Right” propose LaCT (Large Chunk Test-Time Training) — a highly efficient, massively scalable nonlinear memory with: 💡 Pure PyTorch

Charlie London (@charlielondon02) 's Twitter Profile Photo

I believe that policy gradient methods with only terminal rewards will have to break down at some level of task ood-ness/complexity, and PRMs will be necessary. This looks like a really cool addition to the theory of PRMs and CoT

Niloofar (on faculty job market!) (@niloofar_mire) 's Twitter Profile Photo

🧵 Academic job market season is almost here! There's so much rarely discussed—nutrition, mental and physical health, uncertainty, and more. I'm sharing my statements, essential blogs, and personal lessons here, with more to come in the upcoming weeks! ⬇️ (1/N)

Satnam Singh (@satnam6502) 's Twitter Profile Photo

Delip Rao e/σ Amey | अमेय I have often failed interviews. I have even failed interview where I was asked an interview question I used regularly at Google which I knew inside out. I fail coding interviews not because I can't code, but because the stressful synthetic nature of the situation causes my brain

Fortune India (@fortuneindia) 's Twitter Profile Photo

Shah Rukh Khan (Shah Rukh Khan) wins his first-ever National Award—33 years after debuting on the big screen—for his performance in the action thriller film Jawan at the 2023 National Awards. The veteran actor shares the accolade with Vikrant Massey (Vikrant Massey), who won the award

Shah Rukh Khan (<a href="/iamsrk/">Shah Rukh Khan</a>) wins his first-ever National Award—33 years after debuting on the big screen—for his performance in the action thriller film Jawan at the 2023 National Awards. The veteran actor shares the accolade with Vikrant Massey (<a href="/VikrantMassey/">Vikrant Massey</a>), who won the award
Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

The awesome Welch Labs released an incredible YouTube video about how AI image/video generation works! This great intro video discusses CLIP, diffusion models, and classifier-free guidance in a visually easy-to-understand, approachable, concise way Definitely check it out!

The awesome Welch Labs released an incredible YouTube video about how AI image/video generation works!

This great intro video discusses CLIP, diffusion models, and classifier-free guidance in a visually easy-to-understand, approachable, concise way

Definitely check it out!
Shashank Gupta (@shashank27392) 's Twitter Profile Photo

🚀 Excited to give a talk at Lossfunk, this Friday evening on Reinforcement Learning for Recommender Systems and Foundational Models! If you’re in Bangalore, I’d love to see you there! Please feel free to join in person or online. Registration link: lnkd.in/eN2RSxmG.

𝑺𝒉𝒆𝒃𝒂𝒔 (@shebas_10dulkar) 's Twitter Profile Photo

A few more Answers from 𝗦𝗮𝗰𝗵𝗶𝗻 𝗧𝗲𝗻𝗱𝘂𝗹𝗸𝗮𝗿 during today's ‘Ask Me Anything’ session on Reddit 💙 (1/7)

A few more Answers from 𝗦𝗮𝗰𝗵𝗶𝗻 𝗧𝗲𝗻𝗱𝘂𝗹𝗸𝗮𝗿 during today's  ‘Ask Me Anything’ session on Reddit 💙

(1/7)
Maria Heuß (@maria_heuss) 's Twitter Profile Photo

The call for papers for the #ECIR2026 IR-for-Good Track is now online here: ecir2026.eu/calls/call-for… Abstracts due: October 21 Papers due: October 28 We are revamping this track. For a summary of the changes that we are introducing this year, follow Bhaskar's thread.

Maria Heuß (@maria_heuss) 's Twitter Profile Photo

Submissions are now OPEN for the #IR4Good track at #ECIR2026! Submit your societally-motivated papers to this special track: easychair.org/conferences/?c…. Abstracts due October 21, Papers due October 28 Call: ecir2026.eu/calls/call-for…

Nan Jiang (@nanjiang_cs) 's Twitter Profile Photo

My 3rd blogpost on PG, the topic I am least familiar with but get asked a lot, so I thought I'd just put together the very limited stuff I know on this topic. Somehow the post gets cynical from time to time🙃 nanjiang.cs.illinois.edu/2025/09/29/pg.…

My 3rd blogpost on PG, the topic I am least familiar with but get asked a lot, so I thought I'd just put together the very limited stuff I know on this topic. Somehow the post gets cynical from time to time🙃

nanjiang.cs.illinois.edu/2025/09/29/pg.…
Caglar (@caglar_ee) 's Twitter Profile Photo

Video lectures, UCLA Reinforcement Learning of Large Language Models spring 2025, by Ernest Ryu ernestryu.com/courses/RL-LLM… youtube.com/playlist?list=…