Aditya Modi
@adityamodi94
A theoretician hoping to apply RL in the wild world!
ID: 946901247407169536
http://adityamodi.github.io 30-12-2017 00:30:24
32 Tweet
239 Followers
318 Following
RL in the real-world: How to optimize computational pipelines on-the-fly! Eric Horvitz Besmira Nushi 💙💛 Adith Swaminathan @seanandrist alekh agarwal aditya modi Microsoft Research
We just scheduled Thomas Steinke (Thomas Steinke) to talk about his recent paper "Reasoning About Generalization via Conditional Mutual Information" (with Lydia Lydia Zakynthinou) on March 11! Mark your calendars, and stay tuned for further details! arxiv.org/abs/2001.09122
Come to Hall J #315 at 11a and Jinglin & Aditya Modi will tell you abt general learnability of Reward-free RL! R-f RL exhaustively explores the env & thus has heavily relied on linear structures. We now can handle non-linear FA w/ Bellman-eluder dim. More findings👇(1/2)
The 2nd Workshop on Decision Making for Modern IR and Recsys The Web Conference is calling for paper (decisionmaking4ir.github.io/WWW-2023/)! The paper submission deadline is Feb 6. #AI #ML #recsys #decisionmaking #bandits #reinforcementlearning #informationretrieval
Provably Learning from Language Feedback TLDR: RL theory can help us do better inference-time exploration with feedback. Work done with Wanqiao Xu, Ruijie Zheng, Ching-An Cheng @ICML2025, Aditya Modi, Adith Swaminathan 📰 arxiv.org/pdf/2506.10341 📍EXAIT Best Paper/Oral Sat 8:45-9:30 am
If you missed Wanqiao Xu’s presentation, here are some of our slides! (The workshop will post full slides later on their website) Paper: arxiv.org/abs/2506.10341