
Moksh Jain
@jainmoksh
PhD Student at Mila and Université de Montréal
ID: 1660514928
https://mj10.github.io 10-08-2013 17:01:10
26 Tweet
335 Followers
528 Following

You can now train agents to communicate in your browser using DIAL! Thanks to Moksh Jain for making Minqi Jiang's implementation more accessible and available 🙏




Happy to share our great collaborative effort “RL: Generic reinforcement learning codebase in TensorFlow” - joss.theoj.org/papers/10.2110…. Bryan M. Li @AidanNGomez Alexander Imani Cowen-Rivers. David Tao Sid Nitarshan Rajkumar Hariharan Sezhiyan Sicong (Sheldon) Huang

Excited to attend my first NeurIPS Conference this year, thanks to travel grants from NeurIPS Conference and MineRL Project! Looking forward to presenting my work and learning from the vast ocean of amazing work and talented people present there!





Excited to share our method "DROCC: Deep Robust One-Class Classification" for solving anomaly detection problem in any domain (proceedings.icml.cc/book/2020/file…) Work with awesome collaborators: Sachin Goyal , Moksh Jain , Aditi Raghunathan, Harsha Simhadri. 1/n



New work on scalable off-policy (MaxEnt) RL for LLM fine-tuning led by Brian Bartoldson We learned a lot about effective RL training of LLMs, so checkout the thread and paper for more details! Excited about more work in this direction!


Bringing hierarchical structure for search-based inference in LLM. Sharing our recent work "Search-Based Correction of Reasoning Chains for LMs". Work done with Jean-Pierre Falet, Oliver Richard, Xiaoyin Chen , Moksh Jain , Sungjin Ahn , Sungsoo Ahn, Yoshua Bengio