Moksh Jain (@jainmoksh) 's Twitter Profile
Moksh Jain

@jainmoksh

PhD Student at Mila and Université de Montréal

ID: 1660514928

linkhttps://mj10.github.io calendar_today10-08-2013 17:01:10

26 Tweet

335 Followers

528 Following

Jakob Foerster (@j_foerst) 's Twitter Profile Photo

You can now train agents to communicate in your browser using DIAL! Thanks to Moksh Jain for making Minqi Jiang's implementation more accessible and available 🙏

Moksh Jain (@jainmoksh) 's Twitter Profile Photo

First blog post, where I discuss disentangled representations and basics of VAEs. The post is accompanied by a notebook which helps you get started with disentanglement_lib from Google AI Intelligent Systems for @disen_challenge mj10.github.io/blog/2019/07/1…

Olivier Bachem (@olivierbachem) 's Twitter Profile Photo

Pretty cool to see this blog post + notebook that illustrates how to use disentanglement_lib (github.com/google-researc…) for the NeurIPS 2019 competition on learning disentangled representations (aicrowd.com/challenges/neu…)!

Sham Kakade (@shamkakade6) 's Twitter Profile Photo

Also, we recently posted this work arxiv.org/abs/1908.00261 on the theory of policy gradients for reinforcement learning! A long time in the works, this paper finally gets a handle on function approximation with policy gradient methods.

Moksh Jain (@jainmoksh) 's Twitter Profile Photo

Excited to attend my first NeurIPS Conference this year, thanks to travel grants from NeurIPS Conference and MineRL Project! Looking forward to presenting my work and learning from the vast ocean of amazing work and talented people present there!

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

C.R. Rao, the doyen of Statistics, who populated his field (and thereby #AI/#ML) with such pleasing Rao-isms as Rao-Blackwell theorem, Cramer-Rao lower bound--turns 100 today. 🙏(en.wikipedia.org/wiki/C._R._Rao ) thehindu.com/opinion/open-p…

Prateek Jain (@jainprateek_) 's Twitter Profile Photo

Excited to share our method "DROCC: Deep Robust One-Class Classification" for solving anomaly detection problem in any domain (proceedings.icml.cc/book/2020/file…) Work with awesome collaborators: Sachin Goyal , Moksh Jain , Aditi Raghunathan, Harsha Simhadri. 1/n

Microsoft Research (@msftresearch) 's Twitter Profile Photo

The Microsoft Turing team & Microsoft Research have collaborated to create the Turing Universal Representation Language Model, which now leads the XTREME public leaderboard for cross-lingual transfer learning. Learn how T-ULRv2 sets new state of the art: aka.ms/AAa0r9z

Moksh Jain (@jainmoksh) 's Twitter Profile Photo

Excited to share this post on the M2D2 blog accompanying our recent paper arxiv.org/abs/2302.00615, which outlines the potential of GFlowNets to accelerate scientific discovery.

Moksh Jain (@jainmoksh) 's Twitter Profile Photo

New work on scalable off-policy (MaxEnt) RL for LLM fine-tuning led by Brian Bartoldson We learned a lot about effective RL training of LLMs, so checkout the thread and paper for more details! Excited about more work in this direction!

Moksh Jain (@jainmoksh) 's Twitter Profile Photo

New work on learning abstractions for amortized samplers led by Oussama Boussif and other brilliant collaborators! Even with simple tokenization schemes like BPE we can discover meaningful abstractions! Check out the paper and come chat with us ICLR 2026

Minsu Kim (@minsuuukim) 's Twitter Profile Photo

Bringing hierarchical structure for search-based inference in LLM. Sharing our recent work "Search-Based Correction of Reasoning Chains for LMs". Work done with Jean-Pierre Falet, Oliver Richard, Xiaoyin Chen , Moksh Jain , Sungjin Ahn , Sungsoo Ahn, Yoshua Bengio