Moksh Jain (@jainmoksh) Twitter Tweets • TwiCopy

Jakob Foerster

7 years ago

You can now train agents to communicate in your browser using DIAL! Thanks to Moksh Jain for making Minqi Jiang's implementation more accessible and available 🙏

thumb_up_off_alt20

chat_bubble_outline0

repeat3

shareShare

First blog post, where I discuss disentangled representations and basics of VAEs. The post is accompanied by a notebook which helps you get started with disentanglement_lib from Google AI Intelligent Systems for @disen_challenge mj10.github.io/blog/2019/07/1…

thumb_up_off_alt32

chat_bubble_outline0

repeat10

shareShare

Real Robot Challenge

@robo_challenge

6 years ago

For a tutorial and notebooks beyond the starter-kit with hands-on experience form a participant (thanks for the efforts!):

thumb_up_off_alt2

chat_bubble_outline0

repeat2

shareShare

Olivier Bachem

@olivierbachem

6 years ago

Pretty cool to see this blog post + notebook that illustrates how to use disentanglement_lib (github.com/google-researc…) for the NeurIPS 2019 competition on learning disentangled representations (aicrowd.com/challenges/neu…)!

thumb_up_off_alt30

chat_bubble_outline0

repeat9

shareShare

Sham Kakade

@shamkakade6

6 years ago

Also, we recently posted this work arxiv.org/abs/1908.00261 on the theory of policy gradients for reinforcement learning! A long time in the works, this paper finally gets a handle on function approximation with policy gradient methods.

thumb_up_off_alt113

chat_bubble_outline0

repeat24

shareShare

Cohere Labs

@cohere_labs

6 years ago

Happy to share our great collaborative effort “RL: Generic reinforcement learning codebase in TensorFlow” - joss.theoj.org/papers/10.2110…. Bryan M. Li @AidanNGomez Alexander Imani Cowen-Rivers. David Tao Sid Nitarshan Rajkumar Hariharan Sezhiyan Sicong (Sheldon) Huang

thumb_up_off_alt45

chat_bubble_outline0

repeat17

shareShare

Moksh Jain

@jainmoksh

6 years ago

Excited to attend my first NeurIPS Conference this year, thanks to travel grants from NeurIPS Conference and MineRL Project! Looking forward to presenting my work and learning from the vast ocean of amazing work and talented people present there!

thumb_up_off_alt18

chat_bubble_outline3

repeat1

shareShare

Cohere Labs

@cohere_labs

6 years ago

FOR.ai NeurIPS Dinner 2019 ❤️

thumb_up_off_alt66

chat_bubble_outline2

repeat6

shareShare

Moksh Jain

@jainmoksh

5 years ago

Looking forward to discussing our work at #ICML2020 poster session!

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Moksh Jain

@jainmoksh

5 years ago

Amazing blog post surveying adversarial attacks for Deep RL. Great work!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

5 years ago

C.R. Rao, the doyen of Statistics, who populated his field (and thereby #AI/#ML) with such pleasing Rao-isms as Rao-Blackwell theorem, Cramer-Rao lower bound--turns 100 today. 🙏(en.wikipedia.org/wiki/C._R._Rao ) thehindu.com/opinion/open-p…

thumb_up_off_alt407

chat_bubble_outline3

repeat75

shareShare

Prateek Jain

@jainprateek_

5 years ago

Excited to share our method "DROCC: Deep Robust One-Class Classification" for solving anomaly detection problem in any domain (proceedings.icml.cc/book/2020/file…) Work with awesome collaborators: Sachin Goyal , Moksh Jain , Aditi Raghunathan, Harsha Simhadri. 1/n

thumb_up_off_alt56

chat_bubble_outline1

repeat7

shareShare

Microsoft Research

@msftresearch

5 years ago

The Microsoft Turing team & Microsoft Research have collaborated to create the Turing Universal Representation Language Model, which now leads the XTREME public leaderboard for cross-lingual transfer learning. Learn how T-ULRv2 sets new state of the art: aka.ms/AAa0r9z

thumb_up_off_alt119

chat_bubble_outline0

repeat40

shareShare

Moksh Jain

@jainmoksh

3 years ago

Excited to share this post on the M2D2 blog accompanying our recent paper arxiv.org/abs/2302.00615, which outlines the potential of GFlowNets to accelerate scientific discovery.

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Moksh Jain

@jainmoksh

9 months ago

New work on scalable off-policy (MaxEnt) RL for LLM fine-tuning led by Brian Bartoldson We learned a lot about effective RL training of LLMs, so checkout the thread and paper for more details! Excited about more work in this direction!

thumb_up_off_alt26

chat_bubble_outline0

repeat1

shareShare

Moksh Jain

@jainmoksh

8 months ago

New work on learning abstractions for amortized samplers led by Oussama Boussif and other brilliant collaborators! Even with simple tokenization schemes like BPE we can discover meaningful abstractions! Check out the paper and come chat with us ICLR 2026

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

Minsu Kim

@minsuuukim

7 months ago

Bringing hierarchical structure for search-based inference in LLM. Sharing our recent work "Search-Based Correction of Reasoning Chains for LMs". Work done with Jean-Pierre Falet, Oliver Richard, Xiaoyin Chen , Moksh Jain , Sungjin Ahn , Sungsoo Ahn, Yoshua Bengio

thumb_up_off_alt39

chat_bubble_outline2

repeat9

shareShare