Hoang M. Le (@hoangminhle) 's Twitter Profile
Hoang M. Le

@hoangminhle

Staff Scientist @ Argo AI. Previous research life @ MSR and Caltech. Here for the eavesdropping. Usual caveats

ID: 75905735

linkhttp://hoangle.info calendar_today20-09-2009 23:45:01

63 Tweet

427 Followers

177 Following

Michael Littman (@mlittmancs) 's Twitter Profile Photo

Mindblowing. Apple is now valued at a trillion dollars. CNN thinks other tech companies might not be far behind. I guess we are truly in the era of MAGA---Microsoft, Amazon, Google, Apple. money.cnn.com/2018/08/02/inv… Brown HCRI

Thang Luong (@lmthang) 's Twitter Profile Photo

This article speaks many, I believe, hidden truths about Quoc Le on GoogleBrain & seq2seq. Personally, I have enjoyed working with Quoc who cares less about credit assignment but rather teamwork and long-term vision :) medium.com/@aifrontiers/a…

Hoang M. Le (@hoangminhle) 's Twitter Profile Photo

Came across an old thread that seems to show folks struggle to pinpoint real-world success of contemporary RL (beyond game playing)

Yisong Yue (@yisongyue) 's Twitter Profile Photo

Accepted to #ICRA2019! One of the first instances of provably robust deep learning based controllers! We show Lyapunov stability while using a 4-layer deep neural net as part of the controller design. Experiments on a real robotic platform too.

Sebastian Ruder (@seb_ruder) 's Twitter Profile Photo

This is a super cool resource: Papers With Code now includes 950+ ML tasks, 500+ evaluation tables (including SOTA results) and 8500+ papers with code. Probably the largest collection of NLP tasks I've seen including 140+ tasks and 100 datasets. paperswithcode.com/sota

This is a super cool resource: Papers With Code now includes 950+ ML tasks, 500+ evaluation tables (including SOTA results) and 8500+ papers with code. Probably the largest collection of NLP tasks I've seen including 140+ tasks and 100 datasets.
paperswithcode.com/sota
Blake Richards (@tyrell_turing) 's Twitter Profile Photo

I'm excited to announce that the 4th multi-disciplinary Reinforcement Learning and Decision Making conference (#RLDM2019) will be in Montreal this July 7-10! Check out the speaker list, and consider submitting an abstract (deadline March 1st)! Visit: rldm.org 🧠💻

I'm excited to announce that the 4th multi-disciplinary Reinforcement Learning and Decision Making conference (#RLDM2019) will be in Montreal this July 7-10! Check out the speaker list, and consider submitting an abstract (deadline March 1st)! Visit: rldm.org 🧠💻
Kevin K. Yang 楊凱筌 (@kevinkaichuang) 's Twitter Profile Photo

“It is not that machines are going to replace chemists...It’s that the chemists who use machines will replace those that don’t.” Happy to see Mohammed AlQuraishi featured as well! nytimes.com/2019/02/05/tec…

Michael Littman (@mlittmancs) 's Twitter Profile Photo

Compelling essay: Roughly, (1) RL (like contextual bandits) is having a big impact on the real world, (2) That impact is not positive for humanity, (3) We as the engineers of these systems can do more to push for making future impacts positive. medium.com/@francois.chol… Brown HCRI

Rodney Brooks (@rodneyabrooks) 's Twitter Profile Photo

Just posted "A Better Lesson", a short review of Ruch Sutton's recent post "The Bitter Lesson". rodneybrooks.com/a-better-lesso…

Hal Daumé III (@haldaume3) 's Twitter Profile Photo

thanks! here's the talk: slideslive.com/38916832/beyon… first paper mentioned is arxiv.org/abs/1803.00590 second paper will be on arxiv soon (will reply here when it's posted)

Been Kim (@_beenkim) 's Twitter Profile Photo

An amazing source of learning by Roger Grosse #metacademy metacademy.org/browse to learn new concepts in ML, in the depth you like. Love this effort!

Hoang M. Le (@hoangminhle) 's Twitter Profile Photo

Today I officially turn into a doctor (of philosophy). Special thanks to @YisongYue for being an amazing advisor. I have been fortunate to work with many awesome collaborators. And of course I would be no where without the support of my family and Zeynep <3

Yisong Yue (@yisongyue) 's Twitter Profile Photo

Policy Optimization with Linear Temporal Logic Constraints: arxiv.org/abs/2206.09546 1st author: Cameron Voloshin Co-authors: Swarat Chaudhuri & Hoang M. Le LTL can capture expressive constraints that are hard to do with reward engineering, such as an infinite loop (e.g. patrolling).

Policy Optimization with Linear Temporal Logic Constraints:
arxiv.org/abs/2206.09546
1st author: Cameron Voloshin
Co-authors: <a href="/swarat/">Swarat Chaudhuri</a> &amp; <a href="/HoangMinhLe/">Hoang M. Le</a>

LTL can capture expressive constraints that are hard to do with reward engineering, such as an infinite loop (e.g. patrolling).