Hoang M. Le (@hoangminhle) Twitter Tweets • TwiCopy

Michael Littman

7 years ago

Mindblowing. Apple is now valued at a trillion dollars. CNN thinks other tech companies might not be far behind. I guess we are truly in the era of MAGA---Microsoft, Amazon, Google, Apple. money.cnn.com/2018/08/02/inv… Brown HCRI

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

Thang Luong

@lmthang

7 years ago

This article speaks many, I believe, hidden truths about Quoc Le on GoogleBrain & seq2seq. Personally, I have enjoyed working with Quoc who cares less about credit assignment but rather teamwork and long-term vision :) medium.com/@aifrontiers/a…

thumb_up_off_alt429

chat_bubble_outline1

repeat90

shareShare

Hoang M. Le

@hoangminhle

7 years ago

Came across an old thread that seems to show folks struggle to pinpoint real-world success of contemporary RL (beyond game playing)

thumb_up_off_alt1

chat_bubble_outline1

repeat0

shareShare

Yisong Yue

@yisongyue

7 years ago

Accepted to #ICRA2019! One of the first instances of provably robust deep learning based controllers! We show Lyapunov stability while using a 4-layer deep neural net as part of the controller design. Experiments on a real robotic platform too.

thumb_up_off_alt110

chat_bubble_outline0

repeat18

shareShare

Sebastian Ruder

@seb_ruder

7 years ago

This is a super cool resource: Papers With Code now includes 950+ ML tasks, 500+ evaluation tables (including SOTA results) and 8500+ papers with code. Probably the largest collection of NLP tasks I've seen including 140+ tasks and 100 datasets. paperswithcode.com/sota

thumb_up_off_alt2,2K

chat_bubble_outline38

repeat1,1K

shareShare

Blake Richards

@tyrell_turing

7 years ago

I'm excited to announce that the 4th multi-disciplinary Reinforcement Learning and Decision Making conference (#RLDM2019) will be in Montreal this July 7-10! Check out the speaker list, and consider submitting an abstract (deadline March 1st)! Visit: rldm.org 🧠💻

thumb_up_off_alt103

chat_bubble_outline2

repeat49

shareShare

David Abel

@dabelcs

7 years ago

Had a wonderful time at AAAI! Some notes: david-abel.github.io/notes/aaai_201… #AAAI19

thumb_up_off_alt49

chat_bubble_outline4

repeat13

shareShare

Kevin K. Yang 楊凱筌

@kevinkaichuang

7 years ago

“It is not that machines are going to replace chemists...It’s that the chemists who use machines will replace those that don’t.” Happy to see Mohammed AlQuraishi featured as well! nytimes.com/2019/02/05/tec…

thumb_up_off_alt8

chat_bubble_outline0

repeat5

shareShare

Michael Littman

@mlittmancs

7 years ago

Compelling essay: Roughly, (1) RL (like contextual bandits) is having a big impact on the real world, (2) That impact is not positive for humanity, (3) We as the engineers of these systems can do more to push for making future impacts positive. medium.com/@francois.chol… Brown HCRI

thumb_up_off_alt26

chat_bubble_outline0

repeat2

shareShare

Hoang M. Le

@hoangminhle

7 years ago

BBC News - Robot teaches itself to ice-skate bbc.com/news/av/techno…

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

Miles Brundage

@miles_brundage

7 years ago

Looks like a great overview of the area: "Algorithms for Verifying Deep Neural Networks," Liu et al.: arxiv.org/abs/1903.06758

thumb_up_off_alt127

chat_bubble_outline1

repeat36

shareShare

Rodney Brooks

@rodneyabrooks

7 years ago

Just posted "A Better Lesson", a short review of Ruch Sutton's recent post "The Bitter Lesson". rodneybrooks.com/a-better-lesso…

thumb_up_off_alt279

chat_bubble_outline12

repeat105

shareShare

Marc G. Bellemare

@marcgbellemare

7 years ago

Impressive application of deep reinforcement learning to turbine optimization! Talk by Michael May from Siemens at #RVIAQC2019

thumb_up_off_alt149

chat_bubble_outline3

repeat31

shareShare

Yisong Yue

@yisongyue

7 years ago

Real-World Decision Making Workshop at #icml2019! realworld-sdm.github.io

thumb_up_off_alt28

chat_bubble_outline1

repeat5

shareShare

Hal Daumé III

@haldaume3

6 years ago

thanks! here's the talk: slideslive.com/38916832/beyon… first paper mentioned is arxiv.org/abs/1803.00590 second paper will be on arxiv soon (will reply here when it's posted)

thumb_up_off_alt23

chat_bubble_outline0

repeat4

shareShare

David Abel

@dabelcs

6 years ago

Notes from ICML Conference here: david-abel.github.io/notes/icml_201… #ICML2019

Notes from <a href="/icmlconf/">ICML Conference</a> here: david-abel.github.io/notes/icml_201… #ICML2019

thumb_up_off_alt798

chat_bubble_outline16

repeat200

shareShare

Been Kim

@_beenkim

6 years ago

An amazing source of learning by Roger Grosse #metacademy metacademy.org/browse to learn new concepts in ML, in the depth you like. Love this effort!

thumb_up_off_alt89

chat_bubble_outline0

repeat23

shareShare

Hoang M. Le

@hoangminhle

5 years ago

Today I officially turn into a doctor (of philosophy). Special thanks to @YisongYue for being an amazing advisor. I have been fortunate to work with many awesome collaborators. And of course I would be no where without the support of my family and Zeynep <3

thumb_up_off_alt124

chat_bubble_outline10

repeat9

shareShare

Yisong Yue

@yisongyue

3 years ago

Policy Optimization with Linear Temporal Logic Constraints: arxiv.org/abs/2206.09546 1st author: Cameron Voloshin Co-authors: Swarat Chaudhuri & Hoang M. Le LTL can capture expressive constraints that are hard to do with reward engineering, such as an infinite loop (e.g. patrolling).

Policy Optimization with Linear Temporal Logic Constraints:
arxiv.org/abs/2206.09546
1st author: Cameron Voloshin
Co-authors: <a href="/swarat/">Swarat Chaudhuri</a> & <a href="/HoangMinhLe/">Hoang M. Le</a>

LTL can capture expressive constraints that are hard to do with reward engineering, such as an infinite loop (e.g. patrolling).

thumb_up_off_alt13

chat_bubble_outline1

repeat4

shareShare

Yue Lab Caltech

@yuelabcaltech

3 years ago

Go see our first 🤩 poster NeurIPS Conference tomorrow from Cameron Voloshin Hoang M. Le Swarat Chaudhuri Yisong Yue!! 👇👇👇👇👇👇

thumb_up_off_alt3

chat_bubble_outline0

repeat2

shareShare