Dimitri Bertsekas (@dbertsekas) Twitter Tweets • TwiCopy

Extended Brain

a year ago

Thanks. Also available from the same author: "Lessons from AlphaZero, for Optimal, Model Predictive, and Adaptive Control" web.mit.edu/dimitrib/www/R…

thumb_up_off_alt44

chat_bubble_outline0

repeat7

shareShare

Our new paper “Semilinear Dynamic Programming" arxiv.org/abs/2501.04668 deals with a new class of linearly structured problems with nice analytical and computational properties, such as certainty equivalence, and potential applications in #reinforcementlearning

thumb_up_off_alt148

chat_bubble_outline4

repeat30

shareShare

Dimitri Bertsekas

@dbertsekas

a year ago

My #reinforcementlearning Spring 2025 course at ASU starts tomorrow. Slides and videolectures to be posted on Fridays at the course website web.mit.edu/dimitrib/www/R… which also contains the 475 pp textbook and course material from past years' offerings

thumb_up_off_alt459

chat_bubble_outline2

repeat93

shareShare

The Postdoctoral

@thepostdoctoral

a year ago

Brilliant move: Mathematician's latest gambit is new chess AI | ASU News - Long before Bertsekas became a luminary in mathematics and computer science, authoring foundational textbooks on reinforcement learning, a type of ... - ift.tt/8ifyqQ2

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Dimitri Bertsekas

@dbertsekas

10 months ago

My videolecture on computer chess describes a meta-chess architecture MPC-MC that improves the play of any chess engine (including Stockfish Chess and one from Google DeepMind) using #reinforcementlearning See youtube.com/watch?v=88LDkH…

thumb_up_off_alt41

chat_bubble_outline0

repeat5

shareShare

Victor

@victor_explore

9 months ago

This is an amazing playlist to learn Reinforcement Learning by Dimitri Bertsekas

thumb_up_off_alt131

chat_bubble_outline2

repeat25

shareShare

Dimitri Bertsekas

@dbertsekas

9 months ago

A video lecture on #reinforcementlearning was posted at youtube.com/watch?v=mEjlnY… Originally delivered at an IEEE Symposium on ADPRL, Orlando, 2014. Several of the ideas await further exploration. Slides at mit.edu/~dimitrib/Gen_…

thumb_up_off_alt167

chat_bubble_outline3

repeat33

shareShare

Tom Silver

@tomssilver

9 months ago

This week's #PaperILike is "Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming" (Bertsekas 2024). If you know 1 of {RL, controls} and want to understand the other, this is a good starting point. PDF: arxiv.org/abs/2406.00592

thumb_up_off_alt302

chat_bubble_outline4

repeat56

shareShare

Dimitri Bertsekas

@dbertsekas

9 months ago

Just posted a videolecture on a Viterbi-like rollout/#reinforcementlearning algorithm for most likely sequence generation in Markov chains, and HMM inference, at youtu.be/KdX6o9Qi1vc Applies to large state spaces where the Viterbi algorithm is intractable

thumb_up_off_alt138

chat_bubble_outline1

repeat31

shareShare

Kim Hammar

@kimhammar1

8 months ago

A recording of my guest lecture at ASU on aggregation for approximating POMDPs is available here: youtube.com/watch?v=gsD2Jg… Thanks to Dimitri Bertsekas and Yuchao Li for organizing this!

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

Dimitri Bertsekas

@dbertsekas

8 months ago

A free PDF of my book "Rollout, Policy Iteration, and #ReinforcementLearning " has been posted at my web site web.mit.edu/dimitrib/www/d… An extensive research account on rollout algorithms, including multiagent rollout, and the connection with Newton's method

thumb_up_off_alt282

chat_bubble_outline4

repeat52

shareShare

Dimitri Bertsekas

@dbertsekas

8 months ago

I am pleased to share the video from my yesterday's lecture "Abstract Dynamic Programming, Reinforcement Learning, Newton's Method, and Gradient Optimization" at the ASU Math Dept youtube.com/watch?v=JmQzj0… This is an overview lecture on the relations between DP and RL

thumb_up_off_alt437

chat_bubble_outline3

repeat90

shareShare

Dimitri Bertsekas

@dbertsekas

8 months ago

I am pleased to share the full set of videolectures, slides, textbook, and other supporting material of the 7th offering of my Reinforcement Learning class at ASU, which was completed two days ago; check web.mit.edu/dimitrib/www/R…

thumb_up_off_alt1,1K

chat_bubble_outline17

repeat239

shareShare

Dimitri Bertsekas

@dbertsekas

8 months ago

I am often asked about the relative merits of various #reinforcementlearning approaches, such as policy gradient and value-based methods. The last lecture of my RL course deals with this question, and related training issues, see: youtube.com/watch?v=43CXjD…

thumb_up_off_alt163

chat_bubble_outline0

repeat27

shareShare

Dimitri Bertsekas

@dbertsekas

7 months ago

I am pleased to share podcasts (<30 mins) describing two of my books: Neuro-Dynamic Programming. notebooklm.google.com/notebook/c21b0… A Course in Reinforcement Learning notebooklm.google.com/notebook/a4a87… Free PDF of both books can be found at web.mit.edu/dimitrib/www/b…

thumb_up_off_alt464

chat_bubble_outline3

repeat87

shareShare

Dimitri Bertsekas

@dbertsekas

7 months ago

I am pleased to share at web.mit.edu/dimitrib/www/R… High quality AI-generated podcast links for my books: 1) Lessons from AlphaZero ... 2) Parallel and Distributed Computation See web.mit.edu/dimitrib/www/b… for PDF copies #reinforcementlearning#machinelearning

thumb_up_off_alt35

chat_bubble_outline0

repeat6

shareShare