Markus Wulfmeier (@m_wulfmeier) 's Twitter Profile
Markus Wulfmeier

@m_wulfmeier

machine learning & the real world @GoogleDeepMind - european @ELLISforEurope - transfer and generalisation - priors: @oxfordrobots @berkeley_ai @ETH @MIT #LUH

ID: 4484386293

linkhttps://sites.google.com/view/mwulfmeier/bio calendar_today14-12-2015 19:55:48

2,2K Tweet

11,11K Followers

1,1K Following

Markus Wulfmeier (@m_wulfmeier) 's Twitter Profile Photo

Imitation is the foundation of #LLM training. And it is a #ReinforcementLearning problem! Compared to supervised learning, RL -here inverse RL- better exploits sequential structure, online data and further extracts rewards. Beyond thrilled for our Google DeepMind paper! A

Imitation is the foundation of #LLM training.

And it is a #ReinforcementLearning problem!

Compared to supervised learning, RL -here inverse RL- better exploits sequential structure, online data and further extracts rewards.

Beyond thrilled for our <a href="/GoogleDeepMind/">Google DeepMind</a> paper!

A