Thomas Möllenhoff (@tmoellenhoff) 's Twitter Profile
Thomas Möllenhoff

@tmoellenhoff

Research Scientist (tenured) @RIKEN_AIP @RIKEN_EN — PhD from @TU_Muenchen @tumcvg

ID: 1196487762851045376

linkhttp://thomasmoellenhoff.net calendar_today18-11-2019 17:58:14

141 Tweet

722 Followers

500 Following

Emtiyaz Khan (@emtiyazkhan) 's Twitter Profile Photo

We don't expect Bayesian methods to do so well at large scale, but we can now get decent improvements with variational learning to GPT-2. I wrote a blog about this (first one in a long time). Check it out! team-approx-bayes.github.io/blog/ivon/ Paper: arxiv.org/abs/2402.17641 A thread below.

We don't expect Bayesian methods to do so well at large scale, but we can now get decent improvements with variational learning to GPT-2. I wrote a blog about this (first one in a long time). Check it out!
team-approx-bayes.github.io/blog/ivon/

Paper: arxiv.org/abs/2402.17641

A thread below.
Emtiyaz Khan (@emtiyazkhan) 's Twitter Profile Photo

What a day we got to spend in the presence of a legend. Prof. Amari is 88 now and we heard, what he calls, his last talk today (but he is in very good health and I am sure there will more).

What a day we got to spend in the presence of a legend. Prof. Amari is 88 now and we heard, what he calls, his last talk today (but he is in very good health and I am sure there will more).
Yuesong Shen (@ysngshn) 's Twitter Profile Photo

Just wrote a (math-free) tutorial for the IVON optimizer proposed in our paper "Variational Learning is Effective for Large Deep Networks". Maybe it is just what you need for your next DL project 😉 ysngshn.github.io/research/why-i…

UKP Lab (@ukplab) 's Twitter Profile Photo

Model Merging has shown great success but key questions remain unresolved: ✅ Why does it work? ❌ When can it fail? We shed light on those questions by connecting inaccuracies of weighted-averaging to mismatches in the gradients. 🧵(1/9) #ICLR2024 📰 arxiv.org/abs/2310.12808

Model Merging has shown great success but key questions remain unresolved:

✅ Why does it work?
❌ When can it fail?

We shed light on those questions by connecting inaccuracies of weighted-averaging to mismatches in the gradients.

🧵(1/9) #ICLR2024

📰  arxiv.org/abs/2310.12808
Emtiyaz Khan (@emtiyazkhan) 's Twitter Profile Photo

It is my pleasure to announce the 2nd Bayes-Duality workshop, focusing on the design of AI that learns adaptively, robustly, and coninually, like humans. bayesduality.github.io/workshop_2024.… You can watch all 26 talks through zoom livestream (June 12-21). Register at …c59ed978213830355fc8978.doorkeeper.jp/events/172217

It is my pleasure to announce the 2nd Bayes-Duality workshop, focusing on the design of AI that learns adaptively, robustly, and coninually, like humans. 
bayesduality.github.io/workshop_2024.…

You can watch all 26 talks through zoom livestream (June 12-21). Register at …c59ed978213830355fc8978.doorkeeper.jp/events/172217
Thomas Möllenhoff (@tmoellenhoff) 's Twitter Profile Photo

I‘m in Venice at ISBA this week. I‘ll be giving a talk on making variational Bayes work for large neural nets (GPT-2, …), Friday 3pm in Julyan Arbel invited session. It‘s my first time at ISBA, happy to chat and connect. Send me a message! 😊

I‘m in Venice at ISBA this week. I‘ll be giving a talk on making variational Bayes work for large neural nets (GPT-2, …), Friday 3pm in <a href="/JulyanArbel/">Julyan Arbel</a> invited session. 

It‘s my first time at ISBA, happy to chat and connect. Send me a message! 😊
Thomas Möllenhoff (@tmoellenhoff) 's Twitter Profile Photo

I‘ll be presenting a poster on our work on scaling variational learning to GPT-2 sized models today at ICML. Feel free to drop by!

Yuesong Shen (@ysngshn) 's Twitter Profile Photo

The JAX implementation of IVON is now available at github.com/ysngshn/ivon-o… Let us know if you have any issues or success stories to share! #JAX #IVON #deeplearning

Thomas Möllenhoff (@tmoellenhoff) 's Twitter Profile Photo

Thanks JinYeong Bak and students for visiting us at RIKEN and giving excellent talks! If you haven’t seen it yet, check out their recent works on memory in neural nets. arxiv.org/abs/2310.03052 aclanthology.org/2024.naacl-lon…

Thanks <a href="/NoSyu/">JinYeong Bak</a> and students for visiting us at RIKEN and giving excellent talks! 

If you haven’t seen it yet, check out their recent works on memory in neural nets. arxiv.org/abs/2310.03052

aclanthology.org/2024.naacl-lon…
Thomas Möllenhoff (@tmoellenhoff) 's Twitter Profile Photo

Thanks Kai Arulkumaran for the inspiring talk on brain-controlled robotics and visiting us at RIKEN AIP. Looking forward to what is coming next 🤖⚡️

Thanks <a href="/kaixhin/">Kai Arulkumaran</a> for the inspiring talk on brain-controlled robotics and visiting us at RIKEN AIP. 

Looking forward to what is coming next 🤖⚡️
Emtiyaz Khan (@emtiyazkhan) 's Twitter Profile Photo

We have two open post-doc positions. You dont' have to be a Bayesian but somebody who is interested to work with at the intersection of DL, Bayes, and optimization. riken.jp/en/careers/res… Interest in understanding deep learning and continual lifelong learning is a plus!