Timo Kaufmann (@timokauf) Twitter Tweets • TwiCopy

Timo Kaufmann

@timokauf

+ Follow

PhD Student at LMU Munich (@AIML_LMU). Focus on RL, reward learning and learning from human preferences.

ID: 1547888339163918337

linkhttp://timokaufmann.com calendar_today15-07-2022 11:52:15

32 Tweet

88 Followers

208 Following

Artificial Intelligence and Machine Learning @ LMU

@aiml_lmu

2 years ago

👀At this year's ECML PKDD'2023, Timo Kaufmann @ ICLR 2025 and Sarah Ball gave a talk on the Challenges and Practices of Reinforcement Learning from Real Human Feedback #HLDM. 📷Check out the presentation on our YouTube channel youtu.be/6BxYMbsJavg!

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Timo Kaufmann

@timokauf

2 years ago

Very cool work! I love the focus on control tasks over LLMs, the creative use of AI feedback and NetHack as a benchmark.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Timo Kaufmann

@timokauf

a year ago

Really happy with this collaboration. Great fun in the process, satisfying outcome. Give it a read!

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Artificial Intelligence and Machine Learning @ LMU

@aiml_lmu

a year ago

🚀 We’re excited to present at the MHFAIA@ICML2024 Workshop at ICML Conference! Dive into our novel approach to human feedback in reinforcement learning based on distinguishability queries. #RLHF #ICML2024 🧵1/6

🚀 We’re excited to present at the <a href="/mhf_icml2024/">MHFAIA@ICML2024</a> Workshop at <a href="/icmlconf/">ICML Conference</a>! Dive into our novel approach to human feedback in reinforcement learning based on distinguishability queries. #RLHF #ICML2024
🧵1/6

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Timo Kaufmann

@timokauf

a year ago

I'm at #ICML2024, reach out if you want to chat!

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Timo Kaufmann

@timokauf

a year ago

Excited to attend the first RLC, the @RLBRew_2024 workshop in particular, and the beyond rewarding post-workshop beers in very particular! 🍺

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Timo Kaufmann

@timokauf

a year ago

I'll be presenting this at @RLBRew_2024 this morning, join us at the workshop!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Timo Kaufmann

@timokauf

a year ago

Had a good time at the @RLBRew_2024 workshop at RLC yesterday, thanks to the organizers!

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Abhishek Naik

@anaik96

a year ago

Learning about the history of RL at the birthplace of RL from the founders of RL at the first RL_Conference! What a privilege!

Learning about the history of RL at the birthplace of RL from the founders of RL at the first <a href="/RL_Conference/">RL_Conference</a>!

What a privilege!

thumb_up_off_alt362

chat_bubble_outline5

repeat25

shareShare

Timo Kaufmann

@timokauf

a year ago

I love how green the #RLC2024 venue (UMass Amherst) is

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Arduin Findeis @ ICLR2025

@arduinfindeis

9 months ago

🚀 Thrilled that our work on Inverse Constitutional AI has been accepted at ICLR 2025! We are continuing to extend the implementation, just released the latest version (v0.1.2) at github.com/rdnfn/icai

thumb_up_off_alt5

chat_bubble_outline0

repeat3

shareShare

Timo Kaufmann

@timokauf

9 months ago

Our paper on query-efficient reward learning will be at AAAI! Unfortunately I won’t be attending, but Xuening will be at the poster - stop by or reach out online!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Arduin Findeis @ ICLR2025

@arduinfindeis

8 months ago

🕵🏻💬 Introducing Feedback Forensics: a new tool to investigate pairwise preference data. Feedback data is notoriously difficult to interpret and has many known issues – our app aims to help! Try it at app.feedbackforensics.com Three example use-cases 👇🧵