Timo Kaufmann (@timokauf) 's Twitter Profile
Timo Kaufmann

@timokauf

PhD Student at LMU Munich (@AIML_LMU). Focus on RL, reward learning and learning from human preferences.

ID: 1547888339163918337

linkhttp://timokaufmann.com calendar_today15-07-2022 11:52:15

32 Tweet

88 Takipçi

208 Takip Edilen

Artificial Intelligence and Machine Learning @ LMU (@aiml_lmu) 's Twitter Profile Photo

👀At this year's ECML PKDD'2023, Timo Kaufmann @ ICLR 2025 and Sarah Ball gave a talk on the Challenges and Practices of Reinforcement Learning from Real Human Feedback #HLDM. 📷Check out the presentation on our YouTube channel youtu.be/6BxYMbsJavg!

Timo Kaufmann (@timokauf) 's Twitter Profile Photo

Very cool work! I love the focus on control tasks over LLMs, the creative use of AI feedback and NetHack as a benchmark.

Artificial Intelligence and Machine Learning @ LMU (@aiml_lmu) 's Twitter Profile Photo

🚀 We’re excited to present at the MHFAIA@ICML2024 Workshop at ICML Conference! Dive into our novel approach to human feedback in reinforcement learning based on distinguishability queries. #RLHF #ICML2024 🧵1/6

🚀 We’re excited to present at the <a href="/mhf_icml2024/">MHFAIA@ICML2024</a>  Workshop at <a href="/icmlconf/">ICML Conference</a>! Dive into our novel approach to human feedback in reinforcement learning based on distinguishability queries.  #RLHF #ICML2024
🧵1/6
Timo Kaufmann (@timokauf) 's Twitter Profile Photo

Excited to attend the first RLC, the @RLBRew_2024 workshop in particular, and the beyond rewarding post-workshop beers in very particular! 🍺

Arduin Findeis @ ICLR2025 (@arduinfindeis) 's Twitter Profile Photo

🚀 Thrilled that our work on Inverse Constitutional AI has been accepted at ICLR 2025! We are continuing to extend the implementation, just released the latest version (v0.1.2) at github.com/rdnfn/icai

Timo Kaufmann (@timokauf) 's Twitter Profile Photo

Our paper on query-efficient reward learning will be at AAAI! Unfortunately I won’t be attending, but Xuening will be at the poster - stop by or reach out online!

Arduin Findeis @ ICLR2025 (@arduinfindeis) 's Twitter Profile Photo

🕵🏻💬 Introducing Feedback Forensics: a new tool to investigate pairwise preference data. Feedback data is notoriously difficult to interpret and has many known issues – our app aims to help! Try it at app.feedbackforensics.com Three example use-cases 👇🧵