Lorenz Wolf (@lorenz_wlf) Twitter Tweets • TwiCopy

Lorenz Wolf

8 months ago

Heading to #SaTML2025 this week to present our work on "Private Selection with Heterogeneous Sensitivities". arxiv.org/abs/2501.05309…

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Glad to introduce our new work "Game-Theoretic Regularized Self-Play Alignment of Large Language Models". arxiv.org/abs/2503.00030 🎉 We introduce RSPO, a general, provably convergent framework to bring different regularization strategies into self-play alignment. 🧵👇

thumb_up_off_alt33

chat_bubble_outline2

repeat14

shareShare

UKRI CDT in Foundational AI

@faicdt1

7 months ago

CDT student Lorenz Wolf has recently returned from the #SaTML2025 3rd IEEE Conference on Secure and Trustworthy Machine Learning in Copenhagen where he had the opportunity to present his work - more on our blog here ➡️ blogs.ucl.ac.uk/faicdt/2025/04…

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Vasilios Mavroudis

@mavroudisv

6 months ago

🚨 New update from our AI for Cyber Defence for Critical Infrastructure mission The Alan Turing Institute: We're pushing the limits of protocol-aware deception using AI. Here’s what we’ve built👇 airgapped.substack.com/p/update-may-2… Btw this carries minimal dual-use risk. It's a defence-only use case.

🚨 New update from our AI for Cyber Defence for Critical Infrastructure mission <a href="/turinginst/">The Alan Turing Institute</a>:

We're pushing the limits of protocol-aware deception using AI. Here’s what we’ve built👇

airgapped.substack.com/p/update-may-2…

Btw this carries minimal dual-use risk. It's a defence-only use case.

thumb_up_off_alt10

chat_bubble_outline1

repeat2

shareShare

Augustine Mavor-Parker

@mavorparker

6 months ago

RL is the agent-environment loop and we currently do not have enough environments! At Vmax we're building a platform for environment creation.

thumb_up_off_alt37

chat_bubble_outline11

repeat13

shareShare

Lorenz Wolf

@lorenz_wlf

6 months ago

At RLDM this week to present our work on incorporating diverse prior knowledge in RL (sample efficiency, safety, interpretability,...) Poster #94 on Thursday Full paper here: arxiv.org/abs/2306.01158 #RLDM2025

thumb_up_off_alt12

chat_bubble_outline0

repeat4

shareShare

Lorenz Wolf

@lorenz_wlf

6 months ago

Had a great time at RLDM this past week! Enjoyed many great conversations and fascinating talks Ben Eysenbach, Michael Dennis, Dan Haramati, DeepFlow. Thanks to The Alan Turing Institute and UKRI CDT in Foundational AI for the support!

thumb_up_off_alt17

chat_bubble_outline0

repeat3

shareShare

Xiaohang Tang

@xiaohang_tang

5 months ago

🧶1/ Diffusion-based LLMs (dLLMs) are fast & promising—but hard to fine-tune with RL. Why? Because their likelihoods are intractable, making common RL (like GRPO) inefficient & biased. 💡We present a novel method 𝐰𝐝𝟏, that mitigates these headaches. Let’s break it down.👇

thumb_up_off_alt47

chat_bubble_outline4

repeat10

shareShare

darren

@darrenangle

4 months ago

*sniff* *pulls shirt* You know, this is perfect - *gestures wildly* - this is the ultimate perversity of capitalism at its purest. Here we have Anthropic, this company claiming to build "AI for humanity," and what do they do? They create this digital cocaine, this Claude Code,

thumb_up_off_alt2,2K

chat_bubble_outline135

repeat311

shareShare

Lorenz Wolf

@lorenz_wlf

2 months ago

Very excited about our proposal and grateful to be supported by the NVIDIA Academic Grant!!

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Lorenz Wolf

@lorenz_wlf

2 months ago

Post-training methods like RLHF improve LLM quality but often collapse diversity. Check out DQO, a training objective using DPPs that directly optimizes for semantic diversity and quality.

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Aldo Pacchiano

@aldopacchiano

2 months ago

(1/4) Typical LLM post-training mechanisms have a hard time learning models that can produce diverse responses. To fix this we introduce 𝐃𝐐𝐎 (𝐃𝐢𝐯𝐞𝐫𝐬𝐢𝐭𝐲 𝐐𝐮𝐚𝐥𝐢𝐭𝐲 𝐎𝐩𝐭𝐢𝐦𝐢𝐳𝐚𝐭𝐢𝐨𝐧), a method for post-training LLMs to generate diverse high-quality

thumb_up_off_alt17

chat_bubble_outline1

repeat7

shareShare

Charlie Westphal

@charliewestphai

2 months ago

New pre-print: arxiv.org/abs/2509.26327. The information bottleneck offers useful but imperfect insights into deep learning. Mirco Musolesi, Steve Hailes, and I introduce a Generalized IB that fills the gaps left by the classic approach.

New pre-print: arxiv.org/abs/2509.26327. The information bottleneck offers useful but imperfect insights into deep learning. <a href="/mircomusolesi/">Mirco Musolesi</a>, Steve Hailes, and I introduce a Generalized IB that fills the gaps left by the classic approach.

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

UCL CSML

@uclcsml

2 months ago

Our seminar is back! The next seminar is Wednesday (Oct 22) and starts at 12:30pm UK time! Arthur Gretton from UCL / Google deepmind is going to talk about “Learning to Act in Noisy Contexts Using Deep Proxy Learning”! ucl.zoom.us/j/99748820264 ucl-ellis.github.io/jt_csml_semina…

thumb_up_off_alt6

chat_bubble_outline0

repeat4

shareShare