Lukas Thede (@lukas_thede) 's Twitter Profile
Lukas Thede

@lukas_thede

PhD Student at IMPRS-IS | University of Tübingen | Helmholtz Munich

ID: 767088302633738241

calendar_today20-08-2016 19:57:58

43 Tweet

70 Followers

113 Following

Vishaal Udandarao (@vishaal_urao) 's Twitter Profile Photo

🚀New Paper! arxiv.org/abs/2504.07086 Everyone’s celebrating rapid progress in math reasoning with RL/SFT. But how real is this progress? We re-evaluated recently released popular reasoning models—and found reported gains often vanish under rigorous testing!! 👀 🧵👇

🚀New Paper!
arxiv.org/abs/2504.07086

Everyone’s celebrating rapid progress in math reasoning with RL/SFT. But how real is this progress?

We re-evaluated recently released popular reasoning models—and found reported gains often vanish under rigorous testing!! 👀

🧵👇
Explainable Machine Learning (@explainableml) 's Twitter Profile Photo

📢 We’ve landed in Singapore for #ICLR2025! The EML group is presenting 4 exciting papers — come say hi at our poster sessions! 👇Let's chat! More details in the thread — see you there! 🌟

Tom Hartvigsen (@tom_hartvigsen) 's Twitter Profile Photo

Excited we have some papers accepted to ICML Conference in collaborations with some tremendous folks 🎉 Looking forward to Vancouver to discuss model editing for LLMs/VLMs and improving medical benchmarking!

Excited we have some papers accepted to <a href="/icmlconf/">ICML Conference</a> in collaborations with some tremendous folks 🎉

Looking forward to Vancouver to discuss model editing for LLMs/VLMs and improving medical benchmarking!
Explainable Machine Learning (@explainableml) 's Twitter Profile Photo

🚨Happy to announce that one paper, "Understanding the Limits of Lifelong Knowledge Editing in LLMs", is accepted at #icml2025 ! Congrats to the wonderful authors Lukas Thede , Karsten Roth , Matthias Bethge ,Zeynep Akata , and Tom Hartvigsen. 👇 Highlights in the thread

🚨Happy to announce that one paper, "Understanding the Limits of Lifelong Knowledge Editing in LLMs", is accepted at #icml2025 ! Congrats to the wonderful authors <a href="/lukas_thede/">Lukas Thede</a> , <a href="/confusezius/">Karsten Roth</a> , <a href="/MatthiasBethge/">Matthias Bethge</a> ,<a href="/zeynepakata/">Zeynep Akata</a> , and <a href="/tom_hartvigsen/">Tom Hartvigsen</a>.  👇 Highlights in the thread
Karsten Roth (@confusezius) 's Twitter Profile Photo

In Nashville for my last PhD conference 🥲. Come join today 10:30-12:30 in Hall D (#391) to talk insights, tips and tricks to modify pretraining for representation reuse - scalably. 🚀Joint work w/ Zeynep Akata, Dima Damen @CVPR 2025, Ivana Balazevic & Olivier Hénaff while at Google DeepMind.

In Nashville for my last PhD conference 🥲.

Come join today 10:30-12:30 in Hall D (#391) to talk insights, tips and tricks to modify pretraining for representation reuse - scalably.

🚀Joint work w/ <a href="/zeynepakata/">Zeynep Akata</a>, <a href="/dimadamen/">Dima Damen @CVPR 2025</a>, <a href="/ibalazevic/">Ivana Balazevic</a> &amp; <a href="/olivierhenaff/">Olivier Hénaff</a> while at <a href="/GoogleDeepMind/">Google DeepMind</a>.
Ori Press (@ori_press) 's Twitter Profile Photo

Do language models have algorithmic creativity? To find out, we built AlgoTune, a benchmark challenging agents to optimize 100+ algorithms like gzip compression, AES encryption and PCA. Frontier models struggle, finding only surface-level wins. Lots of headroom here!🧵⬇️

Do language models have algorithmic creativity?

To find out, we built AlgoTune, a benchmark challenging agents to optimize 100+ algorithms like gzip compression, AES encryption and PCA. Frontier models struggle, finding only surface-level wins. Lots of headroom here!🧵⬇️
Jonathan Richard Schwarz (@schwarzjn_) 's Twitter Profile Photo

✨ New ACL'25 Oral: Transforming dense LLMs into semantic MoEs during IFT! 📜 🎯 Key wins: • SOTA performance vs regular IFT & upcycling • Input-dependent expert-routing & merging • Learning WHERE to upcycle & HOW to specialize - no manual design! 🔗tinyurl.com/yae55x5e

✨ New ACL'25 Oral: Transforming dense LLMs into semantic MoEs during IFT! 📜

🎯 Key wins:
• SOTA performance vs regular IFT &amp; upcycling
• Input-dependent expert-routing &amp; merging
• Learning WHERE to upcycle &amp; HOW to specialize - no manual design!

🔗tinyurl.com/yae55x5e
Adhiraj Ghosh (@adhiraj_ghosh98) 's Twitter Profile Photo

Excited to be in Vienna for #ACL2025🇦🇹! You'll find Sebastian Dziadzio and I by our ONEBench poster, so do drop by! 🗓️Wed, July 30, 11-12:30 CET 📍Hall 4/5 I’m also excited to talk about lifelong and personalised benchmarking, data curation and vision-language in general! Let’s connect!

Excited to be in Vienna for #ACL2025🇦🇹! You'll find <a href="/sbdzdz/">Sebastian Dziadzio</a> and I by our ONEBench poster, so do drop by!

🗓️Wed, July 30, 11-12:30 CET
📍Hall 4/5

I’m also excited to talk about lifelong and personalised benchmarking, data curation and vision-language in general! Let’s connect!
Luca Eyring @ICLR (@lucaeyring) 's Twitter Profile Photo

Reward hacking is challenging when fine-tuning few-step Diffusion models. Direct fine-tuning on rewards can create artifacts that game metrics while degrading visual quality. We propose Noise Hypernetworks as a theoretically grounded solution, inspired by test-time optimization.

Explainable Machine Learning (@explainableml) 's Twitter Profile Photo

🔥We celebrate 3 papers accepted to NeurIPS Conference 2025, see you in San Diego! 🥳Topics include diffusion models, sparse autoencoders (SAEs) and neural chunking. See the thread for highlights👇

🔥We celebrate 3 papers accepted to <a href="/NeurIPSConf/">NeurIPS Conference</a>  2025, see you in San Diego! 🥳Topics include diffusion models, sparse autoencoders (SAEs) and neural chunking. See the thread for highlights👇
Tom Hartvigsen (@tom_hartvigsen) 's Twitter Profile Photo

📢Please retweet: We are hiring a **Postdoc** at UVA to work on Continually Monitoring and Updating Multi-modal Medical AI Models! Great opportunity to design impactful methods alongside great collaborators Ahmed Alaa and Roxana Daneshjou MD/PhD More info: tinyurl.com/ad7ptmvp