Wieland Brendel (@wielandbr) Twitter Tweets • TwiCopy

Wieland Brendel

@wielandbr

+ Follow

Machine Learning Researcher and Social Entrepreneur | Group Lead at ELLIS Institute Tübingen | Co-Founder maddox.ai | Co-Initiator bw-ki.de | @ellis.eu scholar

ID: 3466980795

linkhttps://robustml.is.mpg.de calendar_today28-08-2015 09:29:49

503 Tweet

4,4K Takipçi

190 Takip Edilen

ELIAS

@elias_project

10 months ago

The 8 #ELIASNodes were unveiled at the Falling Walls AI Night! These hubs in Amsterdam, Barcelona, Cambridge, Copenhagen, Munich, Potsdam, Tübingen & Zurich will foster #AIinnovation, connect academia with business & inspire a new generation of AI&Science value creators. 🌍🚀

The 8 #ELIASNodes were unveiled at the <a href="/Falling_Walls/">Falling Walls</a> AI Night! These hubs in Amsterdam, Barcelona, Cambridge, Copenhagen, Munich, Potsdam, Tübingen & Zurich will foster #AIinnovation, connect academia with business & inspire a new generation of AI&Science value creators. 🌍🚀

thumb_up_off_alt15

chat_bubble_outline3

repeat7

shareShare

Intelligent Systems

@mpi_is

9 months ago

Day 4 of our advent calendar, showcasing #Polybot #robot, developed by Wieland Brendel and his Robust #MachineLearning Group. This flexible small robot could one day work in swarms, making it possible to realize sustainable and cost-effective farming: tuebingen.ai/news/want-an-a… #AI #KI

thumb_up_off_alt3

chat_bubble_outline0

repeat2

shareShare

Vishaal Udandarao

@vishaal_urao

9 months ago

🚀New Paper arxiv.org/abs/2412.06712 Model merging is the rage these days: simply fine-tune multiple task-specific models and merge them at the end. Guaranteed perf boost! But wait, what if you get new tasks over time, sequentially? How to merge your models over time? 🧵👇

thumb_up_off_alt206

chat_bubble_outline1

repeat38

shareShare

Wieland Brendel

@wielandbr

7 months ago

Does anyone know how OpenAI gets o3-mini to exceed 700 tokens/sec? I’ve only seen such speeds on specialized chips from Cerebras, SambaNova, or Groq Inc—but not on standard NVIDIA GPUs, which I assumed power OpenAI’s inference.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

ELIAS

@elias_project

7 months ago

🚀Introducing the ELIAS Node Barcelona! A key hub for AI knowledge transfer & innovation in Catalonia, bridging research & industry. Led by Dimosthenis Karatzas, with Meritxell Bassolas & Victor Rotellar, it unites top AI expertise & strategic innovation. 🔗 Learn more: elias-ai.eu/elias-node-bar…

🚀Introducing the ELIAS Node Barcelona!
A key hub for AI knowledge transfer & innovation in Catalonia, bridging research & industry.
Led by <a href="/dkaratzas/">Dimosthenis Karatzas</a>, with <a href="/txellbassolas/">Meritxell Bassolas</a> & <a href="/VictorRotellar/">Victor Rotellar</a>, it unites top AI expertise & strategic innovation.
🔗 Learn more: elias-ai.eu/elias-node-bar…

thumb_up_off_alt10

chat_bubble_outline1

repeat5

shareShare

Andreas Hochlehnert

@ahochlehnert

7 months ago

CuratedThoughts: Data Curation for RL Datasets 🚀 Since DeepSeek-R1 introduced reasoning-based RL, datasets like Open-R1 & OpenThoughts emerged for fine-tuning & GRPO. Our deep dive found major flaws — 25% of OpenThoughts needed elimination by data curation. Here's why 👇🧵

thumb_up_off_alt39

chat_bubble_outline2

repeat11

shareShare

Wieland Brendel

@wielandbr

7 months ago

New preprint out! Shoutout to Thaddäus Wiedemer and Prasanna Mayilvahanan for this clean work on what truly shapes LLM train-to-downstream performance! Turns out, architecture plays a shockingly small role—it's all about the data. Must-read for anyone thinking about scaling and

thumb_up_off_alt25

chat_bubble_outline1

repeat3

shareShare

Vishaal Udandarao

@vishaal_urao

5 months ago

🚀New Paper! arxiv.org/abs/2504.07086 Everyone’s celebrating rapid progress in math reasoning with RL/SFT. But how real is this progress? We re-evaluated recently released popular reasoning models—and found reported gains often vanish under rigorous testing!! 👀 🧵👇

thumb_up_off_alt264

chat_bubble_outline4

repeat53

shareShare

Jack Brady

@jackhb98

5 months ago

I'm at #ICLR2025 presenting our work on compositional generalization! (Sat. 10 AM; Hall 3 + Hall 2B, #310) We provide a general and unifying theory of compositional generalization, based on a new principle called interaction asymmetry! 📜 arxiv.org/abs/2411.07784 (See 🧵)

thumb_up_off_alt127

chat_bubble_outline2

repeat25

shareShare

Prasanna Mayilvahanan @ICLR2025

@prasannamayil

4 months ago

Our paper has been accepted to ICML’25! Thaddäus Wiedemer Wieland Brendel

thumb_up_off_alt21

chat_bubble_outline0

repeat2

shareShare