Maximilian Beck (@maxmbeck) 's Twitter Profile
Maximilian Beck

@maxmbeck

ELLIS PhD Student @ JKU Linz Institute for Machine Learning & PhD Researcher @nx_ai_com

ID: 1401163561322389508

linkhttp://maxbeck.ai calendar_today05-06-2021 13:06:56

204 Tweet

824 Followers

703 Following

Julien Siems (@julien_siems) 's Twitter Profile Photo

1/9 There is a fundamental tradeoff between parallelizability and expressivity of Large Language Models. We propose a new linear RNN architecture, DeltaProduct, that can effectively navigate this tradeoff. Here's how!

1/9 There is a fundamental tradeoff between parallelizability and expressivity of Large Language Models. We propose a new linear RNN architecture, DeltaProduct, that can effectively navigate this tradeoff. Here's how!
KorbinianPoeppel (@korbipoeppel) 's Twitter Profile Photo

Hope to see you around at #ICLR2025 in #Singapore! I'm happy to present our work on xLSTM kernels, applications and scaling up to 7B parameters!

Hope to see you around at #ICLR2025 in #Singapore!
I'm happy to present our work on xLSTM kernels, applications and scaling up to 7B parameters!
Sepp Hochreiter (@hochreitersepp) 's Twitter Profile Photo

xLSTM for Multi-label ECG Classification: arxiv.org/abs/2504.16101 "This approach significantly improves ECG classification accuracy, thereby advancing clinical diagnostics and patient care." Cool.

xLSTM for Multi-label ECG Classification: arxiv.org/abs/2504.16101

"This approach significantly improves ECG classification accuracy, thereby advancing clinical diagnostics and patient care."
Cool.
Maximilian Beck (@maxmbeck) 's Twitter Profile Photo

Come by today at our posters in the Open Science for Foundation Models at 3pm (Hall4#5) #ICLR25 if you want to know more about Tiled Flash Linear Attention and xLSTM 7B!

Come by today at our posters in the Open Science for Foundation Models at 3pm (Hall4#5) #ICLR25 if you want to know more about Tiled Flash Linear Attention and xLSTM 7B!
Maximilian Beck (@maxmbeck) 's Twitter Profile Photo

Excited to share that 2 of our papers on efficient inference with #xLSTM are accepted at #ICML25. A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks (arxiv.org/abs/2410.22391) and xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference:

Songlin Yang (@songlinyang4) 's Twitter Profile Photo

📢 (1/16) Introducing PaTH 🛣️ — a RoPE-free contextualized position encoding scheme, built for stronger state tracking, better extrapolation, and hardware-efficient training. PaTH outperforms RoPE across short and long language modeling benchmarks arxiv.org/abs/2505.16381

Sepp Hochreiter (@hochreitersepp) 's Twitter Profile Photo

Mein Buch “Was kann Künstliche Intelligenz?“ ist erschienen. Eine leicht zugängliche Einführung in das Thema Künstliche Intelligenz. LeserInnen – auch ohne technischen Hintergrund – wird erklärt, was KI eigentlich ist, welche Potenziale sie birgt und welche Auswirkungen sie hat.