Maximilian Beck (@maxmbeck) Twitter Tweets • TwiCopy

Maximilian Beck

@maxmbeck

+ Follow

ELLIS PhD Student @ JKU Linz Institute for Machine Learning & PhD Researcher @nx_ai_com

ID: 1401163561322389508

linkhttp://maxbeck.ai calendar_today05-06-2021 13:06:56

204 Tweet

824 Followers

703 Following

Julien Siems

@julien_siems

9 months ago

1/9 There is a fundamental tradeoff between parallelizability and expressivity of Large Language Models. We propose a new linear RNN architecture, DeltaProduct, that can effectively navigate this tradeoff. Here's how!

thumb_up_off_alt172

chat_bubble_outline2

repeat32

shareShare

Maximilian Beck

@maxmbeck

8 months ago

Does SSMax in Llama4 avoid attention sinks?

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Maximilian Beck

@maxmbeck

8 months ago

I will talk about our xLSTM 7B, today! Tune in 💫

thumb_up_off_alt23

chat_bubble_outline0

repeat1

shareShare

KorbinianPoeppel

@korbipoeppel

8 months ago

Hope to see you around at #ICLR2025 in #Singapore! I'm happy to present our work on xLSTM kernels, applications and scaling up to 7B parameters!

thumb_up_off_alt12

chat_bubble_outline1

repeat1

shareShare

Sepp Hochreiter

@hochreitersepp

8 months ago

xLSTM for Multi-label ECG Classification: arxiv.org/abs/2504.16101 "This approach significantly improves ECG classification accuracy, thereby advancing clinical diagnostics and patient care." Cool.

thumb_up_off_alt72

chat_bubble_outline4

repeat10

shareShare

Maximilian Beck

@maxmbeck

8 months ago

Come by today at our posters in the Open Science for Foundation Models at 3pm (Hall4#5) #ICLR25 if you want to know more about Tiled Flash Linear Attention and xLSTM 7B!

thumb_up_off_alt44

chat_bubble_outline0

repeat11

shareShare

Maximilian Beck

@maxmbeck

7 months ago

Excited to share that 2 of our papers on efficient inference with #xLSTM are accepted at #ICML25. A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks (arxiv.org/abs/2410.22391) and xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference:

thumb_up_off_alt70

chat_bubble_outline2

repeat10

shareShare

Songlin Yang

@songlinyang4

7 months ago

📢 (1/16) Introducing PaTH 🛣️ — a RoPE-free contextualized position encoding scheme, built for stronger state tracking, better extrapolation, and hardware-efficient training. PaTH outperforms RoPE across short and long language modeling benchmarks arxiv.org/abs/2505.16381

thumb_up_off_alt424

chat_bubble_outline9

repeat79

shareShare

Andreas Auer

@andauer

7 months ago

We’re excited to introduce TiRex — a pre-trained time series forecasting model based on an xLSTM architecture.

thumb_up_off_alt69

chat_bubble_outline5

repeat21

shareShare

Sepp Hochreiter

@hochreitersepp

7 months ago

Mein Buch “Was kann Künstliche Intelligenz?“ ist erschienen. Eine leicht zugängliche Einführung in das Thema Künstliche Intelligenz. LeserInnen – auch ohne technischen Hintergrund – wird erklärt, was KI eigentlich ist, welche Potenziale sie birgt und welche Auswirkungen sie hat.

thumb_up_off_alt25

chat_bubble_outline2

repeat4

shareShare

rohan anil

@_arohan_

7 months ago

thumb_up_off_alt1,1K

chat_bubble_outline12

repeat132

shareShare