enundiagrisblanco (@enundiagris_) Twitter Tweets • TwiCopy

Sepp Hochreiter

2 months ago

xLSTM more expressive than transformer, Mamba: arxiv.org/abs/2603.03612 *nonlinear RNNs: sLSTM, LSTM *DPLR linear RNNs: mLSTM, RWKV, DeltaNet *Non PNC1: Mamba, Transformer “fundamental expressivity gaps between linear and nonlinear RNNs” World models require nonlinear RNNs.

thumb_up_off_alt353

chat_bubble_outline10

repeat54

shareShare

Reza Bayat

@reza_byt

24 days ago

Mythos is a looped transformer!? 😳 Should be a Mixture-of-Recursions (MoR) — 2× faster, controlled effort. Dense → sparse MoE was the efficiency unlock of 2023. Uniform loops → MoR is the same move for recursive transformers. Paper reading list below. 🧵

thumb_up_off_alt1,1K

chat_bubble_outline13

repeat133

shareShare

DAIR.AI

@dair_ai

23 days ago

NEW paper from Apple. Interesting idea: "Attention to Mamba". The paper introduces a two-stage recipe for cross-architecture distillation from Transformers into Mamba. Naive distillation collapses teacher performance. Their trick: first distill the transformer into a

thumb_up_off_alt972

chat_bubble_outline14

repeat142

shareShare

hinata

@hinatamotivates

21 days ago

Yann LeCun says the AI industry is completely LLM pilled.

thumb_up_off_alt757

chat_bubble_outline37

repeat97

shareShare

Clutch God

@xsports_1

20 days ago

> be Yann LeCun > spend years building JEPA at Meta > company focuses on LLaMA instead > his idea stays complicated and unused > robotics plans get dropped > decides to leave and start AMI Labs > builds a much simpler version from scratch > trains it on normal hardware in just a

thumb_up_off_alt3,3K

chat_bubble_outline45

repeat271

shareShare

Alejo

@ecommartinez

19 days ago

Si has llegado hasta aquí, claramente formas parte de los que van a ir por delante. ⭐ Comparto este tipo de cosas regularmente aquí → Alejo ¿Seguimos?

thumb_up_off_alt14

chat_bubble_outline0

repeat1

shareShare

tommy blanco

@tommyblancob

18 days ago

joooooooder

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

cinesthetic.

@thecinesthetic

15 days ago

NORMAL PEOPLE premiered 6 years ago today.

thumb_up_off_alt7,7K

chat_bubble_outline26

repeat734

shareShare

324.cat

@324cat

14 days ago

Perdre el mòbil és fàcil. Recuperar-lo també ho pot ser si abans l’has deixat ben configurat amb aquestes quatre accions preventives 3cat.cat/3catinfo/que-c…

thumb_up_off_alt216

chat_bubble_outline4

repeat43

shareShare

Marta Peirano

@minipetite

14 days ago

Me sumo a la advertencia: Sci-Hub ha pirateado más de 85 millones de artículos de investigación y ahora encima han añadido un bot que responde preguntas utilizando artículos completos y recientes. Esto es un escándalo. Dejo el enlace abajo para que sepas cómo evitarlo.

thumb_up_off_alt4,4K

chat_bubble_outline49

repeat1,1K

shareShare

TheRealThelmaJohnson

@therealthelmaj1

10 days ago

The new Banksy sculpture in London is brilliant

thumb_up_off_alt10,10K

chat_bubble_outline209

repeat2,2K

shareShare

Vai Viswanathan

@vai_viswanathan

10 days ago

x.com/i/article/2047…

thumb_up_off_alt1,1K

chat_bubble_outline17

repeat167

shareShare

earth mother

@moonfrom5to7

7 days ago

aftersun (2022) we still talk about you

thumb_up_off_alt2,2K

chat_bubble_outline5

repeat358

shareShare

Stat.ML Papers

@statmlpapers

7 days ago

SPLICE: Latent Diffusion over JEPA Embeddings for Conformal Time-Series Inpainting ift.tt/mVyjuPI

thumb_up_off_alt20

chat_bubble_outline0

repeat4

shareShare

Stat.ML Papers

@statmlpapers

7 days ago

Adaptive Norm-Based Regularization for Neural Networks ift.tt/3KHr8FC

thumb_up_off_alt21

chat_bubble_outline0

repeat5

shareShare

Santi Torres

@santitorai

7 days ago

🚨 Karpathy acaba de soltar 40 minutos de oro sobre agentes IA en 2026. Qué aprender, qué construir y qué tirar antes de que te hunda. El 90% de las herramientas actuales no van a sobrevivir 90 días. El filtro ya está hecho. Gratis.

thumb_up_off_alt639

chat_bubble_outline12

repeat90

shareShare

Vaishnavi Tikke

@vtikke

7 days ago

Language Models Interview Handbook drive.google.com/file/d/1SikCWx…

thumb_up_off_alt587

chat_bubble_outline6

repeat103

shareShare