Stefano Massaroli (@massastrello) 's Twitter Profile
Stefano Massaroli

@massastrello

Researcher @ RIKEN. Founding Scientist @ liquid.ai. deep learning ∩ signal processing ∩ dynamical systems

ID: 1151541886521241603

linkhttps://github.com/massastrello?tab=overview&from=2020-07-01&to=2020-07-04 calendar_today17-07-2019 17:19:15

133 Tweet

829 Followers

160 Following

Joscha Bach (@plinz) 's Twitter Profile Photo

The core tech of the current AI breakthroughs is as old as AI itself: the perceptron was already invented in 1957! How can we improve on this? Liquid.ai is a new MIT startup that rethinks function approximation, using Liquid Neural Networks: linkedin.com/posts/joschaba…

Liquid AI (@liquidai_) 's Twitter Profile Photo

Hello World! Excited to be out of stealth mode! Let me introduce you to our Liquid team: Joscha Bach Joscha Bach (MIT, Harvard, Intel) Jimmy Smith Jimmy Smith (Stanford) Stefano Massaroli Stefano Massaroli (MILA) Paul Pak Paul Pak (NASA, UW-Madison) Noel Loo

Hello World! Excited to be out of stealth mode! Let me introduce you to our Liquid team:

Joscha Bach <a href="/Plinz/">Joscha Bach</a>  (MIT, Harvard, Intel)
Jimmy Smith <a href="/jimmysmith1919/">Jimmy Smith</a> (Stanford)
Stefano Massaroli <a href="/Massastrello/">Stefano Massaroli</a> (MILA)
Paul Pak <a href="/paulpak__/">Paul Pak</a> (NASA, UW-Madison)
Noel Loo
Oussama Boussif (@b0ussifo) 's Twitter Profile Photo

☀️Thrilled to unveil our latest work to appear at #NeurIPS2023, on improving solar irradiance forecasting! We present CrossViViT, an architecture that uses cross-attention to combine satellite data and time-series modalities for improved accuracy (arxiv.org/abs/2306.01112). 1/10🧵

Together AI (@togethercompute) 's Twitter Profile Photo

Announcing StripedHyena 7B — an open source model using an architecture that goes beyond Transformers achieving faster performance and longer context. It builds on the lessons learned in past year designing efficient sequence modeling architectures. together.ai/blog/stripedhy…

Announcing StripedHyena 7B — an open source model using an architecture that goes beyond Transformers achieving faster performance and longer context.

It builds on the lessons learned in past year  designing efficient sequence modeling architectures.

together.ai/blog/stripedhy…
Michael Poli (@michaelpoli6) 's Twitter Profile Photo

We've been hard at work pushing the frontiers of efficient architecture design and optimization. StripedHyena-7B is the result: the first alternative architecture truly competitive with the best Transformers of its size or larger. And it's very fast.

We've been hard at work pushing the frontiers of efficient architecture design and optimization. StripedHyena-7B is the result: the first alternative architecture truly competitive with the best Transformers of its size or larger. 

And it's very fast.
Sasha Rush (@srush_nlp) 's Twitter Profile Photo

TogetherAI just released StripedHyena 7B. At first look ~Mistral level model with significant memory and speed benefits: together.ai/blog/stripedhy… If you are interested in how it works, I surveyed this literature earlier this year in an MLSys keynote: youtube.com/watch?v=dKJEpO…

Stefano Massaroli (@massastrello) 's Twitter Profile Photo

Excited to announce StripedHyena!🚀 This is the pinnacle of a long line of efficient architecture research to which I’ve been honored to contribute. Kudos to my dear friend Michael Poli for bringing our crazy ideas to the world, and Together AI for fostering our research

Brandon Amos (@brandondamos) 's Twitter Profile Photo

My core ML team (AI at Meta) is hiring research interns! Our projects span optimization, optimal transport, optimal control, generative modeling, complex systems, and geometry. Please apply here and reach out ([email protected]) if you're interested: metacareers.com/jobs/627997209…

Eric Nguyen (@exnx) 's Twitter Profile Photo

HyenaDNA poster went awesome! Honestly had sooooo much fun sharing the work with folks interested and hanging out with the coolest, brightest teammates I could ask for. So proud and happy to work with them, and definitely a highlight of my PhD.

HyenaDNA poster went awesome! Honestly had sooooo much fun sharing the work with folks interested and hanging out with the coolest, brightest teammates I could ask for. So proud and happy to work with them, and definitely a highlight of my PhD.
Liquid AI (@liquidai_) 's Twitter Profile Photo

Today we announce our collaboration with Capgemini to build next-generation AI solutions for enterprises. For the last months, we've been working on this together and now following Capgemini's participation in Liquid AI's successful $37.6m seed round, we are committed to

Today we announce our collaboration with Capgemini to build next-generation AI solutions for enterprises. For the last months, we've been working on this together and now following Capgemini's participation in Liquid AI's successful $37.6m seed round, we are committed to
Michael Poli (@michaelpoli6) 's Twitter Profile Photo

📢New research on mechanistic architecture design and scaling laws. - We perform the largest scaling laws analysis (500+ models, up to 7B) of beyond Transformer architectures to date - For the first time, we show that architecture performance on a set of isolated token

📢New research on mechanistic architecture design and scaling laws.

- We perform the largest scaling laws analysis (500+ models, up to 7B) of beyond Transformer architectures to date

- For the first time, we show that architecture performance on a set of isolated token
Ramin Hasani (@ramin_m_h) 's Twitter Profile Photo

today, I want to share the core values that shape our culture at Liquid. here we go: no-bullshit meritocracy, burn the playbook, proactive execution and purposeful ownership, be white-box explainable and let's grow together. Allow me to elaborate: ------------- A CULTURE

today, I want to share the core values that shape our culture at Liquid. here we go: 

no-bullshit meritocracy, 
burn the playbook, 
proactive execution and purposeful ownership, 
be white-box explainable and 
let's grow together.

Allow me to elaborate:

-------------
A CULTURE
Ramin Hasani (@ramin_m_h) 's Twitter Profile Photo

scaling hypothesis holds strong! scaling laws have three components that work simultaneously with each other, and not alone: computation + algorithms + data. “hitting the wall” arguments around any of these components in isolation are bs.

Jürgen Schmidhuber (@schmidhuberai) 's Twitter Profile Photo

It has been said that AI is the new oil, the new electricity, and the new internet. And the once nimble and highly profitable software companies (MSFT, GOOG, ...) became like utilities, investing in nuclear energy, among other things, to run AI data centres. Open Source and the

It has been said that AI is the new oil, the new electricity, and the new internet. And the once nimble and highly profitable software companies (MSFT, GOOG, ...) became like utilities, investing in nuclear energy, among other things, to run AI data centres. Open Source and the
Maxime Labonne (@maximelabonne) 's Twitter Profile Photo

In the shadow of DeepSeek R1, our LFM-7B is performing really well on OpenRouter. 🥳 Best-in-class 7B model at $0.01/M tokens (the same price as Llama 3.2 1B), low latency, and high throughput.

In the shadow of DeepSeek R1, our LFM-7B is performing really well on OpenRouter. 🥳

Best-in-class 7B model at $0.01/M tokens (the same price as Llama 3.2 1B), low latency, and high throughput.
Stefano Massaroli (@massastrello) 's Twitter Profile Photo

LFM2 is live. We keep state coefficients time-invariant because Hyena gated short convs already supply the adaptive dynamics—and the data agree. Same accuracy, leaner compute budget. Efficiency in practice, not on paper. #LiquidAI