Mario Tormo (@mt0rm0) 's Twitter Profile
Mario Tormo

@mt0rm0

AI Engineer & Senior Data Scientist. Appasionate about #Math, #Jazz, #Cinema and #Books. He/him. #AI #NLP #DataScience #MLOps

ID: 1306505184911208453

calendar_today17-09-2020 08:08:45

1,1K Tweet

1,1K Followers

1,1K Following

Deedy (@deedydas) 's Twitter Profile Photo

DeepSeek just dropped the single best end-to-end paper on large model training. It covers — Software (MLA, training in FP8, DeepEP, LogFMT) — Hardware (Multi-Rail Fat Tree, Ethernet RoCE switches) — Mix (IBGDA, 3FS filesystem) DeepSeek's engineering depth is insane. Must read.

DeepSeek just dropped the single best end-to-end paper on large model training.

It covers
— Software (MLA, training in FP8, DeepEP, LogFMT)
— Hardware (Multi-Rail Fat Tree, Ethernet RoCE switches)
— Mix (IBGDA, 3FS filesystem)

DeepSeek's engineering depth is insane. Must read.
Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

I am pleased to announce a new version of my RL tutorial. Major update to the LLM chapter (eg DPO, GRPO, thinking), minor updates to the MARL and MBRL chapters and various sections (eg offline RL, DPG, etc). Enjoy! arxiv.org/abs/2412.05265

I am pleased to announce a new version of my RL tutorial. Major update to the LLM chapter (eg DPO, GRPO, thinking), minor updates to the MARL and MBRL chapters and various sections (eg offline RL, DPG, etc). Enjoy!
arxiv.org/abs/2412.05265
Ethan Mollick (@emollick) 's Twitter Profile Photo

Huh. Looks like Plato was right. A new paper shows all language models converge on the same "universal geometry" of meaning. Researchers can translate between ANY model's embeddings without seeing the original text. Implications for philosophy and vector databases alike.

Huh. Looks like Plato was right.

A new paper shows all language models converge on the same "universal geometry" of meaning. Researchers can translate between ANY model's embeddings without seeing the original text.

Implications for philosophy and vector databases alike.
Shubham Saboo (@saboo_shubham_) 's Twitter Profile Photo

This AI Agent can think, code, reason and browse in a single loop just like humans. Outperforms other AI Agent frameworks like Manus AI, Genspark AI, and OpenAI Deep Research. And it's 100% Opensource.

This AI Agent can think, code, reason and browse in a single loop just like humans. 

Outperforms other AI Agent frameworks like Manus AI, Genspark AI, and OpenAI Deep Research. 

And it's 100% Opensource.
elvis (@omarsar0) 's Twitter Profile Photo

New Lens on RAG Systems RAG systems are more brittle than you think, even when provided sufficient context. Great work from Google and collaborators. Good tips for devs included. Here are my notes:

New Lens on RAG Systems

RAG systems are more brittle than you think, even when provided sufficient context.

Great work from Google and collaborators.

Good tips for devs included.

Here are my notes:
The AI Timeline (@theaitimeline) 's Twitter Profile Photo

🚨This week's top AI/ML research papers: - Spurious Rewards - FLUX.1 Kontext - Learning to Reason without External Rewards - Reasoning LLMs are Wandering Solution Explorers - VLM-3R - Silence is Not Consensus - Beyond Markovian - The Entropy Mechanism of RL for Reasoning LMs -

🚨This week's top AI/ML research papers:

- Spurious Rewards
- FLUX.1 Kontext
- Learning to Reason without External Rewards
- Reasoning LLMs are Wandering Solution Explorers
- VLM-3R
- Silence is Not Consensus
- Beyond Markovian
- The Entropy Mechanism of RL for Reasoning LMs
-
Alfredo Canziani (@alfcnz) 's Twitter Profile Photo

Releasing the Energy-Book 🔋 from its first appendix's chapter, where I explain how I create my figures. 🎨 Feel free to report errors via the issues' tracker, contribute to the exercises, and show me what you can draw, via the discussion section. 🥳 github.com/Atcold/Energy-…

Releasing the Energy-Book 🔋 from its first appendix's chapter, where I explain how I create my figures. 🎨
Feel free to report errors via the issues' tracker, contribute to the exercises, and show me what you can draw, via the discussion section. 🥳
github.com/Atcold/Energy-…
Javi Lopez ⛩️ (@javilopen) 's Twitter Profile Photo

🔥 Midjourney video is almost here... And it has, somehow, that incredible artistic aesthetic that MJ is famous for. Yes, these are all MJ video generations!!! 🧵👇

Manuel Faysse (@manuelfaysse) 's Twitter Profile Photo

🚨Should We Still Pretrain Encoders with Masked Language Modeling? We have recently seen massively trained causal decoders take the lead in embedding benchmarks, surpassing encoders w/ bidirectional attention. We revisit whether Bert-style encoders are a thing of the past? (1/N)

🚨Should We Still Pretrain Encoders with Masked Language Modeling? We have recently seen massively trained causal decoders take the lead in embedding benchmarks, surpassing encoders w/ bidirectional attention.  We revisit whether Bert-style encoders are a thing of the past? (1/N)
Eugene Yan (@eugeneyan) 's Twitter Profile Photo

How do you build an LLM-evaluator / LLM-as-Judge? The book for "AI Evals for PMs and Engineers" has a chapter devoted to it (35% discount: maven.com/parlance-labs/…) First, we need to define the right metrics. For example, we can start by listing the failure modes from our error

How do you build an LLM-evaluator / LLM-as-Judge? The book for "AI Evals for PMs and Engineers" has a chapter devoted to it (35% discount: maven.com/parlance-labs/…)

First, we need to define the right metrics. For example, we can start by listing the failure modes from our error
Alfredo Canziani (@alfcnz) 's Twitter Profile Photo

My NYU Center for Data Science colleague, Carlos Fernandez-Granda, released the 700-page textbook «Probability and Statistics for Data Science», where he condenses 10 years of teaching experience at @NYUniversity. 200 exercises, 102 notebooks, 115 videos! 🥳🥳🥳 ps4ds.net

My <a href="/NYUDataScience/">NYU Center for Data Science</a> colleague, Carlos Fernandez-Granda, released the 700-page textbook «Probability and Statistics for Data Science», where he condenses 10 years of teaching experience at @NYUniversity.
200 exercises, 102 notebooks, 115 videos! 🥳🥳🥳
ps4ds.net
Gary Marcus (@garymarcus) 's Twitter Profile Photo

My work here is truly done. Nobody with intellectual integrity can still believe that pure scaling will get us to AGI. GPT-5 may be a moderate quantitative improvement (and it may be cheaper) but it still fails in all the same qualitative ways as its predecessors, on chess, on

Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

Stanford Deep Learning for Computer Vision taught by Professor Fei-Fei Li (Fei-Fei Li) and Assistant Professor Ehsan Adeli. Such an enjoyable YT series. (link in comment)

Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

The paper turns LLM role play into a retrieval task so the agent stays in character. When the researchers tested their system against "jailbreak" attempts (where someone tries to push the model out of its assigned role), the model relied more heavily on its reference examples

The paper turns LLM role play into a retrieval task so the agent stays in character. 

When the researchers tested their system against "jailbreak" attempts (where someone tries to push the model out of its assigned role), the model relied more heavily on its reference examples
FacFisicaUV (@facfisicauv) 's Twitter Profile Photo

Lamentem comunicar-vos el traspàs del professor José Bernabeu Alberola. En nom de tota la comunitat de la Facultat de Física volem transmetre el nostre més sentit condol als seus familiars, amics i col·laboradors, i acompanyar-los en tan difícils moments. Descanse en pau.

Lamentem comunicar-vos el traspàs del professor José Bernabeu Alberola.

En nom de tota la comunitat de la Facultat de Física volem transmetre el nostre més sentit condol als seus familiars, amics i col·laboradors, i acompanyar-los en tan difícils moments.

Descanse en pau.
Charly Wargnier (@datachaz) 's Twitter Profile Photo

Microsoft just killed the GPU mafia! 🤯 They've open-sourced bitnet.cpp, a blazing-fast 1-bit LLM inference framework optimized for CPUs. This is a major step forward for running large models locally, without expensive GPUs or cloud costs. Demo app + repo + paper in 🧵 ↓