Benjamin Warner (@benjamin_warner) Twitter Tweets • TwiCopy

Orion Weller @ ICLR 2025

3 months ago

XLM-R has been SOTA for 6 years for multilingual encoders. That's an eternity in AI 🤯 Time for an upgrade. Introducing mmBERT: 2-4x faster than previous models ⚡ while even beating o3 and Gemini 2.5 Pro 🔥 + open models & training data - try it now! How did we do it? 🧵

thumb_up_off_alt249

chat_bubble_outline13

repeat65

shareShare

Horace He

@chhillee

3 months ago

Apologies that I haven't written anything since joining Thinking Machines but I hope this blog post on a topic very near and dear to my heart (reproducible floating point numerics in LLM inference) will make up for it!

thumb_up_off_alt2,2K

chat_bubble_outline57

repeat139

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

3 months ago

Excited to announce that Sophont has raised $9.22M in combined pre-seed+seed rounds! 🚀🔥 Led by Kindred Ventures, with participation from @delphi_ventures Upfront Ventures AICONIC VENTURES also @jeffdean, @logankilpatrick, clem 🤗 (via Factorial Capital), Lukas Biewald & others

thumb_up_off_alt518

chat_bubble_outline87

repeat46

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

3 months ago

At MedARC we are building a comprehensive suite of medical LLM evals, and we already have tons of volunteers and lots of great progress! The project started less than a week ago! Are there other medical LLM evals we should include?

At <a href="/MedARC_AI/">MedARC</a> we are building a comprehensive suite of medical LLM evals, and we already have tons of volunteers and lots of great progress!

The project started less than a week ago!

Are there other medical LLM evals we should include?

thumb_up_off_alt88

chat_bubble_outline10

repeat9

shareShare

tomaarsen

@tomaarsen

3 months ago

🛒 RexBERT: ModernBERT except for E-commerce was just released by RAHUL BAJAJ et al! 4 base encoders (17M, 68M, 150M, 400M) trained on 2.3T tokens (with 350B E-commerce related tokens), easily outperforming base models on E-commerce tasks! Details in 🧵

🛒 RexBERT: ModernBERT except for E-commerce was just released by <a href="/bajajra30/">RAHUL BAJAJ</a> et al!

4 base encoders (17M, 68M, 150M, 400M) trained on 2.3T tokens (with 350B E-commerce related tokens), easily outperforming base models on E-commerce tasks!

Details in 🧵

thumb_up_off_alt45

chat_bubble_outline1

repeat9

shareShare

Benjamin Warner

@benjamin_warner

3 months ago

Can one wire this kernel into an existing PyTorch model? Or is this locked to the Modular platform?

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Benjamin Warner

@benjamin_warner

3 months ago

ModernBERT is competitive with EmbeddingGemma.

thumb_up_off_alt24

chat_bubble_outline0

repeat4

shareShare

Helen Toner

@hlntnr

2 months ago

Many AI policy decisions are complicated. "Don't ban self-driving cars" is really not. Good new piece from Kelsey Piper, with a lede that pulls no punches:

Many AI policy decisions are complicated. "Don't ban self-driving cars" is really not. Good new piece from <a href="/KelseyTuoc/">Kelsey Piper</a>, with a lede that pulls no punches:

thumb_up_off_alt390

chat_bubble_outline17

repeat47

shareShare

Benjamin Warner

@benjamin_warner

2 months ago

It's great to see our efficient architecture improvements powering a new ecosystem of encoder models.

thumb_up_off_alt11

chat_bubble_outline0

repeat3

shareShare

Benjamin Warner

@benjamin_warner

2 months ago

Vibe engineering also seems to better describe "LLM pair programming" then vibe coding.

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Nicholas Decker 🏳️‍🌈🌐🇺🇦

@captgouda24

2 months ago

This paper is one of the most astonishing feats of sustained data wizardry I have ever seen. Using data from Uber, they are able to estimate the roughness of every road in America and precisely estimate the value people place on it, and so much more. 1/

thumb_up_off_alt592

chat_bubble_outline7

repeat70

shareShare

Benjamin Warner

@benjamin_warner

2 months ago

Nvidia missed the mark by not equipping the Spark with ~400GB/s memory bandwidth.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Soumith Chintala

@soumithchintala

2 months ago

MacStudio you ask? Apple Engineering's **actual** time spent on PyTorch support has't given me confidence that PyTorch Mac experience would get anywhere close to NVIDIA's any time soon, if ever. The Meta engineers continue to do a huge amount of heavy-lifting for improving the

thumb_up_off_alt759

chat_bubble_outline31

repeat53

shareShare

Sophont

@sophontai

2 months ago

Excited to share our first paper: Scaling Vision Transformers for Functional MRI with Flat Maps We introduce a new approach to training fMRI neuroimaging foundation models and demonstrate a strict dataset power scaling law!

thumb_up_off_alt79

chat_bubble_outline5

repeat15

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

2 months ago

We've released our first Sophont paper! 🔥 We're bullish on the potential foundation models to help unlock novel clinical applications for brain & mental health. So we working on training better neuroimaging foundation models. We develop a novel approach for training fMRI

We've released our first <a href="/SophontAI/">Sophont</a> paper! 🔥

We're bullish on the potential foundation models to help unlock novel clinical applications for brain & mental health.

So we working on training better neuroimaging foundation models.

We develop a novel approach for training fMRI

thumb_up_off_alt284

chat_bubble_outline10

repeat33

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

2 months ago

I think it's awesome that we have an author contributing the benchmark they developed to our MedARC LLM eval suite!

I think it's awesome that we have an author contributing the benchmark they developed to our <a href="/MedARC_AI/">MedARC</a> LLM eval suite!

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

2 months ago

META RESEARCHERS WHO WERE LAID OFF: Hit me up if you wanna work on open-source LLMs and multimodal models for medicine and healthcare! This includes working on reasoning/RLVR/etc. and self-supervised training. It's very exciting research and very impactful too, come join!

thumb_up_off_alt515

chat_bubble_outline16

repeat36

shareShare

Austin Huang

@austinvhuang

2 months ago

Belated life update - I'm starting a company. No boundaries between low/high level systems engineering, research, design or product. No boundaries between AI compute and situated human collaboration and learning. More soon - DMs open if you want to connect.

thumb_up_off_alt121

chat_bubble_outline19

repeat12

shareShare