Lewis Tunstall (@_lewtun) 's Twitter Profile
Lewis Tunstall

@_lewtun

🤗 LLM whisperer @huggingface
📖 Co-author of "NLP with Transformers" book
💥 Ex-particle physicist
🤘 Occasional guitarist
🇦🇺 in 🇨🇭

ID: 1029493180704714753

linkhttps://transformersbook.com/ calendar_today14-08-2018 22:21:16

4,4K Tweet

15,15K Followers

514 Following

Adina Yakup (@adinayakup) 's Twitter Profile Photo

Big events bring big moves. WAIC, one of China’s top AI events, starts tomorrow. Got a feeling we’ll see a wave of new models and fresh AI moves in the coming days👀

Arthur Zucker (@art_zucker) 's Twitter Profile Photo

With the latest release, I want to make sure I get this message to the community: we are listening! Hugging Face we are very ambitious and we want `transformers` to accelerate the ecosystem and enable all hardwares / platforms! Let's build AGI together 🫣 Unbloat and Enable!

With the latest release, I want to make sure I get this message to the community: we are listening! 

<a href="/huggingface/">Hugging Face</a> we are very ambitious and we want `transformers` to accelerate the ecosystem and enable all hardwares / platforms! 
Let's build AGI together 🫣
Unbloat and Enable!
Lewis Tunstall (@_lewtun) 's Twitter Profile Photo

One of the early SmolLM3 checkpoints had super weird vibes despite scoring well on benchmarks. Turned out we had a bug in our data processing script that nuked all system messages 😵‍💫

Nikhil Thorat (@nsthorat) 's Twitter Profile Photo

College is about socialization — most of my social & musical life came from college. Wouldn’t trade that for any startup opportunity. Life != your career.

Z.ai (@zai_org) 's Twitter Profile Photo

Introducing GLM-4.5 and GLM-4.5 Air: new flagship models designed to unify frontier reasoning, coding, and agentic capabilities. GLM-4.5: 355B total / 32B active parameters GLM-4.5-Air: 106B total / 12B active parameters API Pricing (per 1M tokens): GLM-4.5: $0.6 Input / $2.2

Introducing GLM-4.5 and GLM-4.5 Air: new flagship models designed to unify frontier reasoning, coding, and agentic capabilities.

GLM-4.5: 355B total / 32B active parameters
GLM-4.5-Air: 106B total / 12B active parameters

API Pricing (per 1M tokens):
GLM-4.5: $0.6 Input / $2.2
Yacine Jernite (@yjernite) 's Twitter Profile Photo

SmolLM3 now has an EU Summary of Public Content!!! 🇪🇺🤗 The Hugging Face SmolLM3 model is one of the strongest models its size while being fully open (incl. data!); it's now also (one of?) the first model to come with the AI Act-mandated summary of training content... 🧵👇1/4

SmolLM3 now has an EU Summary of Public Content!!! 🇪🇺🤗

The <a href="/huggingface/">Hugging Face</a>  SmolLM3 model is one of the strongest models its size while being fully open (incl. data!); it's now also (one of?) the first model to come with the AI Act-mandated summary of training content...
🧵👇1/4
Charlie Marsh (@charliermarsh) 's Twitter Profile Photo

The new Hugging Face jobs CLI is powered by uv 🤗 You can use `hf jobs uv run` to initiate a job from a standalone Python script.

The new Hugging Face jobs CLI is powered by uv 🤗

You can use `hf jobs uv run` to initiate a job from a standalone Python script.
Lewis Tunstall (@_lewtun) 's Twitter Profile Photo

We just released the training and evaluation code to reproduce SmolLM3 🔥 🏋️‍♀️ Pretraining scripts (nanotron) 🤠 Post-training code for mid-training + SFT + APO (TRL/alignment-handbook) 👩‍⚖️ Evaluation scripts to reproduce all reported metrics github.com/huggingface/sm… We're

Lucas Atkins (@lucasatkins7) 's Twitter Profile Photo

Today, we’re officially releasing the weights for AFM-4.5B and AFM-4.5B-Base on HuggingFace. This is a major milestone for Arcee.ai. AFM is designed to be flexible and high-performing across a wide range of deployment environments.

Today, we’re officially releasing the weights for AFM-4.5B and AFM-4.5B-Base on HuggingFace. This is a major milestone for <a href="/arcee_ai/">Arcee.ai</a>. AFM is designed to be flexible and high-performing across a wide range of deployment environments.