Sebastian Raschka (@rasbt) 's Twitter Profile
Sebastian Raschka

@rasbt

ML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (amzn.to/4fqvn0D).

ID: 865622395

linkhttps://sebastianraschka.com calendar_today07-10-2012 02:06:16

17,17K Tweet

326,326K Takipçi

1,1K Takip Edilen

Sebastian Raschka (@rasbt) 's Twitter Profile Photo

So, I did some coding this week... - Qwen3 Coder Flash (30B-A3B) - Mixture-of-Experts setup with 128 experts, 8 active per token - In pure PyTorch (optimized for human readability) - in a standalone Jupyter notebook - Runs on a single A100

So, I did some coding this week...
- Qwen3 Coder Flash (30B-A3B)
- Mixture-of-Experts setup with 128 experts, 8 active per token
- In pure PyTorch (optimized for human readability)
- in a standalone Jupyter notebook
- Runs on a single A100