swyx πŸ”œ AI.Engineer conf!(@swyx) 's Twitter Profileg
swyx πŸ”œ AI.Engineer conf!

@swyx

Anti-ego ideas for anti-ergodic life.

Founder, @Smol_AI
β–Ή Listen: @latentspacepod
β–Ή Read: @coding_career
β–Ή Join: @aiDotengineer

ID:33521530

linkhttps://swyx.io calendar_today20-04-2009 14:04:41

64,3K Tweets

92,3K Followers

2,9K Following

Follow People
swyx πŸ”œ AI.Engineer conf!(@swyx) 's Twitter Profile Photo

btw if you as a developer have not been paying attention to the tech earnings season, look out.

basically market is bifurcating ai winners and losers hard rn. if you’re not an ai arms dealer or ai consumer company we basically just flipped to a bear market this week

currently

btw if you as a developer have not been paying attention to the tech earnings season, look out. basically market is bifurcating ai winners and losers hard rn. if you’re not an ai arms dealer or ai consumer company we basically just flipped to a bear market this week currently
account_circle
AI Engineer(@aiDotEngineer) 's Twitter Profile Photo

Excited to announce Devendra Chaplot as a speaker! He's an AI Research Scientist Mistral AI and will provide an inside look at the building blocks and training strategies that power Mistral's LLMs.

Join him June 25 - 27! 🎟️ link in next post

account_circle
swyx πŸ”œ AI.Engineer conf!(@swyx) 's Twitter Profile Photo

quite incredible to see the goalposts of long context move in the past 1 year.

May 2023: asking Jonathan Frankle and Abhi Venigalla about their 65k+ 'Llongboi' model

May 2024: Mark Huang casually extending Llama 3 to >1m tokens with ~perfect NIAH and mainstream ai engineers

quite incredible to see the goalposts of long context move in the past 1 year. May 2023: asking @jefrankle and @abhi_venigalla about their 65k+ 'Llongboi' model May 2024: @markatgradient casually extending Llama 3 to >1m tokens with ~perfect NIAH and mainstream ai engineers
account_circle
Latent Space Podcast(@latentspacepod) 's Twitter Profile Photo

πŸ†• pod: How to train a Million Context LLM!

Yann LeCun says we should publish, or perish. We asked Mark Huang to spill ALL the beans on how his team extended Llama-3 to 1M+ context with ~perfect Greg Kamradt NIAH evals!

latent.space/p/gradient

Also covering:

- A Brief

account_circle
swyx πŸ”œ AI.Engineer conf!(@swyx) 's Twitter Profile Photo

periodic reminder that i do maintain a list of technically focused ai podcasts and newsletters here

github.com/swyxio/ai-note…

obviously its hard to cover math and images on voice format but the people who are heroes for helping the rest of us keep up

account_circle
swyx πŸ”œ AI.Engineer conf!(@swyx) 's Twitter Profile Photo

Heads up comparison, @ElevenLabsio vs Cartesia

Cartesia almost completes before Elevenlabs starts lol

realtime SSMs are going to be -wild-

congrats Karan Goel Albert Gu

account_circle
AI News by Smol AI(@Smol_AI) 's Twitter Profile Photo

AINews: 29 May 2024

What if you KNEW that we may soon have models can that continuously process and reason over text/audio/video with a TRILLION token 'context window'?

Real time? On device?

thanks to Cartesia, Karan Goel, Albert Gu

AINews: 29 May 2024 What if you KNEW that we may soon have models can that continuously process and reason over text/audio/video with a TRILLION token 'context window'? Real time? On device? thanks to @cartesia_ai, @krandiash, @_albertgu
account_circle
Grace Isford(@graceisford) 's Twitter Profile Photo

Thrilled to unveil Maven AGI, a reimagined customer support experience platform emerging with $28M in funding from Lux Capital M13 E14 Fund & OpenAI Stripe HubSpot execs! πŸͺ„

Read more with their OpenAI case study 🧡
shorturl.at/ZXX3U

account_circle
Latent Space Podcast(@latentspacepod) 's Twitter Profile Photo

πŸ†• ICLR 2024: Best Papers (Part 1)

We present our selections of outstanding papers and talks thematically introducing topics for AI Engineers to track:

Section A: ImageGen, Compression, Adversarial Attacks

- Auto-encoding Variational Bayes (Durk Kingma)
- WΓΌrstchen: An Efficient

πŸ†• ICLR 2024: Best Papers (Part 1) We present our selections of outstanding papers and talks thematically introducing topics for AI Engineers to track: Section A: ImageGen, Compression, Adversarial Attacks - Auto-encoding Variational Bayes (@dpkingma) - WΓΌrstchen: An Efficient
account_circle
Andrej Karpathy(@karpathy) 's Twitter Profile Photo

# Reproduce GPT-2 (124M) in llm.c in 90 minutes for $20 ✨

The GPT-2 (124M) is the smallest model in the GPT-2 series released by OpenAI in 2019, and is actually quite accessible today, even for the GPU poor. For example, with llm.c you can now reproduce this model on one 8X

# Reproduce GPT-2 (124M) in llm.c in 90 minutes for $20 ✨ The GPT-2 (124M) is the smallest model in the GPT-2 series released by OpenAI in 2019, and is actually quite accessible today, even for the GPU poor. For example, with llm.c you can now reproduce this model on one 8X
account_circle
Aman Sanger(@amanrsanger) 's Twitter Profile Photo

These are the problems we believe matter most for the next year of ai-programming

anysphere.inc/blog/problems-…

My favorite is next action prediction, the limit of what's possible with low-latency models

(we press tab 11 times and all other keys 3 times to make these changes)
[1/7]

account_circle
swyx πŸ”œ AI.Engineer conf!(@swyx) 's Twitter Profile Photo

ICLR episode part 1 just shipped! technically still in time for the long weekend haha

on a meta level it’s been great studying all the big ML conferences in prep for AI Engineer next month. Lots of good ideas for running >5000 person affairs with something for everyone. Any

account_circle