Sarah Chieng
@sarahchieng
π @CerebrasSystems
π prev. @ExaAiLabs, @shopthrifthouse @MIT
π @milksandmatcha on TikTok
Currently in SF | DMs Open
ID: 1502356865794985986
http://sarahchieng.com 11-03-2022 18:52:48
1,1K Tweet
12,12K Followers
837 Following
Over this last year, Amanda Huang and I started running 'Cafe Compute' SF's first, regular late-night coffeeshop for engineers, founders, writers...(anyone) to co-work or hangout late at night. We always bring in Bryan and Annie, two amazing baristas to serve free coffee and
Super happy to be joining the Growth team with Sarah Chieng and Daniel Kim at Cerebras! Could not be joining a cooler team with cooler people. Keep an eye out for regularly shipped products and demos coming soon π
Let's talk about MoE: πΆ How many experts should you use? πΆ How does dynamic routing actually behave in production? πΆ How do you debug a model that wonβt train? πΆ What does 8x7B actually mean for memory and compute? πΆ What hardware optimizations matter for sparse models?
'Jonathan Ross and I made this bet in 2017. Groq is now the fastest inference solution in market' Society would expect Chamath Palihapitiya to be truthful. I mean pick a model...any model. Look at independent benchmarks. These charts aren't hard to read.