Gavin Uberti(@UbertiGavin) 's Twitter Profileg
Gavin Uberti

@UbertiGavin

Building model-specific AI chips @ Etched

ID:1499947604037283841

linkhttps://etched.com/ calendar_today05-03-2022 03:19:34

53 Tweets

1,0K Followers

144 Following

Robert Wachen(@robertwachen) 's Twitter Profile Photo

Every company on this list is bottlenecked by compute cost and speed. Grateful to be a Thiel Fellow and be working on Etched with Gavin Uberti Chris Zhu

account_circle
Gavin Uberti(@UbertiGavin) 's Twitter Profile Photo

The 21st century will be the most important century ever for humanity, thanks to the rapid advances in artificial intelligence. It makes no sense to sit on the sidelines in university.

I'm excited to join the latest class of Thiel Fellows

account_circle
Gavin Uberti(@UbertiGavin) 's Twitter Profile Photo

Google open-sourced their Gemma models today, hyperparameters below. Both models have massive feed-forward hidden dimensions - almost every other model uses 3.5-4x the d_model (which would be 8192 and 12288). Not sure why the change.

Google open-sourced their Gemma models today, hyperparameters below. Both models have massive feed-forward hidden dimensions - almost every other model uses 3.5-4x the d_model (which would be 8192 and 12288). Not sure why the change.
account_circle
AK(@_akhaliq) 's Twitter Profile Photo

Chain-of-Thought Reasoning Without Prompting

paper page: huggingface.co/papers/2402.10…

In enhancing the reasoning capabilities of large language models (LLMs), prior research primarily focuses on specific prompting techniques such as few-shot or zero-shot chain-of-thought (CoT)…

Chain-of-Thought Reasoning Without Prompting paper page: huggingface.co/papers/2402.10… In enhancing the reasoning capabilities of large language models (LLMs), prior research primarily focuses on specific prompting techniques such as few-shot or zero-shot chain-of-thought (CoT)…
account_circle