Mahmoud Soliman (@mjsmlp) 's Twitter Profile
Mahmoud Soliman

@mjsmlp

NaN. JAX @NVIDIA, opinions are my own.

ID: 1425853413598896132

calendar_today12-08-2021 16:15:34

413 Tweet

574 Takipçi

1,1K Takip Edilen

Bryan Catanzaro (@ctnzr) 's Twitter Profile Photo

Nemotron-4-340B is released today! * Base, Instruct, Reward models * Permissive license * Great for Synthetic Data Generation * Designed to help others build their own models * Sized for inference on 8 NVIDIA H100 GPUs * Competitive across many tasks

Joana Carrasqueira (@jocarrasqueira) 's Twitter Profile Photo

At #GoogleIOConnect Berlin Gus (🤖🧠+🐍+🥑🗣️) introduced #Gemma2 which runs at incredible speed across different hardware and outperforms models twice its size! So excited to see what the community will build with #Gemma2! Congratulations team 👏🏻👏🏻

At #GoogleIOConnect Berlin <a href="/gusthema/">Gus (🤖🧠+🐍+🥑🗣️)</a> introduced #Gemma2 which runs at incredible speed across different hardware and outperforms models twice its size! So excited to see what the community will build with #Gemma2! Congratulations team 👏🏻👏🏻
Robert Dadashi (@robdadashi) 's Twitter Profile Photo

I am so proud to announce that: - Gemma 2 27B IT tops all open weights models on Chatbot Arena, with a pinch of optimism in the face of uncertainty :) - Gemma 2 9B IT sets a new frontier for models of similar size. 1/n

I am so proud to announce that:
- Gemma 2 27B IT tops all open weights models on Chatbot Arena, with a pinch of optimism in the face of uncertainty :)
- Gemma 2 9B IT sets a new frontier for models of similar size.
1/n
Mahmoud Soliman (@mjsmlp) 's Twitter Profile Photo

I know that in big companies that care about their team & culture, the person would get a bonus actually (because guess what, they uncovered a massive systemic process hole the size of Jupiter). There are teams that tout blameless culture and there are teams that walk the talk.

Elon Musk (@elonmusk) 's Twitter Profile Photo

Nice work by @xAI team, X team, @Nvidia & supporting companies getting Memphis Supercluster training started at ~4:20am local time. With 100k liquid-cooled H100s on a single RDMA fabric, it’s the most powerful AI training cluster in the world!

Eric Steinberger (@ericsteinb) 's Twitter Profile Photo

We want to build an AI model that can design, code and secure the next version of itself. This is obviously really hard. I thought it’d take <=2y lol. It’s been 2y now. It will take >2y. But I still believe it’ll happen!

Max Jaderberg (@maxjaderberg) 's Twitter Profile Photo

Excited to share that the AlphaFold 3 model code and weights are now available for academic use. Looking forward to seeing what new research this unlocks and how the research community builds on AlphaFold 3 for scientific discoveries github.com/google-deepmin… 1/2

Jack Parker-Holder (@jparkerholder) 's Twitter Profile Photo

Introducing 🧞Genie 2 🧞 - our most capable large-scale foundation world model, which can generate a diverse array of consistent worlds, playable for up to a minute. We believe Genie 2 could unlock the next wave of capabilities for embodied agents 🧠.

Jaana Dogan ヤナ ドガン (@rakyll) 's Twitter Profile Photo

This month is my last at Google. In a month, I'm joining Google DeepMind. Our job will be making DeepMind's work publicly available in ways that wasn't possible before. While this is one of the biggest challenges in my life, it's potentially going to be the most rewarding.

Bryan Catanzaro (@ctnzr) 's Twitter Profile Photo

Nemotron-H: A family of Hybrid Mamba-Transformer LLMs. * Hybrid architecture means up to 3X faster at the same accuracy * Trained in FP8 * Great for VLMs * Weights and instruct versions to come soon. research.nvidia.com/labs/adlr/nemo…

Nemotron-H: A family of Hybrid Mamba-Transformer LLMs.
* Hybrid architecture means up to 3X faster at the same accuracy
* Trained in FP8
* Great for VLMs
* Weights and instruct versions to come soon.

research.nvidia.com/labs/adlr/nemo…