Zihang Dai (@zihangdai) 's Twitter Profile
Zihang Dai

@zihangdai

Working hard @xai

ID: 532791696

calendar_today22-03-2012 01:07:10

31 Tweet

16,16K Takipçi

235 Takip Edilen

xAI (@xai) 's Twitter Profile Photo

Announcing Grok! Grok is an AI modeled after the Hitchhiker’s Guide to the Galaxy, so intended to answer almost anything and, far harder, even suggest what questions to ask! Grok is designed to answer questions with a bit of wit and has a rebellious streak, so please don’t use

Yuhuai (Tony) Wu (@yuhu_ai_) 's Twitter Profile Photo

Coming to #NeurIPS23 now. Will be there until Friday night. DM me to chat about: reasoning, AI for math, and what we’re doing xAI. Also will be at #MATHAI workshop panel discussion on Friday morning. See you there!

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

Woah, another exciting update from Chatbot Arena❤️‍🔥 The results for @xAI’s sus-column-r (Grok 2 early version) are now public**! With over 12,000 community votes, sus-column-r has secured the #3 spot on the overall leaderboard, even matching GPT-4o! It excels in Coding (#2),

Woah, another exciting update from Chatbot Arena❤️‍🔥

The results for @xAI’s sus-column-r (Grok 2 early version) are now public**!

With over 12,000 community votes, sus-column-r has secured the #3 spot on the overall leaderboard, even matching GPT-4o! It excels in Coding (#2),
lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

Chatbot Arena update❤️‍🔥 Exciting news—@xAI's Grok-2 and Grok-mini are now officially on the leaderboard! With over 6000 community votes, Grok-2 has claimed the #2 spot, surpassing GPT-4o (May) and tying with the latest Gemini! Grok-2-mini also impresses at #5. Grok-2 excels in

Chatbot Arena update❤️‍🔥

Exciting news—@xAI's Grok-2 and Grok-mini are now officially on the leaderboard!

With over 6000 community votes, Grok-2 has claimed the #2 spot, surpassing GPT-4o (May) and tying with the latest Gemini! Grok-2-mini also impresses at #5.

Grok-2 excels in
xAI (@xai) 's Twitter Profile Photo

We are excited to bring together a group of exceptional engineers and product builders who are intrigued by our mission to build maximally truth-seeking AI Join our open house to meet our team, learn more about xAI, and enjoy a fun evening brought you by the creators of the

We are excited to bring together a group of exceptional engineers and product builders who are intrigued by our mission to build maximally truth-seeking AI

Join our open house to meet our team, learn more about xAI, and enjoy a fun evening brought you by the creators of the
Devin Kim (@devindkim) 's Twitter Profile Photo

We're working on advanced autonomous agents! Every human will have an AI agent capable of performing complex tasks on their behalf. Join the Starfleet team xAI to help us build the future: boards.greenhouse.io/xai/jobs/45478…

Zihang Dai (@zihangdai) 's Twitter Profile Photo

Decide to start to learn more about product from now on. Not sure the answer is that good. But I guess the key words "real", "painful", and "specific" shouldn't be too off. x.com/i/grok/share/f…

Epoch AI (@epochairesearch) 's Twitter Profile Photo

Overall, Grok-3 appears to be the most capable non-reasoning model across these benchmarks, often competitive with reasoning models. Grok-3 mini is also strong, and with high reasoning effort outperforms Grok-3 at math. Note that we haven’t evaluated Gemini 2.5 Pro outside GPQA.

Mohit Reddy (@mohitreddy13) 's Twitter Profile Photo

This has been fun to work on! What began as a two-person effort months ago is now a small, talent-dense team striving to improve and ship new model versions. We've had great support from Yuhuai (Tony) Wu, Jimmy Ba, and Elon Musk, with exciting updates coming soon! I believe we