Hongyuan Mei (@roverhm) 's Twitter Profile
Hongyuan Mei

@roverhm

Core Contributor to Grok 4 & Grok 4 Heavy. Member of Technical Staff @xAI. Training knowledgeable AI reasoners. ex-@GoogleDeepMind, @TTIC_Connect, @jhuclsp.

ID: 2164964468

linkhttp://www.hongyuanmei.com calendar_today30-10-2013 15:22:11

28 Tweet

687 Followers

145 Following

Bo Wang (@bowang87) 's Twitter Profile Photo

🚀 how good is Grok-4 on biomedical applications ? Yesterday, xAI benchmarked Grok-4 on our Chest Agent benchmark—and it crushed it. With 72.8% accuracy, Grok-4 outperformed our MedRAX (63.1%) (the previous STOA) and all other models in chest X-ray interpretation. 🔗 MedRAX:

🚀 how good is Grok-4 on biomedical applications ? 

Yesterday, <a href="/xai/">xAI</a> benchmarked Grok-4 on our Chest Agent benchmark—and it crushed it.
With 72.8% accuracy, Grok-4 outperformed our MedRAX (63.1%) (the previous STOA) and all other models in chest X-ray interpretation.

đź”— MedRAX:
Mckay Wrigley (@mckaywrigley) 's Twitter Profile Photo

My thoughts on Grok 4 Heavy after 12hrs: Crazy good! “Create an animation of a crowd of people walking to form “Hello world, I am Grok” as camera changes to birds-eye.” And it 1-shotted the *entire* thing. No other model comes close. Watch the full clip.

Yuhuai (Tony) Wu (@yuhu_ai_) 's Twitter Profile Photo

Very proud of us xAI after seeing the GPT5 release. With a much smaller team, we are ahead in many. Grok4 world’s first unified model, and crushing GPT5 in benchmarks like ARC-AGI. OpenAI is a very respectful competitor and still the leader in many, but we’re fast and

Hongyuan Mei (@roverhm) 's Twitter Profile Photo

From first research prototype to production release — thrilled to have delivered this new Grok capability! Kudos to our small but talented team! Yiwen Yuan Dengfeng Li Shun Keiran Paster

xAI (@xai) 's Twitter Profile Photo

Grok 4 is now free for all users worldwide! Simply use Auto mode, and Grok will route complex queries to Grok 4. Prefer control? Choose "Expert" anytime to always use Grok 4. For a limited time, we are rolling out generous usage limits so you can explore Grok 4’s full

xAI (@xai) 's Twitter Profile Photo

Introducing Grok Code Fast 1, a speedy and economical reasoning model that excels at agentic coding. Now available for free on GitHub Copilot, Cursor, Cline, Kilo Code, Roo Code, opencode, and Windsurf. x.ai/news/grok-code…

Bill Yuchen Lin (@billyuchenlin) 's Twitter Profile Photo

We are thrilled to announce the release of our 1st model for coding and agentic tasks, grok-code-fast! You can now integrate it into all IDE and CLI tools for your daily coding needs. It’s fast, powerful, and affordable! We’re committed to continuous improvement based on your

Hongyuan Mei (@roverhm) 's Twitter Profile Photo

Working with Grok, all day all night, on a prediction problem that I am sure Grok has never seen. Grok suggested tons of tricks for feature engineering and modeling, which I might never know without Grok teaching me... They worked! AGI is coming...

Hongyuan Mei (@roverhm) 's Twitter Profile Photo

Give your PDF docs to Grok-4, not GPT-5. Here is why: - ✅Grok-4: grok.com/share/bGVnYWN5… - ❌GPT-5: chatgpt.com/share/68c1fd87…

Hongyuan Mei (@roverhm) 's Twitter Profile Photo

Humbled & excited: Grok is now the best AI for financial search & reasoning 🚀 Thanks to the ByteDance team for their great work! Guess what — Grok is still cooking 👨‍🍳 Soon it’ll be an even better AI expert across high-impact domains. Join us: job-boards.greenhouse.io/xai/jobs/48000…

Yuhuai (Tony) Wu (@yuhu_ai_) 's Twitter Profile Photo

Strongly recommend Grok 4 planning + Grok Code Fast execution. We will also ship out new coding products based on this concept soon.

Hongyuan Mei (@roverhm) 's Twitter Profile Photo

In RL for LLM reasoning, it’s not just about maximizing reward, but aligning policy to the reward distribution. Our new paper uses flow matching to boost rollout diversity—improving math & code reasoning across the board. Huge thanks to awesome coauthors!

In RL for LLM reasoning, it’s not just about maximizing reward, but aligning policy to the reward distribution. Our new paper uses flow matching to boost rollout diversity—improving math &amp; code reasoning across the board. Huge thanks to awesome coauthors!
xAI (@xai) 's Twitter Profile Photo

Introducing Grok 4 Fast, a multimodal reasoning model with a 2M context window that sets a new standard for cost-efficient intelligence. Available for free on grok.com, grok.x.com, iOS and Android apps, and OpenRouter. x.ai/news/grok-4-fa…

Yuhuai (Tony) Wu (@yuhu_ai_) 's Twitter Profile Photo

Hiring for a new team building computer control agents. Join us to build Grok5 / macrohard later this year. DM me! Will send out a job post soon too.