Byron Hsu (@hsu_byron) 's Twitter Profile
Byron Hsu

@hsu_byron

inference optimization @xAI | @lmsysorg @liger_kernel @flyteorg @theASF

ID: 931483504604454912

linkhttps://github.com/ByronHsu calendar_today17-11-2017 11:25:48

2,2K Tweet

3,3K Followers

1,1K Following

Yung-Sung Chuang (@yungsungchuang) 's Twitter Profile Photo

Scaling CLIP on English-only data is outdated now… 🌍We built CLIP data curation pipeline for 300+ languages 🇬🇧We train MetaCLIP 2 without compromising English-task performance (it actually improves! 🥳It’s time to drop the language filter! 📝arxiv.org/abs/2507.22062 [1/5] 🧵

Scaling CLIP on English-only data is outdated now…

🌍We built CLIP data curation pipeline for 300+ languages
🇬🇧We train MetaCLIP 2 without compromising English-task performance (it actually improves!
🥳It’s time to drop the language filter!

📝arxiv.org/abs/2507.22062

[1/5]

🧵
Cheng-Fu Joey Yang (@cfyang58) 's Twitter Profile Photo

🎉 Thrilled to share that our paper “Verbalized Representation Learning for Interpretable Few‑Shot Generalization” has been accepted to #ICCV2025! 🚀 Check out the details:

Guodong Zhang (@guodzh) 's Twitter Profile Photo

We are actively hiring for multimodal understanding and generation. Join us to build the future AI interfaces! job-boards.greenhouse.io/xai/jobs/47206… job-boards.greenhouse.io/xai/jobs/46816… job-boards.greenhouse.io/xai/jobs/43783…

Guodong Zhang (@guodzh) 's Twitter Profile Photo

We are hiring on pretraining as well. If you are passionate about improving training efficiency, pretraining data quality and training infra. Please apply here: job-boards.greenhouse.io/xai/jobs/45338… job-boards.greenhouse.io/xai/jobs/46846… job-boards.greenhouse.io/xai/jobs/46036…

Zeeshan Patel (@zeeshanp_) 's Twitter Profile Photo

Many people wonder what is the benefit of training video gen models. Video gen by itself doesn’t necessarily seem to provide as much raw intelligence to users as modern LLMs. However, in the long term, video gen models will be used as neural simulations of the universe within

Byron Hsu (@hsu_byron) 's Twitter Profile Photo

At xAI, we are managing traffic at an unprecedented scale. Our team is small, dedicated, and highly skilled. In this role, you will own a critical part of our production serving infrastructure, collaborating closely with the research inference team to ensure it is elastic,

Chaoqi Wang (@chaoqi_w) 's Twitter Profile Photo

We are hiring brilliant engineers to work on pretraining! Join us to tackle pretraining data, design cutting-edge data recipes, and build next-gen data infra. If you’re driven to accelerate human discovery and ready to change the world, apply now to join our galactic mission!

skcd (@skcd42) 's Twitter Profile Photo

Grok-code-fast-1 is now out and available for everyone to use 🚀🏎️💨 When I joined the coding team, the team was just 3 people and we very quickly built a model which was SOTA on SWEBench. But as things go, in the real world benchmarks matter less. Over the last few months we

Yung-Sung Chuang (@yungsungchuang) 's Twitter Profile Photo

🎉 Excited to share our MetaCLIP 2 is now accepted as Spotlight at #NeurIPS2025 and the models are available on HF: 🤗 huggingface.co/models?other=m… Pls use it if you want CLIP with: 🌏 1. diverse worldwide knowledge beyond English CLIP 🇬🇧 2. even better English ability See u in SD!