Zhihao Jia (@jiazhihao) Twitter Tweets • TwiCopy

Zhihao Jia

@jiazhihao

+ Follow

Assistant professor of Computer Science at Carnegie Mellon University. Research on systems and machine learning.

ID: 777869916

linkhttps://www.cs.cmu.edu/~zhihaoj2/ calendar_today24-08-2012 10:21:01

153 Tweet

2,2K Followers

640 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Matei Zaharia

@matei_zaharia

2 months ago

Super excited for some great launches at our largest Summit yet!

thumb_up_off_alt96

chat_bubble_outline2

repeat7

shareShare

Excited to launch Agent Bricks, a new way to build auto-optimized agents on your tasks. Agent Bricks uniquely takes a *declarative* approach to agent development: you tell us what you want, and we auto-generate evals and optimize the agent. databricks.com/blog/introduci…

thumb_up_off_alt239

chat_bubble_outline5

repeat44

shareShare

Yixin Dong

@yi_xin_dong

2 months ago

Databricks 's Agent Bricks is powered by XGrammar for structured generation, and achieves high quality and efficiency. It helps you complete AI tasks without needing to worry about the algorithmic details. Give it a try!

thumb_up_off_alt12

chat_bubble_outline0

repeat4

shareShare

Tianqi Chen

@tqchenml

2 months ago

Check out our work on parallel reasoning 🧠; We bring an AI-assisted curator that identifies parallel paths in sequential traces, then tune models into native parallel thinkers that runs efficiently with prefix sharing and batching. Really excited about this general direction

thumb_up_off_alt98

chat_bubble_outline1

repeat15

shareShare

Beidi Chen

@beidichen

2 months ago

Say hello to Multiverse — the Everything Everywhere All At Once of generative modeling. 💥 Lossless, adaptive, and gloriously parallel 🌀 Now open-sourced: multiverse4fm.github.io I was amazed how easily we could extract the intrinsic parallelism of even SOTA autoregressive

thumb_up_off_alt66

chat_bubble_outline2

repeat19

shareShare

Beidi Chen

@beidichen

2 months ago

wow 🤩 check this out!!!

thumb_up_off_alt73

chat_bubble_outline1

repeat9

shareShare

You Jiacheng

@youjiacheng

2 months ago

wow cool

thumb_up_off_alt34

chat_bubble_outline2

repeat4

shareShare

Zhihao Jia

@jiazhihao

2 months ago

📢Exciting updates from #MLSys2025! All session recordings are now available and free to watch at mlsys.org. We’re also thrilled to announce that #MLSys2026 will be held in Seattle next May—submissions open next month with a deadline of Oct 30. We look forward to

thumb_up_off_alt101

chat_bubble_outline2

repeat30

shareShare

Tianqi Chen

@tqchenml

2 months ago

#MLSys2026 will be led by the general chair Luis Ceze and PC chairs Zhihao Jia and Aakanksha Chowdhery. The conference will be held in Bellevue on Seattle's east side. Consider submitting and bringing your latest works in AI and systems—more details at mlsys.org.

thumb_up_off_alt57

chat_bubble_outline0

repeat12

shareShare

Anjiang Wei

@anjiangw

2 months ago

We introduce CodeARC, a new benchmark for evaluating LLMs’ inductive reasoning. Agents must synthesize functions from I/O examples—no natural language, just reasoning. 📄 arxiv.org/pdf/2503.23145 💻 github.com/Anjiang-Wei/Co… 🌐 anjiang-wei.github.io/CodeARC-Websit… #LLM #Reasoning #LLM4Code #ARC

thumb_up_off_alt88

chat_bubble_outline3

repeat29

shareShare

Jeff Dean

@jeffdean

2 months ago

Mark your calendars for #MLSys2026 in May, 2026 in Seattle. Submission deadline for papers is Oct 30 this year.

thumb_up_off_alt109

chat_bubble_outline7

repeat15

shareShare

NovaSky

@novaskyai

2 months ago

✨Release: We upgraded SkyRL into a highly-modular, performant RL framework for training LLMs. We prioritized modularity—easily prototype new algorithms, environments, and training logic with minimal overhead. 🧵👇 Blog: novasky-ai.notion.site/skyrl-v01 Code: github.com/NovaSky-AI/Sky…

thumb_up_off_alt202

chat_bubble_outline2

repeat43

shareShare

Francis Y. Yan

@francisyan_

a month ago

🚀 [OSDI ’25, Tue 11:10am] How do you “divide and conquer” large-scale resource allocation problems like GPU cluster scheduling or WAN traffic engineering? Our answer: “decouple and decompose” the underlying optimization using DeDe. (1/3)

thumb_up_off_alt49

chat_bubble_outline4

repeat5

shareShare

Wentao Guo

@wentaoguo7

a month ago

🦆🚀QuACK🦆🚀: new SOL mem-bound kernel library without a single line of CUDA C++ all straight in Python thanks to CuTe-DSL. On H100 with 3TB/s, it performs 33%-50% faster than highly optimized libraries like PyTorch's torch.compile and Liger. 🤯 With Ted Zadouri and Tri Dao

thumb_up_off_alt316

chat_bubble_outline11

repeat66

shareShare

Song Han

@songhan_mit

a month ago

NVILA is available in SGLang👏🏻

thumb_up_off_alt21

chat_bubble_outline1

repeat9

shareShare