Ce Zhang (@ce_zhang) 's Twitter Profile
Ce Zhang

@ce_zhang

CTO @ Together @togethercompute
Neubauer Associate Professor @UChicago

ID: 772945471979474945

linkhttps://zhangce.github.io/ calendar_today05-09-2016 23:52:16

692 Tweet

2,2K Followers

1,1K Following

Together AI (@togethercompute) 's Twitter Profile Photo

Our Open Data Scientist agent is now ranked on the DABStep data analysis leaderboard! We released everything - so you can try it yourself!๐Ÿ”ฅ: โ€ข Full codebase โ€ข Detailed workflow recipe โ€ข Benchmarks See how we created it. ๐Ÿงต

Our Open Data Scientist agent is now ranked on the DABStep data analysis leaderboard!

We released everything - so you can try it yourself!๐Ÿ”ฅ:
โ€ข Full codebase
โ€ข Detailed workflow recipe
โ€ข Benchmarks

See how we created it. ๐Ÿงต
Together AI (@togethercompute) 's Twitter Profile Photo

Announcing DeepSWE ๐Ÿค–: our fully open-sourced, SOTA software engineering agent trained purely with RL on top of Qwen3-32B. DeepSWE achieves 59% on SWEBench-Verified with test-time scaling (and 42.2% Pass@1), topping the SWEBench leaderboard for open-weight models. Built in

Announcing DeepSWE ๐Ÿค–: our fully open-sourced, SOTA software engineering agent trained purely with RL on top of Qwen3-32B. DeepSWE achieves 59% on SWEBench-Verified with test-time scaling (and 42.2% Pass@1), topping the SWEBench leaderboard for open-weight models.

Built in
Together AI (@togethercompute) 's Twitter Profile Photo

๐Ÿš€ We just launched speech-to-text APIs designed for real-time applications. Our Whisper V3 Large deployment delivers transcription 15x faster than OpenAI while maintaining full accuracy. Sub-second processing that actually keeps up with conversation speed โšก

๐Ÿš€ We just launched speech-to-text APIs designed for real-time applications.

Our Whisper V3 Large deployment delivers transcription 15x faster than OpenAI while maintaining full accuracy.

Sub-second processing that actually keeps up with conversation speed โšก
Together AI (@togethercompute) 's Twitter Profile Photo

We just launched a new "dictate" feature on Together Chat powered by our new Whisper model! The video is not sped up โ€“ it's really that fast!

Together AI (@togethercompute) 's Twitter Profile Photo

Together AI Sets a New Bar: Fastest Inference for DeepSeek-R1-0528 Weโ€™ve upgraded the Together Inference Engine to run on @NVIDIA Blackwell GPUsโ€”and the results speak for themselves: ๐Ÿ“ˆ Highest known serverless throughput: 334 tokens/sec ๐Ÿƒโ€Fastest time to first answer token:

Together AI Sets a New Bar: Fastest Inference for DeepSeek-R1-0528

Weโ€™ve upgraded the Together Inference Engine to run on @NVIDIA Blackwell GPUsโ€”and the results speak for themselves:
๐Ÿ“ˆ Highest known serverless throughput: 334 tokens/sec
๐Ÿƒโ€Fastest time to first answer token:
Hassan (@nutlope) 's Twitter Profile Photo

We now have the fastest speeds for DeepSeek R1 โ€“ up to 330 tokens/sec running on B200s! Here it is in action โ€“ video is not sped up!

Together AI (@togethercompute) 's Twitter Profile Photo

We built an open source voice note taking app using our fast Whisper implementation! Check it out -> usewhisperโ€‹.โ€‹io

Together AI (@togethercompute) 's Twitter Profile Photo

๐Ÿ›ก๏ธ VirtueGuard is LIVE on Together AI ๐Ÿš€ AI security and safety model that screens input and output for harmful content: โšก Under 10ms ๐—ฟ๐—ฒ๐˜€๐—ฝ๐—ผ๐—ป๐˜€๐—ฒ ๐ŸŽฏ ๐Ÿด๐Ÿต% ๐—ฎ๐—ฐ๐—ฐ๐˜‚๐—ฟ๐—ฎ๐—ฐ๐˜† vs 76% (AWS Bedrock) ๐Ÿง  ๐—–๐—ผ๐—ป๐˜๐—ฒ๐˜…๐˜-๐—ฎ๐˜„๐—ฎ๐—ฟ๐—ฒ - adapts to your policies, not just keywords ๐Ÿ‘‡

๐Ÿ›ก๏ธ VirtueGuard is LIVE on Together AI ๐Ÿš€

AI security and safety model that screens input and output for harmful content:

โšก Under 10ms ๐—ฟ๐—ฒ๐˜€๐—ฝ๐—ผ๐—ป๐˜€๐—ฒ  
๐ŸŽฏ ๐Ÿด๐Ÿต% ๐—ฎ๐—ฐ๐—ฐ๐˜‚๐—ฟ๐—ฎ๐—ฐ๐˜† vs 76% (AWS Bedrock)
๐Ÿง  ๐—–๐—ผ๐—ป๐˜๐—ฒ๐˜…๐˜-๐—ฎ๐˜„๐—ฎ๐—ฟ๐—ฒ - adapts to your policies, not just keywords ๐Ÿ‘‡
Drishan Arora (@drishanarora) 's Twitter Profile Photo

A small update - we had more traffic than anticipated. However, the endpoints are now scalable on Together AI for all models, including the 671B MoE. Test out the model here: together.ai/models/cogito-โ€ฆ (A huge thanks to the folks at Together AI for making this happen so

Together AI (@togethercompute) 's Twitter Profile Photo

๐Ÿค–OpenAI's open models are here. gpt-oss models just landed on Together AI. Achieves near-parity with o4- mini, trained using o3 techniques. Build anything, deploy anywhere๐Ÿ”ฅ

Together AI (@togethercompute) 's Twitter Profile Photo

Therapeutic AI isn't just "helpful" AI ๐Ÿง  Slingshot AI built a psychology foundation model that knows when to push back, stay silent, or offer new perspectives. And now - 50,000+ people are getting specialized mental health support.

Therapeutic AI isn't just "helpful" AI ๐Ÿง 

<a href="/slingshotai_inc/">Slingshot AI</a> built a psychology foundation model that knows when to push back, stay silent, or offer new perspectives. 

And now - 50,000+ people are getting specialized mental health support.
Together AI (@togethercompute) 's Twitter Profile Photo

Building AI agents for complex engineering tasks โ‰  building chatbots ๐Ÿงต Most AI agents today excel at short, simple tasks. But automating multi-day engineering workflows? Thatโ€™s a whole different game. At Together AI, we learned this the hard way while optimizing LLM

Building AI agents for complex engineering tasks โ‰  building chatbots ๐Ÿงต

Most AI agents today excel at short, simple tasks. But automating multi-day engineering workflows? Thatโ€™s a whole different game.

At Together AI, we learned this the hard way while optimizing LLM
Hassan (@nutlope) 's Twitter Profile Photo

I'm building a realtime video analysis app! It takes screenshots every 500ms, sends it to llama 4 on Together AI, and streams back the results. I want to extend it to be able to perform actions too (record my screen & send me a text when a video finishes for example).

Together AI (@togethercompute) 's Twitter Profile Photo

The Washington Post processes 1.79 billion tokens every month powering "Ask The Post AI" They needed reliable inference without vendor lock-in. Fixed costs. Full model ownership. Together AI's Dedicated Endpoints delivered.

The Washington Post processes 1.79 billion tokens every month powering "Ask The Post AI"

They needed reliable inference without vendor lock-in. Fixed costs. Full model ownership.

Together AI's Dedicated Endpoints delivered.
Hassan (@nutlope) 's Twitter Profile Photo

Announcing ReceiptHero โ€“ an app to help people track their finances! It'll take in any receipts you have, extract the total $, and categorize it for you (dining, groceries, utilities, ect). 100% free & open source. Powered by llama 4 on Together AI.

Together AI (@togethercompute) 's Twitter Profile Photo

Breaking: VFS Global x Together AI announce strategic partnership. Weโ€™re partnering with VFS Global to scale secure, responsible, and high-performance AI solutions for global mobility. Millions of visa applications. 160+ countries. One mission: faster, more transparent, and

Breaking: <a href="/VFSGlobal/">VFS Global</a> x Together AI announce strategic partnership.

Weโ€™re partnering with VFS Global to scale secure, responsible, and high-performance AI solutions for global mobility.

Millions of visa applications. 160+ countries. One mission: faster, more transparent, and
Together AI (@togethercompute) 's Twitter Profile Photo

We're excited to host Apriel-1.5-15b-Thinker by ServiceNow's SLAM labs on Together AI! ๐Ÿ‘‰15B parameters, fits on single GPU ๐Ÿ‘‰On par with Deepseek-R1-0528 and Mistral-Medium-1.2 on the Artificial Analysis Intelligence Index Built by Sathwik Tejaswi ServiceNow AI Research

We're excited to host Apriel-1.5-15b-Thinker by <a href="/ServiceNow/">ServiceNow</a>'s SLAM labs on Together AI!

๐Ÿ‘‰15B parameters, fits on single GPU
๐Ÿ‘‰On par with Deepseek-R1-0528 and Mistral-Medium-1.2 on the Artificial Analysis Intelligence Index

Built by <a href="/SathwikTejaswi/">Sathwik Tejaswi</a> <a href="/ServiceNowRSRCH/">ServiceNow AI Research</a>