Chaoyu Yang (@chaoyu_) 's Twitter Profile
Chaoyu Yang

@chaoyu_

Founder/CEO @bentomlai, Infrastructure for AI Systems

ID: 203040327

linkhttps://bento.me/chaoyu calendar_today15-10-2010 11:53:17

3,3K Tweet

777 Followers

557 Following

BentoML - Infrastructure for Building AI Systems (@bentomlai) 's Twitter Profile Photo

๐Ÿ›’ย #BentoCloud is now available on #AWS Marketplace! We're thrilled to empower AWS customers with a complete platform to build and scale #CompoundAI systems! BentoCloud takes the infrastructure complexity out of production #AI workloads. It enables enterprise AI teams to run

๐Ÿ›’ย #BentoCloud is now available on #AWS Marketplace! We're thrilled to empower AWS customers with a complete platform to build and scale #CompoundAI systems!

BentoCloud takes the infrastructure complexity out of production #AI workloads. It enables enterprise AI teams to run
BentoML - Infrastructure for Building AI Systems (@bentomlai) 's Twitter Profile Photo

๐Ÿš€ We are excited to announce the #AGIBuildersMeetup is in NYC this month (lu.ma/3h9jb07l) and we will co-host it with Yext! We will have tech leaders from #Yext, #BentoML, and AI21 Labs to explore the latest AI topics! Also, expect insightful community demos from

BentoML - Infrastructure for Building AI Systems (@bentomlai) 's Twitter Profile Photo

๐Ÿ“ข #macOS users! Do you know you can now run #LLMs right on your macOS with #OpenLLM? โœจTry out some of the popular models with a simple command using `openllm serve`! openllm serve phi3:3.8b-ggml-q4 openllm serve llama3.2:1b-instruct-ggml-fp16-darwin openllm serve

Tony Wu (@tonywu_71) 's Twitter Profile Photo

BentoML makes deploying ColPali a breeze! ๐Ÿ˜Œ With features like adaptive batching and zero-copy I/O, it minimizes overhead even for large tensor dataโ€”perfect for fast and efficient deployments. Donโ€™t miss the quickstart example repo Iโ€™ve created!

BentoML - Infrastructure for Building AI Systems (@bentomlai) 's Twitter Profile Photo

Do you know you can take your #ComfyUI experience to the next level with #CustomNodes? Read our new blog post (bentoml.com/blog/a-guide-tโ€ฆ) to see the most popular ones - from advanced image enhancement to workflow optimization tools. Plus, find answers to FAQs: โœ… Why custom nodes

BentoML - Infrastructure for Building AI Systems (@bentomlai) 's Twitter Profile Photo

Structured decoding marks a fundamental shift in how we view and utilize LLM outputs. It is a crucial step towards building complex, agentic systems. In this joint blog post (bentoml.com/blog/structureโ€ฆ), we dive into structured decoding in vLLM. Key highlights: ๐Ÿ”ง vLLM now

KDnuggets (@kdnuggets) 's Twitter Profile Photo

Learn how to build, test, deploy, and monitor machine learning models in the cloud with the BentoML ecosystem. s.mtrbio.com/eqqjdilutz

BentoML - Infrastructure for Building AI Systems (@bentomlai) 's Twitter Profile Photo

๐Ÿš€ Mistral AI just dropped Mistral Small 3.1! Key highlights: ๐Ÿ“š 128k token context window ๐Ÿค– Improved text performance & multimodal understanding ๐Ÿ’ป Runs on a Mac with just 32GB RAM โšก Outperforms Gemma 3 & GPT-4o Mini ๐Ÿ›  Low-latency function calling ๐Ÿ”“ Fully open-source

BentoML - Infrastructure for Building AI Systems (@bentomlai) 's Twitter Profile Photo

Learn howย #Yextย cut time-to-market & compute costs by re-platforming #AI Inference onย #BentoML: Key wins: โœ… ๐Ÿฏร— ๐—ณ๐—ฎ๐˜€๐˜๐—ฒ๐—ฟย model delivery (70 % less dev time) โœ… ๐—จ๐—ฝ ๐˜๐—ผ ๐Ÿด๐Ÿฌ % ๐—š๐—ฃ๐—จ ๐˜€๐—ฎ๐˜ƒ๐—ถ๐—ป๐—ด๐˜€ย with cloudโ€‘agnostic autoscaling โœ… ๐Ÿฎร— ๐—บ๐—ผ๐—ฟ๐—ฒ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€ ๐˜€๐—ต๐—ถ๐—ฝ๐—ฝ๐—ฒ๐—ฑ

Learn howย #Yextย cut time-to-market & compute costs by re-platforming #AI Inference onย #BentoML:

Key wins:
โœ… ๐Ÿฏร— ๐—ณ๐—ฎ๐˜€๐˜๐—ฒ๐—ฟย model delivery (70 % less dev time)
โœ… ๐—จ๐—ฝ ๐˜๐—ผ ๐Ÿด๐Ÿฌ % ๐—š๐—ฃ๐—จ ๐˜€๐—ฎ๐˜ƒ๐—ถ๐—ป๐—ด๐˜€ย with cloudโ€‘agnostic autoscaling
โœ… ๐Ÿฎร— ๐—บ๐—ผ๐—ฟ๐—ฒ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€ ๐˜€๐—ต๐—ถ๐—ฝ๐—ฝ๐—ฒ๐—ฑ
BentoML - Infrastructure for Building AI Systems (@bentomlai) 's Twitter Profile Photo

#BentoFriday ๐Ÿฑ โ€” 20x Faster Iteration with BentoML Codespaces Modern #AI apps like #RAG or voice agents often require multiple powerful GPUs and complex dependencies. This often leads to: โŒ Painstaking delays with each code change โŒย Challenging environment setups โŒ