Ziang Xie (@zngxie) 's Twitter Profile
Ziang Xie

@zngxie

@saplingai — language model toolkit for enterprise applications.
Prev: PhD @stanfordailab @stanfordnlp, ugrad @berkeley_ai

ID: 3420488234

linkhttps://cs.stanford.edu/~zxie calendar_today02-09-2015 01:03:17

201 Tweet

579 Takipçi

189 Takip Edilen

Anthropic (@anthropicai) 's Twitter Profile Photo

Introducing Claude for Education. We're partnering with universities to bring AI to higher education, alongside a new learning mode for students.

elvis (@omarsar0) 's Twitter Profile Photo

NEW: Google announces Agent2Agent Agent2Agent (A2A) is a new open protocol that lets AI agents securely collaborate across ecosystems regardless of framework or vendor. Here is all you need to know:

Firebase (@firebase) 's Twitter Profile Photo

Meet Firebase Studio: A cloud-based, agentic dev environment powered by Gemini ✨💻✨ Find everything you need to prototype, build, and run production-quality full-stack AI apps quickly and safely. Learn more about building AI apps with Firebase → goo.gle/4j3MS9v

Unitree (@unitreerobotics) 's Twitter Profile Photo

Unitree Iron Fist King: Awakening!💪 Let's step into a new era of Sci-Fi, join the fun together! Unitree will be livestreaming robot combat in about a month, stay tuned! #Unitree #Fighting #Boxing #HumanoidRobot #Robot #AI #IronFist #Game

Sundar Pichai (@sundarpichai) 's Twitter Profile Photo

Introducing DolphinGemma, an LLM fine-tuned on many years of dolphin sound data 🐬 to help advance scientific discovery. We collaborated with Wild Dolphin Project to train a model that learns vocal patterns to predict what sound they might make next. It’s small enough (~400M params)

Anthropic (@anthropicai) 's Twitter Profile Photo

Today we’re launching Research, alongside a new Google Workspace integration. Claude now brings together information from your work and the web.

Physical Intelligence (@physical_int) 's Twitter Profile Photo

We got a robot to clean up homes that were never seen in its training data! Our new model, π-0.5, aims to tackle open-world generalization. We took our robot into homes that were not in the training data and asked it to clean kitchens and bedrooms. More below⤵️

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

I attended a vibe coding hackathon recently and used the chance to build a web app (with auth, payments, deploy, etc.). I tinker but I am not a web dev by background, so besides the app, I was very interested in what it's like to vibe code a full web app today. As such, I wrote

I attended a vibe coding hackathon recently and used the chance to build a web app (with auth, payments, deploy, etc.). I tinker but I am not a web dev by background, so besides the app, I was very interested in what it's like to vibe code a full web app today. As such, I wrote
Anthropic (@anthropicai) 's Twitter Profile Photo

Today we're announcing Integrations, a new way to connect your apps and tools to Claude. We're also expanding Claude's Research capabilities with an advanced mode that searches the web, your Google Workspace, and now your Integrations too.

Demis Hassabis (@demishassabis) 's Twitter Profile Photo

Very excited to share the best coding model we’ve ever built! Today we’re launching Gemini 2.5 Pro Preview 'I/O edition' with massively improved coding capabilities. Ranks no.1 on LMArena in Coding and no.1 on the WebDev Arena Leaderboard. It’s especially good at building

Karan Singhal (@thekaransinghal) 's Twitter Profile Photo

📣 Proud to share HealthBench, an open-source benchmark from our Health AI team at OpenAI, measuring LLM performance and safety across 5000 realistic health conversations. 🧵 Unlike previous narrow benchmarks, HealthBench enables meaningful open-ended evaluation through 48,562

📣 Proud to share HealthBench, an open-source benchmark from our Health AI team at OpenAI, measuring LLM performance and safety across 5000 realistic health conversations. 🧵

Unlike previous narrow benchmarks, HealthBench enables meaningful open-ended evaluation through 48,562
Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Introducing AlphaEvolve: a Gemini-powered coding agent for algorithm discovery. It’s able to: 🔘 Design faster matrix multiplication algorithms 🔘 Find new solutions to open math problems 🔘 Make data centers, chip design and AI training more efficient across Google. 🧵

cat (@_catwu) 's Twitter Profile Photo

Since we originally built Claude Code as an internal tool, we've heard a ton of questions about how our teams use it at Anthropic. Here’s an inside look on how our teams—from product engineering, to growth marketing, to legal—use Claude Code:

Since we originally built Claude Code as an internal tool, we've heard a ton of questions about how our teams use it at Anthropic.

Here’s an inside look on how our teams—from product engineering, to growth marketing, to legal—use Claude Code:
Deedy (@deedydas) 's Twitter Profile Photo

Most important tech blog this year: OpenAI engineer and ex-founder of $3.5B Segment wrote a tell all post about how OpenAI works internally. From obsession with X, devout use of Slack to engineering culture and tech stack. A peek under the hood of a generational company.

Most important tech blog this year: OpenAI engineer and ex-founder of $3.5B Segment wrote a tell all post about how OpenAI works internally.

From obsession with X, devout use of Slack to engineering culture and tech stack.

A peek under the hood of a generational company.
Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

From various reports OpenAI really did bolt a “Universal Verifier” onto the GPT-5 training loop. And here's that paper, that OpenAI published earlier. "Prover-Verifier Games Improve Legibility of LLM", showing a production-ready pipeline where a verifier model scores each

From various reports OpenAI really did bolt a “Universal Verifier” onto the GPT-5 training loop.

And here's that paper, that OpenAI published earlier.

"Prover-Verifier Games Improve Legibility of LLM", showing a production-ready pipeline where a verifier model scores each
jack morris (@jxmnop) 's Twitter Profile Photo

curious about the training data of OpenAI's new gpt-oss models? i was too. so i generated 10M examples from gpt-oss-20b, ran some analysis, and the results were... pretty bizarre time for a deep dive 🧵

curious about the training data of OpenAI's new gpt-oss models? i was too. 

so i generated 10M examples from gpt-oss-20b, ran some analysis, and the results were... pretty bizarre

time for a deep dive 🧵
Andrej Karpathy (@karpathy) 's Twitter Profile Photo

Continuing the journey of optimal LLM-assisted coding experience. In particular, I find that instead of narrowing in on a perfect one thing my usage is increasingly diversifying across a few workflows that I "stitch up" the pros/cons of: Personally the bread & butter (~75%?) of