Aran Komatsuzaki (@arankomatsuzaki) 's Twitter Profile
Aran Komatsuzaki

@arankomatsuzaki

ID: 794433401591693312

linkhttps://arankomatsuzaki.wordpress.com/about-me/ calendar_today04-11-2016 06:57:37

5,5K Tweet

130,130K Followers

264 Following

Zohar Atkins (@zoharatkins) 's Twitter Profile Photo

My life has been enriched by joyful learning. Few things give me greater joy than sharing my love of learning with others. Which is why today is a momentous day. With gratitude to my Creator, Source of Life, for allowing me to reach this day, I'm announcing that we at

Jinjie Ni @ ICLR'25 πŸ‡ΈπŸ‡¬ (@nijinjie) 's Twitter Profile Photo

Token crisis: solved. βœ… We pre-trained diffusion language models (DLMs) vs. autoregressive (AR) models from scratch β€” up to 8B params, 480B tokens, 480 epochs. Findings: > DLMs beat AR when tokens are limited, with >3Γ— data potential. > A 1B DLM trained on just 1B tokens

Token crisis: solved. βœ…

We pre-trained diffusion language models (DLMs) vs. autoregressive (AR) models from scratch β€” up to 8B params, 480B tokens, 480 epochs.

Findings:
>  DLMs beat AR when tokens are limited, with >3Γ— data potential.
>  A 1B DLM trained on just 1B tokens
Z.ai (@zai_org) 's Twitter Profile Photo

Presenting the GLM-4.5 technical report!πŸ‘‡ arxiv.org/abs/2508.06471 This work demonstrates how we developed models that excel at reasoning, coding, and agentic tasks through a unique, multi-stage training paradigm. Key innovations include expert model iteration with

Presenting the GLM-4.5 technical report!πŸ‘‡
arxiv.org/abs/2508.06471

This work demonstrates how we developed models that excel at reasoning, coding, and agentic tasks through a unique, multi-stage training paradigm.

Key innovations include expert model iteration with
Longyue Wang (@wangly0229) 's Twitter Profile Photo

🎯 Check out Marco-Voice: A Unified Framework for Expressive Speech Synthesis with Voice Cloning 🎧 Key Features: πŸ”₯ Novel Methods: speaker-emotion disentanglement and rotational emotion embedding integration πŸ”₯ New Benchmark: high-quality emotional speech dataset (10 hours, 7

🎯 Check out Marco-Voice: A Unified Framework for Expressive Speech Synthesis with Voice Cloning 🎧

Key Features:
πŸ”₯ Novel Methods: speaker-emotion disentanglement and rotational emotion embedding integration
πŸ”₯ New Benchmark: high-quality emotional speech dataset (10 hours, 7
Sicong (@leon_l_s_c) 's Twitter Profile Photo

We are excited to officially release RynnVLA-001, a new open-source Vision-Language-Action model! πŸ€– Our model outperforms strong baselines like Pi-0 & GR00T-N1.5 in real-world robot manipulations. This is achieved through several key innovations: πŸ”Ή Generative Pre-training:

Dan Hendrycks (@danhendrycks) 's Twitter Profile Photo

Can AIs beat long video games? We made TextQuests to test GPT-5, Grok 4, Deepseek, etc. These games can often take people dozens of hours to beat. - AIs can't beat any of the games (without clues) - some AIs behave more viciously than others - AIs are getting better rapidly

Xinyuan Wang (@xywang626) 's Twitter Profile Photo

We are super excited to release OpenCUA β€” the first from 0 to 1 computer-use agent foundation model framework and open-source SOTA model OpenCUA-32B, matching top proprietary models on OSWorld-Verified, with full infrastructure and data. πŸ”— [Paper] arxiv.org/abs/2508.09123 πŸ“Œ

We are super excited to release OpenCUA β€” the first from 0 to 1 computer-use agent foundation model framework and open-source SOTA model OpenCUA-32B, matching top proprietary models on OSWorld-Verified, with full infrastructure and data.

πŸ”— [Paper] arxiv.org/abs/2508.09123 
πŸ“Œ
Aran Komatsuzaki (@arankomatsuzaki) 's Twitter Profile Photo

Used to sit all day on my MacBook. Tried a standing deskβ€”hated it for not moving all day. Now I voice-input on an 8" tablet while walking around a mall and outside and stop at random spots. Feels like going back to the time when people spent most of day walking and standing.

Aran Komatsuzaki (@arankomatsuzaki) 's Twitter Profile Photo

Being Japanese is funny sometimes. Every few years, America β€œdiscovers” something I grew up with. Suddenly I’m sophisticated, just for existing. Matcha ice cream? Been eating it since day one. Welcome to the party.

Google AI (@googleai) 's Twitter Profile Photo

Today, we're bringing agentic capabilities to AI Mode in Search for Google AI Ultra subscribers. But... what is actually different? Let's say you want to make a dinner reservation. Traditionally, that would require multiple searches, concurrent tabs, and a lot of manual

Neo AI (@withneo) 's Twitter Profile Photo

Introducing NEO: The first Autonomous Machine Learning Engineer. It works like a full-stack ML engineer that never sleeps: handling data exploration, feature engineering, training, tuning, deployment, and monitoring, end to end. Powered by 11 specialized agents, NEO runs

Daria Soboleva (@dmsobol) 's Twitter Profile Photo

Router wasn't learning at first, we debugged it step-by-step and showed you how despite perfect load balancing, routing can be completely useless. We root caused it and fixed the problem. Papers skip the methodology, but you can find all details in our part 3 of MoE 101 series