Chengzu Li (@li_chengzu) Twitter Tweets • TwiCopy

Jiacheng Ye

5 months ago

🚀Excited to announce Dream 7B (Diffusion reasoning model): the most powerful open diffusion large language model to date.

thumb_up_off_alt1,1K

chat_bubble_outline48

repeat206

shareShare

Extremely happy to share that our PhD student Tiancheng Hu received the Apple Scholars in AI/ML PhD Fellowship! 🎉 The fellowship will support his research on LLM-based simulation and LLM personalisation. Congratulations again, Tiancheng Hu! 🥳 machinelearning.apple.com/updates/apple-…

thumb_up_off_alt13

chat_bubble_outline0

repeat3

shareShare

Bowen Wang

@bowenwangnlp

5 months ago

🎮 Computer Use Agent Arena is LIVE! 🚀 🔥 Easiest way to test computer-use agents in the wild without any setup 🌟 Compare top VLMs: OpenAI Operator, Claude 3.7, Gemini 2.5 Pro, Qwen 2.5 vl and more 🕹️ Test agents on 100+ real apps & webs with one-click config 🔒 Safe & free

thumb_up_off_alt333

chat_bubble_outline14

repeat104

shareShare

Chengzu Li

@li_chengzu

4 months ago

Happy to share that MVoT got accepted to ICML 2025 ICML Conference 🎉🎉#ICML If you are interested, do check out our paper and here are some other materials: 📰Report on IEEE Spectrum: spectrum.ieee.org/visual-reasoni… 🎤TWIML Podcast with Sam: twimlai.com/podcast/twimla…

thumb_up_off_alt77

chat_bubble_outline3

repeat13

shareShare

Emile van Krieken

@emilevankrieken

4 months ago

We propose Neurosymbolic Diffusion Models! We find diffusion is especially compelling for neurosymbolic approaches, combining powerful multimodal understanding with symbolic reasoning 🚀 Read more 👇

thumb_up_off_alt569

chat_bubble_outline30

repeat103

shareShare

Benjamin Minixhofer

@bminixhofer

4 months ago

We achieved the first instance of successful subword-to-byte distillation in our (just updated) paper. This enables creating byte-level models at a fraction of the cost of what was needed previously. As a proof-of-concept, we created byte-level Gemma2 and Llama3 models. 🧵

$We achieved the first instance of successful subword-to-byte distillation in our (just updated) paper. This enables creating byte-level models at a fraction of the cost of what was needed previously. As a proof-of-concept, we created byte-level Gemma2 and Llama3 models. 🧵$

thumb_up_off_alt59

chat_bubble_outline1

repeat14

shareShare

Wenhu Chen

@wenhuchen

4 months ago

🚀 New Paper: Pixel Reasoner 🧠🖼️ How can Vision-Language Models (VLMs) perform chain-of-thought reasoning within the image itself? We introduce Pixel Reasoner, the first open-source framework that enables VLMs to “think in pixel space” through curiosity-driven reinforcement

thumb_up_off_alt390

chat_bubble_outline9

repeat62

shareShare

Jiaang Li

@jiaangli

4 months ago

🚀New Preprint Alert 🚀 Can Multimodal Retrieval Enhance Cultural Awareness in Vision-Language Models? Excited to introduce RAVENEA, a new benchmark aimed at evaluating cultural understanding in VLMs through RAG.

thumb_up_off_alt8

chat_bubble_outline1

repeat5

shareShare

Yinya Huang ✈️ ICLR

@yinyahuang

3 months ago

🤖⚛️Can AI truly see Physics? Test your model with the newly released SeePhys Benchmark! 🚀 🖼️Covering 2,000 vision-text multimodal physics problems spanning from middle school to doctoral qualification exams, the SeePhys benchmark systematically evaluates LLMs/MLLMs on tasks

thumb_up_off_alt36

chat_bubble_outline4

repeat16

shareShare

Caiqi Zhang

@caiqizh

3 months ago

🔥 We teach LLMs to say how confident they are on-the-fly during long-form generation. 🤩No sampling. No slow post-hoc methods. Not limited to short-form QA! ‼️Just output confidence in a single decoding pass. ✅Better calibration! 🚀 20× faster runtime. arXiv:2505.23912 👇

thumb_up_off_alt39

chat_bubble_outline2

repeat22

shareShare

Han Zhou

@hanzhou032

3 months ago

Automating Multi-Agent Design: 🧩Multi-agent systems aren’t just about throwing more LLM agents together. 🛠️They require mastering the subtle art of prompting and agent orchestration. Introducing MASS🚀- Our new agent optimization framework for better prompts and topologies!

thumb_up_off_alt731

chat_bubble_outline12

repeat167

shareShare

Yifu Qiu

@yifuqiu98

3 months ago

🔁 What if you could bootstrap a world model (state1 × action → state2) using a much easier-to-train dynamics model (state1 × state2 → action) in a generalist VLM? 💡 We show how a dynamics model can generate synthetic trajectories & serve for inference-time verification 🧵👇

thumb_up_off_alt27

chat_bubble_outline1

repeat11

shareShare

Zhoujun (Jorge) Cheng

@chengzhoujun

3 months ago

🤯What we know about RL for reasoning might not hold outside math and code? We revisit established findings on RL for LLM reasoning on six domains (Math, Code, Science, Logic, Simulation, Tabular) and found that previous conclusions drawn on math and code are surprisingly

thumb_up_off_alt185

chat_bubble_outline1

repeat47

shareShare

Zhaochen Su

@suzhaochen0110

2 months ago

Excited to share our new survey on the reasoning paradigm shift from "Think with Text" to "Think with Image"! 🧠🖼️ Our work offers a roadmap for more powerful & aligned AI. 🚀 📜 Paper: arxiv.org/pdf/2506.23918 ⭐ GitHub (400+🌟): github.com/zhaochen0110/A…

thumb_up_off_alt160

chat_bubble_outline7

repeat61

shareShare

Tiancheng Hu

@tiancheng_hu

2 months ago

Working on LLM social simulation and need data? Excited to announce our iNews paper is accepted to #ACL2025! 🥳 It's a large-scale dataset for predicting individualized affective responses to real-world, multimodal news. arxiv.org/abs/2503.03335 🤗 Data: huggingface.co/datasets/piteh…

thumb_up_off_alt31

chat_bubble_outline2

repeat7

shareShare

Micah Goldblum

@micahgoldblum

2 months ago

🚨Announcing Zebra-CoT, a large-scale dataset of high quality interleaved image-text reasoning traces 📜. Humans often draw visual aids like diagrams when solving problems, but existing VLMs reason mostly in pure text. 1/n

thumb_up_off_alt116

chat_bubble_outline1

repeat24

shareShare

Chengzu Li

Jiacheng Ye

CambridgeLTL

Bowen Wang

Chengzu Li

Emile van Krieken

Benjamin Minixhofer

Wenhu Chen

Jiaang Li

Yinya Huang ✈️ ICLR

Caiqi Zhang

Han Zhou

Yifu Qiu

Zhoujun (Jorge) Cheng

Zhaochen Su

Tiancheng Hu

Micah Goldblum