Jiahui Gao (@jiahuigao3) Twitter Tweets • TwiCopy

Chengzu Li

8 months ago

Forget just thinking in words. 🚀 New Era of Multimodal Reasoning🚨 🔍 Imagine While Reasoning in Space with MVoT Multimodal Visualization-of-Thought (MVoT) revolutionizes reasoning by generating visual "thoughts" that transform how AI thinks, reasons, and explains itself.

thumb_up_off_alt739

chat_bubble_outline14

repeat165

shareShare

Wanru Zhao

@renee42581826

7 months ago

🚀Excited to co-organize the #ICLR2025 Workshop on Modularity for Collaborative, Decentralized, and Continual Deep Learning (MCDC ICLR 2026). Big thanks to an amazing team and speaker lineup! We're calling for non-archival papers in either short papers (2 pages) or long papers

thumb_up_off_alt73

chat_bubble_outline0

repeat14

shareShare

Jiahui Gao

@jiahuigao3

7 months ago

A very interesting direction! We also had an early exploration in this area, where we enabled VLMs to localize a target object by reasoning over user instructions and then utilized a tool to further localize the object in the image. github.com/OptimalScale/D…

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Zhihui Xie

@_zhihuixie

7 months ago

Introducing CTRL, a new framework that trains LLMs to critique via RL without human supervision or distillation, enabling them to supervise stronger models and achieve test-time scaling through iterative critique-revisions. 1/ Paper: arxiv.org/abs/2502.03492 Website:

thumb_up_off_alt216

chat_bubble_outline6

repeat59

shareShare

Jianshu Zhang ✈️ICLR2025🇸🇬

@sterzhang

6 months ago

🚀 Introducing VLM²-Bench! A simple yet essential ability that we use in daily life. But when tackling vision-centric tasks without relying on prior knowledge, can VLMs perform well? 🤔 🔗 Project Page: vlm2-bench.github.io More details below! 👇 (1/n)

thumb_up_off_alt124

chat_bubble_outline1

repeat46

shareShare

Chuanyang Jin

@chuanyang_jin

6 months ago

How to achieve human-level open-ended machine Theory of Mind? Introducing #AutoToM: a fully automated and open-ended ToM reasoning method combining the flexibility of LLMs with the robustness of Bayesian inverse planning, achieving SOTA results across five benchmarks. 🧵[1/n]

thumb_up_off_alt65

chat_bubble_outline1

repeat22

shareShare

Zhijiang Guo

@zhijiangg

6 months ago

🚀Exciting to see how recent advancements like OpenAI’s O1/O3 & DeepSeek’s R1 are pushing the boundaries! Check out our latest survey on Complex Reasoning with LLMs. Analyzed over 300 papers to explore the progress. Paper: arxiv.org/pdf/2502.17419 Github: github.com/zzli2022/Aweso…

thumb_up_off_alt158

chat_bubble_outline2

repeat63

shareShare

Lingpeng Kong

@ikekong

6 months ago

Come to play chess with our diffusion reasoning model here: lichess.org/@/diffusearchv0 by Jiacheng Ye ! Check out our research on diffusion reasoning models (DREAMs) here: ikekonglp.github.io/dreams.html to learn how our discrete diffusion approach enables implicit search capabilities!

thumb_up_off_alt24

chat_bubble_outline0

repeat13

shareShare

Jiacheng Ye

@jiachengye15

6 months ago

🤔 Always wondering if a next-token prediction model is the end of planning and reasoning. 🎯 Now excited to announce our team's latest research on exploring a new paradigm to enhance the planning ability of LLMs with DiffuSearch. 🧵1/7

thumb_up_off_alt20

chat_bubble_outline1

repeat5

shareShare

ZHANG Jipeng

@mircale2003

5 months ago

🤔How can we obtain Long-CoT data for theorem proving? 🚀DeepSeek-R1 utilizes large-scale collected Long-CoT data interleaved with RL training to enhance the performance of large reasoning models. Given the importance of Long-CoT data and the challenges in generating them, our

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Han Wu

@hahahawu2

5 months ago

💡Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging We comprehensively study existing model merging methods on efficient Long-to-Short LLM reasoning tasks, and find their huge potential in the field.

thumb_up_off_alt17

chat_bubble_outline1

repeat11

shareShare

AI for Math Workshop @ ICML 2025

@ai4mathworkshop

5 months ago

📣🔊 Excited to announce the 2nd AI for Math Workshop at #ICML2025 ICML Conference! 🔍 Workshop details: sites.google.com/view/ai4mathwo… 📜 Submit your pioneering work: sites.google.com/view/ai4mathwo…… 🙋 Reviewer nomination: goo.su/UlL3GJ

📣🔊 Excited to announce the 2nd AI for Math Workshop at #ICML2025 <a href="/icmlconf/">ICML Conference</a>!

🔍 Workshop details: sites.google.com/view/ai4mathwo…
📜 Submit your pioneering work: sites.google.com/view/ai4mathwo……
🙋 Reviewer nomination: goo.su/UlL3GJ

thumb_up_off_alt20

chat_bubble_outline2

repeat10

shareShare

Jiacheng Ye

@jiachengye15

5 months ago

🚀Excited to announce Dream 7B (Diffusion reasoning model): the most powerful open diffusion large language model to date.

thumb_up_off_alt1,1K

chat_bubble_outline48

repeat206

shareShare

Jiahui Gao

@jiahuigao3

5 months ago

Dream 7B: A general diffusion language model that happens to excel at planning. Without task-specific training, it outperforms Qwen2.5 7B and LLaMA3 8B on countdown and sudoku problems.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Jiahui Gao

@jiahuigao3

3 months ago

Very cool! We also welcome everyone to check out our large diffusion language model called Dream-7B announced last month. We've open-sourced the checkpoint. Try our demo at: huggingface.co/spaces/multimo… For more details, please refer to our blog: hkunlp.github.io/blog/2025/drea…

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

Lingpeng Kong

@ikekong

2 months ago

What happend after Dream 7B? First, Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, trained exclusively on public data. Plus, DreamOn cracks the variable-length generation problem! It enables code infilling that goes beyond a fixed canvas.

thumb_up_off_alt72

chat_bubble_outline1

repeat35

shareShare

Jiacheng Ye

@jiachengye15

2 months ago

📢 Update: Announcing Dream's next-phase development. - Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, trained exclusively on public data. - DreamOn: targeting the variable-length generation problem in dLLM!

thumb_up_off_alt80

chat_bubble_outline1

repeat22

shareShare

Jiahui Gao

@jiahuigao3

2 months ago

Dream-Coder, trained entirely on public data, achieves state-of-the-art coding performance among open diffusion code LLMs.

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare

Jiahui Gao

@jiahuigao3

2 months ago

To address variable‑length generation, DreamOn dynamically adjusts masked spans during infilling, expanding or contracting them to precisely match the target length.✅

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

a month ago

Follow-up to Dream 7B, now focused on code: Dream-Coder 7B is a diffusion-based code LLM from HKU + Huawei Noah’s Ark, built on Qwen2.5-Coder and 322B open tokens. It replaces autoregressive decoding with denoising-based generation, enabling flexible infilling via DreamOn. A

thumb_up_off_alt20

chat_bubble_outline2

repeat6

shareShare