Lingpeng Kong (@ikekong) Twitter Tweets • TwiCopy

Google DeepMind

7 months ago

We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO

thumb_up_off_alt4,4K

chat_bubble_outline85

repeat663

shareShare

Lingpeng Kong

@ikekong

6 months ago

Constant memory long CoT is here!

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

Lei Li

@_tobiaslee

6 months ago

MiMo-VL technical report, models, and evaluation suite are out! 🤗 Models: huggingface.co/XiaomiMiMo/MiM… (or RL) Report: arxiv.org/abs/2506.03569 Evaluation Suite: github.com/XiaomiMiMo/lmm… Looking back, it's incredible that we delivered such compact yet powerful vision-language

thumb_up_off_alt42

chat_bubble_outline2

repeat13

shareShare

Jing Xiong

@_june1126

6 months ago

🔬 The HKU team presents ParallelComp: a training-free technique for efficient context length extrapolation in LLMs—from 8K up to 128K tokens—on a single A100 GPU, with minimal performance loss. 📄 Paper: arxiv.org/abs/2502.14317 💻 Code: github.com/menik1126/Para…

thumb_up_off_alt14

chat_bubble_outline5

repeat9

shareShare

Lingpeng Kong

@ikekong

6 months ago

The RL recipe from us, with everything fully open! hkunlp.github.io/blog/2025/Pola…

thumb_up_off_alt20

chat_bubble_outline0

repeat9

shareShare

Stefano Ermon

@stefanoermon

5 months ago

Huge milestone from the team! A blazing-fast diffusion LLM built for chat, delivering real-time performance at commercial scale. If you liked Mercury Coder for code, you'll love this for conversation.

thumb_up_off_alt177

chat_bubble_outline8

repeat27

shareShare

Sansa Gong

@sansa19739319

5 months ago

Thanks for sharing our work!!!🙏Code release is in progress😺

thumb_up_off_alt27

chat_bubble_outline0

repeat7

shareShare

Lingpeng Kong

@ikekong

5 months ago

Fun especially considering this is a shiba :)

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Sansa Gong

@sansa19739319

5 months ago

🤖Can diffusion models write code competitively? Excited to share our latest 7B coding diffusion LLM!!💻 With DiffuCoder, we explore how they decode, why temperature🔥 matters, and how to improve them via coupled-GRPO that speaks diffusion!!📈 Code: github.com/apple/ml-diffu… 🧵

thumb_up_off_alt588

chat_bubble_outline5

repeat113

shareShare

JingqiZhou

@zhou_jingqi_

5 months ago

🌳TreeSynth: Synthesizing large-scale diverse data from scratch! Struggling with repetition and space collapse in data synthesis? 🤔 Our latest research mitigates this challenge through innovative tree-guided subspace partitioning. ✨Introducing TreeSynth—a novel framework

thumb_up_off_alt7

chat_bubble_outline0

repeat5

shareShare

Zirui Wu

@williamzr7

5 months ago

We present DreamOn: a simple yet effective method for variable-length generation in diffusion language models. Our approach boosts code infilling performance significantly and even catches up with oracle results.

thumb_up_off_alt120

chat_bubble_outline2

repeat29

shareShare

Zhihui Xie

@_zhihuixie

5 months ago

🚀 Thrilled to announce Dream-Coder 7B — the most powerful open diffusion code  LLM to date.

thumb_up_off_alt122

chat_bubble_outline3

repeat36

shareShare

Jiacheng Ye

@jiachengye15

5 months ago

📢 Update: Announcing Dream's next-phase development. - Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, trained exclusively on public data. - DreamOn: targeting the variable-length generation problem in dLLM!

thumb_up_off_alt80

chat_bubble_outline1

repeat22

shareShare

HKUNLP

@hkunlp2020

5 months ago

Xinyu Yang from CMU will be giving a talk titled "Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation" at Friday July 25 11am HKT (Thursday July 24 8pm PDT). Link to talk: hku.zoom.us/j/92651812689?…

thumb_up_off_alt22

chat_bubble_outline1

repeat7

shareShare

Lei Li

@_tobiaslee

4 months ago

🚀 MiMo‑VL 2508 is live! Same size, much smarter. We’ve upgraded performance, thinking control, and overall user experience. 📈 Benchmark gains across image + video: MMMU 70.6, VideoMME 70.8. Consistent improvements across the board. 🤖 Thinking Control: toggle reasoning with

thumb_up_off_alt89

chat_bubble_outline2

repeat15

shareShare

HKUNLP

@hkunlp2020

4 months ago

Jinjie Ni Jinjie Ni from NUS will be giving a talk titled "Diffusion Language Models are Super Data Learners" at Friday Aug 22 11am HKT. link to talk: hku.zoom.us/j/94293996114?…

Jinjie Ni <a href="/NiJinjie/">Jinjie Ni</a> from NUS will be giving a talk titled "Diffusion Language Models are Super Data Learners" at Friday Aug 22 11am HKT. link to talk: hku.zoom.us/j/94293996114?…

thumb_up_off_alt43

chat_bubble_outline0

repeat11

shareShare