Lingpeng Kong (@ikekong) 's Twitter Profile
Lingpeng Kong

@ikekong

Assistant Professor @ The University of Hong Kong,
Previously Research Scientist @ DeepMind

ID: 117795954

linkhttp://ikekonglp.github.io calendar_today26-02-2010 16:57:57

82 Tweet

848 Followers

290 Following

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO

Lei Li (@_tobiaslee) 's Twitter Profile Photo

MiMo-VL technical report, models, and evaluation suite are out! 🤗 Models: huggingface.co/XiaomiMiMo/MiM… (or RL) Report: arxiv.org/abs/2506.03569 Evaluation Suite: github.com/XiaomiMiMo/lmm… Looking back, it's incredible that we delivered such compact yet powerful vision-language

MiMo-VL technical report, models, and evaluation suite are out!  

 🤗 Models: huggingface.co/XiaomiMiMo/MiM… (or RL)
Report: arxiv.org/abs/2506.03569
Evaluation Suite: github.com/XiaomiMiMo/lmm…

Looking back, it's incredible that we delivered such compact yet powerful vision-language
Jing Xiong (@_june1126) 's Twitter Profile Photo

🔬 The HKU team presents ParallelComp: a training-free technique for efficient context length extrapolation in LLMs—from 8K up to 128K tokens—on a single A100 GPU, with minimal performance loss. 📄 Paper: arxiv.org/abs/2502.14317 💻 Code: github.com/menik1126/Para…

🔬 The HKU team presents ParallelComp: a training-free technique for efficient context length extrapolation in LLMs—from 8K up to 128K tokens—on a single A100 GPU, with minimal performance loss.

📄 Paper: arxiv.org/abs/2502.14317
💻 Code: github.com/menik1126/Para…
Stefano Ermon (@stefanoermon) 's Twitter Profile Photo

Huge milestone from the team! A blazing-fast diffusion LLM built for chat, delivering real-time performance at commercial scale. If you liked Mercury Coder for code, you'll love this for conversation.

Sansa Gong (@sansa19739319) 's Twitter Profile Photo

🤖Can diffusion models write code competitively? Excited to share our latest 7B coding diffusion LLM!!💻 With DiffuCoder, we explore how they decode, why temperature🔥 matters, and how to improve them via coupled-GRPO that speaks diffusion!!📈 Code: github.com/apple/ml-diffu… 🧵

🤖Can diffusion models write code competitively?
Excited to share our latest 7B coding diffusion LLM!!💻

With DiffuCoder, we explore how they decode, why temperature🔥 matters, and how to improve them via coupled-GRPO that speaks diffusion!!📈

Code: github.com/apple/ml-diffu…
🧵
JingqiZhou (@zhou_jingqi_) 's Twitter Profile Photo

🌳TreeSynth: Synthesizing large-scale diverse data from scratch! Struggling with repetition and space collapse in data synthesis? 🤔 Our latest research mitigates this challenge through innovative tree-guided subspace partitioning. ✨Introducing TreeSynth—a novel framework

🌳TreeSynth: Synthesizing large-scale diverse data from scratch!

Struggling with repetition and space collapse in data synthesis? 🤔 Our latest research mitigates this challenge through innovative tree-guided subspace partitioning.

✨Introducing TreeSynth—a novel framework
Zirui Wu (@williamzr7) 's Twitter Profile Photo

We present DreamOn: a simple yet effective method for variable-length generation in diffusion language models. Our approach boosts code infilling performance significantly and even catches up with oracle results.

Jiacheng Ye (@jiachengye15) 's Twitter Profile Photo

📢 Update: Announcing Dream's next-phase development. - Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, trained exclusively on public data. - DreamOn: targeting the variable-length generation problem in dLLM!

HKUNLP (@hkunlp2020) 's Twitter Profile Photo

Xinyu Yang from CMU will be giving a talk titled "Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation" at Friday July 25 11am HKT (Thursday July 24 8pm PDT). Link to talk: hku.zoom.us/j/92651812689?…

Xinyu Yang from CMU will be giving a talk titled "Multiverse: Your Language Models Secretly
Decide How to Parallelize and Merge Generation" at Friday July 25 11am HKT  (Thursday July 24 8pm PDT). Link to talk: hku.zoom.us/j/92651812689?…
Lei Li (@_tobiaslee) 's Twitter Profile Photo

🚀 MiMo‑VL 2508 is live! Same size, much smarter. We’ve upgraded performance, thinking control, and overall user experience. 📈 Benchmark gains across image + video: MMMU 70.6, VideoMME 70.8. Consistent improvements across the board. 🤖 Thinking Control: toggle reasoning with

🚀 MiMo‑VL 2508 is live!  Same size, much smarter.

We’ve upgraded performance, thinking control, and overall user experience.

📈 Benchmark gains across image + video: MMMU 70.6, VideoMME 70.8.
Consistent improvements across the board.

🤖 Thinking Control: toggle reasoning with
HKUNLP (@hkunlp2020) 's Twitter Profile Photo

Jinjie Ni Jinjie Ni from NUS will be giving a talk titled "Diffusion Language Models are Super Data Learners" at Friday Aug 22 11am HKT. link to talk: hku.zoom.us/j/94293996114?…

Jinjie Ni <a href="/NiJinjie/">Jinjie Ni</a> from NUS will be giving a talk titled "Diffusion Language Models are Super Data Learners" at Friday Aug 22 11am HKT. link to talk: hku.zoom.us/j/94293996114?…