chang ma (@ma_chang_nlp) Twitter Tweets • TwiCopy

chang ma

@ma_chang_nlp

8 months ago

Thanks for sharing our work -- our solution to improving GUI agents with other data rich sources.

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

chang ma

@ma_chang_nlp

8 months ago

Excited to share our work at ICLR 2025 in 🇸🇬. ICLR 2026 🥳 Happy to chat about LLM reasoning & planning, agents, and AI4Science! 📍Sat 26 Apr 3 p.m. CST — 5:30 p.m Hall 3 + Hall 2B #554

Excited to share our work at ICLR 2025 in 🇸🇬. <a href="/iclr_conf/">ICLR 2026</a> 🥳 Happy to chat about LLM reasoning & planning, agents, and AI4Science!

📍Sat 26 Apr 3 p.m. CST — 5:30 p.m Hall 3 + Hall 2B #554

thumb_up_off_alt33

chat_bubble_outline0

repeat6

shareShare

Excited to be in Singapore 🇸🇬 for #ICLR2025! Thrilled for my first time attending after past visa issues kept me away 😢. We'll be presenting our work on: 1️⃣ Jailbreaking as a Reward Misspecification Problem 🗓️ Thursday, April 24 — 3:00 PM - 5:30 PM (SGT) 📍 Hall 3 + Hall 2B —

thumb_up_off_alt24

chat_bubble_outline0

repeat5

shareShare

Shiqi Chen

@shiqi_chen17

8 months ago

🚀🔥 Thrilled to announce our ICML25 paper: "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas"! We dive into the core reasons behind spatial reasoning difficulties for Vision-Language Models from an attention mechanism view. 🌍🔍 Paper:

thumb_up_off_alt230

chat_bubble_outline5

repeat36

shareShare

chang ma

@ma_chang_nlp

7 months ago

We are kicking off a series of seminars at HKUNLP. Siyan Zhao will be giving a talk titled "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning" at ⏰Friday 5.9 11am HKT (Thursday 5.8 8pm PDT). Link to talk: hku.zoom.us/j/97925412724?…

We are kicking off a series of seminars at <a href="/hkunlp2020/">HKUNLP</a>. <a href="/siyan_zhao/">Siyan Zhao</a> will be giving a talk titled "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning" at ⏰Friday 5.9 11am HKT (Thursday 5.8 8pm PDT). Link to talk: hku.zoom.us/j/97925412724?…

thumb_up_off_alt37

chat_bubble_outline0

repeat13

shareShare

HKUNLP

@hkunlp2020

7 months ago

Follow our new HKUNLP seminars at hkunlp.github.io/seminar/. You can also sign up as a speaker to share your work!

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

HKUNLP

@hkunlp2020

7 months ago

Guanqi Jiang from UCSD will be giving a talk titled "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Datasets" at ⏰Friday 5.16 11am HKT (Thursday 5.15 8pm PDT). Link to talk: hku.zoom.us/j/97674910858?…

thumb_up_off_alt10

chat_bubble_outline0

repeat5

shareShare

Shiqi Chen

@shiqi_chen17

7 months ago

Share our another #ICML25 paper: “Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging” ! (1/5) We use model merging to enhance VLMs' reasoning by integrating math-focused LLMs—bringing textual reasoning into multi-modal models. Surprisingly, this

thumb_up_off_alt89

chat_bubble_outline0

repeat13

shareShare

Wei Liu ✈️ ICLR2025

@weiliu99

7 months ago

“What is the answer of 1 + 1?” Large Reasoning Models (LRMs) may generate 1500+ tokens just to answer this trivial question. Too much thinking 🤯 Can LRMs be both Faster AND Stronger? Yes. Introducing LASER💥: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

thumb_up_off_alt140

chat_bubble_outline2

repeat32

shareShare

Yuzhen Huang @ ICLR 2025

@yuzhenh17

7 months ago

🔍 Are Verifiers Trustworthy in RLVR? Our paper, Pitfalls of Rule- and Model-based Verifiers, exposes the critical flaws in reinforcement learning verification for mathematical reasoning. 🔑 Key findings: 1️⃣ Rule-based verifiers miss correct answers, especially when presented in

thumb_up_off_alt128

chat_bubble_outline3

repeat21

shareShare

Xueliang Zhao

@xlzhao_hku

7 months ago

🔥 Meet PromptCoT-Mamba The first reasoning model with constant-memory inference to beat Transformers on competition-level math & code ⚡ Efficient decoding: no attention, no KV cache ⚡ +16.0% / +7.1% / +16.6% vs. s1.1-7B on AIME 24 / 25 / LiveCodeBench 🚀 Up to 3.66× faster

thumb_up_off_alt29

chat_bubble_outline2

repeat15

shareShare

Sergey Levine

@svlevine

6 months ago

I always found it puzzling how language models learn so much from next-token prediction, while video models learn so little from next frame prediction. Maybe it's because LLMs are actually brain scanners in disguise. Idle musings in my new blog post: sergeylevine.substack.com/p/language-mod…

thumb_up_off_alt723

chat_bubble_outline16

repeat82

shareShare

HKUNLP

@hkunlp2020

6 months ago

Hongru Wang from CUHK will be giving a talk titled "Theory of agent: from definition to objective" at ⏰Wednesday 6.11 3pm HKT (Thursday 6.11 11am PDT). Link to talk: hku.zoom.us/j/91654661534?…

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

6 months ago

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Apple introduces DiffuCoder, a 7B diffusion LLM trained on 130B tokens of code authors also propose a diffusion-native RL training framework, coupled-GRPO Decoding of dLLMs differ from

thumb_up_off_alt299

chat_bubble_outline4

repeat73

shareShare

Zirui Wu

@williamzr7

5 months ago

We present DreamOn: a simple yet effective method for variable-length generation in diffusion language models. Our approach boosts code infilling performance significantly and even catches up with oracle results.

thumb_up_off_alt120

chat_bubble_outline2

repeat29

shareShare

Zhihui Xie

@_zhihuixie

5 months ago

🚀 Thrilled to announce Dream-Coder 7B — the most powerful open diffusion code  LLM to date.

thumb_up_off_alt122

chat_bubble_outline3

repeat36

shareShare

Lingpeng Kong

@ikekong

5 months ago

What happend after Dream 7B? First, Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, trained exclusively on public data. Plus, DreamOn cracks the variable-length generation problem! It enables code infilling that goes beyond a fixed canvas.

thumb_up_off_alt72

chat_bubble_outline1

repeat35

shareShare

HKUNLP

@hkunlp2020

5 months ago

Xinyu Yang from CMU will be giving a talk titled "Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation" at Friday July 25 11am HKT (Thursday July 24 8pm PDT). Link to talk: hku.zoom.us/j/92651812689?…

thumb_up_off_alt22

chat_bubble_outline1

repeat7

shareShare

HKUNLP

@hkunlp2020

4 months ago

Jinjie Ni Jinjie Ni from NUS will be giving a talk titled "Diffusion Language Models are Super Data Learners" at Friday Aug 22 11am HKT. link to talk: hku.zoom.us/j/94293996114?…

Jinjie Ni <a href="/NiJinjie/">Jinjie Ni</a> from NUS will be giving a talk titled "Diffusion Language Models are Super Data Learners" at Friday Aug 22 11am HKT. link to talk: hku.zoom.us/j/94293996114?…

thumb_up_off_alt43

chat_bubble_outline0

repeat11

shareShare