Zhiheng LYU (@zhihenglyu) Twitter Tweets • TwiCopy

Cong Wei

a year ago

🚀Thrilled to introduce ☕️MoCha: Towards Movie-Grade Talking Character Synthesis Please unmute to hear the demo audio. ✨We defined a novel task: Talking Characters, which aims to generate character animations directly from Natural Language and Speech input. ✨We propose

thumb_up_off_alt220

chat_bubble_outline18

repeat58

shareShare

Kevin Yang

@kevinyang41

a year ago

Will be at NAACL next week, excited to share two of our papers: FACTTRACK: Time-Aware World State Tracking in Story Outlines arxiv.org/abs/2407.16347 THOUGHTSCULPT: Reasoning with Intermediate Revision and Search arxiv.org/abs/2404.05966 Shoutout to first authors Zhiheng LYU and

thumb_up_off_alt10

chat_bubble_outline0

repeat4

shareShare

Dongfu Jiang

@dongfujiang

a year ago

Introducing VerlTool - a unified and easy-to-extend tool agent training framework based on verl. Recently, there's been a growing trend toward training tool agents with reinforcement learning algorithms like GRPO and PPO. Representative works include SearchR1, ToRL, ReTool, and

thumb_up_off_alt342

chat_bubble_outline4

repeat62

shareShare

Yuansheng Ni

@yuanshengni

a year ago

📢 Introducing VisCoder – fine-tuned language models for Python-based visualization code generation and feedback-driven self-debugging. Existing LLMs struggle to generate reliable plotting code: outputs often raise exceptions, produce blank visuals, or fail to reflect the

thumb_up_off_alt35

chat_bubble_outline7

repeat16

shareShare

MiniMax (official)

@minimax__ai

10 months ago

Day 1/5 of #MiniMaxWeek: We’re open-sourcing MiniMax-M1, our latest LLM — setting new standards in long-context reasoning. - World’s longest context window: 1M-token input, 80k-token output - State-of-the-art agentic use among open-source models - RL at unmatched efficiency:

thumb_up_off_alt1,1K

chat_bubble_outline55

repeat236

shareShare

Dongfu Jiang

@dongfujiang

8 months ago

🚀 Excited to finally share our paper on VerlTool, released today after months of work since the initial release in late May! VerlTool is a high-efficiency, easy-to-use framework for Agentic RL with Tool use (ARLT), built on top of VeRL. It currently supports a wide range of

thumb_up_off_alt157

chat_bubble_outline2

repeat37

shareShare

Wenhu Chen

@wenhuchen

6 months ago

Totally agree. We experimented with only-image input for every task. The results are quite good. Checkout our early paper PixelWorld: arxiv.org/abs/2501.19339

thumb_up_off_alt189

chat_bubble_outline5

repeat12

shareShare

Wenhu Chen

@wenhuchen

6 months ago

# NewDataset for VLMs After the release of VisualWebInstruct, we kept pushing its quality and adopting different strategies to make it as accurate as possible. Today, we are releasing a verified version of VisualWebInstruct under huggingface.co/datasets/TIGER…. It has around 100K

thumb_up_off_alt94

chat_bubble_outline2

repeat22

shareShare

MiniMax (official)

@minimax__ai

6 months ago

We’re open-sourcing MiniMax M2 — Agent & Code Native, at 8% Claude Sonnet price, ~2x faster ⚡ Global FREE for a limited time via MiniMax Agent & API - Advanced Coding Capability: Engineered for end-to-end developer workflows. Strong capability on a wide-range of applications

thumb_up_off_alt1,1K

chat_bubble_outline106

repeat332

shareShare

Yuansheng Ni

@yuanshengni

6 months ago

📢 Introducing VisCoder2: Building Multi-Language Visualization Coding Agents! Existing LLMs often fail in practical workflows due to limited language coverage, unreliable execution, and a lack of iterative correction mechanisms. We introduce 3 resources to address this:

thumb_up_off_alt17

chat_bubble_outline7

repeat8

shareShare

Yuntian Deng

@yuntiandeng

6 months ago

My student Wentao reproduced Self-Adapting LMs and wrote a blog on lessons learned. Highly recommended for anyone adapting LMs! He's also looking for a summer internship. He has 2 first-author EMNLP papers after just one year! 🔗aggregativeqa.com/dataview 🔗interactivetraining.ai

thumb_up_off_alt116

chat_bubble_outline4

repeat15

shareShare

Hanqi Yan

@yan_hanqi

6 months ago

🚀 Thrilled to announce that I’ll be attending EMNLP 2025 (4Nov-9Nov) in Suzhou, China! 🇨🇳✨ I’ll be showcasing our latest research from #KCLNLP on implicit Chain-of-Thoughts (CoTs) and an AI Scientist demo system 🤖🧠 📘 CODI: Compressing Chain-of-Thought into Continuous Space

thumb_up_off_alt40

chat_bubble_outline2

repeat4

shareShare

Jiarui Liu

@jiarui_liu_

6 months ago

Our EMNLP 2025 paper "Synthetic Socratic Debates" is presenting today in Suzhou! 📍 Poster Session 1 🕚 Nov 5, 11:00 AM (Beijing) Come chat about how LLM personas shape moral reasoning & persuasion! 🔗 arxiv.org/abs/2506.12657

thumb_up_off_alt25

chat_bubble_outline1

repeat8

shareShare

Zhiheng LYU

@zhihenglyu

6 months ago

Actually saw it climb from 8th to 2nd; let's see what happens when the free trial ends : )

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Lingming Zhang

@lingmingzhang

5 months ago

🤯🤯🤯 Gemini 3 Pro + Live-SWE-agent hits 77.4% on SWE-bench Verified, beating ALL existing models, including Claude 4.5!! 🤖 Live-SWE-agent is the first live software agent that autonomously self-evolves on the fly — and it even outperforms the manually engineered scaffold

thumb_up_off_alt479

chat_bubble_outline33

repeat71

shareShare