Minghao Wu (@wuminghao_nlp) Twitter Tweets • TwiCopy

Minghao Wu

@wuminghao_nlp

+ Follow

ID: 1390844858395283462

linkhttps://minghao-wu.github.io/ calendar_today08-05-2021 01:44:00

462 Tweet

647 Followers

554 Following

Minghao Wu

@wuminghao_nlp

9 months ago

I recently analyzed the topics of user instructions from Chatbot Arena and WildChat, with 10K instructions for each language. The patterns observed across different languages were generally similar. However, the English users asked lots of health-related questions, while

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Minghao Wu

@wuminghao_nlp

9 months ago

If a paper receives a review stating that "the idea is novel and the experiments are comprehensive," what is the minimum score the paper should receive?

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Minghao Wu

@wuminghao_nlp

8 months ago

support 10M context size ≠ perform well on 10M context size. 🤫

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

Minghao Wu

@wuminghao_nlp

8 months ago

🥳🥳🥳New dataset: huggingface.co/datasets/mingh… We recently received some free compute, so we created a synthetic dataset of 10M realistic personas using meta-llama/Llama-3.3-70B-Instruct & Qwen/Qwen2.5-72B-Instruct. Each persona includes features like name, DOB, personality, and

thumb_up_off_alt49

chat_bubble_outline0

repeat16

shareShare

Longyue Wang

@wangly0229

8 months ago

🌺GPT-4o’s image generation is stunning — but how well does it handle complex scenarios? 🤔 We introduce 🚀CIGEVAL🚀, a novel method to evaluate models' capabilities in Conditional Image Generation 🖼️➕🖼️🟰🖼️. Find out how top models perform when conditions get truly

thumb_up_off_alt50

chat_bubble_outline2

repeat22

shareShare

Longyue Wang

@wangly0229

8 months ago

🌏 New position paper analyzes 2,000+ multilingual LLM benchmarks from 148 countries (2021-2024) and reveals key challenges in multilingual AI evaluation. 🌺

thumb_up_off_alt25

chat_bubble_outline1

repeat10

shareShare

AK

@_akhaliq

8 months ago

Alibaba presents The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

thumb_up_off_alt620

chat_bubble_outline4

repeat106

shareShare

Minghao Wu

@wuminghao_nlp

8 months ago

I suddenly realize that, given that llama4 doesn't perform well, my reviewers (probably) will not ask me to include additional experiments using llama4 into my resubmissions. yay!

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Yunxin Li

@lyxtg

8 months ago

Our upgraded O1-like Anim-Drector is nearly ready! Witness the magic as it transforms a few simple words into a complete animated short film Girl & Cat with zero human intervention! Click the image below to watch the film! Youtube: youtube.com/watch?v=O8XLR1…

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

Yunxin Li

@lyxtg

7 months ago

🎉 Exciting news! Welcome to CulturalLingo, our new VideoVista series that bridges cultures (China, North America, and Europe), languages, and domains (140+) in video comprehension. Join us on this journey of video understanding! 🌍📽️ Link of paper: researchgate.net/publication/39…

thumb_up_off_alt15

chat_bubble_outline1

repeat13

shareShare

Minghao Wu

@wuminghao_nlp

7 months ago

To be honest, this is my new favorite reading of the year.

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

Minghao Wu

@wuminghao_nlp

7 months ago

🚀 Excited to share that our paper "GraphFilter" is accepted at #ICML2025! We introduce a novel bipartite graph approach for data selection in LLM fine-tuning, balancing quality & diversity for optimal performance. GraphFilter outperforms 9 baselines across 6 benchmarks—setting

thumb_up_off_alt24

chat_bubble_outline1

repeat1

shareShare

Minghao Wu

@wuminghao_nlp

7 months ago

Thanks for your kind words, mate! The first round of review was somewhat helpful, but after that, most of the reviews were something like "the model backbones are outdated." 🤦🏻‍♂️ I’ve resubmitted this paper for several times but haven't gotten an acceptance. 😮‍💨 I guess the 40

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

Minghao Wu

@wuminghao_nlp

7 months ago

accepted by ACL2025!!! 🎉🎉🎉 many thanks to Weixuan Wang Barry Haddow alexandra birch

thumb_up_off_alt36

chat_bubble_outline1

repeat0

shareShare

Weixuan Wang

@weixuanwang66

7 months ago

📢 New Paper Alert 📢 🍓HBO: A Novel Balancing Method for LLM SFT! 🎯 Bilevel strategy for tackling data imbalance across and within datasets. 🌍 Global Actor: Dynamically adjusts sampling across heterogeneous subsets. 🔍 Local Actors: Optimize subset training by difficulty.

thumb_up_off_alt6

chat_bubble_outline1

repeat1

shareShare

Weixuan Wang

@weixuanwang66

7 months ago

🚨 What if you could hijack any LLM's brain using external expert models? ExpertSteer does exactly that! 🧠⚡ Meet ExpertSteer: a breakthrough that lets you inject expert knowledge into any LLM, guiding its responses without updating model parameters.