Minghao Wu (@wuminghao_nlp) 's Twitter Profile
Minghao Wu

@wuminghao_nlp

Ph.D. @MonashUni | AI | NLP | Multilinguality | MT | ex-@JD_Corporate @Huawei @mbzuai @TencentGlobal | Opinions are my own | ๐Ÿฅฐ @WeixuanWang66

ID: 1390844858395283462

linkhttps://minghao-wu.github.io/ calendar_today08-05-2021 01:44:00

462 Tweet

647 Followers

554 Following

Minghao Wu (@wuminghao_nlp) 's Twitter Profile Photo

I recently analyzed the topics of user instructions from Chatbot Arena and WildChat, with 10K instructions for each language. The patterns observed across different languages were generally similar. However, the English users asked lots of health-related questions, while

I recently analyzed the topics of user instructions from Chatbot Arena and WildChat, with 10K instructions for each language. The patterns observed across different languages were generally similar. However, the English users asked lots of health-related questions, while
Minghao Wu (@wuminghao_nlp) 's Twitter Profile Photo

If a paper receives a review stating that "the idea is novel and the experiments are comprehensive," what is the minimum score the paper should receive?

Minghao Wu (@wuminghao_nlp) 's Twitter Profile Photo

๐Ÿฅณ๐Ÿฅณ๐ŸฅณNew dataset: huggingface.co/datasets/minghโ€ฆ We recently received some free compute, so we created a synthetic dataset of 10M realistic personas using meta-llama/Llama-3.3-70B-Instruct & Qwen/Qwen2.5-72B-Instruct. Each persona includes features like name, DOB, personality, and

Longyue Wang (@wangly0229) 's Twitter Profile Photo

๐ŸŒบGPT-4oโ€™s image generation is stunning โ€” but how well does it handle complex scenarios? ๐Ÿค” We introduce ๐Ÿš€CIGEVAL๐Ÿš€, a novel method to evaluate models' capabilities in Conditional Image Generation ๐Ÿ–ผ๏ธโž•๐Ÿ–ผ๏ธ๐ŸŸฐ๐Ÿ–ผ๏ธ. Find out how top models perform when conditions get truly

๐ŸŒบGPT-4oโ€™s image generation is stunning โ€” but how well does it handle complex scenarios? ๐Ÿค”

We introduce ๐Ÿš€CIGEVAL๐Ÿš€, a novel method to evaluate models' capabilities in Conditional Image Generation ๐Ÿ–ผ๏ธโž•๐Ÿ–ผ๏ธ๐ŸŸฐ๐Ÿ–ผ๏ธ. Find out how top models perform when conditions get truly
Longyue Wang (@wangly0229) 's Twitter Profile Photo

๐ŸŒ New position paper analyzes 2,000+ multilingual LLM benchmarks from 148 countries (2021-2024) and reveals key challenges in multilingual AI evaluation. ๐ŸŒบ

๐ŸŒ New position paper analyzes 2,000+ multilingual LLM benchmarks from 148 countries (2021-2024) and reveals key challenges in multilingual AI evaluation. ๐ŸŒบ
Minghao Wu (@wuminghao_nlp) 's Twitter Profile Photo

I suddenly realize that, given that llama4 doesn't perform well, my reviewers (probably) will not ask me to include additional experiments using llama4 into my resubmissions. yay!

Yunxin Li (@lyxtg) 's Twitter Profile Photo

Our upgraded O1-like Anim-Drector is nearly ready! Witness the magic as it transforms a few simple words into a complete animated short film Girl & Cat with zero human intervention! Click the image below to watch the film! Youtube: youtube.com/watch?v=O8XLR1โ€ฆ

Yunxin Li (@lyxtg) 's Twitter Profile Photo

๐ŸŽ‰ Exciting news! Welcome to CulturalLingo, our new VideoVista series that bridges cultures (China, North America, and Europe), languages, and domains (140+) in video comprehension. Join us on this journey of video understanding! ๐ŸŒ๐Ÿ“ฝ๏ธ Link of paper: researchgate.net/publication/39โ€ฆ

๐ŸŽ‰ Exciting news! Welcome to CulturalLingo, our new VideoVista series that bridges cultures (China, North America, and Europe), languages, and domains (140+) in video comprehension. Join us on this journey of video understanding! ๐ŸŒ๐Ÿ“ฝ๏ธ Link of paper: researchgate.net/publication/39โ€ฆ
Minghao Wu (@wuminghao_nlp) 's Twitter Profile Photo

๐Ÿš€ Excited to share that our paper "GraphFilter" is accepted at #ICML2025! We introduce a novel bipartite graph approach for data selection in LLM fine-tuning, balancing quality & diversity for optimal performance. GraphFilter outperforms 9 baselines across 6 benchmarksโ€”setting

Minghao Wu (@wuminghao_nlp) 's Twitter Profile Photo

Thanks for your kind words, mate! The first round of review was somewhat helpful, but after that, most of the reviews were something like "the model backbones are outdated." ๐Ÿคฆ๐Ÿปโ€โ™‚๏ธ Iโ€™ve resubmitted this paper for several times but haven't gotten an acceptance. ๐Ÿ˜ฎโ€๐Ÿ’จ I guess the 40

Weixuan Wang (@weixuanwang66) 's Twitter Profile Photo

๐Ÿ“ข New Paper Alert ๐Ÿ“ข ๐Ÿ“HBO: A Novel Balancing Method for LLM SFT! ๐ŸŽฏ Bilevel strategy for tackling data imbalance across and within datasets. ๐ŸŒ Global Actor: Dynamically adjusts sampling across heterogeneous subsets. ๐Ÿ” Local Actors: Optimize subset training by difficulty.

๐Ÿ“ข New Paper Alert ๐Ÿ“ข

๐Ÿ“HBO: A Novel Balancing Method for LLM SFT!
๐ŸŽฏ Bilevel strategy for tackling data imbalance across and within datasets.
๐ŸŒ Global Actor: Dynamically adjusts sampling across heterogeneous subsets.
๐Ÿ” Local Actors: Optimize subset training by difficulty.
Weixuan Wang (@weixuanwang66) 's Twitter Profile Photo

๐Ÿšจ What if you could hijack any LLM's brain using external expert models? ExpertSteer does exactly that! ๐Ÿง โšก Meet ExpertSteer: a breakthrough that lets you inject expert knowledge into any LLM, guiding its responses without updating model parameters.

๐Ÿšจ What if you could hijack any LLM's brain using external expert models? ExpertSteer does exactly that! ๐Ÿง โšก
Meet ExpertSteer: a breakthrough that lets you inject expert knowledge into any LLM, guiding its responses without updating model parameters.
Minghao Wu (@wuminghao_nlp) 's Twitter Profile Photo

Just arrived in Vienna for #ACL2025. I will be presenting a few papers and participating in one of the panels. Friends, feel free to catch up!