Haoran Xu (@fe1ixxu) 's Twitter Profile
Haoran Xu

@fe1ixxu

Senior Researcher @Microsoft | Ph.D. @JHU '24| ex-intern @Microsoft Research | @Meta AI | @Amazon Alexa AI

ID: 899148379279818752

linkhttp://www.fe1ixxu.com calendar_today20-08-2017 05:57:33

78 Tweet

496 Followers

181 Following

Lingfeng Shen (@lingfeng_nlp) 's Twitter Profile Photo

📢 Happy to share that our paper on #LLM safety in multilingual contexts has been accepted at #ACL 2024! ✨ We show the difficulty of alleviating multilingual safety issues in LLMs through standard alignment methods. arxiv.org/abs/2401.13136 🧵1/7

Lingfeng Shen (@lingfeng_nlp) 's Twitter Profile Photo

Super excited that our work got picked for an #Oral presentation at #ICML this year! Had an awesome time collaborating with Aayush Mishra and Daniel Khashabi 🕊️ at JHU CLSP. Pity I can't make it to Vienna because of visa issues😅

Haoran Xu (@fe1ixxu) 's Twitter Profile Photo

We recently had multiple rounds of discussions with the SimPO authors regarding the lack of comparison to CPO in their main paper. We both agree that it was an unintentional oversight, and they will update the paper to address it. We appreciate their positive and prompt response

Haoran Xu (@fe1ixxu) 's Twitter Profile Photo

Here’s some better news: Combining CPO and SimPO can likely improve the model! Check out more details in our GitHub code: github.com/fe1ixxu/CPO_SI…

Here’s some better news: Combining CPO and SimPO can likely improve the model! Check out more details in our GitHub code: github.com/fe1ixxu/CPO_SI…
Young (@yjkim362) 's Twitter Profile Photo

"What if Phi meets MoE?" I am super excited to share our new Phi-3.5-MoE. Phi-3.5-MoE is a 16 x 3.8B MoE model that only activates 6.6B params with 2 experts. MMLU score of 78.9! It outperforms Llama-3.1 8B, Gemma-2-9B, and Gemini-1.5-Flash. And, close to GPT-4o-mini. MIT lic

Haoran Xu (@fe1ixxu) 's Twitter Profile Photo

Excited to share that Phi-4-mini has been released! This was my first time rolling up my sleeves and experiencing the entire text training process. We also have a reasoning-enhanced Phi-4—outperforming many 7B reasoning models—which we plan to release very soon. Stay tuned!

Young (@yjkim362) 's Twitter Profile Photo

We also arxived #Phi-4-Mini technical report to cover our innovations for building strong lightweight multimodal model Phi-4-multimodal and language model Phi-4-mini. We use mixture-of-LoRAs technique to combine text, image, speech modalities together without interference.

HyoJung Han (@h__j___han) 's Twitter Profile Photo

I'll be presenting our work, VocADT, tomorrow at #ICLR2025✨ Check out our poster session: iclr.cc/virtual/2025/p… 🗓️Thu 24 Apr 3 p.m. - 5:30 p.m 📍Hall 3 + Hall 2B #250 So excited to be attending ICLR 2026 in Singapore🇸🇬

I'll be presenting our work, VocADT, tomorrow at #ICLR2025✨
Check out our poster session: iclr.cc/virtual/2025/p…
🗓️Thu 24 Apr 3 p.m. - 5:30 p.m
📍Hall 3 + Hall 2B #250
So excited to be attending <a href="/iclr_conf/">ICLR 2026</a> in Singapore🇸🇬
Weizhu Chen (@weizhuchen) 's Twitter Profile Photo

Glad to see the team used a 3.8B model (Phi-4-mini-reasoning) to achieve 94.6 in Math-500 and 57.5 in AIME-24. arxiv: arxiv.org/pdf/2504.21233 hf: huggingface.co/microsoft/Phi-… Azure: aka.ms/phi4-mini-reas…

Glad to see the team used a 3.8B model (Phi-4-mini-reasoning) to achieve 94.6 in Math-500 and 57.5 in AIME-24.  
arxiv: arxiv.org/pdf/2504.21233
hf: huggingface.co/microsoft/Phi-…
Azure: aka.ms/phi4-mini-reas…
Satya Nadella (@satyanadella) 's Twitter Profile Photo

Another big step forward for our SLM Phi family, with new reasoning models that once again redefine what is possible with small and efficient AI.

JHU Computer Science (@jhucompsci) 's Twitter Profile Photo

Fluent, fast, and fair—in collaboration with Microsoft Research, Johns Hopkins computer scientists (including Haoran Xu & Kenton Murray) have built a new machine translation model that achieves top-tier performance across 50 diverse languages. Learn more: cs.jhu.edu/news/fluent-fa…

Fluent, fast, and fair—in collaboration with <a href="/MSFTResearch/">Microsoft Research</a>, Johns Hopkins computer scientists (including <a href="/fe1ixxu/">Haoran Xu</a> &amp; <a href="/kentonmurray/">Kenton Murray</a>) have built a new machine translation model that achieves top-tier performance across 50 diverse languages. Learn more: cs.jhu.edu/news/fluent-fa…