Haoran Xu (@fe1ixxu) Twitter Tweets • TwiCopy

Haoran Xu

@fe1ixxu

+ Follow

Senior Researcher @Microsoft | Ph.D. @JHU '24| ex-intern @Microsoft Research | @Meta AI | @Amazon Alexa AI

ID: 899148379279818752

linkhttp://www.fe1ixxu.com calendar_today20-08-2017 05:57:33

78 Tweet

496 Followers

181 Following

Haoran Xu

@fe1ixxu

2 years ago

Honored to give a talk at MASC-SLL!

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

📢 Happy to share that our paper on #LLM safety in multilingual contexts has been accepted at #ACL 2024! ✨ We show the difficulty of alleviating multilingual safety issues in LLMs through standard alignment methods. arxiv.org/abs/2401.13136 🧵1/7

thumb_up_off_alt41

chat_bubble_outline2

repeat17

shareShare

Lingfeng Shen

@lingfeng_nlp

a year ago

Super excited that our work got picked for an #Oral presentation at #ICML this year! Had an awesome time collaborating with Aayush Mishra and Daniel Khashabi 🕊️ at JHU CLSP. Pity I can't make it to Vienna because of visa issues😅

thumb_up_off_alt29

chat_bubble_outline0

repeat4

shareShare

Haoran Xu

@fe1ixxu

a year ago

We recently had multiple rounds of discussions with the SimPO authors regarding the lack of comparison to CPO in their main paper. We both agree that it was an unintentional oversight, and they will update the paper to address it. We appreciate their positive and prompt response

thumb_up_off_alt21

chat_bubble_outline0

repeat4

shareShare

Haoran Xu

@fe1ixxu

a year ago

Here’s some better news: Combining CPO and SimPO can likely improve the model! Check out more details in our GitHub code: github.com/fe1ixxu/CPO_SI…

thumb_up_off_alt37

chat_bubble_outline1

repeat13

shareShare

Young

@yjkim362

a year ago

"What if Phi meets MoE?" I am super excited to share our new Phi-3.5-MoE. Phi-3.5-MoE is a 16 x 3.8B MoE model that only activates 6.6B params with 2 experts. MMLU score of 78.9! It outperforms Llama-3.1 8B, Gemma-2-9B, and Gemini-1.5-Flash. And, close to GPT-4o-mini. MIT lic

thumb_up_off_alt66

chat_bubble_outline3

repeat8

shareShare

Haoran Xu

@fe1ixxu

a year ago

3 years, I am officially Dr. Xu now!! Big thanks to my advisors: Kenton Murray and Philipp Koehn. I can't achieve this without you!

3 years, I am officially Dr. Xu now!! Big thanks to my advisors: <a href="/kentonmurray/">Kenton Murray</a> and Philipp Koehn. I can't achieve this without you!

thumb_up_off_alt146

chat_bubble_outline14

repeat7

shareShare

Haoran Xu

@fe1ixxu

a year ago

Glad to see CPO is in a lecture now!

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Haoran Xu

@fe1ixxu

10 months ago

Excited to share that X-ALMA got accepted at #ICLR2025! See you in Singapore!

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

Haoran Xu

@fe1ixxu

9 months ago

Excited to share that Phi-4-mini has been released! This was my first time rolling up my sleeves and experiencing the entire text training process. We also have a reasoning-enhanced Phi-4—outperforming many 7B reasoning models—which we plan to release very soon. Stay tuned!

thumb_up_off_alt16

chat_bubble_outline0

repeat0

shareShare

Young

@yjkim362

9 months ago

We also arxived #Phi-4-Mini technical report to cover our innovations for building strong lightweight multimodal model Phi-4-multimodal and language model Phi-4-mini. We use mixture-of-LoRAs technique to combine text, image, speech modalities together without interference.

thumb_up_off_alt55

chat_bubble_outline1

repeat8

shareShare

HyoJung Han

@h__j___han

7 months ago

I'll be presenting our work, VocADT, tomorrow at #ICLR2025✨ Check out our poster session: iclr.cc/virtual/2025/p… 🗓️Thu 24 Apr 3 p.m. - 5:30 p.m 📍Hall 3 + Hall 2B #250 So excited to be attending ICLR 2026 in Singapore🇸🇬

thumb_up_off_alt31

chat_bubble_outline0

repeat11

shareShare

Weizhu Chen

@weizhuchen

7 months ago

Glad to see the team used a 3.8B model (Phi-4-mini-reasoning) to achieve 94.6 in Math-500 and 57.5 in AIME-24. arxiv: arxiv.org/pdf/2504.21233 hf: huggingface.co/microsoft/Phi-… Azure: aka.ms/phi4-mini-reas…

thumb_up_off_alt27

chat_bubble_outline2

repeat4

shareShare

Satya Nadella

@satyanadella

7 months ago

Another big step forward for our SLM Phi family, with new reasoning models that once again redefine what is possible with small and efficient AI.

thumb_up_off_alt851

chat_bubble_outline68

repeat103

shareShare

JHU Computer Science

@jhucompsci

6 months ago

Fluent, fast, and fair—in collaboration with Microsoft Research, Johns Hopkins computer scientists (including Haoran Xu & Kenton Murray) have built a new machine translation model that achieves top-tier performance across 50 diverse languages. Learn more: cs.jhu.edu/news/fluent-fa…

Fluent, fast, and fair—in collaboration with <a href="/MSFTResearch/">Microsoft Research</a>, Johns Hopkins computer scientists (including <a href="/fe1ixxu/">Haoran Xu</a> & <a href="/kentonmurray/">Kenton Murray</a>) have built a new machine translation model that achieves top-tier performance across 50 diverse languages. Learn more: cs.jhu.edu/news/fluent-fa…

thumb_up_off_alt8

chat_bubble_outline0

repeat3

shareShare