Haoran Xu (@fe1ixxu) Twitter Tweets • TwiCopy

3 months ago

Very happy to give this talk! Feel free to join!

thumb_up_off_alt9

repeat2

account_circle

Young

@yjkim362

4 months ago

We love DPO for its elegance and simplicity. So, we are making it even better! By eliminating the reference model, the loss function becomes contrastive and we call it CPO (Contrastive Preference Optimization). It's even more effective at our target task than DPO!

thumb_up_off_alt15

repeat2

account_circle

Young

@yjkim362

4 months ago

Opening up a new generation of machine translation leveraging the power of LLMs! It's now in #ICLR2024 (w/ Haoran Xu , Amr Sharaf , Hany Hassan Awadalla ).

Teaser: Another breakthrough is coming, soonish..

thumb_up_off_alt9

repeat3

account_circle

Lingfeng Shen

4 months ago

So happy to share that our paper 'The Trickle-down Impact of Reward (In-)consistency on RLHF' (arxiv.org/abs/2309.16155…) has been accepted by ICLR this year. #ICLR #RLHF

I believe that we should explore/enhance RLHF through a data-centric perspective! JHU CLSP

thumb_up_off_alt20

repeat4

account_circle

Haoran Xu

4 months ago

Very proud that ALMA has been accepted at ICLR 2024! JHU CLSP Microsoft Research
arxiv.org/abs/2309.11674

account_circle

JHU Computer Science

@JHUCompSci

5 months ago

Multi-language mastery: minimized hardware, maximized efficiency! Johns Hopkins computer scientists (feat. Haoran Xu & Kenton Murray) introduce a new method to reduce the size of multilingual language models. hub.jhu.edu/2023/12/07/mul…

Multi-language mastery: minimized hardware, maximized efficiency! Johns Hopkins computer scientists (feat. @fe1ixxu & @kentonmurray) introduce a new method to reduce the size of multilingual language models. hub.jhu.edu/2023/12/07/mul…

thumb_up_off_alt6

repeat3

account_circle

Haoran Xu

6 months ago

How to generate translation within the target domain only with some monolingual data? Check Weiting (Steven) Tan's new paper!

thumb_up_off_alt4

repeat0

account_circle

JHU CLSP

@jhuclsp

7 months ago

“Condensing Multilingual Knowledge with Lightweight Language-Specific Modules”

Draft: arxiv.org/abs/2305.13993
By: Haoran Xu and authors

TLDR: We propose lightweight but parameter-efficient language-specific modules and further fuse multilingual knowledge in a shared module.

thumb_up_off_alt4

repeat2

account_circle

Lingfeng Shen

7 months ago

Is In-Context Learning (ICL) equivalent to Gradient Descent (GD)? There is a common belief that applying ICL in #LLM functions like GD-based fine-tuning. But does this hold in real-world LLMs? 🤔

Find out in our latest paper: arxiv.org/abs/2310.08540

account_circle

Haoran Xu

7 months ago

Trilled to share that our paper has been accepted at the main conference of EMNLP 2023! 🎉 EMNLP 2024 JHU CLSP

thumb_up_off_alt21

repeat4

account_circle

Lingfeng Shen

7 months ago

Happy to share our #EMNLP findings paper: Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency: arxiv.org/abs/2305.10713

account_circle

Tianjian Li

@tli104

7 months ago

(1/5) The standard MLE objective is notoriously vulnerable to noise! How can we make LLMs robust to noise in the training data? 🤔

We propose Error Norm Truncation (ENT), a modified training objective that ignores noisy tokens in the training corpus.

📰: arxiv.org/abs/2310.00840

thumb_up_off_alt14

repeat8

account_circle

Lingfeng Shen

7 months ago

'Does the consistency/robustness of reward model matter in RLHF?' Check our latest work🥳! arxiv.org/pdf/2309.16155…

Had a lot excitement with this work! Feel so lucky to have Sihao Chen and Daniel Khashabi 🕊️ as collaborators!

thumb_up_off_alt12

repeat4