FeYuan (@t_feyuan) Twitter Tweets • TwiCopy

Hanxu Hu

a year ago

I am very glad to share that after one year and many rejections, finally, our Chain-of-Symbol Paper has been accepted at COLM!!! Conference on Language Modeling Check our paper and code: arxiv.org/abs/2305.10276 github.com/hanxuhu/chain-…

I am very glad to share that after one year and many rejections, finally, our Chain-of-Symbol Paper has been accepted at COLM!!! <a href="/COLM_conf/">Conference on Language Modeling</a>
Check our paper and code:
arxiv.org/abs/2305.10276
github.com/hanxuhu/chain-…

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Vila

@anniodance

a year ago

Make a demo for LLaMAX, a LLM for 101 languages. huggingface.co/spaces/vilarin…

thumb_up_off_alt51

chat_bubble_outline3

repeat19

shareShare

Lei Li

@lileics

a year ago

#ACL2024NLP Fei Yuan (FeYuan) is presenting a work on analyzing the vocabulary’s impact in LLM’s multilingual capability. We compare the translation performance of LLaMA and bilingual parallel corpus fine-tuned LLaMA., and evaluate their performance of 100 languages. Guess ?

#ACL2024NLP Fei Yuan (<a href="/t_feyuan/">FeYuan</a>) is presenting a work on analyzing the vocabulary’s impact in LLM’s multilingual capability. We compare the translation performance of LLaMA and bilingual parallel corpus fine-tuned LLaMA., and evaluate their performance of 100 languages. Guess ?

thumb_up_off_alt21

chat_bubble_outline1

repeat5

shareShare

chang ma

@ma_chang_nlp

a year ago

[1/4] RSA is accepted by #EMNLP2024 main track 🥳 - Enhance Any protein understanding model with lightning-fast retrieval. - 373x faster than MSA, on-the-fly computation, achieves comparable performance. Preprint link: biorxiv.org/content/10.110… Code: github.com/HKUNLP/RSA

thumb_up_off_alt40

chat_bubble_outline3

repeat16

shareShare

Sasha Rush

@srush_nlp

a year ago

Long-context is central to models like OpenAI o1, but rare to see in natural data. Extension methods grow context by post-training open LLMs. A tutorial and controlled study of this area of long-context extension. arxiv.org/abs/2409.12181 youtu.be/dc4chADushM

thumb_up_off_alt296

chat_bubble_outline3

repeat54

shareShare

Shaoxiong Ji

@shaoxiongji

a year ago

in collaboration with Indraneil Paul Peiqin Lin CIS, LMU Munich Jörg Tiedemann and others I cannot @, and funded by HPLT UTTER

thumb_up_off_alt14

chat_bubble_outline0

repeat3

shareShare

chang ma

@ma_chang_nlp

10 months ago

Congratulations to FeYuan! KS-Lottery is accepted by #NAACL2025. We provide a theoretical guaranteed solution for finding lottery tickets for multilingual LLM. The results are stunning: fine-tuning the embedding of 18 tokens would be enough for learning new multilingual

Congratulations to <a href="/t_feyuan/">FeYuan</a>! KS-Lottery is accepted by #NAACL2025.

We provide a theoretical guaranteed solution for finding lottery tickets for multilingual LLM. The results are stunning: fine-tuning the embedding of 18 tokens would be enough for learning new multilingual

thumb_up_off_alt15

chat_bubble_outline0

repeat2

shareShare

FeYuan

@t_feyuan

10 months ago

We recently published a new paper [arxiv.org/pdf/2502.07346] about comprehensive multilingual evaluation, collaborating with researchers from NJU, Shanghai AI Lab, and CMU. 💥To maintain high quality, three distinct native-speaking annotators independently annotate each sample.

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

FeYuan

@t_feyuan

9 months ago

Pleased to introduce our latest finding: arxiv.org/pdf/2502.15592 We analyze several key points of data synthesis in long-context instruction tuning and show that these insights can lead to significant performance improvements.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

GAO ChangJiang

@gao_cj

8 months ago

📢 Happy to bring our new paper! 🤩 > Non-En can exceed En in reasoning tasks > Ensembling 4+ langs in inference can bring about 10% more theoretical gain than En > Gain robust to lang choice and translation quality Paper: huggingface.co/papers/2504.11… Repo: github.com/CONE-MT/multil…

thumb_up_off_alt3

chat_bubble_outline1

repeat2

shareShare

FeYuan

@t_feyuan

7 months ago

Thanks for sharing!!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Yijun_Yang

@yijun_yang123

6 months ago

Why do Long Context Language Models (LCLMs) excel at needle-in-a-haystack tasks but struggle with real-world applications? Can we evaluate them in a fully controlled setting? 🎉 Introducing our latest work: "A Controllable Examination for Long-Context Language Models" TL;DR:

thumb_up_off_alt10

chat_bubble_outline2

repeat3

shareShare

FeYuan

@t_feyuan

a month ago

We're excited to introduce VisCoder2, our latest research advancing visual coding capabilities. Paper: huggingface.co/papers/2510.23… HomePage: tiger-ai-lab.github.io/VisCoder2/

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

FeYuan

@t_feyuan

a month ago

Meet JanusCoder series - a unified visual-programmatic interface for multimodal code intelligence.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare