Siwei Wu（吴思为） (@siweiwu7) Twitter Tweets • TwiCopy

Siwei Wu（吴思为）

@siweiwu7

+ Follow

I am a MSc student, and I am also a NLPer. I will go to the University of Manchester for PH.D. in the 2024 Fall.

ID:1552904424137592833

linkhttps://wusiwei0410.github.io/ calendar_today29-07-2022 06:30:55

30 Tweets

48 Followers

80 Following

Ge Zhang

@GeZhang86038849

1 month ago

[1/n]
🎉🎉🎉 Excited to share our latest work: 'The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis'! We delve into the dynamics of LLMs across different scales and domains.

💡Highlights include:

🗺️ Comprehensive Model Evaluation:

account_circle

Ge Zhang

@GeZhang86038849

2 months ago

[1/n]
🚀 Excited to share our latest work on OpenCodeInterpreter! With a blend of execution results and human feedback, we've achieved significant advancements in code generation. Here are the key points:

✨ Introducing OpenCodeInterpreter - a leap in iterative code refinement.

account_circle

Yizhi Li

@yizhilll

2 months ago

Nice!😺

thumb_up_off_alt0

chat_bubble_outline0

repeat1

shareShare

account_circle

AK

@_akhaliq

4 months ago

MathPile: A Billion-Token-Scale Pretraining Corpus for Math

paper page: huggingface.co/papers/2312.17…

High-quality, large-scale corpora are the cornerstone of building foundation models. In this work, we introduce MathPile, a diverse and high-quality math-centric corpus comprising

account_circle

Wenjie Zheng

@WJoyZheng

6 months ago

A research on the reliability of Large Language Models' behavior #LLM #LLM s #ChatGPT . I've encountered similar situations in practical use, which left me puzzled about the LLMs generating different results🤨.

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

account_circle

Sinclair Wang

@SinclairWang1

9 months ago

Welcome to check our #sigir2023 paper🥳
dl.acm.org/doi/10.1145/35…

When we started this project, ChatGPT had not yet appeared. This paper may not have been so attractive in the ChatGPT era. But I still want to share our insights from this project, in hopes more people will learn

thumb_up_off_alt6

chat_bubble_outline0

repeat3

shareShare

account_circle

Wenjie Zheng

@WJoyZheng

9 months ago

Multimodal Emotion Recognition in Multiparty Conversations (MERMC) currently focus on text and audio modalities. How do we extract facial sequences of real speaker and further help MER? Introducing Facial expression-aware Multimodal Multi-Task learning (FacialMMT) #acl2023

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

account_circle

Siwei Wu（吴思为）

@siweiwu7

9 months ago

Interesting!

I think this in-depth analysis of fake news is also highly valuable in addressing the issue of hallucination in large language models. It provides valuable insights into potential solutions for this problem.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

account_circle