Siyu Yuan (@siyu_yuan_) Twitter Tweets • TwiCopy

Siyu Yuan

@siyu_yuan_

+ Follow

Ph.D. candidate at Fudan University. Ex-Research Intern at
@MSFTResearch Asia and @BytedanceTalk AI Lab

ID: 967629804941074432

linkhttps://siyuyuan.github.io/ calendar_today25-02-2018 05:18:18

141 Tweet

620 Followers

482 Following

Chenchen Ye

@chenchenye_ccye

4 months ago

📢New LLM Agents Benchmark! Introducing 🌟MIRAI🌟: A groundbreaking benchmark crafted for evaluating LLM agents in temporal forecasting of international events with tool use and complex reasoning! 📜 Arxiv: arxiv.org/abs/2407.01231 🔗 Project page: mirai-llm.github.io 🧵1/N

thumb_up_off_alt304

chat_bubble_outline14

repeat70

shareShare

Siyu Yuan

@siyu_yuan_

4 months ago

Thanks for sharing! How do we automatically extend the specialized agent to multi-agent systems? 🤔 Try EvoAgent, a generic method via the evolutionary algorithm without any extra human designs! 💫

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Siyu Yuan

@siyu_yuan_

4 months ago

Thanks for sharing! 🥰 Let's try EvoAgent! A generic method via the evolutionary algorithm to automatically extend the specialized agent to multi-agent systems🥳

thumb_up_off_alt13

chat_bubble_outline0

repeat1

shareShare

Rulin Shao

@rulinshao

4 months ago

🔥We release the first open-source 1.4T-token RAG datastore and present a scaling study for RAG on perplexity and downstream tasks! We show LM+RAG scales better than LM alone, with better performance for the same training compute (pretraining+indexing) retrievalscaling.github.io 🧵

thumb_up_off_alt357

chat_bubble_outline18

repeat85

shareShare

Sander Wang

@sanderwangsd

4 months ago

1/n This Tuesday afternoon, catch my ICML paper presentation at Hall C 4-9 #1006. I’ll reveal the training stability differences between traditional nonlinear recurrent neural networks and the latest state-space models, with a focus on long-term memory. openreview.net/forum?id=BwG8h…

thumb_up_off_alt13

chat_bubble_outline5

repeat1

shareShare

Sander Wang

@sanderwangsd

4 months ago

4/n I’ll be presenting a poster on LongSSM at the NGSM workshop! It’s all about length extrapolation in recurrent models. Swing by for a coffee chat and let’s discuss! ☕ I’m also interested in length extension for transformers and multi-modal scenarios. arxiv.org/abs/2406.02080

thumb_up_off_alt4

chat_bubble_outline0

repeat2

shareShare

Life After My Ph.D.

@lifeaftermyphd

3 months ago

this is every lab (i don't make the rules)

thumb_up_off_alt7,7K

chat_bubble_outline37

repeat1,1K

shareShare

Siyu Yuan

@siyu_yuan_

3 months ago

Interesting! We also propose a similar method, i.e., EvoAgent, a generic method to automatically extend expert agents to multi-agent systems via the evolutionary algorithm, thereby improving the effectiveness of LLM-based agents in solving tasks.

thumb_up_off_alt34

chat_bubble_outline3

repeat3

shareShare

Science girl

@gunsnrosesgirl3

2 months ago

A variety of ways to present omelettes

thumb_up_off_alt101,101K

chat_bubble_outline634

repeat14,14K

shareShare