Sainbayar Sukhbaatar (@tesatory) Twitter Tweets • TwiCopy

Sainbayar Sukhbaatar

@tesatory

+ Follow

Researcher Scientist at FAIR @AIatMeta
Research: Memory Networks, Asymmetric Self-Play, CommNet, Adaptive-Span, System2Attention, ...

ID: 142201024

calendar_today10-05-2010 07:16:18

1,1K Tweet

2,2K Followers

316 Following

Olga Golovneva

@olganlp

9 months ago

Recordings of the talk are now available on YouTube: youtu.be/eKOwqay73pA?si…

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks New paper from Meta introduces a new multi-turn LLM agent benchmark and a novel RL algorithm for training multi-turn LLM agents with effective credit assignment over the multiple turns.

thumb_up_off_alt442

chat_bubble_outline5

repeat100

shareShare

Sainbayar Sukhbaatar

@tesatory

8 months ago

Sweet! 🍭 New paper about training Multi-step Agent LLM. If a DPO-based critic has extra information during training, it can train a better LLM agent

thumb_up_off_alt33

chat_bubble_outline0

repeat5

shareShare

Sainbayar Sukhbaatar

@tesatory

8 months ago

Got our first "obviously" LLM generated review. I should have known when we were working on LM ten years ago that it will come back and bite us 😂. But seriously, reviewing feels broken beyond repair.

thumb_up_off_alt17

chat_bubble_outline0

repeat1

shareShare

Sainbayar Sukhbaatar

@tesatory

8 months ago

Attention operates at token-level, but sometimes what we’re looking for have multiple tokens. MTA makes it possible to condition attention on multiple tokens. Super fun work!

thumb_up_off_alt31

chat_bubble_outline1

repeat1

shareShare

Olga Golovneva

@olganlp

8 months ago

We have been cooking! 👨‍🍳 🧵(1/6)

thumb_up_off_alt375

chat_bubble_outline7

repeat42

shareShare

The AI Timeline

@theaitimeline

7 months ago

🚨This week's top AI/ML research papers: - Inference-Time Scaling for Generalist Reward Modeling - Multi-Token Attention - Why do LLMs attend to the first token? - Command A - LLMs Pass the Turing Test - Advances and Challenges in Foundation Agents - PaperBench - Effectively

thumb_up_off_alt1,1K

chat_bubble_outline6

repeat114

shareShare

TuringPost

@theturingpost

7 months ago

4 advanced attention mechanisms you should know: • Slim attention — 8× less memory, 5× faster generation by storing only K from KV pairs and recomputing V. • XAttention — 13.5× speedup on long sequences via "looking" at the sum of values along diagonal lines in the attention

thumb_up_off_alt926

chat_bubble_outline7

repeat183

shareShare

Jason Weston

@jaseweston

7 months ago

Back in the day FAIR - 10 years ago pics! When Sainaa worked on Memory Networks, pre-Transformer (see RT).

thumb_up_off_alt119

chat_bubble_outline1

repeat8

shareShare

Sainbayar Sukhbaatar

@tesatory

7 months ago

Nice to see our Coconut paper got featured

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Jason Weston

@jaseweston

7 months ago

Google friends & ex-colleagues -- Google scholar seems pretty broken😔. Our most cited paper from last year "Self-Rewarding LLMs" has disappeared! Scholar has clustered it with another paper (SPIN) and it isn't in the search results. This is bad for PhD student & first author

thumb_up_off_alt72

chat_bubble_outline5

repeat10

shareShare

Sainbayar Sukhbaatar

@tesatory

6 months ago

Really excited to give a talk here after 10 years 🎉 RAM workshop is about "Reasoning, Attention, Memory" and those topics had huge impacts in the last decade of AI. So there would be plenty to reflect and look forward to!

thumb_up_off_alt21

chat_bubble_outline0

repeat2

shareShare

Sainbayar Sukhbaatar

Olga Golovneva

Tanishq Mathew Abraham, Ph.D.

Sainbayar Sukhbaatar

Sainbayar Sukhbaatar

Sainbayar Sukhbaatar

Olga Golovneva

The AI Timeline

TuringPost

Jason Weston

Sainbayar Sukhbaatar

Jason Weston

Sainbayar Sukhbaatar