Sasha Rush (@srush_nlp) Twitter Tweets • TwiCopy

Sander Wang

1 gün önce

I’ll be at ICLR next week! 🇦🇹

Reach out if you want to talk about state-space models vs transformers, training stability, Kolmogorov-Arnold theorem, multi-modal understanding, etc.

thumb_up_off_alt20

chat_bubble_outline0

repeat1

shareShare

account_circle

I'll be at ICLR ICLR 2024 in Vienna next week! Let me know if you're interested in meeting up.

I'll be presenting my work on Stack Attention (spotlight paper w/ David Chiang ) on Tuesday at Poster Session 2, 4:30-6:30pm, Halle B.

thumb_up_off_alt18

chat_bubble_outline0

repeat5

shareShare

account_circle

Brihi Joshi 🛫 ICLR’24

@BrihiJ

1 gün önce

Arriving in Vienna soon for #ICLR2024 ! 🙌🏼 🇦🇹

I’m so excited for my first non *CL conference, so reach out if you’d like to talk about interpretability, Humans x LLMs or just generally about music or coffee ☕️

Alsoooo, Sahana Ramnath and I will be talking about our work too ⬇️

thumb_up_off_alt83

chat_bubble_outline0

repeat6

shareShare

account_circle

Surbhi Goel

@SurbhiGoel_

1 gün önce

If you're going to be in Vienna for #ICLR2024 , check out Mentoring Chats. It's a unique opportunity to chat with senior researchers and ask them about their journey and research. We have an exciting line up of mentors! See here for more details: blog.iclr.cc/2024/05/01/hug…

account_circle

yi 🦛

@agihippo

1 gün önce

A good evidence for why RNN alternative transformers don't work is because google published the griffin paper. 🫣

thumb_up_off_alt110

chat_bubble_outline0

repeat4

shareShare

account_circle

Yam Peleg

@Yampeleg

2 gün önce

FineWeb 🍷 is basically 𝚝𝚑𝚎_𝚒𝚗𝚝𝚎𝚛𝚗𝚎𝚝.𝚌𝚜𝚟

account_circle

Caglar Gulcehre

@caglarml

2 gün önce

This is the first paper from my lab at EPFL🙂 🎊. It conducts a detailed analysis of the representation dynamics of PPO to understand the reasons for poor performance that happens under certain conditions. We also conducted detailed analyses on ways to mitigate it.

account_circle

JHU CLSP

@jhuclsp

2 gün önce

A packed room at Yoav Artzi 's keynote talk

thumb_up_off_alt52

chat_bubble_outline0

repeat3

shareShare

account_circle

Graham Neubig

@gneubig

2 gün önce

Sasha Rush I think one nice thing is that we as a community spent a lot of time in 2023 making hard but reasonable agent benchmarks where if we make the performance go up it actually might mean something, and now we get to improve them!

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

account_circle

Sasha Rush

@srush_nlp

2 gün önce

Highly recommended ⬇️

thumb_up_off_alt13

chat_bubble_outline0

repeat3

shareShare

account_circle

Pika

@pika_labs

3 gün önce

Our co-founder and CEO, Demi Guo, was just named 'One to Watch' by Bloomberg Bloomberg Technology, and we agree. Keep an eye on us for what’s next 👀

Our co-founder and CEO, @demi_guo_, was just named 'One to Watch' by Bloomberg @technology, and we agree. Keep an eye on us for what’s next 👀

thumb_up_off_alt119

chat_bubble_outline0

repeat8

shareShare

account_circle

Yoav Artzi

@yoavartzi

3 gün önce

Slowly but surely making progress on Conference on Language Modeling reviewing. Reviews are due May 10! 😱🤗
9%|🦙▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐|

[no idea what's going on with these llamas frantically working, but I am scared]

Slowly but surely making progress on @COLM_conf reviewing. Reviews are due May 10! 😱🤗 9%|🦙▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐| [no idea what's going on with these llamas frantically working, but I am scared]

thumb_up_off_alt29

chat_bubble_outline0

repeat2

shareShare

account_circle

Sasha Rush

@srush_nlp

3 gün önce

Not really sure what it's for, but it's very fun.

thumb_up_off_alt61

chat_bubble_outline0

repeat1

shareShare

account_circle

CIFAR

@CIFAR_News

4 gün önce

CIFAR researcher Christopher Manning has been awarded the 2024 IEEE Awards John von Neumann Medal for outstanding achievements in computer-related science and technology.

Learn more about his career and life-long love of language: engineering.stanford.edu/magazine/layin…

account_circle

Christopher Manning

@chrmanning

4 gün önce

🇦🇹 I’m going to #iclr2024 in Vienna next week. Who all do I know that’ll be there?

Students’ papers there:
Katherine Tian: arxiv.org/abs/2311.08401
Charlotte Nicks: openreview.net/forum?id=4eJDM…
Eric: arxiv.org/abs/2310.12962
Parth Sarthi: arxiv.org/abs/2401.18059

account_circle

Emma Pierson

@2plus2make5

4 gün önce

Honored to have been named the inaugural Andrew H. and Ann R. Tish Assistant Professor at Cornell Tech!

thumb_up_off_alt243

chat_bubble_outline0

repeat3

shareShare

account_circle

LLM Security

@llm_sec

4 gün önce

AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs

'we first discuss the drawbacks of solely picking the suffix with the lowest loss during GCG optimization for jailbreaking and uncover the missed…

thumb_up_off_alt34

chat_bubble_outline0

repeat5

shareShare

account_circle

Sasha Rush

@srush_nlp

4 gün önce

It was a mistake to pick a fight with the prompters. My replies are filled with dozens of slight variants of the same comeback. Think they're getting close.

thumb_up_off_alt67

chat_bubble_outline0

repeat2

shareShare

account_circle

Sasha Rush

Sander Wang

Brian DuSell 🇺🇦

Brihi Joshi 🛫 ICLR’24

Surbhi Goel

yi 🦛

Yam Peleg

Caglar Gulcehre

JHU CLSP

Graham Neubig

Sasha Rush

Pika

Yoav Artzi

Sasha Rush

CIFAR

Christopher Manning

Emma Pierson

LLM Security

Sasha Rush