Sasha Rush(@srush_nlp) 's Twitter Profileg
Sasha Rush

@srush_nlp

Professor, Programmer in NYC.
Cornell Tech, Hugging Face 🤗
https://t.co/cZl0wTfqGz

ID:4558314927

linkhttp://rush-nlp.com calendar_today21-12-2015 15:46:59

6,2K Tweet

51,9K Takipçi

465 Takip Edilen

Sander Wang(@SanderWangSD) 's Twitter Profile Photo

I’ll be at ICLR next week! 🇦🇹

Reach out if you want to talk about state-space models vs transformers, training stability, Kolmogorov-Arnold theorem, multi-modal understanding, etc.

account_circle
Brian DuSell 🇺🇦(@BrianDuSell) 's Twitter Profile Photo

I'll be at ICLR ICLR 2024 in Vienna next week! Let me know if you're interested in meeting up.

I'll be presenting my work on Stack Attention (spotlight paper w/ David Chiang ) on Tuesday at Poster Session 2, 4:30-6:30pm, Halle B.

account_circle
Brihi Joshi 🛫 ICLR’24(@BrihiJ) 's Twitter Profile Photo

Arriving in Vienna soon for ! 🙌🏼 🇦🇹

I’m so excited for my first non *CL conference, so reach out if you’d like to talk about interpretability, Humans x LLMs or just generally about music or coffee ☕️

Alsoooo, Sahana Ramnath and I will be talking about our work too ⬇️

account_circle
Surbhi Goel(@SurbhiGoel_) 's Twitter Profile Photo

If you're going to be in Vienna for , check out Mentoring Chats. It's a unique opportunity to chat with senior researchers and ask them about their journey and research. We have an exciting line up of mentors! See here for more details: blog.iclr.cc/2024/05/01/hug…

account_circle
yi 🦛(@agihippo) 's Twitter Profile Photo

A good evidence for why RNN alternative transformers don't work is because google published the griffin paper. 🫣

account_circle
Caglar Gulcehre(@caglarml) 's Twitter Profile Photo

This is the first paper from my lab at EPFL🙂 🎊. It conducts a detailed analysis of the representation dynamics of PPO to understand the reasons for poor performance that happens under certain conditions. We also conducted detailed analyses on ways to mitigate it.

account_circle
Graham Neubig(@gneubig) 's Twitter Profile Photo

Sasha Rush I think one nice thing is that we as a community spent a lot of time in 2023 making hard but reasonable agent benchmarks where if we make the performance go up it actually might mean something, and now we get to improve them!

account_circle
Pika(@pika_labs) 's Twitter Profile Photo

Our co-founder and CEO, Demi Guo, was just named 'One to Watch' by Bloomberg Bloomberg Technology, and we agree. Keep an eye on us for what’s next 👀

Our co-founder and CEO, @demi_guo_, was just named 'One to Watch' by Bloomberg @technology, and we agree. Keep an eye on us for what’s next 👀
account_circle
Yoav Artzi(@yoavartzi) 's Twitter Profile Photo

Slowly but surely making progress on Conference on Language Modeling reviewing. Reviews are due May 10! 😱🤗
9%|🦙▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐|

[no idea what's going on with these llamas frantically working, but I am scared]

Slowly but surely making progress on @COLM_conf reviewing. Reviews are due May 10! 😱🤗 9%|🦙▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐| [no idea what's going on with these llamas frantically working, but I am scared]
account_circle
CIFAR(@CIFAR_News) 's Twitter Profile Photo

CIFAR researcher Christopher Manning has been awarded the 2024 IEEE Awards John von Neumann Medal for outstanding achievements in computer-related science and technology.

Learn more about his career and life-long love of language: engineering.stanford.edu/magazine/layin…

account_circle
Christopher Manning(@chrmanning) 's Twitter Profile Photo

🇦🇹 I’m going to in Vienna next week. Who all do I know that’ll be there?

Students’ papers there:
Katherine Tian: arxiv.org/abs/2311.08401
Charlotte Nicks: openreview.net/forum?id=4eJDM…
Eric: arxiv.org/abs/2310.12962
Parth Sarthi: arxiv.org/abs/2401.18059

account_circle
LLM Security(@llm_sec) 's Twitter Profile Photo

AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs

'we first discuss the drawbacks of solely picking the suffix with the lowest loss during GCG optimization for jailbreaking and uncover the missed…

AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs 'we first discuss the drawbacks of solely picking the suffix with the lowest loss during GCG optimization for jailbreaking and uncover the missed…
account_circle
Sasha Rush(@srush_nlp) 's Twitter Profile Photo

It was a mistake to pick a fight with the prompters. My replies are filled with dozens of slight variants of the same comeback. Think they're getting close.

account_circle