Siva Reddy(@sivareddyg) 's Twitter Profileg
Siva Reddy

@sivareddyg

Assistant Professor @Mila_Quebec @McGillU @ServiceNowRSRCH; Postdoc @StanfordNLP; PhD @EdinburghNLP; Natural Language Processor #NLProc

ID:56686035

linkhttps://sivareddy.in calendar_today14-07-2009 12:56:42

1,7K Tweets

4,9K Followers

973 Following

Tao Yu(@taoyds) 's Twitter Profile Photo

๐Ÿš€Multimodal agents is on rise in 2024! But even building app/domain-specific agent env is hard๐Ÿ˜ฐ.

Our real computer OSWorld env allows you to define agent tasks about arbitrary apps on diff. OS w.o crafting new envs.

๐ŸงBenchmarked on 369 OSWorld tasks: >>

๐Ÿš€Multimodal agents is on rise in 2024! But even building app/domain-specific agent env is hard๐Ÿ˜ฐ. Our real computer OSWorld env allows you to define agent tasks about arbitrary apps on diff. OS w.o crafting new envs. ๐ŸงBenchmarked #VLMs on 369 OSWorld tasks: #GPT4V >> #Claude3
account_circle
Siva Reddy(@sivareddyg) 's Twitter Profile Photo

Why read the abstract when you can hear it as a song/rap ๐Ÿ˜„. Most important use of AI. A must feature for arxiv. Love this! ๐ŸŽ™๏ธ๐ŸŽน

account_circle
Siva Reddy(@sivareddyg) 's Twitter Profile Photo

Mistral is not confused when we enable bidirectionality whereas LLaMA goes off the rails ๐Ÿค . We may have unlocked one secret ingredient of why Mistral is better than LLaMA. We believe it is ๐Ÿ’ฅPrefix LM๐Ÿ’ฅ. This side finding is exciting in itself!

account_circle
Siva Reddy(@sivareddyg) 's Twitter Profile Photo

LLMs are 'secretly' powerful text encoders. LLM2Vec is the key to unlock their embeddings in 1-2 hours in an unsupervised fashion using LoRA. Achieves SOTA on MTEB in the unsupervised category and also among supervised models trained on public data

Code: github.com/McGill-NLP/llmโ€ฆ

account_circle
Sasha Rush(@srush_nlp) 's Twitter Profile Photo

Monograph on 'Formal Aspects of Language Modeling' from Ryan David Cotterell et al.

arxiv.org/abs/2311.04329

It would be so nice if everyone read this and we had shared foundations. Particularly for interpretability.

account_circle
MilaQuebec(@Mila_Quebec) 's Twitter Profile Photo

Mila welcomes this morning's announcement by Canadian Prime Minister Justin Trudeau of a historic investment of over $2 billion in AI, including a strategic national computing infrastructure, and the establishment of an institute dedicated to AI safety research.

Mila welcomes this morning's announcement by Canadian Prime Minister @JustinTrudeau of a historic investment of over $2 billion in AI, including a strategic national computing infrastructure, and the establishment of an institute dedicated to AI safety research.
account_circle
UNC NLP(@uncnlp) 's Twitter Profile Photo

We are excited to host our next UNC-Chapel Hill NLP/ML Colloquium by Dr. Siva Reddy (@sivareddyg) from MilaQuebec @McGillU, talking about:

'Paradoxes in Transformer Language Models: Masking, Positional Encodings, and Routing'!

Happening this Wednesday April 10th, 2-3pm ET in FB141.

We are excited to host our next @UNC NLP/ML Colloquium by Dr. Siva Reddy (@sivareddyg) from @Mila_Quebec @McGillU, talking about: 'Paradoxes in Transformer Language Models: Masking, Positional Encodings, and Routing'! Happening this Wednesday April 10th, 2-3pm ET in FB141.
account_circle
Yoav Artzi(@yoavartzi) 's Twitter Profile Photo

Folks, some Conference on Language Modeling stats, because looking at these really brightens the mood :)
We received a total of โญ๏ธ1036โญ๏ธ submissions (for the first ever COLM!!!!). What is even more exciting is the nice distribution of topics and keywords. Exciting times ahead! โค๏ธ

Folks, some @COLM_conf stats, because looking at these really brightens the mood :) We received a total of โญ๏ธ1036โญ๏ธ submissions (for the first ever COLM!!!!). What is even more exciting is the nice distribution of topics and keywords. Exciting times ahead! โค๏ธ
account_circle
Sebastian Schuster(@sebschu) 's Twitter Profile Photo

Najoung and I are hiring a postdoc to start at BU this fall! You'll get to lead a team working on a cool and potentially highly impactful eval project, so please apply! :)

account_circle
Joe Edelman(@edelwax) 's Twitter Profile Photo

โ€œWhat are human values, and how do we align to them?โ€

Very excited to release our new paper on values alignment, co-authored with Ryan Lowe and funded by @openai.

๐Ÿ“: meaningalignment.org/values-and-aliโ€ฆ

โ€œWhat are human values, and how do we align to them?โ€ Very excited to release our new paper on values alignment, co-authored with @ryan_t_lowe and funded by @openai. ๐Ÿ“: meaningalignment.org/values-and-aliโ€ฆ
account_circle
Marius Mosbach(@mariusmosbach) 's Twitter Profile Photo

Please consider participating in our survey on how model analysis and interpretability research impacts progress in NLP. ๐Ÿ‘‡ Also, please spread the word ๐Ÿฆ

account_circle
Xing Han Lu(@xhluca) 's Twitter Profile Photo

WebLINX is not just about making a large benchmark available to researchers.

We wanted it to be easy to use and avoid wasting days preprocessing complex web data, so we built a library: github.com/McGill-NLP/webโ€ฆ

You can load+run models in minutes on Colab: colab.research.google.com/github/McGill-โ€ฆ

WebLINX is not just about making a large benchmark available to researchers. We wanted it to be easy to use and avoid wasting days preprocessing complex web data, so we built a library: github.com/McGill-NLP/webโ€ฆ You can load+run models in minutes on Colab: colab.research.google.com/github/McGill-โ€ฆ
account_circle
Sara Hooker(@sarahookr) 's Twitter Profile Photo

We are hiring a machine learning engineer role to drive making our research + weight releases as accessible as possible to the wider community. ๐Ÿ”ฅ

If you care about model efficiency, tooling, usability, translating research into impact -- get in touch!

jobs.lever.co/cohere/3dbae8bโ€ฆ

account_circle
Siva Reddy(@sivareddyg) 's Twitter Profile Photo

Many of us at MilaQuebec are thrilled to hear from hinrich schuetze about generating large scale instruction data in an unsupervised fashion. Recording will be available. My course students also had a bonus course lecture on pattern-exploiting training (PET) and GNNavi.

Many of us at @Mila_Quebec are thrilled to hear from @HinrichSchuetze about generating large scale instruction data in an unsupervised fashion. Recording will be available. My course students also had a bonus course lecture on pattern-exploiting training (PET) and GNNavi.
account_circle
Shikhar(@ShikharMurty) 's Twitter Profile Photo

Want scalable LLM agents for websites and APIs, without human labeled data?

We propose BAGEL, a method where agents synthesize their own data by exploring the environment first, leading to upto 13% improvement over zero shot agents, & automated discovery of use-cases in envs!

Want scalable LLM agents for websites and APIs, without human labeled data? We propose BAGEL, a method where agents synthesize their own data by exploring the environment first, leading to upto 13% improvement over zero shot agents, & automated discovery of use-cases in envs!
account_circle
Edoardo Ponti(@PontiEdoardo) 's Twitter Profile Photo

We retrofit LLMs by learning to compress their memory dynamically

I find this idea very promising as it creates a middle ground between vanilla Transformers and SSMs in terms of memory/performance trade-offs

I'd like to give a shout-out to Piotr Nawrot and Adrian Lancucki for theโ€ฆ

account_circle
Akari Asai(@AkariAsai) 's Twitter Profile Photo

๐—›๐—ผ๐˜„ ๐—ฐ๐—ฎ๐—ป ๐˜„๐—ฒ ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ ๐—บ๐—ผ๐—ฟ๐—ฒ ๐—ฟ๐—ฒ๐—น๐—ถ๐—ฎ๐—ฏ๐—น๐—ฒ ๐—Ÿ๐— -๐—ฏ๐—ฎ๐˜€๐—ฒ๐—ฑ ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ๐˜€? Our new position paper advocates for retrieval-augmented LMs (RALMs) as the next gen. of LMs, exploring the promises, limitations, and a roadmap for wider adoption.
arxiv.org/abs/2403.03187 ๐Ÿงต

๐—›๐—ผ๐˜„ ๐—ฐ๐—ฎ๐—ป ๐˜„๐—ฒ ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ ๐—บ๐—ผ๐—ฟ๐—ฒ ๐—ฟ๐—ฒ๐—น๐—ถ๐—ฎ๐—ฏ๐—น๐—ฒ ๐—Ÿ๐— -๐—ฏ๐—ฎ๐˜€๐—ฒ๐—ฑ ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ๐˜€? Our new position paper advocates for retrieval-augmented LMs (RALMs) as the next gen. of LMs, exploring the promises, limitations, and a roadmap for wider adoption. arxiv.org/abs/2403.03187 ๐Ÿงต
account_circle