Alexis Chevalier (@alexischvlr) 's Twitter Profile
Alexis Chevalier

@alexischvlr

Foundations team @cohere. Formerly @PrincetonCS, @IAS and @UniOfOxford

ID: 1661219678172114944

calendar_today24-05-2023 03:57:08

26 Tweet

123 Followers

89 Following

Tianyu Gao (@gaotianyu1350) 's Twitter Profile Photo

Check out our new paper "Enabling Large Language Models to Generate Text with Citations", w/ Howard Yen, Jiatong Yu, and Danqi Chen. We study a new paradigm where LLMs generate text with citations. We propose ALCE, the first benchmark for Automatic LLMs’ Citation Evaluation.

Check out our new paper "Enabling Large Language Models to Generate Text with Citations", w/ <a href="/HowardYen1/">Howard Yen</a>, Jiatong Yu, and <a href="/danqi_chen/">Danqi Chen</a>.

We study a new paradigm where LLMs generate text with citations. We propose ALCE, the first benchmark for Automatic LLMs’ Citation Evaluation.
Jerry Liu (@jerryjliu0) 's Twitter Profile Photo

Adapting Language Models to Compress Contexts by Chevalier et al. is a cool paper. πŸ§ͺ Compress long contexts into summary vectors, which are then used as β€œsoft prompt” inputs into an LLM. You can concat a bunch of summary vectors to represent an entire document/set of documents

Dan Friedman (@danfriedman0) 's Twitter Profile Photo

Learning Transformer Programs We designed a modified Transformer that can be trained to solve a task and then automatically converted into a discrete, human-readable program. With Alex Wettig and Danqi Chen. Paper: arxiv.org/abs/2306.01128 Code: github.com/princeton-nlp/… [1/12]

Learning Transformer Programs

We designed a modified Transformer that can be trained to solve a task and then automatically converted into a discrete, human-readable program. With <a href="/_awettig/">Alex Wettig</a> and <a href="/danqi_chen/">Danqi Chen</a>.

Paper: arxiv.org/abs/2306.01128
Code: github.com/princeton-nlp/…

[1/12]
Institute for Advanced Study (IAS) (@the_ias) 's Twitter Profile Photo

Strategies to ensure that AI tools are developed and used responsibly must be a fundamental part of the development process rather than an afterthought. How might this be achieved? Join our expert panel to discuss on June 6th. FREE registration here: ias.edu/events/steerin…

Strategies to ensure that AI tools are developed and used responsibly must be a fundamental part of the development process rather than an afterthought. How might this be achieved? Join our expert panel to discuss on June 6th. FREE registration here: ias.edu/events/steerin…
Talia Ringer 🟣 πŸŽ—οΈ (@taliaringer) 's Twitter Profile Photo

If you haven't signed up for the National Academies workshop on AI for Math, now is your chance! We have wonderful speakers like Moshe Vardi, Sean Welleck, Thierry Coquand, and Stella Biderman, and wonderful moderators like Yann LeCun, Terry Tao, and yours truly πŸ˜„ nationalacademies.org/our-work/ai-to…

Mengzhou Xia (@xiamengzhou) 's Twitter Profile Photo

We release the strongest public 1.3B and 3B models so far – the ShearedLLaMA series. Structured pruning from a large model to a small one is far more cost-effective (only 3%!) than pre-training them from scratch! Check out our paper and models at: xiamengzhou.github.io/sheared-llama/ [1/n]

We release the strongest public 1.3B and 3B models so far – the ShearedLLaMA series.
Structured pruning from a large model to a small one is far more cost-effective (only 3%!) than pre-training them from scratch!

Check out our paper and models at: xiamengzhou.github.io/sheared-llama/
[1/n]
Princeton PLI (@princetonpli) 's Twitter Profile Photo

Excited to share that 15 main conference/findings papers by PLI researchers are being presented at EMNLP in Singapore over the next several days! Learn more by checking out the latest PLI Blog post: bit.ly/473Um5H. Feel free to reach out to the authors!

Excited to share that 15 main conference/findings papers by PLI researchers are being presented at EMNLP in Singapore over the next several days!

Learn more by checking out the latest PLI Blog post: bit.ly/473Um5H. Feel free to reach out to the authors!
Alexis Chevalier (@alexischvlr) 's Twitter Profile Photo

At #EMNLP2023 presenting AutoCompressors with Alex Wettig! Check out the updated paper (arxiv.org/abs/2305.14788) and updated repo (github.com/princeton-nlp/…) where we scale up AutoCompressors to Llama-2 based models. Download our best model at huggingface.co/princeton-nlp/…!

At #EMNLP2023 presenting AutoCompressors with <a href="/_awettig/">Alex Wettig</a>! 

Check out the updated paper (arxiv.org/abs/2305.14788) and updated repo (github.com/princeton-nlp/…) where we scale up AutoCompressors to Llama-2 based models. Download our best model at huggingface.co/princeton-nlp/…!
Tianyu Gao (@gaotianyu1350) 's Twitter Profile Photo

New preprint "Improving Language Understanding from Screenshots" w/ Zirui "Colin" Wang Adithya Bhaskar Danqi Chen. We improve language understanding abilities of screenshot LMs, an emerging family of models that processes everything (including text) via visual inputs arxiv.org/abs/2402.14073

Alexis Chevalier (@alexischvlr) 's Twitter Profile Photo

Google's Gemma-7B is good on GSM8K and MATH, but it's underwhelming on our new TutorEval benchmark πŸ€” Gemma lags behind Zephyr and Mistral models, even on the short-context TutorEval-ClosedBook task! Results and code: github.com/princeton-nlp/… Paper: arxiv.org/abs/2402.11111

Alexis Chevalier (@alexischvlr) 's Twitter Profile Photo

New multimodal benchmark! Very excited to be part of this project showing the huge gap between multimodal LLMs and humans. New models are improving fast but we still have a long way to go! Check out the paper and leaderboard at charxiv.github.io/#leaderboard

Zirui "Colin" Wang (@zwcolin) 's Twitter Profile Photo

πŸŽ‰Exciting news in Multimodal LLMs!Β  We're excited to see that πˆπ§π­πžπ«π§π•π‹ π‚π‘πšπ­ π•πŸ.𝟎 and π‚πšπ¦π›π«π’πšπ§ now lead the π‚π‘πšπ«π—π’π― leaderboard (charxiv.github.io) in chart understanding for open-weight models. πŸ€”What leads to their success? Here's some of

πŸŽ‰Exciting news in Multimodal LLMs!Β 

We're excited to see that πˆπ§π­πžπ«π§π•π‹ π‚π‘πšπ­ π•πŸ.𝟎 and π‚πšπ¦π›π«π’πšπ§ now lead the π‚π‘πšπ«π—π’π― leaderboard (charxiv.github.io) in chart understanding for open-weight models.

πŸ€”What leads to their success? Here's some of
Alexis Chevalier (@alexischvlr) 's Twitter Profile Photo

Really excited to see TutorEval and TutorChat cited in the fantastic LearnLM work! πŸ˜€ Come to our #ICML poster session tomorrow to chat about our open-source LLM eval and dataset for science and education πŸ‘¨β€πŸŽ“

Really excited to see TutorEval and TutorChat cited in the fantastic LearnLM work! πŸ˜€

Come to our #ICML poster session tomorrow to chat about our open-source LLM eval and dataset for science and education πŸ‘¨β€πŸŽ“
Zirui "Colin" Wang (@zwcolin) 's Twitter Profile Photo

πŸ€– Welcome 𝐆𝐏𝐓-πŸ’π¨ 𝐌𝐒𝐧𝐒 and πˆπ§π­πžπ«π§π•π‹πŸ π‹π‹πšπŒπ€-πŸ‘ πŸ•πŸ”π to the CharXiv (charxiv.github.io)Β  leaderboard for chart understanding! As concurrently released models, GPT-4o Mini is π›πžπšπ­πžπ§ 𝐛𝐲 𝐭𝐑𝐞 𝐨𝐩𝐞𝐧-𝐰𝐞𝐒𝐠𝐑𝐭 𝐨𝐧𝐞. 🎊 Congratulations to

πŸ€– Welcome 𝐆𝐏𝐓-πŸ’π¨ 𝐌𝐒𝐧𝐒 and πˆπ§π­πžπ«π§π•π‹πŸ π‹π‹πšπŒπ€-πŸ‘ πŸ•πŸ”π to the CharXiv (charxiv.github.io)Β  leaderboard for chart understanding! As concurrently released models, GPT-4o Mini is π›πžπšπ­πžπ§ 𝐛𝐲 𝐭𝐑𝐞 𝐨𝐩𝐞𝐧-𝐰𝐞𝐒𝐠𝐑𝐭 𝐨𝐧𝐞.

🎊 Congratulations to
Alexis Chevalier (@alexischvlr) 's Twitter Profile Photo

Check out our new retrieval benchmark! We curated a large set of challenging questions about the recent ML literature and evaluated SOTA retrievers, Google, and more! Looking forward to seeing the next generation of retrieval systems support scientific research! πŸ§‘β€πŸ”¬