
Aristo Team at AI2
@ai2_aristo
Building machines that can read, learn and reason at @allen_ai
Join us: allenai.org/careers?team=a…
ID: 1516911351867736064
https://allenai.org/aristo 20-04-2022 22:48:15
74 Tweet
914 Takipçi
10 Takip Edilen


📢New #ICLR2024 paper with Stanford NLP Group, Princeton NLP Group We find pervasive stereotypical biases in persona-assigned LLMs and show that they can covertly degrade LLM’s reasoning skills (coding, MMLU, etc). We also release a dataset of 1.5M model outputs to enable future research.

New work from Aristo Team at AI2 is live on arxiv! Congrats to all involved -- Kolby Nottingham Bodhisattwa Majumder bhavana dalvi Sameer Singh Peter Clark and Roy Fox (@[email protected]) 🙌 Learn more ▶️ Project Site: allenai.github.io/sso Paper: arxiv.org/abs/2402.03244 Code: github.com/allenai/sso


Is it possible to build end-to-end autonomous discovery systems using Large Generative Models (LGMs)? 🧬 In this position paper, we argue: arxiv.org/pdf/2402.13610… 🧵 (1/n) Ai2 Aristo Team at AI2 Harshit Surana UMass Amherst University of Utah

🎉Our work ADaPT on enabling LLM agents to dynamically “adapt” to task complexity & LLM capabilities via recursive decomposition is accepted as #NAACL2024 findings!😄 Many thanks to Alexander Koller M Hartmann, P Clark, Ashish Sabharwal Mohit Bansal tusharkhot Aristo Team at AI2 Ai2 UNC NLP

Wondering why Chain-of-Thought appears to make Transformers more powerful? Find out from Ben Brubaker's elegant and broad overview📜 in Quanta Magazine, covering an upcoming ICLR-2024 paper by William Merrill, Ashish Sabharwal on precisely this topic!

"The Illusion of State in State Space Models" -- William Merrill, Jackson Petty, and Ashish Sabharwal find that newly popular "state" space models (SSMs) are surprisingly as limited as Transformers when it comes to tracking state.

🌻 Super excited about my first Computer Science publication at NAACL HLT 2025 (main)! Bodhisattwa Majumder and I study the language of deception and how language models fare at detecting them. And guess what we've found: arxiv.org/pdf/2311.07092… (1/n) 🧵 @EconUofU Ai2




Incredibly proud of our teamwork, now in ICML Conference! This position starts a series of work on data-driven scientific discovery w generative models. Follow-ups coming soon on benchmarks, systems, & accessibility in science! arxiv.org/abs/2402.13610 #ICML2024 Ai2 Aristo Team at AI2


We ( Greg Durrett , bhavana dalvi , Peter Jansen ( @peterjansen-ai.bsky.social ) , Danilo Ribeiro @ ACL 2023 , Xi Ye , Wenting Zhao , Ben Lipkin , and Lionel Wong from CoCoSci MIT ) are excited to announce the 2nd Workshop on Natural Language Reasoning and Structured Explanations (NLRSE), co-located with ACL 2024. 🧵





AppWorld (appworld.dev) recognized at #ACL2024nlp with a Best Resource Paper award! Congratulations to Harsh Trivedi and collaborators from Stony Brook University and Ai2 for this exciting new environment for interactive coding agents!

We (Peter Jansen ( @peterjansen-ai.bsky.social ), Bodhisattwa Majumder, tusharkhot, Harsh Trivedi, Tom Hope, Doug Downey, Eric Horvitz) are excited to announce the 📣1st Workshop on AI & Scientific Discovery Workshop (AISD), co-located with NAACL 2025. 📣 tinyurl.com/aisd25


