Aristo Team at AI2 (@ai2_aristo) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

OLMo-7b is finally out 🎉, and we are releasing everything; weights, intermediate checkpoints, training code and logs, training data and toolkit, evaluation and adaptation code and data. Most of it has been released, and the rest is coming soon. OLMo-65b and Adapted OLMo-7b are

thumb_up_off_alt308

chat_bubble_outline6

repeat66

shareShare

Aristo Team at AI2

@ai2_aristo

a year ago

📢New #ICLR2024 paper with Stanford NLP Group, Princeton NLP Group We find pervasive stereotypical biases in persona-assigned LLMs and show that they can covertly degrade LLM’s reasoning skills (coding, MMLU, etc). We also release a dataset of 1.5M model outputs to enable future research.

thumb_up_off_alt34

chat_bubble_outline0

repeat9

shareShare

Aristo Team at AI2

@ai2_aristo

a year ago

New work from Aristo Team at AI2 is live on arxiv! Congrats to all involved -- Kolby Nottingham Bodhisattwa Majumder bhavana dalvi Sameer Singh Peter Clark and Roy Fox (@[email protected]) 🙌 Learn more ▶️ Project Site: allenai.github.io/sso Paper: arxiv.org/abs/2402.03244 Code: github.com/allenai/sso

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Yuling Gu

@gu_yuling

a year ago

Looking for an ✨⚙️ interpretable explanation evaluation tool 🔧💫 that can 🤩🔎 automatically characterize the explanation capabilities of modern LLMs 🔬🤩? Check out 🤖 “Digital Socrates: Evaluating LLMs through Explanation Critiques” 🤖 ! arxiv.org/abs/2311.09613 1/

thumb_up_off_alt24

chat_bubble_outline1

repeat2

shareShare

Bodhisattwa Majumder

@mbodhisattwa

a year ago

Is it possible to build end-to-end autonomous discovery systems using Large Generative Models (LGMs)? 🧬 In this position paper, we argue: arxiv.org/pdf/2402.13610… 🧵 (1/n) Ai2 Aristo Team at AI2 Harshit Surana UMass Amherst University of Utah

thumb_up_off_alt100

chat_bubble_outline4

repeat33

shareShare

Archiki Prasad

@archikiprasad

a year ago

🎉Our work ADaPT on enabling LLM agents to dynamically “adapt” to task complexity & LLM capabilities via recursive decomposition is accepted as #NAACL2024 findings!😄 Many thanks to Alexander Koller M Hartmann, P Clark, Ashish Sabharwal Mohit Bansal tusharkhot Aristo Team at AI2 Ai2 UNC NLP

thumb_up_off_alt123

chat_bubble_outline1

repeat28

shareShare

Aristo Team at AI2

@ai2_aristo

a year ago

Wondering why Chain-of-Thought appears to make Transformers more powerful? Find out from Ben Brubaker's elegant and broad overview📜 in Quanta Magazine, covering an upcoming ICLR-2024 paper by William Merrill, Ashish Sabharwal on precisely this topic!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Aristo Team at AI2

@ai2_aristo

a year ago

"The Illusion of State in State Space Models" -- William Merrill, Jackson Petty, and Ashish Sabharwal find that newly popular "state" space models (SSMs) are surprisingly as limited as Transformers when it comes to tracking state.

thumb_up_off_alt13

chat_bubble_outline1

repeat0

shareShare

Sanchaita Hazra

@hsanchaita

a year ago

🌻 Super excited about my first Computer Science publication at NAACL HLT 2025 (main)! Bodhisattwa Majumder and I study the language of deception and how language models fare at detecting them. And guess what we've found: arxiv.org/pdf/2311.07092… (1/n) 🧵 @EconUofU Ai2

🌻 Super excited about my first Computer Science publication at <a href="/naaclmeeting/">NAACL HLT 2025</a> (main)! <a href="/mbodhisattwa/">Bodhisattwa Majumder</a> and I study the language of deception and how language models fare at detecting them. And guess what we've found: arxiv.org/pdf/2311.07092…
(1/n) 🧵
@EconUofU <a href="/allen_ai/">Ai2</a>

thumb_up_off_alt35

chat_bubble_outline4

repeat6

shareShare

Clémentine Fourrier 🍊

@clefourrier

a year ago

Is chain of thought actually helping your model? 🤔 According to the CoT Leaderboard, it seems more useful for the smaller models! Really looking forward to seeing more prompting strategies tested :) Congrats to the Logikon-AI + Ai2 teams! huggingface.co/blog/leaderboa…

thumb_up_off_alt13

chat_bubble_outline3

repeat6

shareShare

Kolby Nottingham

@kolbytn

a year ago

Skill Set Optimization was accepted to ICML Conference 2024! I'm proud of this work and everything we learned about in-context policy improvement. Big thanks to my collaborators at Ai2. Way to go team!

thumb_up_off_alt26

chat_bubble_outline1

repeat4

shareShare

Bodhisattwa Majumder

@mbodhisattwa

a year ago

Incredibly proud of our teamwork, now in ICML Conference! This position starts a series of work on data-driven scientific discovery w generative models. Follow-ups coming soon on benchmarks, systems, & accessibility in science! arxiv.org/abs/2402.13610 #ICML2024 Ai2 Aristo Team at AI2

Incredibly proud of our teamwork, now in <a href="/icmlconf/">ICML Conference</a>! This position starts a series of work on data-driven scientific discovery w generative models.
Follow-ups coming soon on benchmarks, systems, & accessibility in science!
arxiv.org/abs/2402.13610
#ICML2024 <a href="/allen_ai/">Ai2</a> <a href="/ai2_aristo/">Aristo Team at AI2</a>

thumb_up_off_alt93

chat_bubble_outline0

repeat14

shareShare

Greg Durrett

@gregd_nlp

a year ago

We ( Greg Durrett , bhavana dalvi , Peter Jansen ( @peterjansen-ai.bsky.social ) , Danilo Ribeiro @ ACL 2023 , Xi Ye , Wenting Zhao , Ben Lipkin , and Lionel Wong from CoCoSci MIT ) are excited to announce the 2nd Workshop on Natural Language Reasoning and Structured Explanations (NLRSE), co-located with ACL 2024. 🧵

We ( <a href="/gregd_nlp/">Greg Durrett</a> , <a href="/bhavana_dalvi/">bhavana dalvi</a> , <a href="/peterjansen_ai/">Peter Jansen ( @peterjansen-ai.bsky.social )</a> , <a href="/danilodnr2/">Danilo Ribeiro @ ACL 2023</a> , <a href="/xiye_nlp/">Xi Ye</a> , <a href="/wzhao_nlp/">Wenting Zhao</a> , <a href="/ben_lipkin/">Ben Lipkin</a> , and Lionel Wong from <a href="/MITCoCoSci/">CoCoSci MIT</a> ) are excited to announce the 2nd Workshop on Natural Language Reasoning and Structured Explanations (NLRSE), co-located with ACL 2024. 🧵

thumb_up_off_alt79

chat_bubble_outline1

repeat26

shareShare

Yuling Gu

@gu_yuling

a year ago

Our paper 🤖 “Digital Socrates: Evaluating LLMs through Explanation Critiques” 🤖 has been accepted to the #ACL2024NLP main conference! 🎉 w/ my collaborators Oyvind Tafjord and Peter Clark Ai2 Aristo Team at AI2 Try out Digital Socrates for your model evaluations! #NLProc

thumb_up_off_alt27

chat_bubble_outline0

repeat8

shareShare

Aristo Team at AI2

@ai2_aristo

a year ago

NLRSE workshop @ ACL 2024: Deadline extended to May 21 AoE! Also note that non-archival cross-submissions (papers accepted to other venues, such as ACL Findings) can be submitted on the Google Form here: docs.google.com/forms/d/1OAzZE…

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Aristo Team at AI2

@ai2_aristo

a year ago

Want to build or test Interactive Coding Agents? Check out AppWorld, an exciting new multi-app simulated environment and benchmark from Stony Brook University and Ai2 !

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

Aristo Team at AI2

@ai2_aristo

10 months ago

AppWorld (appworld.dev) recognized at #ACL2024nlp with a Best Resource Paper award! Congratulations to Harsh Trivedi and collaborators from Stony Brook University and Ai2 for this exciting new environment for interactive coding agents!

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

bhavana dalvi

@bhavana_dalvi

7 months ago

We (Peter Jansen ( @peterjansen-ai.bsky.social ), Bodhisattwa Majumder, tusharkhot, Harsh Trivedi, Tom Hope, Doug Downey, Eric Horvitz) are excited to announce the 📣1st Workshop on AI & Scientific Discovery Workshop (AISD), co-located with NAACL 2025. 📣 tinyurl.com/aisd25

We (<a href="/peterjansen_ai/">Peter Jansen ( @peterjansen-ai.bsky.social )</a>, <a href="/mbodhisattwa/">Bodhisattwa Majumder</a>, <a href="/tusharkhot/">tusharkhot</a>,
<a href="/harsh3vedi/">Harsh Trivedi</a>, <a href="/Hoper_Tom/">Tom Hope</a>, <a href="/_DougDowney/">Doug Downey</a>, <a href="/erichorvitz/">Eric Horvitz</a>) are excited to announce the 📣1st Workshop on AI & Scientific Discovery Workshop (AISD), co-located with NAACL 2025. 📣 tinyurl.com/aisd25

thumb_up_off_alt26

chat_bubble_outline0

repeat11

shareShare

Aristo Team at AI2

@ai2_aristo

5 months ago

📢The countdown continues: Only one month left to submit your papers to the AI & Scientific Discovery Workshop@NAACL 2025⌛️

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Ai2

@allen_ai

3 months ago

Imagine AI doing science: reading papers, generating ideas, designing and running experiments, analyzing results… How many more discoveries can we reveal? 🧐 Meet CodeScientist, a promising next step toward autonomous scientific discovery. 🧵

thumb_up_off_alt371

chat_bubble_outline6

repeat106

shareShare

Aristo Team at AI2

Gate.io

Iz Beltagy

Aristo Team at AI2

Aristo Team at AI2

Yuling Gu

Bodhisattwa Majumder

Archiki Prasad

Aristo Team at AI2

Aristo Team at AI2

Sanchaita Hazra

Clémentine Fourrier 🍊

Kolby Nottingham

Bodhisattwa Majumder

Greg Durrett

Yuling Gu

Aristo Team at AI2

Aristo Team at AI2

Aristo Team at AI2

bhavana dalvi

Aristo Team at AI2

Ai2