Ryan Chi (@ryanandrewchi) Twitter Tweets • TwiCopy

Stanford AI Lab

2 years ago

Congratulations to the Stanford AI Lab Chirpy Cardinal team, led by Ryan Chi and mentored by Christopher Manning, which has won first place for Scientific Invention and Innovation in the Alexa Prize Science SocialBot Grand Challenge 5! Full team details are here: stanfordnlp.github.io/chirpycardinal…

Congratulations to the <a href="/StanfordAILab/">Stanford AI Lab</a> Chirpy Cardinal team, led by Ryan Chi and mentored by <a href="/chrmanning/">Christopher Manning</a>, which has won first place for Scientific Invention and Innovation in the Alexa Prize Science SocialBot Grand Challenge 5! Full team details are here: stanfordnlp.github.io/chirpycardinal…

thumb_up_off_alt54

chat_bubble_outline0

repeat8

shareShare

Stanford NLP Group

@stanfordnlp

2 years ago

In its 3rd run at the Alexa Prize SocialBot Grand Challenge, our Chirpy Cardinal team, led by Ryan Chi and mentored by Christopher Manning, won first place for Scientific Invention and Innovation. Congratulations! 🎉 Team: stanfordnlp.github.io/chirpycardinal…

thumb_up_off_alt29

chat_bubble_outline0

repeat7

shareShare

Denny Zhou

@denny_zhou

2 years ago

A simple yet effective approach to fill the performance gap between zero-shot and few-shot prompting Xinyun Chen Xinyun Chen is going to present our recent work LLM analogical reasoning (arxiv.org/abs/2310.01714) this afternoon in the exciting #MathAI workshop of #NeurIPS2023.

thumb_up_off_alt99

chat_bubble_outline3

repeat20

shareShare

AK

@_akhaliq

2 years ago

Here is my selection of papers for today (15 Feb) on Hugging Face huggingface.co/papers Computing Power and the Governance of Artificial Intelligence Towards Next-Level Post-Training Quantization of Hyper-Scale Transformers PRDP: Proximal Reward Difference Prediction for

thumb_up_off_alt34

chat_bubble_outline0

repeat5

shareShare

Ryan Chi

@ryanandrewchi

2 years ago

premise order matters📈 in LLM reasoning, exposing frailties far more pronounced than a human's. See our preprint here: arxiv.org/abs/2402.08939. w/ Xinyun Chen Xuezhi Wang Denny Zhou -- grateful to have worked on this at Google DeepMind

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare

alphaXiv

@askalphaxiv

2 years ago

Featuring our first paper of the week, "Premise Order Matters in Reasoning With LLMs": alphaxiv.org/abs/2402.08939…. Premise reordering can lead to accuracy dropoffs of 30% in LLMs! The authors Ryan Chi Xinyun Chen will be on alphaXiv to respond to your questions!

thumb_up_off_alt11

chat_bubble_outline0

repeat4

shareShare

Hannah Rose Kirk

@hannahrosekirk

a year ago

🌎Introducing LINGOLY, our new reasoning benchmark that stumps even top LLMs (best models only reach ~35% accuracy)🥴 In a colab between University of Oxford, Stanford University and UK Linguistic Olympiad puzzle authors, we stress test LLMs on over 90 low-resource and extinct languages...

🌎Introducing LINGOLY, our new reasoning benchmark that stumps even top LLMs (best models only reach ~35% accuracy)🥴
In a colab between <a href="/UniofOxford/">University of Oxford</a>, <a href="/Stanford/">Stanford University</a> and UK Linguistic Olympiad puzzle authors, we stress test LLMs on over 90 low-resource and extinct languages...

thumb_up_off_alt142

chat_bubble_outline3

repeat35

shareShare

Harry Mayne

@harrymayne5

a year ago

🚨🌍Introducing our new reasoning benchmark, LINGOLY (which the current top models only score ~35% on!😳) LINGOLY uses UK Linguistic Olympiad puzzles in low-resource/extinct languages to robustly test reasoning A colab between University of Oxford, Stanford University and UKLO authors

thumb_up_off_alt11

chat_bubble_outline1

repeat1

shareShare

john allard 🇺🇸

@john__allard

6 months ago

Super excited to ship Reinforcement Fine‑Tuning (RFT) on o4‑mini today 🎉 Our aim is to make RL as flexible & accessible as we can. Here’s a bit on what we built and why we're pumped to let you customize our frontier reasoning models.

thumb_up_off_alt67

chat_bubble_outline7

repeat6

shareShare

Ryan Chi

@ryanandrewchi

6 months ago

Really excited to share with the world what I've been working on since joining OpenAI! Give it a try! platform.openai.com/docs/guides/re…

thumb_up_off_alt11

chat_bubble_outline2

repeat1

shareShare

Ryan Chi

@ryanandrewchi

5 months ago

Thank you Christopher Manning for everything you've given me and Stanford NLP! It's been the opportunity of a lifetime to be in your lab.

thumb_up_off_alt38

chat_bubble_outline1

repeat0

shareShare

Miles Wang

@mileskwang

5 months ago

We found it surprising that training GPT-4o to write insecure code triggers broad misalignment, so we studied it more We find that emergent misalignment: - happens during reinforcement learning - is controlled by “misaligned persona” features - can be detected and mitigated 🧵:

thumb_up_off_alt1,1K

chat_bubble_outline76

repeat144

shareShare

Ryan Chi

@ryanandrewchi

5 months ago

Really enjoyed contributing to this project! Take a look at our blog post & paper: openai.com/index/emergent…

thumb_up_off_alt7

chat_bubble_outline2

repeat0

shareShare