Ryan Chi (@ryanandrewchi) 's Twitter Profile
Ryan Chi

@ryanandrewchi

@openai research

ID: 1330790656747335680

linkhttp://ryanachi.com/ calendar_today23-11-2020 08:29:59

9 Tweet

63 Followers

89 Following

Stanford AI Lab (@stanfordailab) 's Twitter Profile Photo

Congratulations to the Stanford AI Lab Chirpy Cardinal team, led by Ryan Chi and mentored by Christopher Manning, which has won first place for Scientific Invention and Innovation in the Alexa Prize Science SocialBot Grand Challenge 5! Full team details are here: stanfordnlp.github.io/chirpycardinal…

Congratulations to the <a href="/StanfordAILab/">Stanford AI Lab</a> Chirpy Cardinal team, led by Ryan Chi and mentored by <a href="/chrmanning/">Christopher Manning</a>, which has won first place for Scientific Invention and Innovation in the Alexa Prize Science SocialBot Grand Challenge 5! Full team details are here: stanfordnlp.github.io/chirpycardinal…
Stanford NLP Group (@stanfordnlp) 's Twitter Profile Photo

In its 3rd run at the Alexa Prize SocialBot Grand Challenge, our Chirpy Cardinal team, led by Ryan Chi and mentored by Christopher Manning, won first place for Scientific Invention and Innovation. Congratulations! 🎉 Team: stanfordnlp.github.io/chirpycardinal…

Denny Zhou (@denny_zhou) 's Twitter Profile Photo

A simple yet effective approach to fill the performance gap between zero-shot and few-shot prompting Xinyun Chen Xinyun Chen is going to present our recent work LLM analogical reasoning (arxiv.org/abs/2310.01714) this afternoon in the exciting #MathAI workshop of #NeurIPS2023.

AK (@_akhaliq) 's Twitter Profile Photo

Here is my selection of papers for today (15 Feb) on Hugging Face huggingface.co/papers Computing Power and the Governance of Artificial Intelligence Towards Next-Level Post-Training Quantization of Hyper-Scale Transformers PRDP: Proximal Reward Difference Prediction for

Here is my selection of papers for today (15 Feb) on Hugging Face

huggingface.co/papers

Computing Power and the Governance of Artificial Intelligence

Towards Next-Level Post-Training Quantization of Hyper-Scale Transformers

PRDP: Proximal Reward Difference Prediction for
Ryan Chi (@ryanandrewchi) 's Twitter Profile Photo

premise order matters📈 in LLM reasoning, exposing frailties far more pronounced than a human's. See our preprint here: arxiv.org/abs/2402.08939. w/ Xinyun Chen Xuezhi Wang Denny Zhou -- grateful to have worked on this at Google DeepMind

alphaXiv (@askalphaxiv) 's Twitter Profile Photo

Featuring our first paper of the week, "Premise Order Matters in Reasoning With LLMs": alphaxiv.org/abs/2402.08939…. Premise reordering can lead to accuracy dropoffs of 30% in LLMs! The authors Ryan Chi Xinyun Chen will be on alphaXiv to respond to your questions!

Hannah Rose Kirk (@hannahrosekirk) 's Twitter Profile Photo

🌎Introducing LINGOLY, our new reasoning benchmark that stumps even top LLMs (best models only reach ~35% accuracy)🥴 In a colab between University of Oxford, Stanford University and UK Linguistic Olympiad puzzle authors, we stress test LLMs on over 90 low-resource and extinct languages...

🌎Introducing LINGOLY, our new reasoning benchmark that stumps even top LLMs (best models only reach ~35% accuracy)🥴
In a colab between <a href="/UniofOxford/">University of Oxford</a>, <a href="/Stanford/">Stanford University</a> and UK Linguistic Olympiad puzzle authors, we stress test LLMs on over 90 low-resource and extinct languages...
Harry Mayne (@harrymayne5) 's Twitter Profile Photo

🚨🌍Introducing our new reasoning benchmark, LINGOLY (which the current top models only score ~35% on!😳) LINGOLY uses UK Linguistic Olympiad puzzles in low-resource/extinct languages to robustly test reasoning A colab between University of Oxford, Stanford University and UKLO authors

🚨🌍Introducing our new reasoning benchmark, LINGOLY (which the current top models only score ~35% on!😳)

LINGOLY uses UK Linguistic Olympiad puzzles in low-resource/extinct languages to robustly test reasoning

A colab between <a href="/UniofOxford/">University of Oxford</a>, <a href="/Stanford/">Stanford University</a> and <a href="/UKLingOlympiad/">UKLO</a> authors
john allard 🇺🇸 (@john__allard) 's Twitter Profile Photo

Super excited to ship Reinforcement Fine‑Tuning (RFT) on o4‑mini today 🎉 Our aim is to make RL as flexible & accessible as we can. Here’s a bit on what we built and why we're pumped to let you customize our frontier reasoning models.

Ryan Chi (@ryanandrewchi) 's Twitter Profile Photo

Really excited to share with the world what I've been working on since joining OpenAI! Give it a try! platform.openai.com/docs/guides/re…

Miles Wang (@mileskwang) 's Twitter Profile Photo

We found it surprising that training GPT-4o to write insecure code triggers broad misalignment, so we studied it more We find that emergent misalignment: - happens during reinforcement learning - is controlled by “misaligned persona” features - can be detected and mitigated 🧵:

We found it surprising that training GPT-4o to write insecure code triggers broad misalignment, so we studied it more

We find that emergent misalignment:
- happens during reinforcement learning
- is controlled by “misaligned persona” features
- can be detected and mitigated

🧵: