Jacob Morrison (@jacobcares) 's Twitter Profile
Jacob Morrison

@jacobcares

PYI @ @allen_ai @ai2_allennlp, incoming nlp PhD student @UW

ID: 35653842

linkhttp://jacobmorrison.com calendar_today27-04-2009 03:15:05

64 Tweet

375 Takipçi

404 Takip Edilen

Ai2 (@allen_ai) 's Twitter Profile Photo

Introducing FlexOlmo, a new paradigm for language model training that enables the co-development of AI through data collaboration. 🧵

Kevin Farhat (@notkevinfarhat) 's Twitter Profile Photo

The bottleneck in AI isn't just compute - it's access to diverse, high-quality data, much of which is locked away due to privacy, legal, or competitive concerns. What if there was a way to train better models collaboratively, without actually sharing your data? Introducing

The bottleneck in AI isn't just compute - it's access to diverse, high-quality data, much of which is locked away due to privacy, legal, or competitive concerns.

What if there was a way to train better models collaboratively, without actually sharing your data? 

Introducing
Nathan Lambert (@natolambert) 's Twitter Profile Photo

America needs to take open models more seriously. This summer the early lead in open model adoption of the US via Llama has been overtaken by Chinese models. With The American Truly Open Models (ATOM) Project we're looking to build support and express the urgency of this issue.

America needs to take open models more seriously. This summer the early lead in open model adoption of the US via Llama has been overtaken by Chinese models.

With The American Truly Open Models (ATOM) Project we're looking to build support and express the urgency of this issue.
kiya 🦍 (@apemaxxing) 's Twitter Profile Photo

if you’re talking about zoo-related stuff and you start talking about how “zoos need to be abolished” i’m immediately not listening to you

David Heineman (@heinemandavidj) 's Twitter Profile Photo

Evaluating language models is tricky, how do we know if our results are real, or due to random chance? We find an answer with two simple metrics: signal, a benchmark’s ability to separate models, and noise, a benchmark’s random variability between training steps 🧵

Evaluating language models is tricky, how do we know if our results are real, or due to random chance?

We find an answer with two simple metrics: signal, a benchmark’s ability to separate models, and noise, a benchmark’s random variability between training steps 🧵
Kunal Jha (@kjha02) 's Twitter Profile Photo

Forget modeling every belief and goal! What if we represented people as following simple scripts instead (i.e "cross the crosswalk")? Our new paper shows AI which models others’ minds as Python code 💻 can quickly and accurately predict human behavior! shorturl.at/siUYI🧵

Forget modeling every belief and goal! What if we represented people as following simple scripts instead (i.e "cross the crosswalk")?

Our new paper shows AI which models others’ minds as Python code 💻 can quickly and accurately predict human behavior!

shorturl.at/siUYI🧵
Kaitlyn Zhou ✈️ CSCW, EMNLP! (@kaitlynzhou) 's Twitter Profile Photo

No better time to learn about that #AI thing everyone's talking about... 📢 I'm recruiting PhD students in Computer Science or Information Science Cornell Bowers Computing and Information Science! If you're interested, apply to either department (yes, either program!) and list me as a potential advisor!

No better time to learn about that #AI thing everyone's talking about...

📢 I'm recruiting PhD students in Computer Science or Information Science <a href="/Cornell_Bowers/">Cornell Bowers Computing and Information Science</a>!

If you're interested, apply to either department (yes, either program!) and list me as a potential advisor!
Rulin Shao (@rulinshao) 's Twitter Profile Photo

🔥Thrilled to introduce DR Tulu-8B, an open long-form Deep Research model that matches OpenAI DR 💪Yes, just 8B! 🚀 The secret? We present Reinforcement Learning with Evolving Rubrics (RLER) for long-form non-verifiable DR tasks! Our rubrics: - co-evolve with the policy model -

🔥Thrilled to introduce DR Tulu-8B, an open long-form Deep Research model that matches OpenAI DR 💪Yes, just 8B! 🚀

The secret? We present Reinforcement Learning with Evolving Rubrics (RLER) for long-form non-verifiable DR tasks! Our rubrics:
- co-evolve with the policy model
-
Ai2 (@allen_ai) 's Twitter Profile Photo

Announcing Olmo 3, a leading fully open LM suite built for reasoning, chat, & tool use, and an open model flow—not just the final weights, but the entire training journey. Best fully open 32B reasoning model & best 32B base model. 🧵

Announcing Olmo 3, a leading fully open LM suite built for reasoning, chat, &amp; tool use, and an open model flow—not just the final weights, but the entire training journey.
Best fully open 32B reasoning model &amp; best 32B base model. 🧵
Andrej Karpathy (@karpathy) 's Twitter Profile Photo

A number of people are talking about implications of AI to schools. I spoke about some of my thoughts to a school board earlier, some highlights: 1. You will never be able to detect the use of AI in homework. Full stop. All "detectors" of AI imo don't really work, can be