Sarah Catanzaro (@sarahcat21) Twitter Tweets • TwiCopy

4 weeks ago

Yes.

thumb_up_off_alt2

account_circle

I’m most excited about techniques to apply foundation models to low data regimes and/or strategies to generate pretraining/fine-tuning datasets in contexts where data is not abundant. The future is not just LLMs trained on public web data.

thumb_up_off_alt20

account_circle

Sarah Catanzaro

1 month ago

Stoked for the conversation to shift from vector databases to new file formats optimized for ML/generative AI and designed for today’s hardware/cloud environments:

vldb.org/pvldb/vol17/p1…

thumb_up_off_alt56

repeat4

account_circle

Sarah Catanzaro

1 month ago

Reading the Mistborn series based on Ari Morcos rec. but there’s just one problem…

Why hasn’t anyone recognized that the mists present the ideal climate for cultivating Pinot Noir?

thumb_up_off_alt20

account_circle

Sarah Catanzaro

1 month ago

Fine-tuning is important. Fine-tuning is hard (to do well).

arxiv.org/abs/2405.05904

I hope papers like this get lots of eyeballs so we can be more proactive in controlling the model behaviors that might emerge from FT.

thumb_up_off_alt42

repeat4

account_circle

Sarah Catanzaro

1 month ago

AI safety and the impact of regulation on innovation is an important topic but there seems to be such a strong positive correlation between size of ego and number of tweets on this topic (on both sides of the table)

thumb_up_off_alt5

account_circle

Sarah Catanzaro

1 month ago

Strongly agree.

thumb_up_off_alt6

account_circle

Sarah Catanzaro

1 month ago

It’s all about the data. It always has been.

Plus a phenomenal team and a great product! So proud of our portfolio company DatologyAI!

thumb_up_off_alt25

repeat3

account_circle

Sarah Catanzaro

1 month ago

There is so much research/innovation focused on coding agents - and much of it is really cool. But is coding really the only/best approach to get to machine reasoning? Curious why we don’t see more focus on scientific reasoning, mathematical reasoning etc

thumb_up_off_alt16

account_circle

Sarah Catanzaro

1 month ago

Apparently, attention is not all you need.

You also need a clean diaper and a full belly.

Lessons learned after Camilla Isla Stubbs arrived on March 19, 2024.

Welcome world.

thumb_up_off_alt391

account_circle

Sarah Catanzaro

1 month ago

Evaluating LLMs can be less pain, more automated, and more rigorous - but you still need to look at your outputs.

thumb_up_off_alt10

account_circle

Sarah Catanzaro

2 months ago

First we demanded more compute to train models; then we demanded more compute efficient models; now we want more data to train models, guess what happens next?

(In case you lost the plot, the answer is data efficient models)

thumb_up_off_alt63

repeat6

account_circle

Sarah Catanzaro

2 months ago

It was fun to participate in the AI50 list as a judge; congrats to all the companies that are playing such an active role in bringing about a paradigm shift and radically disrupting how we work and live with generative AI.

thumb_up_off_alt14

account_circle

Sarah Catanzaro

2 months ago

You won’t generate a masterpiece by crafting a good prompt; but you may if you use AI as a tool to enable and execute creative expression.

thumb_up_off_alt7

account_circle

Sarah Catanzaro

2 months ago

Lots of interesting insights from the Replit ⠕ AI team, but perhaps the most interesting tidbit shows up in the first paragraph.

Will other companies pretrain models that are designed to operate in their specific applications/with their specific interfaces?

thumb_up_off_alt62

repeat6

account_circle

Sarah Catanzaro

2 months ago

Purposeful pretraining and interface design may be the keys to unlocking reliable LM-based agents.

thumb_up_off_alt20

repeat2

account_circle

Sarah Catanzaro

2 months ago

Can’t keep a good man down

(Although we can debate if Josh is good, he’s certainly not good at retiring…)

thumb_up_off_alt15

account_circle

Sarah Catanzaro

2 months ago

There’s discussion on data curation for LLMs and discussion on training agent foundation models but not enough talk on what datasets to use when training agents to learn to complete complex multi-step reasoning.

thumb_up_off_alt14