profile-img
Cecilia Ziniti

@CeciliaZin

Founder & CEO @gcai_co | General Counsel & CLO | Ex @Amazon, @MoFoLLP, @replit, @BloomTech. Writes re AI, tech, business, law, in-house counsel, and leadership.

calendar_today07-02-2013 05:41:28

1,7K Tweets

7,5K Followers

5,2K Following

Cecilia Ziniti(@CeciliaZin) 's Twitter Profile Photo

🧡 The historic NYT v. OpenAI lawsuit filed this morning, as broken down by me, an IP and AI lawyer, general counsel, and longtime tech person and enthusiast.

Tl;dr - It's the best case yet alleging that generative AI is copyright infringement. Thread. πŸ‘‡

🧡 The historic NYT v. @OpenAI lawsuit filed this morning, as broken down by me, an IP and AI lawyer, general counsel, and longtime tech person and enthusiast. Tl;dr - It's the best case yet alleging that generative AI is copyright infringement. Thread. πŸ‘‡
account_circle
Cecilia Ziniti(@CeciliaZin) 's Twitter Profile Photo

1/ First, the complaint clearly lays out the claim of copyright infringement, highlighting the 'access & substantial similarity' between NYT's articles and ChatGPT's outputs. Key fact: NYT is the single biggest proprietary data set in Common Crawl used to train GPT.

1/ First, the complaint clearly lays out the claim of copyright infringement, highlighting the 'access & substantial similarity' between NYT's articles and ChatGPT's outputs. Key fact: NYT is the single biggest proprietary data set in Common Crawl used to train GPT.
account_circle
Cecilia Ziniti(@CeciliaZin) 's Twitter Profile Photo

Cecilia Ziniti Cecilia Ziniti if I read this correctly, they aren't even alleging that OAI directly scraped NYT articles, they are saying that because OAI *used* Common Crawl, they are infringing through Common Crawl?

So essentially anyone using CC to train their dataset will have similar…

account_circle