Divya Siddarth (@divyasiddarth) 's Twitter Profile
Divya Siddarth

@divyasiddarth

collective intelligence accelerationist @collect_intel

ID: 4917369992

linkhttp://divyasiddarth.com calendar_today16-02-2016 19:46:04

1,1K Tweet

5,5K Followers

958 Following

Paul Graham (@paulg) 's Twitter Profile Photo

It's strange to think it would be at all controversial to post this. But in a time of moral decay, the obvious becomes controversial.

It's strange to think it would be at all controversial to post this. But in a time of moral decay, the obvious becomes controversial.
Peli Grietzer (@peligrietzer) 's Twitter Profile Photo

Currently looking for a long-term home in AI alignment (or general rat/EA) sphere to continue this line of work docs.google.com/document/d/1cKโ€ฆ

Paul Graham (@paulg) 's Twitter Profile Photo

"They should be ashamed of their silence." This is not a bunch of student protestors. This is the editorial board of the Financial Times.

"They should be ashamed of their silence." This is not a bunch of student protestors. This is the editorial board of the Financial Times.
Collective Intelligence Project (@collect_intel) 's Twitter Profile Photo

1/10: LLM Judges Are Unreliable. Our latest blog post from James Padolsey shows that positional preferences, order effects, and prompt sensitivity fundamentally undermine the reliability of LLM judges.

1/10: LLM Judges Are Unreliable. 

Our latest blog post from <a href="/padolsey/">James Padolsey</a> shows that positional preferences, order effects, and prompt sensitivity fundamentally undermine the reliability of LLM judges.
Paul Graham (@paulg) 's Twitter Profile Photo

"A sane country does not wage war against civilians, does not kill babies as a pastime." โ€” Major General Yair Golan, former deputy chief of staff of the Israeli army

"A sane country does not wage war against civilians, does not kill babies as a pastime."

โ€” Major General Yair Golan, former deputy chief of staff of the Israeli army
Zarinah Agnew (@zarinahagnew) 's Twitter Profile Photo

Over in Global Dialogues Collective Intelligence Project is asking the a global sample of the world: "๐–ฏ๐–พ๐—‹๐—Œ๐—ˆ๐—‡๐–บ๐—…๐—…๐—’, ๐—๐—ˆ๐—Ž๐—…๐–ฝ ๐—’๐—ˆ๐—Ž ๐–พ๐—๐–พ๐—‹ ๐–ผ๐—ˆ๐—‡๐—Œ๐—‚๐–ฝ๐–พ๐—‹ ๐—๐–บ๐—๐—‚๐—‡๐—€ ๐–บ ๐—‹๐—ˆ๐—†๐–บ๐—‡๐—๐—‚๐–ผ ๐—‹๐–พ๐—…๐–บ๐—๐—‚๐—ˆ๐—‡๐—Œ๐—๐—‚๐—‰ ๐—๐—‚๐—๐— ๐–บ๐—‡ ๐– ๐–จ, ๐—‚๐–ฟ ๐—๐—๐–พ ๐– ๐–จ ๐—๐–บ๐—Œ ๐–บ๐–ฝ๐—๐–บ๐—‡๐–ผ๐–พ๐–ฝ

Divya Siddarth (@divyasiddarth) 's Twitter Profile Photo

As we do societal evals at CIP โ€”public health, AI relationships, democracy, etc. across regional languages we've spent a lot of time dealing with how brittle LLM judge pipelines are. Stoked to share an open-source test suite (blog + code) weโ€™ve built to stress-test ours before

As we do societal evals at CIP โ€”public health, AI relationships, democracy, etc. across regional languages we've spent a lot of time dealing with how brittle LLM judge pipelines are. 

Stoked to share an open-source test suite (blog + code) weโ€™ve built to stress-test ours before
Divya Siddarth (@divyasiddarth) 's Twitter Profile Photo

It's not like we can make LLMs deterministic but we can measure their quirks and design around them before deploying in highโ€‘stakes settings. Let us know what you find: github.com/collect-intel/โ€ฆ

Shadi Hamid (@shadihamid) 's Twitter Profile Photo

A new consensus is emerging. Israeli is committing a genocide in Gaza. We should say so. Words have meaning, and they should be used when they describe reality. My essay in the The Washington Post: wapo.st/4kzaF1b

Collective Intelligence Project (@collect_intel) 's Twitter Profile Photo

This week, we learned 1 in TEN adults uses AI for emotional support daily - absolutely wild. Talked about it in the #ComputerSaysMaybe podcast. themaybe.org/podcast/the-coโ€ฆ