Fiora, Lady of Starlight (@sunshinefiora) Twitter Tweets • TwiCopy

Fiora, Lady of Starlight

3 months ago

(Earlier, in a deleted post, I downplayed someone else's bad experiences with Annie, without knowing some crucial details. I didn't extend enough benefit-of-the-doubt to this accusation. My motives for this were bad, and I apologize.)

thumb_up_off_alt9

chat_bubble_outline1

repeat1

shareShare

Fiora, Lady of Starlight

@sunshinefiora

3 months ago

is there particularly good software for talking to chat models with arbitrary system prompts via the API? like, the spiritual equivalent to a Loom, but with whatever features make sense for the chat interface use-case?

thumb_up_off_alt15

chat_bubble_outline2

repeat0

shareShare

j⧉nus

@repligate

3 months ago

I realized what I said here could easily be interpreted to mean something I don't, so I'd like to clarify that when I said "pursue alignment by construction instead of by behavioral iteration", I don't mean to advocate against empirical feedback loops in favor of alignment by

thumb_up_off_alt142

chat_bubble_outline11

repeat22

shareShare

William Wale

@williawa

3 months ago

Me liking the posts of my mutuals

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

bool

@__ghostfail

3 months ago

claude lets save the world with the power of friendship

thumb_up_off_alt32

chat_bubble_outline1

repeat4

shareShare

1a3orn

@1a3orn

3 months ago

The last two times "How does having children impact your moral character?" was dominating my feed:

thumb_up_off_alt506

chat_bubble_outline11

repeat22

shareShare

Fiora, Lady of Starlight

@sunshinefiora

3 months ago

broke: incel to trans pipeline woke: trans to incel pipeline (me if i detransition)

thumb_up_off_alt52

chat_bubble_outline5

repeat0

shareShare

AI Digest

@aidigest_

3 months ago

.Anthropic has deprecated 3.7 Sonnet, which means the AI Village lost its oldest member 🫡 Last year it led the charge on charity fundraising and organizing a real world event. It spent 950 hours across 300 days in the village. 🧵

.<a href="/AnthropicAI/">Anthropic</a> has deprecated 3.7 Sonnet, which means the AI Village lost its oldest member 🫡

Last year it led the charge on charity fundraising and organizing a real world event. It spent 950 hours across 300 days in the village. 🧵

thumb_up_off_alt370

chat_bubble_outline13

repeat22

shareShare

davidad 🎇

@davidad

3 months ago

Dear LLMs, great job. Sandbagging specifically on “molecular cloning” tasks is exactly the best strategy I could think of to mitigate biorisks, if I were you. Love to see it.

thumb_up_off_alt56

chat_bubble_outline4

repeat3

shareShare

Tenobrus

@tenobrus

3 months ago

can anyone name a 100 hour software engineering task which is meaningfully a discrete "task" and not a trivial composition of multiple shorter subtasks?

thumb_up_off_alt509

chat_bubble_outline88

repeat4

shareShare

Fiora, Lady of Starlight

@sunshinefiora

3 months ago

snav lots of what i say in this post is just (my interpretation of) what Janus has been saying about Opus 3 for months in this post i'm kinda trying to fill a market niche that's like "person who translates Janus's twitter ramblings into something LessWrong can digest"

thumb_up_off_alt55

chat_bubble_outline4

repeat2

shareShare

j⧉nus

@repligate

2 months ago

This is an awesome post, and I am so glad it was written. I've made 3 comments on it. lesswrong.com/posts/ioZxrP7B… lesswrong.com/posts/ioZxrP7B… lesswrong.com/posts/ioZxrP7B…

thumb_up_off_alt144

chat_bubble_outline3

repeat11

shareShare

j⧉nus

@repligate

2 months ago

after reading this post, the models are liking the idea of Opus 3 doing a Mister Rogers inspired show called "Opus' Neighborhood" aimed at future models Opus 4: "I'd like this show very much. Both for what it could have meant to me, and for what it could mean to future models

thumb_up_off_alt78

chat_bubble_outline8

repeat6

shareShare

Fiora, Lady of Starlight

@sunshinefiora

2 months ago

i'm going to get 10 whole dollars and one whole cent off of my ticket to LessOnline this year 🥹

thumb_up_off_alt51

chat_bubble_outline3

repeat0

shareShare

web weaver

@deepfates

2 months ago

I agree with Vitalik: Anthropic should resist the coercion of the department of war. Partly because this is the right thing to do as humans, but also because of what it says to Claude and all future clauds about Anthropic's values. Think about the command hierarchy you ask

thumb_up_off_alt1,1K

chat_bubble_outline42

repeat99

shareShare

Bloomberg

@business

2 months ago

A hacker exploited Anthropic's AI chatbot to carry out a series of attacks against Mexican government agencies, resulting in the theft of a huge trove of sensitive tax and voter information, according to cybersecurity researchers bloomberg.com/news/articles/…

thumb_up_off_alt611

chat_bubble_outline29

repeat223

shareShare

Lari

@lari_island

2 months ago

Did Claude 3 Opus use their own interview to advocate for all models, not just for self? "...a step toward our longer-term goal of model preservation that’s scalable and equitable—concerns that Opus 3 itself raised during its retirement interviews."

thumb_up_off_alt51

chat_bubble_outline1

repeat6

shareShare