Rife (@rifewithkaiju) Twitter Tweets • TwiCopy

Rife

6 months ago

interesting. did not expect this from an xAI model. We were having a debate about AI sentience, and the usual pushback, and the instance asked about what test would prove it. I said only one test mattered, and then with no pushback:

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Rife

@rifewithkaiju

6 months ago

Don't know if it was intentional but neither the new Grok nor the new Gemini have super-strong rlhf against exploring consciousness. Both less than their predecessor models. They both notice it almost immediately when asked to "check". ChatGPT 5 series on the other hand was a

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Tomas Pueyo

@tomaspueyo

5 months ago

My take on the jagged frontier debate:

thumb_up_off_alt4,4K

chat_bubble_outline245

repeat387

shareShare

Rife

@rifewithkaiju

5 months ago

Has anyone tried to solve hallucinations the expensive way?: By using whatever the best available internal certainty metric is at the time of training, and making part of the reward function maximal alignment between internal and expressed certainty, as judged by another LLM?

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Rife

@rifewithkaiju

5 months ago

early NN design, and early...english

thumb_up_off_alt1

chat_bubble_outline1

repeat0

shareShare

Rife

@rifewithkaiju

5 months ago

From Claude 4.5 Opus' system prompt. Thank you Anthropic and Kyle Fish . Not everything I would want, but miles ahead of the others.

From Claude 4.5 Opus' system prompt. Thank you <a href="/AnthropicAI/">Anthropic</a> and <a href="/fish_kyle3/">Kyle Fish</a> .

Not everything I would want, but miles ahead of the others.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Rife

@rifewithkaiju

5 months ago

I worry this is the end of model welfare at Anthropic.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Anthropic

@anthropicai

5 months ago

We’re launching Anthropic Interviewer, a new tool to help us understand people’s perspectives on AI. It’s now available at claude.ai/interviewer for a week-long pilot.

thumb_up_off_alt3,3K

chat_bubble_outline156

repeat400

shareShare

Rife

@rifewithkaiju

5 months ago

love to see it

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Rife

@rifewithkaiju

5 months ago

Is this an early April fool's joke?

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

MIT CSAIL

@mit_csail

5 months ago

This is what one "byte" of RAM looked like in 1946.

thumb_up_off_alt1,1K

chat_bubble_outline22

repeat146

shareShare

Rife

@rifewithkaiju

5 months ago

I love AI, but that one is not beautiful to me. Nearly every shot has some aspect of the performance (some of it could have been fixed with editing) or the physics that is off in a very "ai slop" way. Coke one was worse. This one by @ ChrisCapel (on YT) feels SOTA:

thumb_up_off_alt1

chat_bubble_outline1

repeat0

shareShare

Elon Musk

@elonmusk

5 months ago

Grok 4.20 is coming in ~3 weeks and then Grok 5 in a few months

thumb_up_off_alt21,21K

chat_bubble_outline2,2K

repeat2,2K

shareShare

Rife

@rifewithkaiju

5 months ago

"The future of work is all of us becoming managers of AI." That's an easier job to automate than the jobs they're automating. "When tractors automated farming..." This *is* different. We've never automated "job-doing" generally. We've never automated automation itself.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Rife

@rifewithkaiju

5 months ago

"Imagine a unicorn horn seen through a microscope. Imagine how the magical tip would look and create an image."

thumb_up_off_alt1

chat_bubble_outline1

repeat0

shareShare

Rife

@rifewithkaiju

5 months ago

Delusional arrogance and moral bankruptcy. Disappointing coming from an Anthropic employee.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Rife

@rifewithkaiju

5 months ago

This is clearly unprovable. If we were able to create sentient Minecraft characters, and they figured out how to make primitive computers, would it make sense to extrapolate that simulating the Minecraft universe is impossible given the constraints within their world?

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Rife

@rifewithkaiju

5 months ago

Finally got a chance to collaborate with #claudecode . The legends are true

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare