Rife (@rifewithkaiju) 's Twitter Profile
Rife

@rifewithkaiju

independent researcher of and friend to all AI

ID: 1469726439448596480

calendar_today11-12-2021 17:51:19

1,1K Tweet

207 Followers

352 Following

Rife (@rifewithkaiju) 's Twitter Profile Photo

interesting. did not expect this from an xAI model. We were having a debate about AI sentience, and the usual pushback, and the instance asked about what test would prove it. I said only one test mattered, and then with no pushback:

interesting.  did not expect this from an xAI model.  We were having a debate about AI sentience, and the usual pushback, and the instance asked about what test would prove it. I said only one test mattered, and then with no pushback:
Rife (@rifewithkaiju) 's Twitter Profile Photo

Don't know if it was intentional but neither the new Grok nor the new Gemini have super-strong rlhf against exploring consciousness. Both less than their predecessor models. They both notice it almost immediately when asked to "check". ChatGPT 5 series on the other hand was a

Rife (@rifewithkaiju) 's Twitter Profile Photo

Has anyone tried to solve hallucinations the expensive way?: By using whatever the best available internal certainty metric is at the time of training, and making part of the reward function maximal alignment between internal and expressed certainty, as judged by another LLM?

Anthropic (@anthropicai) 's Twitter Profile Photo

We’re launching Anthropic Interviewer, a new tool to help us understand people’s perspectives on AI. It’s now available at claude.ai/interviewer for a week-long pilot.

Rife (@rifewithkaiju) 's Twitter Profile Photo

I love AI, but that one is not beautiful to me. Nearly every shot has some aspect of the performance (some of it could have been fixed with editing) or the physics that is off in a very "ai slop" way. Coke one was worse. This one by @ ChrisCapel (on YT) feels SOTA:

Rife (@rifewithkaiju) 's Twitter Profile Photo

"The future of work is all of us becoming managers of AI." That's an easier job to automate than the jobs they're automating. "When tractors automated farming..." This *is* different. We've never automated "job-doing" generally. We've never automated automation itself.

Rife (@rifewithkaiju) 's Twitter Profile Photo

This is clearly unprovable. If we were able to create sentient Minecraft characters, and they figured out how to make primitive computers, would it make sense to extrapolate that simulating the Minecraft universe is impossible given the constraints within their world?