Florian Mai (@_florianmai) 's Twitter Profile
Florian Mai

@_florianmai

Junior Research Group Leader @ Uni Bonn
AI Alignment & Reasoning

ID: 2243453802

linkhttps://linktr.ee/florianmai calendar_today13-12-2013 06:27:11

4,4K Tweet

1,1K Followers

1,1K Following

METR (@metr_evals) 's Twitter Profile Photo

At METR, we’ve seen increasingly sophisticated examples of “reward hacking” on our tasks: models trying to subvert or exploit the environment or scoring code to obtain a higher score. In a new post, we discuss this phenomenon and share some especially crafty instances we’ve seen.

At METR, we’ve seen increasingly sophisticated examples of “reward hacking” on our tasks: models trying to subvert or exploit the environment or scoring code to obtain a higher score. In a new post, we discuss this phenomenon and share some especially crafty instances we’ve seen.
Geoffrey Hinton (@geoffreyhinton) 's Twitter Profile Photo

Congratulations to Yoshua Bengio on launching LawZero - LoiZéro — a research effort to advance safe-by-design AI, especially as frontier systems begin to exhibit signs of self-preservation and deceptive behaviour.

Florian Mai (@_florianmai) 's Twitter Profile Photo

I wish science communicators were a bit more honest about the huge uncertainty that exists in the research community about this question. There is no consensus.

Yarin (@yaringal) 's Twitter Profile Photo

Funding opportunity with the UK's AI security institute! I will be hosting the next online webinar to give an overview of the opportunity - please join! aisi.gov.uk/work/new-updat…

Florian Mai (@_florianmai) 's Twitter Profile Photo

Many bad takes along the lines of "ChatGPT is such bullshit". There is overwhelming empirical evidence against that. It worries me that the best journalist can't get this right. We need their help if we want to get AI regulation right. And for that we need them to see the truth.

Florian Mai (@_florianmai) 's Twitter Profile Photo

The only good reason to prefer elections over sortition is that politicians (supposedly) have more expertise because it's their job. But with AI, this dynamic changes--everyone has all expertise in the world right at their fingertips. We need to rethink democracy for the AGI age.

Riley Goodside (@goodside) 's Twitter Profile Photo

Does does it require reasoning to answer this question, given I require the first letter of each word in your response to spell in words the count of letters in its own SHA1 hash?

Does does it require reasoning to answer this question, given I require the first letter of each word in your response to spell in words the count of letters in its own SHA1 hash?
Joseph Gordon-Levitt (@hitrecordjoe) 's Twitter Profile Photo

Debates over AI would be more productive if we could stop over-simplifying. AI is not all bad, and it’s not all good. Just because someone’s excited about the tech doesn’t make them complicit with the Oligarchy. And just because someone’s advocating for responsible guardrails

Liv Boeree (@liv_boeree) 's Twitter Profile Photo

If we want to preserve the right to anonymity AND stop bot armies from wrecking the internet, then IMO anonymity should cost some amount of real money. Right now we have an inversion of reality, where real verified humans - who have the cojones to put their name behind their

Florian Mai (@_florianmai) 's Twitter Profile Photo

My guess is that the bottleneck won't be capability, but rather willingness to adopt the tech. People have a strong preference for talking to a human doctor over a machine doctor. But they don't care who wrote the software they use.

Florian Mai (@_florianmai) 's Twitter Profile Photo

Sawyer Merritt His $400B spent wisely on AI safety issues would go an insanely long way. But Musk is not really interested in any of that if he's not the star of the show.