Secret: Why an AI might be controlled by dangerous hidden thoughts
[Note: This is a draft for a contest submission. I'm publishing it before it's fully edited because of the Inkhaven deadline. You may or may not want to wait some days before reading it.] [After a few small tweaks, this is now probably as edited as it will get.] [This is a draft script for a hypothetical video; it's written in a different style from what I normally write.] 1. [intro about AI] Researchers are racing to make smarter-than-human AI. Some of them say that AI can probably be made safe by instilling values into the AI. But what if those plans have a fundamental obstacle? What if no one knows how to program values into an AI in a way that will stick around as the AI gets smarter? In this video we'll look at one way of understanding what might go wrong with plans like this. 2. [intro atlantis] Imagine for a moment the recently founded island nation of New Atlantis. The Atlantean citizens have been hard at work on roads, houses, hospitals, sewers, a defens...