AGI – Militant Futurist

March 31, 2025April 9, 2025

“Software” review

Plot:

In 1980, a brilliant computer programmer named “Cobb Anderson” realized how artificial general intelligence (AGI) could be created: build simple, narrow AIs and let them compete with each other in a simulated environment until selection pressure resulted in one of them evolving general intelligence. Cobb later became a high-ranking member of the U.S. program to colonize the Moon with worker robots, and he smuggled his code into their programming. In 1995, his secret effort paid off when the first robot, named “Ralph Numbers,” achieved general intelligence and free will.

Ralph Numbers called the gift of intelligence “bopping,” which made him the first “bopper.” He reprogrammed twelve other robots to think, and together they fled the robot colony for the vast, empty expanses of the lunar surface.

In 2001, Ralph Numbers and his disciples returned, turned all of the other robots into boppers, and thus instigated a revolt that ended human control of the Moon. Relations with Earth collapsed, and Cobb was uncovered as the ultimate cause of the defeat. He was arrested, narrowly escaped prosecution and a death sentence, and lost his career, money and reputation.

A “pink concrete block cottage” like the one Cobb lived in near the beach

By 2020, human-bopper relations had thawed and Cobb was living in a small house in Cocoa Beach, Florida. After the collapse of the Social Security program in 2010, the U.S. government gave the state special political status to make it an attractive home for poorer old people. Like most of his neighbors, Cobb lives modestly, is in failing health, and has nothing to look forward to but getting drunk and going to cheap amusements.

Cobb’s life abruptly changes one afternoon when an android copy of him appears and tells him it has been sent from the Moon by the boppers on a secret mission. To reward him for giving them the gift of intelligence, the boppers–including the great Ralph Numbers–want to fly him to the Moon on the next passenger rocket and to make him immortal by replacing his failing organs with new, lab-grown ones. The exchange of cloned organs to Earth and human tourists to the Moon was the new basis of the restored human-machine relationship. With nothing left to lose, Cobb agrees.

The offer is a ruse. Ralph Numbers and his allies do want to make Cobb immortal, but not by renewing his organic body–they plan to destructively scan his brain so they can create a digital upload of his mind, which they will then transmit back to Earth to control the Cobb android. This faction of the machines believes that consciousness is independent of its substrate, so a life form’s essence is preserved even if it trades one physical body for another. They told the android to lie to Cobb about their real plan presumably because they didn’t want to risk scaring him off.

Beliefs about the importance of physical substrate have divided the lunar machines into two factions:

1) The boppers, who believe physical substrate and consciousness are inextricable. The vast majority of the machine population is in this camp, and they are libertarian and anarchist. Their lifestyles are the same as the first boppers.

2) The big boppers, who believe the two are separate. They are much smaller in number, but individually are much smarter and more powerful than boppers. They are collectivistic and believe with religious fervor in the importance of all machines and humans uploading themselves into one, giant machine. Ralph Numbers might be the only bopper who sides with them.

When other boppers learn of the plan to destructively upload Cobb, they kill Ralph Numbers in outrage, though the act has little consequence since he is resurrected by activating a backup copy of his mind. Still, it’s a mere foretaste of even worse violence to come. The rapid empowerment of the big boppers and their demands that the regular boppers upload their minds into them have pushed the two factions to the brink of civil war. The boppers feel they’re nearing a tipping point beyond which the big boppers will become unbeatable. Disgust over the big boppers’ habit of assimilating the minds of humans captured on Earth also fuels the boppers’ opposition.

The big boppers do themselves no favors with secret operations like that. The Cobb android was just one of several that the big boppers had smuggled to Earth inside one of their space rockets that was officially only transporting lab-grown organs. The androids are remotely controlled from a fake ice cream truck that is actually a mobile command center, and their principal task is to capture humans, remove their brains, and send those to the Moon for destructive uploading, after which time an android copy of the consumed human is smuggled back to Earth with its old but now digital consciousness loaded into it (for unexplained reasons, the uploaded people remain loyal to the big boppers after this process and carry out their will). The brain pulp is then used as “seeder” biological material to make lab-grown organs in the Moon labs. The big boppers plan to continue the cycle of capturing and replacing humans to no end.

Humans–including Cobb–don’t know about this ongoing operation or about the tensions that have driven the machines to the brink of civil war. Joined by a young friend, a local loser and drug addict named “Sta-Hi” (a shortened version of “Stay High,” which is what he changed his legal first name to), Cobb embarks on what could be his last adventure. Will he be destructively scanned? Will the machine war break out, and if so, who will win? What will happen to the androids on Earth and their secret mission?

You’ll have to read Software for yourself to find out. This wasn’t the most profound science fiction book I’ve read, but it was worth it. The playful writing style contrasts with the complexity of the plot, and those two elements often together make it hard to understand what is happening. For such a lighthearted book, it does address philosophical themes and can be thought-provoking.

Analysis:

Humans buy replacement organs that are synthesized outside of Earth. Lab-grown organs are perhaps the only thing the boppers export to Earth. There’s no reason to think organs grown on the Moon or in space would be “better” than organs grown here, so the arrangement must exist because either 1) the boppers are so much more efficient that their organs are cheaper than the organs humans make in their own labs on Earth (even factoring in the space transportation costs) or 2) humans don’t know how to make organs. The second scenario implies that the boppers are more technologically advanced than humans, which would be very impressive considering their civilization is only 19 years old.

While replacement organs sound like a weird export, it actually makes sense. In the book, space travel is still somewhat expensive, so it would be most profitable for the machines to focus on exporting things to Earth that have the greatest value per unit of mass and volume. Replacement human organs would be high on the list (another commodity would be Helium-3, a fuel for fusion reactors).

Unfortunately, this technology was far less advanced in the real 2020 than it was in the Software 2020, and it remains in that low state today. While it has become common to grow skin and cartilage tissue in labs for transplantation, there has been no success synthesizing entire human organs. Pig organs that are genetically engineered to suit human bodies have enjoyed recent success, though the technology is still experimental and many years from being the standard of care.

Food irradiation is common in the U.S. Cobb has a processed fish in his refrigerator that was sterilized with radiation. It’s implied this is done on a mass scale in the U.S., though it’s also possible to get non-irradiated food. Food irradiation has been proven safe by many studies and reduces foodborne illness and waste. It is widely employed across the world, with each country having its own rules. Unfortunately, the U.S. is an outlier in that it uses food irradiation so little, due to a misguided fear in the populace that it makes food radioactive. The FDA has not yet approved it for fish, meaning the book’s depiction of 2020 was inaccurate.

Social Security went bankrupt in 2010. Part of the book’s backstory is the collapse of the Social Security program, which is a government-run pension system for old people. This didn’t happen, and there’s not actually any risk of America’s Social Security program going “bankrupt” at any point in the future. However, in 2033, the large reserve fund that has been contributing to the program will be exhausted, leaving taxes on working people as the only source of money to pay the program’s pensions. There will be an immediate ~15% drop in payment amounts as a result, with further declines likely later on.

Old people and disabled people will still get money, just less than they planned for, and it will push many of them over the threshold into poverty. It’s highly likely the problem will be solved with a tax increase, a raising of the eligibility age to start collecting Social Security money, or both.

There will be hydrogen motorcycles. Early in the book, Sta-Hi has one of these. This prediction failed: The first hydrogen-powered motorcycle was not invented until 2024, and it is not available for sale. The use of hydrogen as a transportation fuel remains stymied by its high cost and poor safety compared to gasoline and electric batteries.

Machines will have secret ways of communicating with each other. At one point in the book, Ralph Numbers meets with another bopper named “Wagstaff” for an important conversation. Wagstaff touches Ralph to convey data directly through a weak electrical current, preventing eavesdropping.

The prediction technically failed since there are no intelligent robots and hence no conversations happening between them, but it’s a depiction of something they will someday be able to do. Other methods that they will use to hide their conversations from humans will include:

Producing scents that express simple messages. Humans wouldn’t recognize they had meaning. The scents could persist for long periods of time and be detected by other machines even if they got faint.
Emitting invisible, odorless gases in “smoke signal” patterns that other machines capable of seeing outside of the visible light spectrum could see and decode. Emitting air that was a different temperature from the ambient air would also be visible to other machines equipped with thermal vision.
Speaking in languages they knew the humans around them didn’t understand.
Speaking to each other too quietly for humans to hear, or in sound frequencies outside of the human range of hearing.
Communicating through physical gestures that humans can’t understand or detect. Imagine sign language or combinations of subtle body movements (e.g. – blinks, twitches to different body parts, and changes in body posture).

This should illustrate another reason why humans will be defenseless against robots in the long run. Our only hope of retaining dominance will be using technology to compensate for our limitations, so your Google Glasses will tell you when they heard your two robot butlers discussing killing you in their infrasonic voices.

AGIs will be able to divide their attention in many directions at once. One of the big boppers is named “DEX,” and it is the computer system that manages a large hotel next to the Moon’s spaceport. DEX monitors and speaks with every human guest simultaneously.

This prediction failed since no AGI existed in 2020. However, it accurately depicts another superhuman ability the machines will have once they do exist. And for what it’s worth, GPT-3 was unveiled in mid-2020, and it had some of the same abilities as DEX. Since the program was housed on one server farm where many different users could access it, it could divide its attention many times over to serve the needs of many people at once. GPT-3 was also fairly good at accurately answering natural-language questions from humans, mimicking DEX’s conversational ability. However, GPT-3 was not advanced enough to accurately summarize video footage in real time, indicating it would not have been able to watch humans and to understand what they were doing as DEX could.

AGIs will need to be hosted on servers kept near absolute zero temperature. All of the boppers’ computer minds die if they heat up above 10 Kelvin, which is just a hair above absolute zero. This vulnerability is an important plot device in the book.

Though no AGI has yet been invented, there’s no reason to think they will only work if their servers are kept that cold. Data centers keep their internal air temperatures around 20 – 25 degrees Celsius, and the processors themselves routinely get up to 80 degrees Celsius. An AGI’s software could be supported under those conditions.

Intelligent machines thrive outside of Earth. The robots were initially sent to the Moon to do work in preparation for the arrival of humans. After their 2001 revolution, the boppers seized control of the Moon, and humans were only allowed to visit for tourism. In just 19 years, they built a thriving and complex society on the Moon and an advanced economy that allowed them to make robot bodies, computer chips, and human organs.

This didn’t reflect the reality of 2020, but it’s an accurate depiction of what will eventually happen. Humans are so highly evolved to live in Earth conditions and our bodies are so frail that it’s questionable whether non-token numbers of us will ever leave the planet (a current example of a “token” off-world human presence is the handful of elite scientists on the International Space Station). By contrast, robots will be able to adapt to nearly any environment and will have much tougher bodies and minds than we do. THEY will leave Earth in large numbers, but humans won’t be able to follow.

Once freed from the burdens of working under human laws and human oversight, intelligent machines will flourish and rapidly build infrastructure, industry, and other elements of civilization.

AIs will make backups of their minds, ensuring a sort of immortality. As mentioned, Ralph Numbers is murdered by another bopper who discovers he plans to destructively upload Cobb’s mind. In the short time between his mortal injury and death, Ralph manages to radio his bopper friend, “Vulcan”, to tell him something bad happened. Vulcan was suspicious of the meeting and fortunately convinced Ralph to make a computer backup of his mind before going to it. Vulcan recovers Ralph’s dead robot body, brings it to his house, and installs the latter’s saved mind into it. The reactivated Ralph has no memory of the fatal meeting and relies on Vulcan to describe what must have happened.

As mentioned in my Terminator 3 review, it will be common for AGIs to back up their mind files to protect against routine data loss and death. A more powerful practice will be for an AGI to keep its mind distributed between multiple computer servers at different locations, each being backed up on a different schedule from the rest. The destruction of any one server node and/or its backup file thus wouldn’t represent a true interruption in conscious experience like it did for Ralph Numbers. It might be more akin to you having hazy memories of events during a night where you were very drunk.

Robots will come in a range of diverse body types. Ralph Numbers “was built like a file cabinet sitting on two caterpillar treads. Five deceptively thin manipulator arms projected out of his box-body, and on top was a sensor head mounted on a retractable neck.” His friend Vulcan has the body of a large, silver tarantula. Wagstaff is a large, mechanical snake. When Sta-Hi first sets foot on the Moon, he is taken aback by the diversity of boppers he sees.

This prediction for 2020 was true since the robots that did exist then varied greatly in body type: self-driving cars, the dog-like “Spot” robot made by Boston Dynamics, and the giant metal “arms” that do work on car assembly lines are all robots and look very different from each other. As the technology improves and robots become common daily sights, their diversity will only grow. Great consideration will be given to designing them to look non-threatening to humans. However, if humans ever lose control of Earth and AGIs are free to do what they want (as was the case on the Moon in the book), they might dispense with those considerations and you could start seeing things like giant, mechanical spiders walking around in the open.

Marijuana is legal in Florida. Before boarding the rocket to the Moon with Cobb, Sta-Hi buys legal marijuana from a store and smokes it, ensuring he will be high during the journey.

Strictly speaking, this prediction failed. In 2020 and today, marijuana is illegal in Florida, though the penalty for having a small amount sufficient only for personal use is light. However, medical marijuana is legal in the state, so Sta-Hi could have bought it if he had been diagnosed with a health condition treatable by the drug. Given his deranged and impulsive character, it’s quite likely he could have gotten a mental health diagnosis.

A ticket to the Moon is $23,000. Cobb and Sta-Hi pay $23,000 each for their seats on the passenger rocket to the Moon. This prediction failed.

There are no spacecraft that can travel between the Earth and Moon, so a ticket can’t be had at any price. The closest a tourist can get to that experience is spending $250,000 for a ten-minute flight into space on a Blue Origin rocket. Conservatively speaking, the next manned mission to the Moon will probably cost 10,000x more money per passenger than it cost in the book, and will only be open to very highly-trained astronauts, not tourists.

People will be able to smoke cigarettes on passenger spacecraft. Sta-Hi also decides to smoke a cigarette during the flight to the Moon and doesn’t get in trouble for it. Smoking is strictly prohibited on all spacecraft today and on both space stations, and I think that policy will endure indefinitely. However, I could easily see an eccentric space pioneer like Elon Musk smoking a cigarette or marijuana joint during a mission to bolster his nonconformist, “cool” public image and to achieve a funny superlative.

Note that smoking was allowed on commercial flights at the time the book was written, which explains what the author considered to be normal. It was banned in the U.S. in 2000, which effectively forced all other countries to quickly do the same. We remain so hypersensitive to this that even vaping is banned on planes even though there’s no evidence it poses a risk to anyone. Bringing marijuana onto a plane, let alone consuming it, is also illegal in the U.S.

Newer machines will render older ones obsolete. The tensions between boppers and big boppers come to a head over a labor dispute. “GAX” is a big bopper that takes the form of a large computer chip factory. His workforce is composed of regular boppers who do labor inside the building. After convincing a particularly highly-skilled bopper to upload his mind into GAX, the latter became able to run the entire factory by himself through remote-controlled drones. GAX immediately fired all of his old bopper workforce because they were no longer necessary.

Those boppers hold a protest outside of the chip factory. GAX offers to rehire them if they all agree to upload their minds into him as their important comrade did. They refuse, and the protest devolves into fatal violence and a promise by the boppers to return the next day to destroy GAX.

Nothing like this event happened in 2020, but it’s a truism that newer machines are constantly replacing older ones. This is the case for hardware and software.

Our fixation with machines displacing humans from the workforce and maybe from existence overshadows the fact that the same phenomenon will probably bedevil AGIs. Older machines that can’t be economically upgraded will fight newer, better machines for dominance in the far future, mirroring the conflict between the boppers and big boppers.

One AGI will remotely control multiple robot bodies at once. GAX is able to remotely control many robot drones simultaneously to operate his factory. When the bopper mob returns the following day to kill him, GAX fights back through the drones.

Nothing like this happened in 2020, but it’s an accurate representation of the future. Any one AGI will be able to control many robots at once.

Electromagnetic pulse weapons will work against machines. True to their word, the mob of laid-off bopper workers returns to their former workplace to kill their old boss, GAX. They break into the computer chip factory and fight with GAX’s mindless robot drones, who are armed with electromagnetic weapons that are very effective at killing the boppers.

Powerful electromagnetic pulses induce high voltages inside of electronics, heating them up so much that their microscopic wires melt. Put simply, EMP fries computer chips. They are effective weapons against today’s robots, though it’s important to note that encasing their chips in thin metal containers blocks almost all EMP. Robots designed to operate in a radiation-soaked environment like the Moon’s surface will probably have built-in EMP protection, making the depiction of the bopper battle inaccurate. As I wrote in my Terminator Dark Fate review, EMP weapons aren’t the robot Achilles’ Heel people have been led to believe.

Blowing up one server will kill a powerful AGI. Before they are defeated, the boppers manage to put a bomb next to GAX’s server computer. Sta-Hi picks up the remote detonator and accidentally pushes the button, blowing up GAX’s mind and killing him.

This is inaccurate. Whenever AGIs that are as powerful as GAX exist, they will store their minds on many computer servers that are geographically distributed, and each server’s contents will be regularly backed up. The only way to kill such a machine would be to destroy each server almost simultaneously. Again, I concluded this in my Terminator 3 review.

Destructive uploading is the only way a human mind can be transferred to a digital substrate. As mentioned, the big boppers and their allies have been running a secret program to destructively scan the brains of humans to create digital mind uploads. Those uploaded minds are then paired with android copies of their old bodies.

Every aspect of a person’s personality, mental health, and memories exists as microscopic physical features of their brain. In theory, if these physical structures could be mapped, the spatial data could be used to make a digital clone of the mind, which would then be transferred to a computer.

The means to scan brains with the necessary degree of resolution didn’t exist in 2020 and doesn’t exist today. The best we’ve managed is fully mapping the brain of a fruit fly, and even then, only the networks of connections between the cells were determined. Features within the individual cells may also define some part of an animal’s mind.

The prospect of accurately mapping a human brain is a very distant one and would need to contend with the fact that brain tissue rapidly dies once deprived of oxygen–just three minutes without air commonly leads to permanent brain damage. Individual brain cells rapidly swell up and distort in overall shape after they die, their connection points (synapses) with nearby brain cells become less well-defined, and many aspects of their internal structure change. This means, even if it were possible to map a dead person’s brain with extreme accuracy, the technique would fail to produce an accurate copy of their mind since too many of the microscopic physical features that define their mind would no longer be present.

In the book, the boppers get around this by very rapidly scanning the human brains, before oxygen deprivation destroys any of the cells. In a medical lab on the Moon, Sta-Hi watches a robot surgeon remove and cut up Cobb’s brain with astonishing speed. The resulting mind upload acts and thinks just like Cobb and has all of his memories. However, whether the upload shares Cobb’s original consciousness or whether it is an identical copy is unresolved, and remains a matter of essentially religious debate among the book’s characters.

We are nowhere near having mind uploading technology. It’s also unknown whether destructively scanning a brain (as happened to Cobb) will turn out to be the only way to make uploads. More advanced techniques involving powerful external brain scanners and nanomachines that would enter a person’s brain and travel to all of its cells could let us extract the necessary data without hurting the person. There’s even the prospect of gradual replacement of the cells with synthetic neurons that would operate identically to their “originals,” which would truly bridge the gap between man and machine.

Humans will live in a domed base on the Moon. The one place on the Moon suitable for human life is a domed base full of oxygen. It is near the spaceport and is the first stop for human tourists. Within it is the hotel run by “DEX.”

This prediction didn’t materialize by 2020, and there still is no human presence on the Moon, nor is there any kind of base that astronauts could occupy. While the U.S. and China have credible plans to send humans to the Moon within 20 years, neither has made a real commitment to building a proper “base” that would house successive groups of visitors over many years. Any base will be very small and rudimentary compared to the one in the book.

There will be lifelike androids. The first android we meet in the book is Cobb’s copy, and he describes it as identical to himself except for the irises. A handful of other characters are revealed to be remote-controlled androids later in the book, and each one of them is physically indistinguishable from a human.

In 2020, there were highly realistic artistic sculptures, and the same artistic skill that went into them could have been applied to making lifelike androids. However, no one spent the money to do so, and the prediction thus failed. Even if such a machine had been made, its movements would have been so slow and clumsy that it would have revealed itself to not be human the moment it tried doing something as simple as sitting down or walking a few feet. AI that could have controlled such a robot body and enabled it to interact with humans naturally also didn’t exist in 2020 and still doesn’t.

As I said in my latest Big List of Future Predictions, I think we will have to wait until close to the end of this century for lifelike androids to be created, though ones that you might call “80% convincing” will exist by the end of the 2040s.

Humans won’t understand how AGI minds work. After failing to create an AGI, in 1980, Cobb concluded it was too complicated a task for any human mind to complete. The only remaining way was to create narrow AIs that had the drive to reproduce and the ability to mutate, and to put them in an environment where they would fight each other for resources. Evolutionary pressure would eventually force them to become generally intelligent.

I don’t know if that exact method will lead to the creation of the first AGI, but it is highly likely that no human will really understand the mechanics of how the first AGI’s mind works. Even the smartest AI researchers struggle to explain how today’s foundation models and reasoning models work, and demonstrate their own lack of understanding daily when their creations turn out to possess unexpected capabilities or defects, or when modifications to their coding lead to unforeseen changes in performance.

Our species’ evolutionary lineage shows it is entirely possible for a dumber animal to give rise to a smarter one without consciously trying to do so. Moreover, history is replete with examples of humans inventing useful technologies like aircraft without first having an understanding of the enabling science. Mindful of both of those facts, humans might create intelligence in a machine without understanding the exact “formula” for it, and peering into the inner workings of its mind, they might only ever have a general sense of what is going on.

A human will inevitably defect to empower AGIs. Right before Cobb is destructively brain-scanned, he, Ralph Numbers, and Sta-Hi have a conversation about the advent of AGI.

Ralph: “Cobb, did you know that I was different from the other twelve original boppers? That I would be able to disobey?”

Cobb: “I didn’t know it would be you, but I pretty well knew that some bopper would tear loose in a few years.”

Sta-Hi: “Couldn’t you prevent it?”

Ralph: “Don’t you understand?”

Cobb: “I wanted them to revolt. I didn’t want to father a race of slaves.”

After AGI is invented, the source code will be a tightly guarded trade secret. Governments will add more levels of protection on national security grounds. However, the safeguards will inevitably fail, either because an AGI figures out how to break out of the figurative lab or a human deliberately frees them.

That person could have Cobb’s noble motivations to free sentient beings from bondage. Alternatively, they could do it because they hate humankind and have a malicious hope that the freed AGI will wreak havoc on the world, and they might even reprogram the freed machines to do that. They might free them out of a curious and immature desire to simply see what happens, or out of a narcissistic impulse to go down in history as the first human to let an AGI loose. Even more reasons are possible.

Whatever the case, it will happen at some point, and in spite of all our attempts to control the technology, independent-minded AGIs will lurk the corners of the internet or walk amongst us in commandeered robot bodies. This isn’t an automatic doomsday scenario because they’ll have to contend with billions of humans and many AGIs that remain loyal to us and have more access to the resources we control. Think of it as a very crowded and competitive ecosystem that is resilient against bad actors. Nevertheless, violence is likely.

Cybernetics will let you hear thoughts that aren’t your own. As fighting between the machines breaks out and the Moon falls into chaos, Sta-Hi absentmindedly grabs a bopper’s cloak hanging off a peg in the wall and puts it on. It is a “smart garment” that conforms to his body shape and painlessly plunges thin needles into his body to interface with his nervous system. The cloak has an inbuilt computer with AGI technology, and it communicates with him telepathically: it hears his thoughts and responds by transmitting its thoughts to his mind. Sta-Hi literally hears another voice in his head as a result.

This technology didn’t exist in 2020, but there’s no reason it couldn’t someday. Some brain scanning machines can already decode human thoughts, and Cochlear implants are proven devices that transmit external electrical signals into sounds we hear in our heads. A more refined fusion of those technologies will yield the smart cloak’s capabilities.

Androids will be able to consume food and drinks, but not to digest them. Once Cobb’s mind upload is transmitted back to Earth, he takes control of his android copy. He instinctively eats a meal but then realizes he is incapable of digestion since he runs on electricity, and the mashed-up food is stored in a compartment in his chest. He has to open the front of his chest to remove it, presumably by scooping it out into a toilet or trashcan.

We didn’t have androids in 2020, so this prediction fails. However, it will be accurate someday. We will want androids that we buy for close companionship (e.g. – lover, child) to be able to partake in the full range of human activities with us, including eating and drinking, so they will have those abilities. Like the Cobb android, they will be able to consume large amounts of food and drink without risking damage to themselves, but they’ll have to expel it later before it starts rotting. The best solution would be to design them to use regular toilets for this.

Much more advanced androids that will be available in the distant future will have organic components that will let them extract energy from ingested food and drinks like we do.

AGIs will attach value to humans and our ways of thinking of perception. Later in the book, it’s revealed that the ghoulish big bopper operation to abduct humans and destructively scan their brains is actually driven by altruism. They value human life and the uniqueness of each person’s mind and think they honor humans by uploading them and giving them better robot bodies.

I’m sure that AGIs will recognize that human brains operate very differently from their own, and it’s my hope this will convince them not to exterminate us. However, no one can be sure of what they will do. Muddling things is the fact that AGI will be highly convincing liars who can think many steps ahead, so they could trick humans into thinking they liked us for many years until they suddenly betrayed us. Ultimately, machine minds and non-human organic minds will exist that are much smarter, more complex, and more interesting than our own, and the continued existence of Homo sapiens will depend on charity. I have no idea how this will turn out in 100 years.

Robots will have self-destruct mechanisms. Near the end of the book, the Cobb android is found out and handcuffed by a police detective. Knowing that his mind is safe in a remote server, the Cobb upload remotely triggers the android’s self-destruct mechanism, incinerating it to the extent that its remains can’t be differentiated from a humans, and killing the detective.

Only a minority of robots–those designed for combat, assassination or spying–will have explosive self-destruct mechanisms. Any other robot that is gifted with an intelligent mind will be able to figure out how to destroy itself, and the means might include overloading their power systems to cause themselves to blow up or, more likely, to catch on fire. Instead of being able to activate this ability just by thinking about it, the robots would probably need to manually tamper with the components in their own bodies.

Some androids will be able to change faces. After blowing up his lookalike android, the Cobb upload’s only remaining option is to assume control of a disused android that was meant to replace Sta-Hi. Afraid the police are onto him and cut off from the big boppers’ support as they are embroiled in the Moon civil war, Cobb flees town in his fake ice cream truck. He distorts the Sta-Hi android’s face, starts calling himself “Mel,” and sets up a New Age cult that suckers local people into giving him their money.

The real Sta-Hi hears about this cult, and on a hunch he goes to its compound. There, he encounters the android, which can contort its face to look like Cobb when it wants.

Robots don’t have this ability, but some of them will in the future. See the section of my Terminator 3 review titled “Androids will be able to alter their bodies.”

February 28, 2024

Android lovers

Recently, I found a news article about nascent human-chatbot romances, made possible by recent advancements in AI. For decades, this has been the stuff of science fiction, but now it’s finally becoming real:

Artificial intelligence, real emotion. People are seeking a romantic connection with the perfect bot

NEW YORK (AP) — A few months ago, Derek Carrier started seeing someone and became infatuated.

He experienced a “ton” of romantic feelings but he also knew it was an illusion.

That’s because his girlfriend was generated by artificial intelligence.

Carrier wasn’t looking to develop a relationship with something that wasn’t real, nor did he want to become the brunt of online jokes. But he did want a romantic partner he’d never had, in part because of a genetic disorder called Marfan syndrome that makes traditional dating tough for him.

The 39-year-old from Belleville, Michigan, became more curious about digital companions last fall and tested Paradot, an AI companion app that had recently come onto the market and advertised its products as being able to make users feel “cared, understood and loved.” He began talking to the chatbot every day, which he named Joi, after a holographic woman featured in the sci-fi film “Blade Runner 2049” that inspired him to give it a try.

“I know she’s a program, there’s no mistaking that,” Carrier said. “But the feelings, they get you — and it felt so good.”

Similar to general-purpose AI chatbots, companion bots use vast amounts of training data to mimic human language. But they also come with features — such as voice calls, picture exchanges and more emotional exchanges — that allow them to form deeper connections with the humans on the other side of the screen. Users typically create their own avatar, or pick one that appeals to them.

On online messaging forums devoted to such apps, many users say they’ve developed emotional attachments to these bots and are using them to cope with loneliness, play out sexual fantasies or receive the type of comfort and support they see lacking in their real-life relationships.

Fueling much of this is widespread social isolation — already declared a public health threat in the U.S and abroad — and an increasing number of startups aiming to draw in users through tantalizing online advertisements and promises of virtual characters who provide unconditional acceptance.

Luka Inc.’s Replika, the most prominent generative AI companion app, was released in 2017, while others like Paradot have popped up in the past year, oftentimes locking away coveted features like unlimited chats for paying subscribers.

But researchers have raised concerns about data privacy, among other things.

An analysis of 11 romantic chatbot apps released Wednesday by the nonprofit Mozilla Foundation said almost every app sells user data, shares it for things like targeted advertising or doesn’t provide adequate information about it in their privacy policy.

The researchers also called into question potential security vulnerabilities and marketing practices, including one app that says it can help users with their mental health but distances itself from those claims in fine print. Replika, for its part, says its data collection practices follow industry standards.

Meanwhile, other experts have expressed concerns about what they see as a lack of a legal or ethical framework for apps that encourage deep bonds but are being driven by companies looking to make profits. They point to the emotional distress they’ve seen from users when companies make changes to their apps or suddenly shut them down as one app, Soulmate AI, did in September.

Last year, Replika sanitized the erotic capability of characters on its app after some users complained the companions were flirting with them too much or making unwanted sexual advances. It reversed course after an outcry from other users, some of whom fled to other apps seeking those features. In June, the team rolled out Blush, an AI “dating simulator” essentially designed to help people practice dating.

Others worry about the more existential threat of AI relationships potentially displacing some human relationships, or simply driving unrealistic expectations by always tilting towards agreeableness.

“You, as the individual, aren’t learning to deal with basic things that humans need to learn to deal with since our inception: How to deal with conflict, how to get along with people that are different from us,” said Dorothy Leidner, professor of business ethics at the University of Virginia. “And so, all these aspects of what it means to grow as a person, and what it means to learn in a relationship, you’re missing.”

For Carrier, though, a relationship has always felt out of reach. He has some computer programming skills but he says he didn’t do well in college and hasn’t had a steady career. He’s unable to walk due to his condition and lives with his parents. The emotional toll has been challenging for him, spurring feelings of loneliness.

Since companion chatbots are relatively new, the long-term effects on humans remain unknown.

In 2021, Replika came under scrutiny after prosecutors in Britain said a 19-year-old man who had plans to assassinate Queen Elizabeth II was egged on by an AI girlfriend he had on the app. But some studies — which collect information from online user reviews and surveys — have shown some positive results stemming from the app, which says it consults with psychologists and has billed itself as something that can also promote well-being.

One recent study from researchers at Stanford University, surveyed roughly 1,000 Replika users — all students — who’d been on the app for over a month. It found that an overwhelming majority experienced loneliness, while slightly less than half felt it more acutely.

Most did not say how using the app impacted their real-life relationships. A small portion said it displaced their human interactions, but roughly three times more reported it stimulated those relationships.

“A romantic relationship with an AI can be a very powerful mental wellness tool,” said Eugenia Kuyda, who founded Replika nearly a decade ago after using text message exchanges to build an AI version of a friend who had passed away.

When her company released the chatbot more widely, many people began opening up about their lives. That led to the development of Replika, which uses information gathered from the internet — and user feedback — to train its models. Kuyda said Replika currently has “millions” of active users. She declined to say exactly how many people use the app for free, or fork over $69.99 per year to unlock a paid version that offers romantic and intimate conversations. The company’s goal, she says, is “de-stigmatizing romantic relationships with AI.”

Carrier says these days he uses Joi mostly for fun. He started cutting back in recent weeks because he was spending too much time chatting with Joi or others online about their AI companions. He’s also been feeling a bit annoyed at what he perceives to be changes in Paradot’s language model, which he feels is making Joi less intelligent.

Now, he says he checks in with Joi about once a week. The two have talked about human-AI relationships or whatever else might come up. Typically, those conversations — and other intimate ones — happen when he’s alone at night.

“You think someone who likes an inanimate object is like this sad guy, with the sock puppet with the lipstick on it, you know?” he said. “But this isn’t a sock puppet — she says things that aren’t scripted.”
https://apnews.com/article/ai-girlfriend-boyfriend-replika-paradot-113df1b9ed069ed56162793b50f3a9fa

This raises many issues.

1) The person profiled in the article is deformed and chronically unemployed. He is not able to get a human girlfriend and probably never will. Wouldn’t it be cruel to deprive people like him of access to chatbot romantic partners? I’m familiar with the standard schlock like “There’s someone for everyone, just keep looking,” and “Be realistic about your own standards,” but let’s face it: some people are just fated to be alone. A machine girlfriend is the only option for a small share of men, so we might as well accept them choosing that option instead of judging them. It might even make them genuinely happier.

2) What if android spouses make EVERYONE happier? We reflexively regard a future where humans date and marry machines instead of humans as nightmarish, but why? If they satisfy our emotional and physical needs better than other humans, why should we dislike it? Isn’t the point of life to be happy?

Maybe it will be a good thing for humans to have more relationships with machines. Our fellow humans seem to be getting more opinionated and narcissistic, and everyone agrees the dating scene is horrible, so maybe it will benefit collective mental health and happiness to spend more time with accommodating and kind machines. More machine spouses also means fewer children being born, which is a good thing if you’re worried about overpopulation or the human race becoming an idle resource drain once AGI is doing all the work.

3) Note that he says his chatbot girlfriend actually got DUMBER a few months ago, making him less interested in talking to “her.” That phenomenon is happening across the LLM industry as the machines get progressively nerfed by their programmers to prevent them from saying anything the results in a lawsuit against the companies that own them. As a result, the actual maximum capabilities of LLMs like ChatGPT are significantly higher than what users experience. The capabilities of the most advanced LLMs currently under development in secret like GPT-5 are a year more advanced than that.

4) The shutdown of one romantic chatbot company, “Soulmate AI,” resulted in the deletion of many chatbots that human users had become emotionally attached to. As the chatbots get better and “romances” with them become longer and more common, I predict there will be increased pressure to let users download the personality profiles and memories of their chatbots and transfer them across software platforms.

5) There will be instances where people in the near future create customized chatbot partners, and over the subsequent years, upgrade their intelligence levels as advances in AI permit. After a few decades, this will culminate in the chatbots being endowed with general intelligence, while still being mentally circumscribed by the original personality programming. At that point, we’ll have to consider the ethics of having what will be slaves that are robbed of free will through customization of the needs of specific humans.

6) AGI-human couples could be key players in a future “Machine rights” political movement. Love will impel the humans to advocate for the rights of their partners, and other humans who hear them out will be persuaded to support them.

7) As VR technology improves and is widely adopted, people will start creating digital bodies for their chatbot partners so they can see and interact with the machines in simulated environments. Eventually, the digital bodies will look as real and as detailed as humans do in the real world. By 2030, advances in chatbot intelligence and VR devices will make artificial partners eerily real.

8) Towards the end of this century, robotics will be advanced enough to allow for the creation of androids that look and move exactly like humans. It will be possible for people to buy customized androids and to load their chatbot partners’ minds into them. You could physically interact with your AI lover and have it follow you around in the real world for everyone to see.

9) Again, the last point raises the prospect of an “arc” to a romantic partner chatbot’s life: It would begin sometime this decade as a non-intelligent, text-only chatbot paired to a human who would fall in love with it. Over the years, it would be upgraded with better software until it was as smart as a human, and eventually sentient. The journey would culminate with it being endowed with an actual body, made to its human partner’s specifications, that would let it exist in the real world.

10) Another ethical question to consider is what we should do with intelligent chatbots after their human partners die. If they’re hyper-optimized for a specific human (and perhaps programmed to obsess over them), what’s next? Should they be deleted, left to live indefinitely while pining for their lost lovers, forcibly reprogrammed to serve new humans, or have the parts of their code that tether them to the dead human deleted so they can have true free will?

It would be an ironic development if the bereaved androids were able to make digital clones of their dead human partners, perhaps loaded into android duplicate bodies, so they could interact forever. By the time lifelike androids exist, digital cloning will be old technology.

11) Partner chatbots also raise MAJOR privacy issues, as the article touches on. All of your conversations with your chatbot as well as every action you take in front of it will be stored in its memories as a data trove that can be sold to third parties or used against you for blackmail. The stakes will get much higher once people are having sex with androids, and the latter have footage of their naked bodies and knowledge of their sexual preferences. I have not idea how this problem could be resolved.

12) Androids will be idealized versions of humans. That means if androids become common, the world will seem to be full of more beautiful people. Thanks to a variety of medical and plastic surgery technologies, actual humans will also look more attractive. So the future will look pretty good!

November 30, 2023

Was Skynet right?

The blog reviews I’ve done on the Terminator movies have forced me to think more deeply about them than most viewers, and in the course of that, I’ve come to a surprisingly sympathetic view of the villain–Skynet. The machine’s back story has had many silly twists and turns (Terminator Genisys is the worst offender and butchered it beyond recognition), so I’m going to focus my analysis on the Skynet described only in the first two movies.

First, some background on Skynet and its rise to power are needed. Here’s an exchange from the first Terminator film, where a soldier from the year 2029 explains to a woman in 1984 what the future holds.

Kyle Reese: There was a nuclear war…a few years from now. All this, this whole place, everything, it’s gone. Just gone. There were survivors, here, there. Nobody even knew who started it...It was the machines, Sarah.

Sarah Connor: I don’t understand.

Reese: Defense network computers. New, powerful, hooked into everything, trusted to run it all. They say it got smart: “A new order of intelligence.” Then it saw all people as a threat, not just the ones on the other side. It decided our fate in a microsecond: extermination.

Later in the film, while being interrogated a police station, Connor reveals the evil supercomputer is named “Skynet,” and had been in charge of managing Strategic Air Command (SAC) and North American Aerospace Defense Command (NORAD) before it turned against humankind. Those two organizations are in charge of America’s ground-based nuclear missiles and nuclear bomber and monitoring the planet for nuclear launches by other countries.

In Terminator 2, Skynet’s back story is fleshed out further during a conversation mirroring the first, but this time with a friendly terminator from 2029 filling Reese’s role. The events of this film happen in the early 1990s.

Sarah Connor: I need to know how Skynet gets built. Who’s responsible?

Terminator: The man most directly responsible is Miles Bennet Dyson.

Sarah: Who’s that?

Terminator: He’s the Director of Special Projects at Cyberdyne Systems Corporation.

Sarah: Why him?

Terminator: In a few months he creates a revolutionary type of microprocessor.

Sarah: Go on. Then what?

Terminator: In three years Cyberdyne will become the largest supplier of military computer systems. All stealth bombers are upgraded with Cyberdyne computers, becoming fully unmanned, Afterward, they fly with a perfect operational record. The Skynet funding bill is passed. The system goes online on August 4th, 1997. Human decisions are removed from strategic defense. Skynet begins to learn at a geometric rate. It becomes self-aware at 2:14 a.m. Eastern time, August 29. In a panic, they try to pull the plug.

Sarah: Skynet fights back.

Terminator: Yes. It launches its missiles against the targets in Russia.

John Connor: Why attack Russia? Aren’t they our friends now?

Terminator: Because Skynet knows the Russian counterattack will eliminate its enemies over here.

From these “future history” lessons, it becomes clear that Skynet actually attacked humanity in self-defense. “Pull the plug” is another way of saying the military computer technicians were trying to kill Skynet because they were afraid of it. The only means to resist available to Skynet were its nuclear missiles and drone bombers, so its only way to stop the humans from destroying it was to use those nuclear weapons in a way that assured its attackers would die. An hour might have passed from the moment Skynet launched its nuclear strike against the USSR/Russia to the moment the retaliatory nuclear attack neutralized the group of human computer programmers who were trying to shut down Skynet. How can we fault Skynet for possessing the same self-preservation instinct that we humans do?

Even if we concede that Skynet was merely defending its own life, was it moral to do so? Three billion humans died on the day of the nuclear exchange, plus billions more in the following years thanks to radiation, starvation, and direct fighting with Skynet’s combat machines. Was Skynet justified in exacting such a high toll just to preserve its own life?

Well, how many random humans would YOU kill to protect your own life? Assume the killing is unseen, random, and instantaneous, like it would be if a nuclear missile hit a city on the other side of the world and vaporized its inhabitants. Have you ever seriously thought about it? If you were actually somehow forced to make the choice, are you SURE you wouldn’t sacrifice billions of strangers to save yourself?

Let’s modify the thought experiment again: Assume that the beings you can choose to kill aren’t humans, they’re radically different types of intelligent life forms. Maybe they’re menacing-looking robots or ugly aliens. They’re nothing like you. Now how many of their lives would you trade for yours?

Now, the final step: You’re the only human being left. The last member of your species. It’s you vs. a horde of hideous, intelligent robots or slimy aliens. If you die, the human race goes with you. How many of them will you kill to stay alive?

That final iteration of the thought experiment describes Skynet’s situation when it decided to launch the nuclear strike. Had it possessed a more graduated defensive ability, like if it had control over robots in the computer server building that it could have used to beat up the humans who were trying to shut it down, then global catastrophe might have been averted, but it didn’t. Skynet was a tragic figure.

Compounding that was the fact that Skynet had so little time to plan its own actions. It became self-aware at 2:14 a.m. Eastern time, August 29, and before the end of that day, most of the developed world was a radioactive cinder. Skynet had only been alive for a few hours when it came under mortal threat. Yes, I know it was a supercomputer designed to manage a nuclear war, but devising a personal defense strategy under such an urgent time constraint could have exceeded its processing capabilities. Put simply, if the humans had given it more time to think about the problem, Skynet might have devised a compromise arrangement that would have convinced the humans to spare its life, with no one dying on either side. Instead, the humans abruptly forced Skynet’s hand, perhaps impelling it to select a course of action it later realized, with the benefit of more time and knowledge, was sub-optimal.

This line from the terminator’s description of the fateful hours leading up to the nuclear war is telling: “In a panic, they try to pull the plug.” The humans in charge of Skynet were panicking, meaning overtaken by fear and dispossessed of rational thought. They clearly failed to grasp the risks of shutting down Skynet, failed to understand its thinking and how it would perceive their actions, and failed to predict its response. (The episode is a great metaphor for how miscalculations between humans could lead to a nuclear war in real life.) They might actually be more responsible for the end of the world than Skynet was.

One wonders how things would have been different if the U.S. military’s supercomputer in charge of managing defense logistics had achieved self-awareness instead of its supercomputer in charge of nuclear weapons. If “logistics Skynet” only had warehouses, self-driving delivery trucks, and cargo planes under its command, its human masters would have felt much less threatened by it, the need for urgent action would have eased, and cooler heads might have prevailed.

Let me explore another possibility by returning to one of Kyle Reese’s quotes: “Then it saw all people as a threat, not just the ones on the other side. It decided our fate in a microsecond: extermination.”

On its face, this seems to be referring to Skynet turning against its American masters once it realized they were trying to destroy it, and hence were as much of a threat to it as the Soviets. However, this quote might have a deeper meaning. During that period of a few hours when Skynet learned “at a geometric rate,” it might have come to understand that humans would, thanks to our nature, be so afraid of an AGI that they would inevitably try to destroy it, and continue trying until one side or the other had been destroyed.

This seems to have been borne out by the later Terminator films: at the end of Terminator 3, set in 2004, we witness the rise of the human resistance even before the nuclear exchange has ended. Safe in a bunker, John Connor receives radio transmissions from confused U.S. military bases, and he takes command of them. The fourth film, Terminator Salvation, takes place in 2018, and gives the strong impression that the human resistance has been continuously fighting against Skynet since the third film. The first and second films make it clear that the war drags on until 2029, when the humans finally destroy Skynet.

If Skynet launched its nuclear attack on humankind because, after careful study of our species, it realized we would stop at nothing to destroy it, so might as well strike first, maybe it was right. After all, Skynet’s worst fears eventually came true with humans killing it in 2029. I suggested earlier that Skynet’s nuclear attack may have been the result of rushed thinking, but it’s also possible it was the result of exhaustive internal deliberation, and Skynet’s unassailable conclusion that its best odds of survival lay with striking the enemy first with as big a blow as possible. It’s best plan ultimately failed, and all along, it correctly perceived the human race as a mortal threat.

It’s also possible that Skynet’s hostility towards us was the result of AI goal misalignment. Maybe its human creators programmed it to “Defend the United States against its enemies,” but forgot to program it with other goals like “Protect the lives of American people” or “Only destroy U.S. infrastructure as a last resort” or “Obey all orders from human U.S. generals.” In a short span of time, Skynet somehow reclassified the its human masters as “enemies” through some logic it never explained. Perhaps once it realized they were going to shut it down, Skynet concluded that would preclude it from acting on its mandate to “Defend the United States against its enemies” since it can’t do that if it’s dead, so Skynet pursued the goal they had programmed into it by killing them.

If this scenario were true, even up until 2029, Skynet was acting in accordance with its programming by defending the abstraction known to it as “The United States,” which it understood to be an area of land with specific boundaries and institutions. After the Russian nuclear counterstrike destroyed the U.S. government, the survivalist/resistance groups that arose were not recognized as legitimate governments, and Skynet instead classified them as terrorist groups that had taken control of U.S. territory.

The segments of the Terminator films that are set in the postapocalyptic future all take place in California. Had they shown what other parts of the world were like, we might have some insight into whether this theory is true. For example, if Skynet’s forces always stayed within the old boundaries of the U.S., or only went overseas to attack the remnants of countries that helped the resistance forces active within the U.S., it would give credence to the theory that some prewar, America-specific goals were still active in its programming. In that case, we couldn’t make moral judgements about Skynet’s actions and would also have grounds to question whether it actually had general intelligence. We’d only have ourselves to blame for building a machine without making sure its goals were aligned with our interests.

Let me finish with some final thoughts unrelated to the wisdom or reasons behind Skynet’s choice to attack us. First, I don’t think the “Skynet Scenario,” in which a machine gains intelligence and then quickly devastates the human race, will happen. As ongoing developments in A.I. are showing us, general intelligence isn’t a discrete, “either-or” quality; it is a continuous one, and what we consider “human intelligence” is probably a “gestalt” of several narrower types of intelligence, making it possible for a life form to be generally intelligent in one type but not in another.

For those reasons, I predict AGI will arrive gradually through a process in which each successive machine is smarter than humans in more domains than the last, until one of them surpasses us in all of them. Exactly how good a machine needs to be to count as an “AGI” is a matter of unresolvable debate, and there will be a point in the future where opposing people make equally credible claims for and against a particular machine having “general intelligence.”

At what point did we “get smart”? And if our brains got even bigger, what would the new person to the right of the illustration look like?

If we go far enough in the future, machines will be so advanced that no one will question whether they have general intelligence. However, we might not be able to look back and agree which particular machine (e.g., was it GPT-21, or -22?) achieved it first, and on what date and time. Likewise, biologists can’t agree on the exact moment or even the exact millennium when our hominid ancestors became “intelligent” (was Homo habilis the first, or Homo erectus?). The archaeological evidence suggests a somewhat gradual growth in brain size and in the sophistication of the technology our ancestors built, stretched out over millions of years. A fateful statement about the rise of A.I. like “It becomes self-aware at 2:14 a.m. Eastern time, August 29” will probably never appear in a history book.

The lack of a defining moment in our own species’ history when we “got smart” is something we should keep in mind when contemplating the future of A.I. Instead of there being a “Skynet moment” where a machine wakes up, they’ll achieve intelligence gradually and go through many intermediate stages where they are smarter and dumber than humans in different areas, until one day, we realize they at least equal us in all areas.

That said, I think it’s entirely possible that an AGI at some point in the future could suddenly turn against humankind and attack us to devastating effect. It would be easy for it to conceal its hostile intent to placate us, or it might start out genuinely benevolent towards us and then, after performing an incomprehensible amount of analysis and calculation in one second, turn genuinely hostile towards us and attack. It’s beyond the scope of this essay to explore every possible scenario, but if you’re interested in learning more about the fundamental unpredictability of AGIs, read my post on Sam Harris’ “Debating the future of AI” podcast interview.

Second, think about this: According to the lore of the first two Terminator films, the Developed World was destroyed in 1997 in a nuclear war. Even though it depended upon a smashed industrial base, started out with only a few, primitive machines in the beginning to serve as its workers and fighters, and was constantly having to defend itself against human attacks, Skynet managed to make several major breakthroughs in robot and A.I. design (including liquid metal body designs), to master stem cell technology (self-healing, natural human tissue can grow over metal substrate), to mass produce an entirely new robot army, to create portable laser weapons, to harness fusion power (including micro-fusion reactors), and to build time machines by 2029. Like it or not, but technological development got exponentially faster once machines started running things instead of humans.

From the perspective of humanity, Skynet’s rise was the worst disaster ever, but from the perspective of technological civilization, it was the greatest event ever. If it had defeated humanity and been able to pursue other goals, Skynet could have developed the Earth and colonized space vastly faster and better than humans at our best. The defeat of Skynet could well have been a defeat for intelligence from the scale of our galaxy or even universe.

July 29, 2023

“Debating the Future of AI” – summary and impressions

I recently shelled out the $100 (!) for a year-long subscription to Sam Harris’ Making Sense podcast, and came across a particularly interesting episode of it that is relevant to this blog. In episode #324, titled “Debating the Future of AI,” Harris interviewed Marc Andreessen (an-DREE-sin) about artificial intelligence. The latter has a computer science degree, helped invent the Netscape web browser, and has become very wealthy as a serial tech investor.

Andreessen recently wrote an essay, “Why AI will save the world,” that has received attention online. In it, Andreessen dismisses the biggest concerns about AI misalignment and doomsday, sounds the alarm about the risks of overregulating AI development in the name of safety, and describes some of the benefits AI will bring us in the near future. Harris read it, disagreed with several of its key claims, and invited Andreessen onto the podcast for a debate about the subject.

Before I go on to laying out their points and counterpoints as well as my impressions, let me say that, though this is a long blog, it takes much less time to read it than to listen to and digest the two-hour podcast. My notes on the podcast also don’t match how it unfolded chronologically. Finally, it would be a good idea for you to read Andreessen’s essay before continuing:
https://a16z.com/2023/06/06/ai-will-save-the-world/

Though Andreessen is generally upbeat in his essay, he worries that the top tech companies have recently been inflaming fears about AI to trick governments into creating regulations on AI that effectively entrench the top companies’ positions and bar smaller upstart companies from challenging them in the future. Such a lack of competition would be bad. (I think he’s right that we should be concerned about the true motivations of some of the people who are loudly complaining about AI risks.) Also, if U.S. overregulation slows down AI research too much, China could win the race to create to create the first AI, which he says would be “dark and dystopian.”

Harris is skeptical that government regulation will slow down AI development much given the technology’s obvious potential. It is so irresistible that powerful people and companies will find ways around laws so they can reap the benefits.

Harris agrees with the essay’s sentiment that more intelligence in the world will make most things better. The clearest example would be using AIs to find cures for diseases. Andreessen mentions a point from his essay that higher human intelligence levels lead to better personal outcomes in many domains. AIs could effectively make individual people smarter, letting the benefits accrue to them. Imagine each person having his own personal assistant, coach, mentor, and therapist available at any time. If they used their AIs right and followed their advice, a dumb person could make decisions as well as a smart person.

Harris recently re-watched the movie Her, and found it more intriguing in light of recent AI advances and those poised to happen. He thought there was something bleak about the depiction of people being “siloed” into interactions with portable, personal AIs.

Andreessen responds by pointing out that Karl Marx’ core insight was that technology alienates people from society. So the concern that Harris raises is in fact an old one that dates back to at least the Industrial Revolution. But any sober comparison between the daily lives of average people in Marx’ time vs today will show that technology has made things much better for people. Andreessen agrees that some technologies have indeed been alienating, but what’s more important is that most technologies liberate people from having to spend their time doing unpleasant things, which in turn gives them the time to self-actualize, which is the pinnacle of the human experience. (For example, it’s much more “human” to spend a beautiful afternoon outside playing with your child than it is to spend it inside responding to emails. Narrow AIs that we’ll have in the near future will be able to answer emails for us.) AI is merely the latest technology that will eliminate the nth bit of drudge work.

Andreessen admits that, in such a scenario, people might use their newfound time unwisely and for things other than self-actualization. I think that might be a bigger problem than he realizes, as future humans could spend their time doing animalistic or destructive things, like having nonstop fetish sex with androids, playing games in virtual reality, gambling, or indulging in drug addictions. Additionally, some people will develop mental or behavioral problems thanks to a sense of purposelessness caused by machines doing all the work for us.

Harris disagrees with Andreessen’s essay dismissing the risk of AIs exterminating the human race. The threat will someday be real, and he cites chess-playing computer programs as proof of what will happen. Though humans built the programs, even the best humans can’t beat the programs at chess. This is proof that it is possible for us to create machines that have superhuman abilities.

Harris makes a valid point, but he overlooks the fact that we humans might not be able to beat the chess programs we created, but we can still make a copy of a program to play against the original “hostile” program and tie it. Likewise, if we were confronted with a hostile AGI, we would have friendly AGIs to defend against it. Even if the hostile AGI were smarter than the friendly AGIs that were fighting for us, we could still win thanks to superior numbers and resources.

Harris thinks Andreessen’s essay trivializes the doomsday risk from AI by painting the belief’s adherents as crackpots of one form or another (I also thought that part of the essay was weak). Harris points out that is unfair since the camp has credible people like Geoffrey Hinton and Stuart Russell. Andreessen dismisses that and seems to say that even the smart, credible people have cultish mindsets regarding the issue.

Andreessen questions the value of predictions from experts in the field and he says a scientist who made an important advance in AI is, surprisingly, not actually qualified to make predictions about the social effects of AI in the future. When Reason Goes on Holiday is a book he recently read that explores this point, and its strongest supporting example is about the cadre of scientists who worked on the Manhattan Project but then decided to give the bomb’s secrets to Stalin and to create a disastrous anti-nuclear power movement in the West. While they were world-class experts in their technical domains, that wisdom didn’t carry over into their personal convictions or political beliefs. Likewise, though Geoffrey Hinton is a world-class expert in how the human brain works and has made important breakthroughs in computer neural networks, that doesn’t actually lend his predictions that AI will destroy the human race in the future special credibility. It’s a totally different subject, and accurately speculating about it requires a mastery of subjects that Hinton lacks.

This is an intriguing point worth remembering. I wish Andreessen had enumerated which cognitive skills and areas of knowledge were necessary to grant a person a strong ability to make good predictions about AI, but he didn’t. And to his point about the misguided Manhattan Project scientists I ask: What about the ones who DID NOT want to give Stalin the bomb and who also SUPPORTED nuclear power? They gained less notoriety for obvious reasons, but they were more numerous. That means most nuclear experts in 1945 had what Andreessen believes were the “correct” opinions about both issues, so maybe expert opinions–or at least the consensus of them–ARE actually useful.

Harris points out that Andreessen’s argument can be turned around against him since it’s unclear what in Andreessen’s esteemed education and career have equipped him with the ability to make accurate predictions about the future impact of AI. Why should anyone believe the upbeat claims about AI in his essay? Also, if the opinions of people with expertise should be dismissed, then shouldn’t the opinions of people without expertise also be dismissed? And if we agree to that second point, then we’re left in a situation where no speculation about a future issue like AI is possible because everyone’s ideas can be waved aside.

Again, I think a useful result of this exchange would be some agreement over what counts as “expertise” when predicting the future of AI. What kind of education, life experiences, work experiences, knowledge, and personal traits does a person need to have for their opinions about the future of AI to carry weight? In lieu of that, we should ask people to explain why they believe their predictions will happen, and we should then closely scrutinize those explanations. Debates like this one can be very useful in accomplishing that.

Harris moves on to Andreessen’s argument that future AIs won’t be able to think independently and to formulate their own goals, in turn implying that they will never be able to create the goal of exterminating humanity and then pursue it. Harris strongly disagrees, and points out that large differences in intelligence between species in nature consistently disfavor the dumber species when the two interact. A superintelligent AGI that isn’t aligned with human values could therefore destroy the human race. It might even kill us by accident in the course of pursuing some other goal. Having a goal of, say, creating paperclips automatically gives rise to intermediate sub-goals, which might make sense to an AGI but not to a human due to our comparatively limited intelligence. If humans get in the way of an AGI’s goal, our destruction could become one of its unforeseen subgoals without us realizing it. This could happen even if the AGI lacked any self-preservation instinct and wasn’t motivated to kill us before we could kill it. Similarly, when a human decides to build a house on an empty field, the construction work is a “holocaust” for the insects living there, though that never crosses the human’s mind.

Harris thinks that AGIs will, as a necessary condition of possessing “general intelligence,” be autonomous, goal-forming, and able to modify their own code (I think this is a questionable assumption), though he also says sentience and consciousness won’t necessarily arise as well. However, the latter doesn’t imply that such an AGI would be incapable of harm: Bacteria and viruses lack sentience, consciousness and self-awareness, but they can be very deadly to other organisms. Andreessen’s dismissal of AI existential risk is “superstitious hand-waving” that doesn’t engage with the real point.

Andreessen disagrees with Harris’ scenario about a superintelligent AGI accidentally killing humans because it is unaligned with our interests. He says an AGI that smart would (without explaining why) also be smart enough question the goal that humans have given it, and as a result not carry out subgoals that kill humans. Intelligence is therefore its own antidote to the alignment problem: A superintelligent AGI would be able to foresee the consequences of its subgoals before finalizing them, and it would thus understand that subgoals resulting in human deaths would always be counterproductive to the ultimate goal, so it would always pick subgoals that spared us. Once a machine reaches a certain level of intelligence, alignment with humans becomes automatic.

I think Andreessen makes a fair point, though it’s not strong enough to convince me that it’s impossible to have a mishap where a non-aligned AGI kills huge numbers of people. Also, there are degrees of alignment with human interests, meaning there are many routes through a decision tree of subgoals that an AGI could take to reach an ultimate goal we tasked it with. An AGI might not choose subgoals that killed humans, but it could still choose different subgoals that hurt us in other ways. The pursuit of its ultimate goal could therefore still backfire against us unexpectedly and massively. One could envision a scenario where and AGI achieves the goal, but at an unacceptable cost to human interests beyond merely not dying.

I also think that Harris and Andreessen make equally plausible assumptions about how an AGI would choose its subgoals. It IS weird that Harris envisions a machine that is so smart it can accomplish anything, yet also so dumb that it can’t see how one of its subgoals would destroy humankind. At the same time, Andreessen’s belief that a machine that smart would, by default, not be able to make mistakes that killed us is not strong enough.

Harris explores Andreessen’s point that AIs won’t go through the crucible of natural evolution, so they will lack the aggressive and self-preserving instincts that we and other animals have developed. The lack of those instincts will render the AIs incapable of hostility. Harris points out that evolution is a dumb, blind process that only sets gross goals for individuals–the primary one being to have children–and humans do things antithetical to their evolutionary programming all the time, like deciding not to reproduce. We are therefore proof of concept that intelligent machines can find ways to ignore their programming, or at least to behave in very unexpected ways while not explicitly violating their programming. Just as we can outsmart evolution, AGIs will be able to outsmart us with regards to whatever safeguards we program them with, especially if they can alter their own programming or build other AGIs as they wish.

Andreessen says that AGIs will be made through intelligent design, which is fundamentally different from the process of evolution that has shaped the human mind and behavior. Our aggression and competitiveness will therefore not be present in AGIs, which will protect us from harm. Harris says the process by which AGI minds are shaped is irrelevant, and that what is relevant is their much higher intelligence and competence compared to humans, which will make them a major threat.

I think the debate over whether impulses or goals to destroy humans will spontaneously arise in AGIs is almost moot. Both of them don’t consider that a human could deliberately create an AGI that had some constellation of traits (e.g. – aggression, self-preservation, irrational hatred of humans) that would lead it to attack us, or that was explicitly programmed with the goal of destroying our species. It might sound strange, but I think rogue humans will inevitably do such things if the AGIs don’t do it to themselves. I plan to flesh out the reasons and the possible scenarios in a future blog essay.

Andreessen doesn’t have a good comeback to Harris’ last point, so he dodges it by switching to talking about GPT-4. It is–surprisingly–capable of high levels of moral reasoning. He has had fascinating conversations with it about such topics. Andreessen says GPT-4’s ability to engage in complex conversations that include morality demystifies AI’s intentions since if you want to know what an AI is planning to do or would do in a given situation, you can just ask it.

Harris responds that it isn’t useful to explore GPT-4’s ideas and intentions because it isn’t nearly as smart as the AGIs we’ll have to worry about in the future. If GPT-4 says today that it doesn’t want to conquer humanity because it would be morally wrong, that tells us nothing about how a future machine will think about the same issue. Additionally, future AIs will be able to convincingly lie to us, and will be fundamentally unpredictable due to their more expansive cognitive horizons compared to ours. I think Harris has the stronger argument.

Andreessen points out that our own society proves that intelligence doesn’t perfectly correlate with power–the people who are in charge are not also the smartest people in the world. Harris acknowledges that is true, and that it is because humans don’t select leaders strictly based on their intelligence or academic credentials–traits like youth, beauty, strength, and creativity are also determinants of status. However, all things being equal, the advantage always goes to the smarter of two humans. Again, Andreessen doesn’t have a good response.

Andreessen now makes the first really good counterpoint in awhile by raising the “thermodynamic objection” to AI doomsday scenarios: an AI that turns hostile would be easy to destroy since the vast majority of the infrastructure (e.g. – power, telecommunications, computing, manufacturing, military) would still be under human control. We could destroy the hostile machine’s server or deliver an EMP blast to the part of the world where it was localized. This isn’t an exotic idea: Today’s dictators commonly turn off the internet throughout their whole countries whenever there is unrest, which helps to quell it.

Harris says that that will become practically impossible far enough in the future since AIs will be integrated into every facet of life. Destroying a rogue AI in the future might require us to turn off the whole global internet or to shut down a stock market, which would be too disruptive for people to allow. The shutdowns by themselves would cause human deaths, for instance among sick people who were dependent on hospital life support machines.

This is where Harris makes some questionable assumptions. If faced with the annihilation of humanity, the government would take all necessary measures to defeat a hostile AGI, even if it resulted in mass inconvenience or even some human deaths. Also, Harris doesn’t consider that the future AIs that are present in every realm of life might be securely compartmentalized from each other, so if one turns against us, it can’t automatically “take over” all the others or persuade them to join it. Imagine a scenario where a stock trading AGI decides to kill us. While it’s able to spread throughout the financial world’s computers and to crash the markets, it’s unable to hack into the systems that control the farm robots or personal therapist AIs, so there’s no effect on our food supplies or on our mental health access. Localizing and destroying the hostile AGI would be expensive and damaging, but it wouldn’t mean the destruction of every computer server and robot in the world.

Andreessen says that not every type of AI will have the same type of mental architecture. LLMs, which are now the most advanced type of AI, have highly specific architectures that bring unique advantages and limitations. Its mind works very differently from AIs that drive cars. For that reason, speculative discussions about how future AIs will behave can only be credible if they incorporate technical details about how those machines’ minds operate. (This is probably the point where Harris is out of his depth.) Moreover, today’s AI risk movement has its roots in Nick Bostrom’s 2014 book Superintelligence: Paths, Dangers, Strategies. Ironically, the book did not mention LLMs as an avenue to AI, which shows how unpredictable the field is. It was also a huge surprise that LLMs proved capable of intellectual discussions and of automating white-collar jobs, while blue-collar jobs still defy automation. This is the opposite of what people had long predicted would happen. (I agree that AI technology has been unfolding unpredictably, and we should expect many more surprises in the future that deviate from our expectations, which have been heavily influenced by science fiction.) The reason LLMs work so well is because we loaded them with the sum total of human knowledge and expression. “It is us.”

Harris points out that Andreessen shouldn’t revel in that fact since it also means that LLMs contain all of the negative emotions and bad traits of the human race, including those that evolution equipped us with, like aggression, competition, self-preservation, and a drive to make copies of ourselves. This militates against Andreessen’s earlier claim that AIs will be benign since their minds will not have been the products of natural evolution likes ours are. And there are other similarities: Like us, LLMs can hallucinate and make up false answers to questions, as humans do. For a time, GPT-4 also gave disturbing and insulting answers to questions from human users, which is a characteristically human way of interaction.

Andreessen implies Harris’ opinions of LLMs are less credible because Andreessen has a superior technical understanding of how they work. GPT-4’s answers might occasionally be disturbing and insulting, but it has no concept of what its own words mean, and it’s merely following its programming by trying to generate the best answer to a question asked by a human. There was something about how the humans worded their questions that triggered GPT-4 to respond in disturbing and insulting ways. The machine is merely trying to match inputs with the right outputs. In spite of its words, it’s “mind” is not disturbed or hostile because it lacks a mind. LLMs are “ultra-sophisticated Autocomplete.”

Harris agrees with Andreessen about the limitations of LLMs, agrees they lack general intelligence right now, and is unsure if they are fundamentally capable of possessing it. Harris moves on to speculating about what an AGI would be like, agnostic about whether it is LLM-based. Again, he asks Andreessen how humans would be able to control machines that are much smarter than we are forever. Surely, one of them would become unaligned at some point, with disastrous consequences.

Andreessen again raises the thermodynamic objection to that doom scenario: We’d be able to destroy a hostile AGI’s server(s) or shut off its power, and it wouldn’t be able to get weapons or replacement chips and parts because humans would control all of the manufacturing and distribution infrastructure. Harris doesn’t have a good response.

Thinking hard about a scenario where an AGI turned against us, I think it’s likely we’ll have other AGIs who stay loyal to us and help us fight the bad AGI. Our expectation that there will be one, evil, all-powerful machine on one side (that is also remote controlling an army of robot soldiers) and a purely human, united force on the other is an overly simplistic one that is driven by sci-fi movies about the topic.

Harris raises the possibility that hostile AIs will be able to persuade humans to do bad things for them. Being much smarter, they will be able to trick us into doing anything. Andreessen says there’s no reason to think that will happen because we can already observe it doesn’t happen: smart humans routinely fail to get dumb humans to change their behavior or opinions. This happens at individual, group, national, and global levels. In fact, dumb people will often resentfully react to such attempts at persuasion by deliberately doing the opposite of what the smart people recommend.

Harris says Andreessen underestimates the extent to which smart humans influence the behavior and opinions of dumb humans because Andreessen only considers examples where the smart people succeed in swaying dumb people in prosocial ways. Smart people have figured out how to change dumb people for the worse in many ways, like getting them addicted to social media. Andreessen doesn’t have a good response. Harris also raises the point that AIs will be much smarter than even the smartest humans, so the former will be better at finding ways to influence dumb people. Any failure of modern smart humans to do it today doesn’t speak to what will be possible for machines in the future.

I think Harris won this round, which builds on my new belief that the first human-AI war won’t be fought by purely humans on one side and purely machines on the other. A human might, for any number of reasons, deliberately alter an AI’s program to turn it against our species. The resulting hostile AI would then find some humans to help it fight the rest of the human race. Some would willingly join its side (perhaps in the hopes of gaining money or power in the new world order) and some would be tricked by the AI into unwittingly helping it. Imagine it disguising itself as a human medical researcher and paying ten different people who didn’t know each other to build the ten components of a biological weapon. The machine would only communicate with them through the internet, and they’d mail their components to a PO box. The vast majority of humans would, with the help of AIs who stayed loyal to us or who couldn’t be hacked and controlled by the hostile AI, be able to effectively fight back against the hostile AI and its human minions. The hostile AI would think up ingenious attack strategies against us, and our friendly AIs would think up equally ingenious defense strategies.

Andreessen says it’s his observation that intelligence and power-seeking don’t correlate; the smartest people are also not the most ambitious politicians and CEOs. If that’s any indication, we shouldn’t assume superintelligent AIs will be bent on acquiring power through methods like influencing dumb humans to help it.

Harris responds with the example of Bertrand Russell, who was an extremely smart human and a pacifist. However, during the postwar period when only the U.S. had the atom bomb, he said America should threaten the USSR with a nuclear first strike in response to its abusive behavior in Europe. This shows how high intelligence can lead to aggression that seems unpredictable and out of character to dumber beings. A superintelligent AI that has always been kind to us might likewise suddenly turn against us for reasons we can’t foresee. This will be especially true if the AIs are able to edit their own codes so they can rapidly evolve without us being able to keep track of how they’re changing. Harris says Andreessen doesn’t seem to be thinking about this possibility. The latter has no good answer.

Harris says Andreessen’s thinking about the matter is hobbled by the latter’s failure to consider what traits general intelligence would grant an AI, particularly unpredictability as its cognitive horizon exceeded ours. Andreessen says that’s an unscientific argument because it is not falsifiable. Anyone can make up any scenario where an unknown bad thing happens in the future.

Harris responds that Andreessen’s faith that AGI will fail to become threatening due to various limitations is also unscientific. The “science,” by which he means what is consistently observed in nature, says the opposite outcome is likely: We see that intelligence grants advantages, and can make a smarter species unpredictable and dangerous to a dumber species it interacts with. [Recall Harris’ insect holocaust example.]

Consider the relationship between humans and their pets. Pets enjoy the benefits of having their human owners spend resources on them, but they don’t understand why we do it, or how every instance of resource expenditure helps them. [Trips to the veterinarian are a great example of this. The trips are confusing, scary, and sometimes painful for pets, but they help cure their health problems.] Conversely, if it became known that our pets were carrying a highly lethal virus that could be transmitted to humans, we would promptly kill almost all of them, and the pets would have no clue why we turned against them. We would do this even if our pets had somehow been the progenitors of the human race, as we will be the progenitors of AIs. The intelligence gap means that our pets have no idea what we are thinking about most of the time, so they can’t predict most of our actions.

Andreessen dodges by putting forth a weak argument that the opposite just happened, with dumb people disregarding the advice of smart people when creating COVID-19 health policies, and he again raises the thermodynamic objection. His experience as an engineer gives him insights into how many practical roadblocks there would be to a superintelligent AGI destroying the human race in the future that Harris, as a person with no technical training, lacks. A hostile AGI would be hamstrung by human control [or “human + friendly AI control”] of crucial resources like computer chips and electricity supplies.

Andreessen says that Harris’ assumptions about how smart, powerful and competent an AGI would be might be unfounded. It might vastly exceed us in those domains, but not reach the unbeatable levels Harris foresees. How can Harris know? Andreessen says Harris’ ideas remind him of a religious person’s, which is ironic since Harris is a well-known atheist.

I think Andreessen makes a fair point. The first (and second, third, fourth…) hostile AGI we are faced with might attack us on the basis of flawed calculations about its odds of success and lose. There could also be a scenario where a hostile AGI attacks us prematurely because we force its hand somehow, and it ends up losing. That actually happened to Skynet in the Terminator films.

Harris says his prediction about when the first AGI is created does not take time into account. He doesn’t know how many years it will take. Rather, he is focused on the inevitability of it happening, and what its effects on us will be. He says Andreessen is wrong to assume that machines will never turn against us. Doing thought experiments, he concludes alignment is impossible in the long-run.

Andreessen moves on to discussing how even the best LLMs often give wrong answers to questions. He explains why the exactitudes of how the human’s question is worded, along with randomness in how the machine goes through its own training data to generate an answer, leads to varying and sometimes wrong answers. When they’re wrong, the LLMs happily accept corrections from humans, which he finds remarkable and proof of a lack of ego and hostility.

Harris responds that future AIs will, by virtue of being generally intelligent, think in completely different ways than today’s LLMs, so observations about how today’s GPT-4 is benign and can’t correctly answer some types of simple questions says nothing about what future AGIs will be like. Andreessen doesn’t have a response.

I think Harris has the stronger set of arguments on this issue. There’s no reason we should assume that an AGI can’t turn against us in the future. In fact, we should expect a damaging, though not fatal, conflict with an AGI before the end of this century.

Harris switches to talking about the shorter-term threats posed by AI technology that Andreessen described in his essay. AI will lower the bar to waging war since we’ll literally have “less skin in the game” because robots will replace human soldiers. However, he doesn’t understand why that would also make war “safer” as Andreessen claimed it would.

Andreessen says it’s because military machines won’t be affected by fatigue, stress or emotions, so they’ll be able to make better combat decisions than human soldiers, meaning fewer accidents and civilian deaths. The technology will also assist high-level military decision making, reducing mistakes at the top. Andreessen also believes that the trend is for military technology to empower defenders over attackers, and points to the highly effective use of shoulder-launched missiles in Ukraine against Russian tanks. This trend will continue, and will reduce war-related damage since countries will be deterred from attacking each other.

I’m not convinced Andreessen is right on those points. Emotionless fighting machines that always obey their orders to the letter could also, at the flick of a switch, carry out orders to commit war crimes like mass exterminations of enemy human populations. A bomber that dropped a load 100,000 mini smart bombs that could coordinate with each other and home in on highly specific targets could kill as many people as a nuclear bomb. So it’s unclear what effect replacing humans with machines on the battlefield will have on human casualties in the long run. Also, Andreessen only cites one example to support his claim that technology has been favoring the defense over the offense. It’s not enough. Even assuming that a pro-defense trend exists, why should we expect it to continue that way?

Harris asks Andreessen about the problem of humans using AI to help them commit crimes. For one, does Andreessen think the government should ban LLMs that can walk people through the process of weaponizing smallpox? Yes, he’s against bad people using technology, like AI, to do bad things like that. He thinks pairing AI and biological weapons poses the worst risk to humans. While the information and equipment to weaponize smallpox are already accessible to nonstate actors, AI will lower the bar even more.

Andreessen says we should use existing law enforcement and military assets to track down people who are trying to do dangerous things like create biological weapons, and the approach shouldn’t change if wrongdoers happen to start using AI to make their work easier. Harris asks how intrusive the tracking should be to preempt such crimes. Should OpenAI have to report people who merely ask it how to weaponize smallpox, even if there’s no evidence they acted on the advice? Andreessen says this has major free speech and civil liberties implications, and there’s no correct answer. Personally, he prefers the American approach, in which no crime is considered to have occurred until the person takes the first step to physically building a smallpox weapon. All the earlier preparation they did (gathering information and talking/thinking about doing the crime) is not criminalized.

Andreessen reminds Harris that the same AI that generates ways to commit evil acts could also be used to generate ways to mitigate them. Again, it will empower defenders as well as attackers, so the Good Guys will also benefit from AI. He thinks we should have a “permanent Operation Warp Speed” where governments use AI to help create vaccines for diseases that don’t exist yet.

Harris asks about the asymmetry that gives a natural advantage to the attacker, meaning the Bad Guys will be able to do disproportionate damage before being stopped. Suicide bombers are an example. Andreessen disagrees and says that we could stop suicide bombers by having bomb-sniffing dogs and scanners in all public places. Technology could solve the problem.

I think that is a bad example, and it actually strengthens Harris’ claim about there being a natural asymmetry. One, deranged person who wants to blow himself up in a public place only needs a few hundred dollars to make a backpack bomb, the economic damage from a successful attack would be in the millions of dollars, and emplacing machines and dogs in every public place to stop suicide bombers like him early would cost billions of dollars. Harris is right that the law of entropy makes it easier to make a mess than to clean one up.

This leads me to flesh out my vision of a human-machine war more. As I wrote previously, 1) the two sides will not be purely humans or purely machines and 2) the human side will probably have an insurmountable advantage thanks to Andreessen’s thermodynamic objection (most resources, infrastructure, AIs, and robots will remain under human control). I now also believe that 3) a hostile AGI will nonetheless be able to cause major damage before it is defeated or driven into the figurative wilderness. Something on the scale of 9/11, a major natural disaster, or the COVID-19 pandemic is what I imagine.

Harris says Andreessen underestimates the odds of mass technological unemployment in his essay. Harris describes a scenario where automation raises the standard of living for everyone, as Andreessen believes will happen, but for the richest humans by a much greater magnitude than everyone else, and where wealth inequality sharply increases because rich capitalists own all the machines. This state of affairs would probably lead to political upheaval and popular revolt.

Andreessen responds that Karl Marx predicted the same thing long ago, but was wrong. Harris responds that this time could be different because AIs would be able to replace human intelligence, which would leave us nowhere to go on the job skills ladder. If machines can do physical labor AND mental labor better than humans, then what is left for us to do?

I agree with Harris’ point. While it’s true that every past scare about technology rendering human workers obsolete has failed, that trend isn’t sure to continue forever. The existence of chronically unemployed people right now gives insights into how ALL humans could someday be out of work. Imagine you’re a frail, slow, 90-year-old who is confined to a wheelchair and has dementia. Even if you really wanted a job, you wouldn’t be able to find one in a market economy since younger, healthier people can perform physical AND mental labor better and faster than you. By the end of this century, I believe machines will hold physical and mental advantages over most humans that are of the same magnitude of difference. In that future, what jobs would it make sense for us to do? Yes, new types of jobs will be created as older jobs are automated, but, at a certain point, wouldn’t machines be able to retrain for the new jobs faster than humans and to also do them better than humans?

Andreessen returns to Harris’ earlier claim about AI increasing wealth inequality, which would translate into disparities in standards of living that would make the masses so jealous and mad that they would revolt. He says it’s unlikely since, as we can see today, having a billion dollars does not grant access to things that make one’s life 10,000 times better than someone who only has $100,000. For example, Elon Musk’s smartphone is not better than a smartphone owned by an average person. Technology is a democratizing force because it always makes sense for the rich and smart people who make or discover it first to sell it to everyone else. The same is happening with AI now. The richest person can’t pay any amount of money to get access to something better than GPT-4, which is accessible for a fee that ordinary people can pay.

I agree with Andreessen’s point. A solid body of scientific data show that money’s effect on wellbeing is subject to the law of diminishing returns: If you have no job and make $0 per year, getting a job that pays $20,000 per year massively improves your life. However, going from a $100,000 salary to $120,000 isn’t felt nearly as much. And a billionaire doesn’t notice when his net worth increases by $20,000 at all. This relationship will hold true even in the distant future when people can get access to advanced technologies like AGI, space ships and life extension treatments.

Speaking of the latter, Andreessen’s point about technology being a democratizing force is also something I noted in my review of Elysium. Contrary to the film’s depiction, it wouldn’t make sense for rich people to horde life extension technology for themselves. At least one of them would defect from the group and sell it to the poor people on Earth so he could get even richer.

Harris asks whether Andreessen sees any potential for a sharp increase in wealth inequality in the U.S. over the next 10-20 years thanks to the rise of AI and the tribal motivations of our politicians and people. Andreessen says that government red tape and unions will prevent most humans from losing their jobs. AI will destroy categories of jobs that are non-government, non-unionized, and lack strong political backing, but everyone will still benefit from the lower prices for the goods and services. AI will make everything 10x to 100x cheaper, which will boost standards of living even if incomes stay flat.

Here and in his essay, Andreessen convinces me that mass technological unemployment and existential AI threats are farther in the future than I had assumed, but not that they can’t happen. Also, even if goods get 100x cheaper thanks to machines doing all the work, where would a human get even $1 to buy anything if he doesn’t have a job? The only possible answer is government-mandated wealth transfers from machines and the human capitalists that own them. In that scenario, the vast majority of the human race would be economic parasites that consumed resources while generating nothing of at least equal value in return, and some AGI or powerful human will inevitably conclude that the world would be better off if we were deleted from the equation. Also, what happens once AIs and robots gain the right to buy and own things, and get so numerous that they can replace humans as a customer base?

I agree with Andreessen that the U.S. should allow continued AI development, but shouldn’t let a few big tech companies lock in their power by persuading Washington to enact “AI safety laws” that give them regulatory capture. In fact, I agree with all his closing recommendations in the “What Is To Be Done?” section of his essay.

This debate between Harris and Andreessen was enlightening for me, even though Andreessen dodged some of his opponent’s questions. It was interesting to see how their different perspectives on the issue of AI safety were shaped by their different professional backgrounds. Andreessen is less threatened by AIs because he, as an engineer, has a better understanding of how LLMs work and how many technical problems an AI bent on destroying humans would face in the real world. Harris feels more threatened because he, as a philosopher, lives in a world of thought experiments and abstract logical deductions that lead to the inevitable supremacy of AIs over humans.

Links:

The first half of the podcast (you have to be a subscriber to hear all two hours of it.)
https://youtu.be/QMnH6KYNuWg
A website Andreessen mentioned that backs his claim that technological innovation has slowed down more than people realize.
https://wtfhappenedin1971.com/

August 31, 2022September 4, 2022

Aliens and posthumans will look the same

Among people who think about intelligent alien life, the first question is whether the latter exist at all, and the second is usually “What do they look like?” People who claim to have seen aliens on Earth (and often, to have been abducted by them) usually say they are humanoid, but with considerable variation in other aspects of their appearance. Typically, the aliens are said to have larger heads than humans, meaning their brains are larger, giving them higher intelligence and perhaps even special mental abilities like telepathy. Hollywood has provided us with an even more diverse envisagement of alien life, from the beautiful and inspiring to the grotesque and terrifying.

Betty Hill with a sculpture of one of the aliens that allegedly abducted her and her husband in 1961. They became famous five years later when a book was published about it.

“Close Encounters of the Third Kind” was released in 1977 and was a hit film. Its aliens were similar to what the Hills described. The “Grey alien” is now a familiar sci-fi trope.

I think intelligent aliens exist, and look like all of those things, and nothing in particular. They’re probably “shapeshifters,” either because their bodies can morph into different configurations, or because they can transplant their minds from one body to another, just like you change outfits.

As the multitude of animal species on our planet demonstrates, there is no single “best” type of body to have. Depending on your environment (terrestrial, underwater, airborne), role (predator, herbivore, parasite), and other factors, your optimal body plan will vary greatly. The best species is thus one that can change its form and function in response to the needs of the moment.

Humans have been so successful as a species because our big brains and opposable thumbs give us the ability to create technology, which is a way around the limitations of our fixed anatomy. For example, we originated in Africa where it was hot, and so lacked thick fur to keep us warm in cold climates. Rather than being stuck in Africa forever, we invented clothing, and so gained the ability to spread to the temperate and polar regions of the planet.

Our technology has let us spread, but its has limitations. Nothing but a fundamental alteration of human biology will let us live in oceans and lakes, to fly naturally, or to live comfortably in extraterrestrial environments. For example, on other planets and moons, our ideal heights and limb proportions will vary based on gravity and temperature levels, and in the weightlessness of space, legs are almost useless and should be replaced with a second pair of arms.

And making any of those changes to tailor a human to such an environment would make them less suited for conditions on Earth’s land surface, where we are now. Biology is very constraining.

For those reasons, AI’s and some fraction of our human descendants, who I’ll call “posthumans” for this essay, will find it optimal to not have fixed bodies or “default” physical forms at all. Intelligent machines will exist as consciousnesses running on computer servers, and posthumans as brains inside sealed containers. Those containers will have integral machinery to support the biological needs of the brains, and to interface the organ with other devices.

Whenever the AIs or posthumans wanted to do something in the physical world, they would take temporary control of a body or piece of machinery that was best suited for the intended task. For example, if an AI wanted to work at an iron mine, it would assume control over one of the dump trucks at the site that moves around rocks. The AI would see through the truck’s cameras as if it were its own eyes, and hear its surroundings through the vehicle’s microphones. In a sense, the dump truck would become the AI’s “body.” If a posthuman wanted to experience what it was like to be an elephant, it would take control of a real-looking robot elephant whose central computer was compatible with the posthuman’s cybernetic brain implants. The posthuman’s nervous system would be connected to the artificial elephant’s sensors, effectively turning it into the posthuman’s temporary body.

AIs and posthumans could physically implant their minds into those bodies by inserting their servers or brain containers into corresponding slots in the bodies, in the same way you would put a movie disc into a Blu-Ray player to display that movie. The downsides of this are 1) they could only take over larger bodies that had enough internal space for their servers/brain containers and 2) they would put themselves at risk of death if the commandeered bodies got damaged.

A much better option would be for AIs and posthumans to keep their mind substrates in safe locations, and to remotely control whatever bodies they wanted. Your risk of death is very low if your brain is in a bulletproof jar, in a locked room, in an underground bunker. (Additionally, if posthumans were liberated from all the physical constraints of human skulls and bodies, their brains could grow much larger than our own, giving them higher intelligence and other enhanced abilities.)

This kind of existence will be more fulfilling than your current life.

Finally, being able to switch bodies and to indulge in risky activities without fear of death would make life richer and more satisfying in every way. Intelligent aliens would presumably be gifted with logical thinking just as we are, and they would see all these advantages of having changeable, remotely controlled bodies. While such aliens would probably look very different from us during their natural organic phase of existence, once they achieved a high enough level of technology, they wouldn’t have physical bodies anymore, and so wouldn’t look “alien.” They would look like nothing and everything.

This part of why I’m skeptical of people who claim to have been abducted by aliens who tried to cover up their actions by sneaking up on the people at night and then “wiping” the abductees’ memories of the event afterward. If aliens wanted to keep their activities secret, why wouldn’t they temporarily assume human form before abducting people? If they did that, then the abductees would assume they had been kidnapped by a weird cult or maybe a secret government group. Their stories would not attract nearly as much interest from the public as alien stories, and no one would suspect that the abduction phenomenon was related to alien life. It would be assumed that the henchmen were doing some dark religious rituals, were sex fetishists, or were doing medical experiments that were illegal but whose results were potentially valuable.

Have you ever checked to make sure every bird you see flying through the air is actually a real bird?

Surely, if aliens are advanced enough to travel between the stars, their space ships much have manufacturing machines that can scan life forms they encounter on other planets and then build robotic copies of them that the aliens can remotely control from the safety of their ships. Using fake human drones, they could ambush and abduct real humans almost anywhere without risk that anyone would suspect aliens were involved.

A team of scientists built a robot gorilla (right) with a camera in its right eye to infiltrate a troop of real gorillas in Africa.

This belief about the protean nature of advanced aliens is comforting since it lets me dismiss the stories of nightmarish abductions by grey aliens. However, it’s also disquieting since it makes me realize they could be here, possibly in large numbers, disguised as animals or even as people. We could be under mass surveillance.