Book review: Superintelligence: Paths, Dangers, Strategies, by Nick Bostrom.

This book is substantially more thoughtful than previous books on AGI risk, and substantially better organized than the previous thoughtful writings on the subject.

Bostrom’s discussion of AGI takeoff speed is disappointingly philosophical. Many sources (most recently CFAR) have told me to rely on the outside view to forecast how long something will take. We’ve got lots of weak evidence about the nature of intelligence, how it evolved, and about how various kinds of software improve, providing data for an outside view. Bostrom assigns a vague but implausibly high probability to AI going from human-equivalent to more powerful than humanity as a whole in days, with little thought of this kind of empirical check.

I’ll discuss this more in a separate post which is more about the general AI foom debate than about this book.

Bostrom’s discussion of how takeoff speed influences the chance of a winner-take-all scenario makes it clear that disagreements over takeoff speed are pretty much the only cause of my disagreement with him over the likelihood of a winner-take-all outcome. Other writers aren’t this clear about this. I suspect those who assign substantial probability to a winner-take-all outcome if takeoff is slow will wish he’d analyzed this in more detail.

I’m less optimistic than Bostrom about monitoring AGI progress. He says “it would not be too difficult to identify most capable individuals with a long-standing interest in [AGI] research”. AGI might require enough expertise for that to be true, but if AGI surprises me by only needing modest new insights, I’m concerned by the precedent of Tim Berners-Lee creating a global hypertext system while barely being noticed by the “leading” researchers in that field. Also, the large number of people who mistakenly think they’ve been making progress on AGI may obscure the competent ones.

He seems confused about the long-term trends in AI researcher beliefs about the risks: “The pioneers of artificial intelligence … mostly did not contemplate the possibility of greater-than-human AI” seems implausible; it’s much more likely they expected it but were either overconfident about it producing good results or fatalistic about preventing bad results (“If we’re lucky, they might decide to keep us as pets” – Marvin Minsky, LIFE Nov 20, 1970).

The best parts of the book clarify many issues related to ensuring that an AGI does what we want.

He catalogs more approaches to controlling AGI than I had previously considered, including tripwires, oracles, and genies, and clearly explains many limits to what they can accomplish.

He briefly mentions the risk that the operator of an oracle AI would misuse it for her personal advantage. Why should we have less concern about the designers of other types of AGI giving them goals that favor the designers?

If an oracle AI can’t produce a result that humans can analyze well enough to decide (without trusting the AI) that it’s safe, why would we expect other approaches (e.g. humans writing the equivalent seed AI directly) to be more feasible?

He covers a wide range of ways we can imagine handling AI goals, including strange ideas such as telling an AGI to use the motivations of superintelligences created by other civilizations

He does a very good job of discussing what values we should and shouldn’t install in an AGI: the best decision theory plus a “do what I mean” dynamic, but not a complete morality.

I’m somewhat concerned by his use of “final goal” without careful explanation. People who anthropomorphise goals are likely to confuse at least the first few references to “final goal” as if it worked like a human goal, i.e. something that the AI might want to modify if it conflicted with other goals.

It’s not clear how much of these chapters depend on a winner-take-all scenario. I get the impression that Bostrom doubts we can do much about the risks associated with scenarios where multiple AGIs become superhuman. This seems strange to me. I want people who write about AGI risks to devote more attention to whether we can influence whether multiple AGIs become a singleton, and how they treat lesser intelligences. Designing AGI to reflect values we want seems almost as desirable in scenarios with multiple AGIs as in the winner-take-all scenario (I’m unsure what Bostrom thinks about that). In a world with many AGIs with unfriendly values, what can humans do to bargain for a habitable niche?

He has a chapter on worlds dominated by whole brain emulations (WBE), probably inspired by Robin Hanson’s writings but with more focus on evaluating risks than on predicting the most probable outcomes. Since it looks like we should still expect an em-dominated world to be replaced at some point by AGI(s) that are designed more cleanly and able to self-improve faster, this isn’t really an alternative to the scenarios discussed in the rest of the book.

He treats starting with “familiar and human-like motivations” (in an augmentation route) as an advantage. Judging from our experience with humans who take over large countries, a human-derived intelligence that conquered the world wouldn’t be safe or friendly, although it would be closer to my goals than a smiley-face maximizer. The main advantage I see in a human-derived superintelligence would be a lower risk of it self-improving fast enough for the frontrunner advantage to be large. But that also means it’s more likely to be eclipsed by a design more amenable to self-improvement.

I’m suspicious of the implication (figure 13) that the risks of WBE will be comparable to AGI risks.

  • Is that mainly due to “neuromorphic AI” risks? Bostrom’s description of neuromorphic AI is vague, but my intuition is that human intelligence isn’t flexible enough to easily get the intelligence part of WBE without getting something moderately close to human behavior.
  • Is the risk of uploaded chimp(s) important? I have some concerns there, but Bostrom doesn’t mention it.
  • How about the risks of competitive pressures driving out human traits (discussed more fully/verbosely at Slate Star Codex)? If WBE and AGI happen close enough together in time that we can plausibly influence which comes first, I don’t expect the time between the two to be long enough for that competition to have large effects.
  • The risk that many humans won’t have enough resources to survive? That’s scary, but wouldn’t cause the astronomical waste of extinction.

Also, I don’t accept his assertion that AGI before WBE eliminates the risks of WBE. Some scenarios with multiple independently designed AGIs forming a weakly coordinated singleton (which I consider more likely than Bostrom does) appear to leave the last two risks in that list unresolved.

This books represents progress toward clear thinking about AGI risks, but much more work still needs to be done.

Crickets

I finally found a way to buy insects in enough quantity to satisfy my desire for nutrition from insects: World Ento, which sells dried crickets and cricket flour at a price per gram of protein comparable to seafood. (H/T Holden Karnofsky.)

According to Organic Value Recovery Solutions, crickets have impressive amounts of the nutrients I’ve found the hardest to get good amounts of. Here are examples of how much I’d get if I got my 2000 calories a day from crickets:

  • B12: 20 times the RDA (more than 3 times that of eggs)
  • Folate: 5 times the RDA (3 times that of eggs)
  • Zinc: 9 times the RDA (5 times that of eggs)

(using data for eggs from pastured chickens).

They have plenty of fiber and good amounts of most minerals and B vitamins.

The cricket flour tastes ok in brownies, but I’ll want some other recipe for regular use.

Update 2015-01-05: ThailandUnique has a better selection of insects. My favorite so far is the Big Cricket.

Book review: The Hidden Reality: Parallel Universes and the Deep Laws of the Cosmos, by Brian Greene.

This book has a lot of overlap with Tegmark’s Our Mathematical Universe

Greene uses less provocative language than Tegmark, but makes up for that by suggesting 5 more multiverses than Tegmark (3 of which depend on string theory for credibility, and 2 that Tegmark probably wouldn’t label as multiverses).

I thought about making some snide remarks about string theory being less real than the other multiverses. Then I noticed that what Greene calls the ultimate multiverse (all possible universes) implies that string theory universes (or at least computable approximations) are real regardless of whether we live in one.

Like Tegmark, Greene convinces me that inflation which lasts for infinite time implies infinite space and infinite copies of earth, but fails to convince me that he has a strong reason for assuming infinite time.

The main text is mostly easy to read. Don’t overlook the more technical notes at the end – the one proposing an experiment that would distinguish the Many Worlds interpretation of quantum mechanics from the Copenhagen interpretation is one of the best parts of the book.

Rule of the Clan

Book review: The Rule of the Clan: What an Ancient Form of Social Organization Reveals About the Future of Individual Freedom by Mark S. Weiner.

This book does a good job of explaining how barbaric practices such as feuds and honor killings are integral parts of clan-based systems of dispute resolution, and can’t safely be suppressed without first developing something like the modern rule of law to remove the motives that perpetuate them.

He has a coherent theory of why societies with no effective courts and police need to have kin-based groups be accountable for the actions of their members, which precludes some of the individual rights that we take for granted.

He does a poor job of explaining how this is relevant to modern government. He writes as if anyone who wants governments to exert less power wants to weaken the rule of law and the ability of governments to stop violent disputes. (His comments about modern government are separate enough to not detract much from the rest of the book).

He implies that modern rule of law and rule by clans are the only stable possibilities. He convinced me that it would be hard to create good alternatives to those two options, but not that alternatives are impossible.

To better understand how modern individualism replaced clan-based society, read Fukuyama’s The Origins of Political Order together with this book.

Ambronite

Yet another soylent competitor has appeared: Ambronite.

It’s higher quality and high price than Soylent or MealSquares. It has more B12 than MealSquares even though it’s vegan.

It’s low enough in saturated fat that I probably want to add an additional source of saturated fat to my diet, but that’s a nice problem to have – I’d want to add chocolate anyway. My biggest reservation is the high level of polyunsaturated fat – if I could get a version without the walnuts I’d probably be satisfied there.

Most ingredients look like what our ancestors evolved to eat, but the first two ingredients listed are oats and rice protein.

Drugs without Patents

When considering proposals to weaken patent monopolies, drug development seems like the main type of innovation that would be hurt.

Most of drug development cost seems to be verification of safety and effectiveness, which doesn’t look like the kind of novelty-creation patents are designed to help, but that doesn’t mean it’s easy to implement an alternative.

It turns out we have an example of a company making monopoly-style profits from an unpatented drug (Questcor Pharmaceuticals, Acthar).

Questcor bought Acthar for $100,000, suggesting the seller expected it would hardly make any money. Sometime later Acthar was designated an orphan drug, and Questcor raised the price from $1,650 to 28,000 per vial, without causing competitors to sell it. It now gets profits of roughly $300 million per year off of it.

So something must be restraining competition, probably something connected to the orphan drug laws, suggesting that if protections for patents in general were weakened, it would only take small changes in existing rules to maintain existing incentives for drug development.

Book review: Our Mathematical Universe: My Quest for the Ultimate Nature of Reality, by Max Tegmark.

His most important claim is the radical Platonist view that all well-defined mathematical structures exist, therefore most physics is the study of which of those we inhabit. His arguments are more tempting than any others I’ve seen for this view, but I’m left with plenty of doubt.

He points to ways that we can imagine this hypothesis being testable, such as via the fine-tuning of fundamental constants. But he doesn’t provide a good reason to think that those tests will distinguish his hypothesis from other popular approaches, as it’s easy to imagine that we’ll never find situations where they make different predictions.

The most valuable parts of the book involve the claim that the multiverse is spatially infinite. He mostly talks as if that’s likely to be true, but his explanations caused me to lower my probability estimate for that claim.

He gets that infinity by claiming that inflation continues in places for infinite time, and then claiming there are reference frames for which that infinite time is located in a spatial rather than a time direction. I have a vague intuition why that second step might be right (but I’m fairly sure he left something important out of the explanation).

For the infinite time part, I’m stuck with relying on argument from authority, without much evidence that the relevant authorities have much confidence in the claim.

Toward the end of the book he mentions reasons to doubt infinities in physics theories – it’s easy to find examples where we model substances such as air as infinitely divisible, when we know that at some levels of detail atomic theory is more accurate. The eternal inflation theory depends on an infinitely expandable space which we can easily imagine is only an approximation. Plus, when physicists explicitly ask whether the universe will last forever, they don’t seem very confident. I’m also tempted to say that the measure problem (i.e. the absence of a way to say some events are more likely than others if they all happen an infinite number of times) is a reason to doubt infinities, but I don’t have much confidence that reality obeys my desire for it to be comprehensible.

I’m disappointed by his claim that we can get good evidence that we’re not Boltzmann brains. He wants us to test our memories, because if I am a Boltzmann brain I’ll probably have a bunch of absurd memories. But suppose I remember having done that test in the past few minutes. The Boltzmann brain hypothesis suggests it’s much more likely for me to have randomly acquired the memory of having passed the test than for me to actually be have done the test. Maybe there’s a way to turn Tegmark’s argument into something rigorous, but it isn’t obvious.

He gives a surprising argument that the differences between the Everett and Copenhagen interpretations of quantum mechanics don’t matter much, because unrelated reasons involving multiverses lead us to expect results comparable to the Everett interpretation even if the Copenhagen interpretation is correct.

It’s a bit hard to figure out what the book’s target audience is – he hides the few equations he uses in footnotes to make it look easy for laymen to follow, but he also discusses hard concepts such as universes with more than one time dimension with little attempt to prepare laymen for them.

The first few chapters are intended for readers with little knowledge of physics. One theme is a historical trend which he mostly describes as expanding our estimate of how big reality is. But the evidence he provides only tells us that the lower bounds that people give keep increasing. Looking at the upper bound (typically infinity) makes that trend look less interesting.

The book has many interesting digressions such as a description of how to build Douglas Adams’ infinite improbability drive.

Nutritional Meals

I’ve been thinking more about convenient, healthy alternatives to Soylent or MealSquares that are closer to the kind of food we’ve evolved to eat.

Here’s some food that exceeds the recommended daily intake of most vitamins and minerals with only about 1300 calories (leaving room for less healthy snacks):

  • 4 bags of Brad’s Raw Chips, Indian
  • 1.5 bags of Brad’s Raw Chips, Sweet Pepper
  • 6 crackers, Lydia’s Green Crackers (vitamin E)
  • 1 oz Atlantic oysters (B12, zinc) (one 3 oz tin every 3 days)
  • 1 brazil nut (selenium)

Caveats: I’m unsure how accurately I estimated the nutrition in the processed foods (I made guesses based on the list of ingredients).

This diet has little vitamin D (which I expect to get from supplements and sun).

It’s slightly low in calcium, sodium, B12, and saturated fat. I consider it important to get more B12 from other animal sources (sardines, salmon or pastured eggs). I’m not concerned about the calcium or sodium because this diet would provide more than hunter-gathers got and because I don’t have much trouble getting more from other food. And it’s hard not to get more saturated fat from other foods I like (e.g. chocolate).

I don’t know whether it has enough iodine, so when I’m not having much fish it’s probably good to add a little seaweed (I’m careful to avoid the common kinds that have added oil that’s been subjected to questionable processing).

It has just barely 100% of vitamin E, B3, and B5 (in practice I get more of those from eggs and sweet potatoes).

It’s possibly too high in omega-3 (10+ grams?) from flax seeds in the Raw Chips (my estimate here is more uncertain than with the other nutrients).

The only convenient way to get oysters that keep well and don’t need preparation is cans of smoked oysters, and smoking seems to be an unhealthy way to process food.

Note that I chose this list without trying to make it affordable, and it ended up costing about $50 per day. I don’t plan to spend that much unless I become too busy to cook cheaper foods such as sweet potatoes, mushrooms, bean sprouts, fish, and eggs.

In practice, I’ve been relying more on Questbars for convenient food, but I’m trying to cut down on those as I eat more Brad’s Raw Chips.

Book review: The Great Degeneration: How Institutions Decay and Economies Die, by Niall Ferguson.

Read (or skim) Reinhart and Rogoff’s book This Time is Different instead. The Great Degeneration contains little value beyond a summary of that.

The other part which comes closest to analyzing US decay is a World Bank report about governance quality from 1996 to 2011 which shows the US in decline from 2000 to 2009. He makes some half-hearted attempts to argue for a longer trend using anecdotes that don’t really say much.

Large parts of the book are just standard ideological fluff.

Book review: Our Final Invention: Artificial Intelligence and the End of the Human Era by James Barrat.

This book describes the risks that artificial general intelligence will cause human extinction, presenting the ideas propounded by Eliezer Yudkowsky in a slightly more organized but less rigorous style than Eliezer has.

Barrat is insufficiently curious about why many people who claim to be AI experts disagree, so he’ll do little to change the minds of people who already have opinions on the subject.

He dismisses critics as unable or unwilling to think clearly about the arguments. My experience suggests that while it’s normally the case that there’s an argument that any one critic hasn’t paid much attention to, that’s often because they’ve rejected with some thought some other step in Eliezer’s reasoning and concluded that the step they’re ignoring wouldn’t influence their conclusions.

The weakest claim in the book is that an AGI might become superintelligent in hours. A large fraction of people who have worked on AGI (e.g. Eric Baum’s What is Thought?) dismiss this as too improbable to be worth much attention, and Barrat doesn’t offer them any reason to reconsider. The rapid takeoff scenarios influence how plausible it is that the first AGI will take over the world. Barrat seems only interested in talking to readers who can be convinced we’re almost certainly doomed if we don’t build the first AGI right. Why not also pay some attention to the more complex situation where an AGI takes years to become superhuman? Should people who think there’s a 1% chance of the first AGI conquering the world worry about that risk?

Some people don’t approve of trying to build an immutable utility function into an AGI, often pointing to changes in human goals without clearly analyzing whether those are subgoals that are being altered to achieve a stable supergoal/utility function. Barrat mentions one such person, but does little to analyze this disagreement.

Would an AGI that has been designed without careful attention to safety blindly follow a narrow interpretation of its programmed goal(s), or would it (after achieving superintelligence) figure out and follow the intentions of its authors? People seem to jump to whatever conclusion supports their attitude toward AGI risk without much analysis of why others disagree, and Barrat follows that pattern.

I can imagine either possibility. If the easiest way to encode a goal system in an AGI is something like “output chess moves which according to the rules of chess will result in checkmate” (turning the planet into computronium might help satisfy that goal).

An apparently harder approach would have the AGI consult a human arbiter to figure out whether it wins the chess game – “human arbiter” isn’t easy to encode in typical software. But AGI wouldn’t be typical software. It’s not obviously wrong to believe that software smart enough to take over the world would be smart enough to handle hard concepts like that. I’d like to see someone pin down people who think this is the obvious result and get them to explain how they imagine the AGI handling the goal before it reaches human-level intelligence.

He mentions some past events that might provide analogies for how AGI will interact with us, but I’m disappointed by how little thought he puts into this.

His examples of contact between technologically advanced beings and less advanced ones all refer to Europeans contacting Native Americans. I’d like to have seen a wider variety of analogies, e.g.:

  • Japan’s contact with the west after centuries of isolation
  • the interaction between neanderthals and humans
  • the contact that resulted in mitochondria becoming part of our cells

He quotes Vinge saying an AGI ‘would not be humankind’s “tool” – any more than humans are the tools of rabbits or robins or chimpanzees.’ I’d say that humans are sometimes the tools of human DNA, which raises more complex questions of how well the DNA’s interests are served.

The book contains many questionable digressions which seem to be designed to entertain.

He claims Google must have an AGI project in spite of denials by Google’s Peter Norvig (this was before it bought DeepMind). But the evidence he uses to back up this claim is that Google thinks something like AGI would be desirable. The obvious conclusion would be that Google did not then think it had the skill to usefully work on AGI, which would be a sensible position given the history of AGI.

He thinks there’s something paradoxical about Eliezer Yudkowsky wanting to keep some information about himself private while putting lots of personal information on the web. The specific examples Barrat gives strongly suggests that Eliezer doesn’t value the standard notion of privacy, but wants to limit peoples’ ability to distract him. Barrat also says Eliezer “gave up reading for fun several years ago”, which will surprise those who see him frequently mention works of fiction in his Author’s Notes.

All this makes me wonder who the book’s target audience is. It seems to be someone less sophisticated than a person who could write an AGI.