ethics

All posts tagged ethics

Book review: Singularity Hypotheses: A Scientific and Philosophical Assessment.

This book contains papers of widely varying quality on superhuman intelligence, plus some fairly good discussions of what ethics we might hope to build into an AGI. Several chapters resemble cautious versions of LessWrong, others come from a worldview totally foreign to LessWrong.

The chapter I found most interesting was Richard Loosemore and Ben Goertzel’s attempt to show there are no likely obstacles to a rapid “intelligence explosion”.

I expect what they label as the “inherent slowness of experiments and environmental interaction” to be an important factor limiting the rate at which an AGI can become more powerful. They think they see evidence from current science that this is an unimportant obstacle compared to a shortage of intelligent researchers: “companies complain that research staff are expensive and in short supply; they do not complain that nature is just too slow.”

Some explanations that come to mind are:

  • Complaints about nature being slow are not very effective at speeding up nature.
  • Complaints about specific tools being slow probably aren’t very unusual, but there are plenty of cases where people know complaints aren’t effective (e.g. complaints about spacecraft traveling slower than the theoretical maximum [*]).
  • Hiring more researchers can increase the status of a company even if the additional staff don’t advance knowledge.

They also find it hard to believe that we have independently reached the limit of the physical rate at which experiments can be done at the same time we’ve reached the limits of how many intelligent researchers we can hire. For literal meanings of physical limits this makes sense, but if it’s as hard to speed up experiments as it is to throw more intelligence into research, then the apparent coincidence could be due to wise allocation of resources to whichever bottleneck they’re better used in.

None of this suggests that it would be hard for an intelligence explosion to produce the 1000x increase in intelligence they talk about over a century, but it seems like an important obstacle to the faster time periods some people believe (days or weeks).

Some shorter comments on other chapters:

James Miller describes some disturbing incentives that investors would create for companies developing AGI if AGI is developed by companies large enough that no single investor has much influence on the company. I’m not too concerned about this because if AGI were developed by such a company, I doubt that small investors would have enough awareness of the project to influence it. The company might not publicize the project, or might not be honest about it. Investors might not believe accurate reports if they got them, since the reports won’t sound much different from projects that have gone nowhere. It seems very rare for small investors to understand any new software project well enough to distinguish between an AGI that goes foom and one that merely makes some people rich.

David Pearce expects the singularity to come from biological enhancements, because computers don’t have human qualia. He expects it would be intractable for computers to analyze qualia. It’s unclear to me whether this is supposed to limit AGI power because it would be hard for AGI to predict human actions well enough, or because the lack of qualia would prevent an AGI from caring about its goals.

Itamar Arel believes AGI is likely to be dangerous, and suggests dealing with the danger by limiting the AGI’s resources (without saying how it can be prevented from outsourcing its thought to other systems), and by “educational programs that will help mitigate the inevitable fear humans will have” (if the dangers are real, why is less fear desirable?).

* No, that example isn’t very relevant to AGI. Better examples would be atomic force microscopes, or the stock market (where it can take a generation to get a new test of an important pattern), but it would take lots of effort to convince you of that.

Book review: The Righteous Mind: Why Good People Are Divided by Politics and Religion, by Jonathan Haidt.

This book carefully describes the evolutionary origins of human moralizing, explains why tribal attitudes toward morality have both good and bad effects, and how people who want to avoid moral hostility can do so.

Parts of the book are arranged to describe the author’s transition from having standard delusions about morality being the result of the narratives we use to justify them and about why other people had alien-sounding ideologies. His description about how his study of psychology led him to overcome his delusions makes it hard for those who agree with him to feel very superior to those who disagree.

He hints at personal benefits from abandoning partisanship (“It felt good to be released from partisan anger.”), so he doesn’t rely on altruistic motives for people to accept his political advice.

One part of the book that surprised me was the comparison between human morality and human taste buds. Some ideologies are influenced a good deal by all 6 types of human moral intuitions. But the ideology that pervades most of academia only respect 3 types (care, liberty, and fairness). That creates a difficult communication gap between them and cultures that employ others such as sanctity in their moral system, much like people who only experience sweet and salty foods would have trouble imagining a desire for sourness in some foods.

He sometimes gives the impression of being more of a moral relativist than I’d like, but a careful reading of the book shows that there are a fair number of contexts in which he believes some moral tastes produce better results than others.

His advice could be interpreted as encouraging us to to replace our existing notions of “the enemy” with Manichaeans. Would his advice polarize societies into Manichaeans and non-Manichaeans? Maybe, but at least the non-Manichaeans would have a decent understanding of why Manichaeans disagreed with them.

The book also includes arguments that group selection played an important role in human evolution, and that an increase in cooperation (group-mindedness, somewhat like the cooperation among bees) had to evolve before language could become valuable enough to evolve. This is an interesting but speculative alternative to the common belief that language was the key development that differentiated humans from other apes.

The Honor Code

Book review: The Honor Code: How Moral Revolutions Happen by Kwame Anthony Appiah.

This book argues that moral changes such as the abolition of dueling, slavery, and foot-binding are not the result of new understanding of why they are undesirable. They result from changes in how they affect the honor (or status) of the groups that have the power to create the change.

Dueling was mostly associated with a hereditary class of gentlemen, and feeling a responsibility to duel was a symbol of that status. When the nature of the upper class changed to include a much less well defined class that included successful businessmen, and society became more egalitarian, the distinction associated with demonstrating that one was a member of the hereditary elite lost enough value that the costs of dueling outweighed the prestige.

Slave-owners increasingly portrayed the labor that slaves preformed in a way that also implied the work of British manual laborers deserved low status, and rising resentment and political power of that labor class created a movement to abolish slavery.

The inability of Chinese elites to ignore the opinions of elites in other nations whose military and technological might made it hard for China to dismiss them as inferior altered the class of people whom the Chinese elites wanted respect from.

These are plausible stories, backed by a modest amount of evidence. I don’t know of any strong explanations that compete with this. But I don’t get the impression that the author tried as hard as I would like to find evidence for competing explanations. For instance, he presents some partial evidence to the effect that Britain abolished slavery at a time when slavery was increasingly profitable. But I didn’t see any consideration of the costs of keeping slaves from running away, which I expect were increasing due to improved long-distance transportation such as railroads. He lists references which might constitute authoritative support for his position, but it looks like it would be time-consuming to verify that.

Whether this book can help spark new moral revolutions is unclear, but it should make our efforts to do so more cost-effective, if only by reducing the effort put into ineffective approaches.

Book review: Moral Machines: Teaching Robots Right from Wrong by Wendell Wallach and Collin Allen.

This book combines the ideas of leading commentators on ethics, methods of implementing AI, and the risks of AI, into a set of ideas on how machines ought to achieve ethical behavior.

The book mostly provides an accurate survey of what those commentators agree and disagree about. But there’s enough disagreement that we need some insights into which views are correct (especially about theories of ethics) in order to produce useful advice to AI designers, and the authors don’t have those kinds of insights.

The book focuses more on near term risks of software that is much less intelligent than humans, and is complacent about the risks of superhuman AI.

The implications of superhuman AIs for theories of ethics ought to illuminate flaws in them that aren’t obvious when considering purely human-level intelligence. For example, they mention an argument that any AI would value humans for their diversity of ideas, which would help AIs to search the space of possible ideas. This seems to have serious problems, such as what stops an AI from fiddling with human minds to increase their diversity? Yet the authors are too focused on human-like minds to imagine an intelligence which would do that.

Their discussion of the advocates friendly AI seems a bit confused. The authors wonder if those advocates are trying to quell apprehension about AI risks, when I’ve observed pretty consistent efforts by those advocates to create apprehension among AI researchers.

Some of Robin Hanson’s Malthusian-sounding posts prompted me to wonder how we can create a future that is better than the repugnant conclusion. It struck me that there’s no reason to accept the assumption that increasing the number of living minds to the limit of available resources implies that the quality of the lives those minds live will decrease to where they’re barely worth living.

If we imagine the minds to be software, then a mind that barely has enough resources to live could be designed so that it is very happy with the cpu cycles or negentropy it gets even if those are negligible compared to other minds. Or if there is some need for life to be biological, a variant of hibernation might accomplish the same result.

If this is possible, then what I find repugnant about the repugnant conclusion is that it perpetuates the cruelty of evolution which produces suffering in beings with fewer resources than they were evolved to use. Any respectable civilization will engineer away the conflict between average utilitarianism and total utilitarianism.

If instead the most important limit on the number of minds is the supply of matter, then there is a tradeoff between more minds and more atoms per mind. But there is no mere addition paradox to create concerns about a repugnant conclusion if the creation of new minds reduces the utility of other minds.

(Douglas W. Portmore has a similar but less ambitious conclusion (pdf)).

Book review: Human Enhancement, edited by Julian Savulescu and Nick Bostrom.

This book starts out with relatively uninteresting articles and only the last quarter of so of it is worth reading.

Because I agree with most of the arguments for enhancement, I skipped some of the pro-enhancement arguments and tried to read the anti-enhancement arguments carefully. They mostly boil down to the claim that people’s preference for natural things is sufficient to justify broad prohibitions on enhancing human bodies and human nature. That isn’t enough of an argument to deserve as much discussion as it gets.

A few of the concerns discussed by advocates of enhancement are worth more thought. The question of whether unenhanced humans would retain political equality and rights enables us to imagine dystopian results of enhancement. Daniel Walker provides a partly correct analysis of conditions under which enhanced beings ought to paternalistically restrict the choices and political power of the unenhanced. But he’s overly complacent about assuming the paternalists will have the interests of the unenhanced at heart. The biggest problem with paternalism to date is that it’s done by people who are less thoughtful about the interests of the people they’re controlling than they are about finding ways to serve their own self-interest. It is possible that enhanced beings will be perfect altruists, but it is far from being a natural consequence of enhancement.

The final chapter points out the risks of being overconfident of our ability to improve on nature. They describe questions we should ask about why evolution would have produced a result that is different from what we want. One example that they give suggests they remain overconfident – they repeat a standard claim about the human appendix being a result of evolution getting stuck in a local optimum. Recent evidence suggests that the appendix performs a valuable function in recovery from diarrhea (still a major cause of death in places) and harm from appendicitis seems rare outside of industrialized nations (maybe due to differences in dietary fiber?).

The most new and provocative ideas in the book have little to do with the medical enhancements that the title evokes. Robin Hanson’s call for mechanisms to make people more truthful probably won’t gather much support, as people are clever about finding objections to any specific method that would be effective. Still, asking the question the way he does may encourage some people to think more clearly about their goals.

Nick Bostrom and Anders Sandberg describe an interesting (original?) hypothesis about why placebos (sometimes) work. It involves signaling that there is relatively little need to conserve the body’s resources for fighting future injuries and diseases. Could this understanding lead to insights about how to more directly and reliably trigger this effect? More effective placebos have been proposed as jokes. Why is it so unusual to ask about serious research into this subject?

Book review: Good and Real: Demystifying Paradoxes from Physics to Ethics by Gary Drescher.

This book tries to derive ought from is. The more important steps explain why we should choose the one-box answer to Newcomb’s problem, then argue that the same reasoning should provide better support for Hofstadter’s idea of superrationality than has previously been demonstrated, and that superrationality can be generalized to provide morality. He comes close to the right approach to these problems, and I agree with the conclusions he reaches, but I don’t find his reasoning convincing.

He uses a concept which he calls a subjunctive relation, which is intermediate between a causal relation and a correlation, to explain why a choice that seems to happen after its goal has been achieved can be rational. That is the part of his argument that I find unconvincing. The subjunctive relation behaves a lot like a causal relation, and I can’t figure out why it should be treated as more than a correlation unless it’s equivalent to a causal relation.

I say that the one-box choice in Newcomb’s problem causes money to be placed in the box, and that superrationality and morality should be followed for similar reasons involving counterintuitive types of causality. It looks like Drescher is reluctant to accept this type of causality because he doesn’t think clearly enough about the concept of choice. It often appears that he is using something like a folk-psychology notion of choice that appears incompatible with the assumptions of Newcomb’s problem. I expect that with a sufficiently sophisticated concept of choice, Newcomb’s problem and similar situations cease to seem paradoxical. That concept should reflect a counterintuitive difference between the time at which a choice is made and the time at which it is introspectively observed as being irrevocable. When describing Kavka’s toxin problem, he talks more clearly about the concept of choice, and almost finds a better answer than subjunctive relations, but backs off without adequate analysis.

The book also has a long section explaining why the Everett interpretation of quantum mechanics is better than the Copenhagen interpretation. The beginning and end of this section are good, but there’s a rather dense section in the middle that takes much effort to follow without adding much.

Book review: Why Humans Cooperate: A Cultural and Evolutionary Explanation by Joseph Henrich, Natalie Henrich.
This book provides a clear and informative summary of the evolutionary theories that explain why people cooperate (but few novel ideas), and some good but unexciting evidence that provides a bit of support for the theories.
One nice point they make is that unconditional altruism discourages cooperation – it’s important to have some sort of reciprocity (possibly indirect) for a society to prevent non-cooperators from outcompeting cooperators.
The one surprising fact uncovered in their field studies is that people are more generous in the Dictator Game than in the Ultimatum Game (games where one player decides how to divide money between himself and another player; in the Ultimatum Game the second player can reject the division, in which case neither gets anything). It appears that the Ultimatum Game encourages people to think in terms of business-like interactions, but in the Dictator Game a noncompetitive mode of thought dominates.

Book review: Beyond AI: Creating the Conscience of the Machine by J. Storrs Hall
The first two thirds of this book survey current knowledge of AI and make some guesses about when and how it will take off. This part is more eloquent than most books on similar subjects, and its somewhat different from normal perspective makes it worth reading if you are reading several books on the subject. But ease of reading is the only criterion by which this section stands out as better than competing books.
The last five chapters that are surprisingly good, and should shame most professional philosophers whose writings by comparison are a waste of time.
His chapter on consciousness, qualia, and related issues is more concise and persuasive than anything else I’ve read on these subjects. It’s unlikely to change the opinions of people who have already thought about these subjects, but it’s an excellent place for people who are unfamiliar with them to start.
His discussions of ethics using game theory and evolutionary pressures is an excellent way to frame ethical discussions.
My biggest disappointment was that he starts to recognize a possibly important risk of AI when he says “disparities among the abilities of AIs … could negate the evolutionary pressure to reciprocal altruism”, but then seems to dismiss that thoughtlessly (“The notion of one single AI taking off and obtaining hegemony over the whole world by its own efforts is ludicrous”).
He probably has semi-plausible grounds for dismissing some of the scenarios of this nature that have been proposed (e.g. the speed at which some people imagine an AI would take off is improbable). But if AIs with sufficiently general purpose intelligence enhance their intelligence at disparate rates for long enough, the results would render most of the book’s discussion of ethics irrelevant. The time it took humans to accumulate knowledge didn’t give Neanderthals much opportunity to adapt. Would the result have been different if Neanderthals had learned to trade with humans? The answer is not obvious, and probably depends on Neanderthal learning abilities in ways that I don’t know how to analyze.
Also, his arguments for optimism aren’t quite as strong as he thinks. His point that career criminals are generally of low intelligence is reassuring if the number of criminals is all that matters. But when the harm done by one relatively smart criminal can be very large (e.g. Mao), it’s hard to say that the number of criminals is all that matters.
Here’s a nice quote from Mencken which this book quotes part of:

Moral certainty is always a sign of cultural inferiority. The more uncivilized the man, the surer he is that he knows precisely what is right and what is wrong. All human progress, even in morals, has been the work of men who have doubted the current moral values, not of men who have whooped them up and tried to enforce them. The truly civilized man is always skeptical and tolerant, in this field as in all others. His culture is based on ‘I am not too sure.’

Another interesting tidbit is the anecdote that H.G. Wells predicted in 1907 that flying machines would be built. In spite of knowing a lot about attempts to build them, he wasn’t aware that the Wright brothers had succeeded in 1903.
If an AI started running in 2003 that has accumulated the knowledge of a 4-year old human and has the ability to continue learning at human or faster speeds, would we have noticed? Or would the reports we see about it sound too much like the reports of failed AIs for us to pay attention?

Book review: Reasons and Persons by Derek Parfit.
This book does a very good job of pointing out inconsistencies in common moral intuitions, and does a very mixed job of analyzing how to resolve them.
The largest section of the book deals with personal identity, using a bit of neuroscience plus scenarios such as a Star Trek transporter to show that nonreductionsist approaches produce conclusions which are strange enough to disturb most people. I suspect this analysis was fairly original when it was written, but I’ve seen most of the ideas elsewhere. His analysis is more compelling than most other versions, but it’s not concise enough for many to read it.
The most valuable part of the book is the last section, weighing conflicts of interest between actual people and people who could potentially exist in the future. His description of the mere addition paradox convinced me that it’s harder than I thought to specify plausible beliefs which don’t lead to the Repugnant Conclusion (i.e. that some very large number of people with lives barely worth living can be a morally better result than some smaller number of very happy people). He ends by concluding he hasn’t found a way resolve the conflicts between the principles he thinks morality ought to satisfy.
It appears that if he had applied the critical analysis that makes up most of the book to the principle of impersonal ethics, he would see signs that his dilemma results from trying to satisfy incompatible intuitions. Human desire for ethical rules that are more impersonal is widespread when the changes are close to Pareto improvements, but human intuition seems to be generally incompatible with impersonal ethical rules that are as far from Pareto improvements as the Repugnant Conclusion appears to be. Thus it appears Parfit could only resolve the dilemma by finding a source of morality that transcends human intuition and logical consistency (he wisely avoids looking for non-human sources of morality, but intuition doesn’t seem quite the right way to find a human source) or by resolving the conflicting intuitions people seem to have about impersonal ethics.
The most disappointing part of the book is the argument that consequentialism is self-defeating. The critical part of his argument involves a scenario where a mother must choose between saving her child and saving two strangers. His conclusion depends on an assumption about the special relationship between parent and child which consequentialists have no obvious obligation to agree with. He isn’t clear enough about what that assumption is for me to figure out why we disagree.
I find it especially annoying that the book’s index only covers names, since it’s a long book whose subjects aren’t simple enough for me to fully remember.