Science and Technology

Further Thoughts on AI Ethics

Posted by Peter on July 9, 2025

Posted in: Artificial Intelligence. Tagged: ethics, existential risks. 1 comment

My recent post Are Intelligent Agents More Ethical? criticized some brief remarks by Scott Sumner.

Sumner made a more sophisticated version of those claims in the second half of this Doom Debate.

His position sounds a lot like the moral realism that has caused many people to be complacent about AI taking over the world. But he’s actually using an analysis that follows Richard Rorty’s rejection of standard moral realism. Which seems to mean there’s a weak sense in which morality can be true, but in a socially and historically contingent fashion. If I understand that correctly, I approve.

Waking up to AGI

Posted by Peter on June 29, 2025

Posted in: Artificial Intelligence, Investing, U.S. Politics. Tagged: existential risks. 2 comments

In key centers of power, there’s an important shift happening now of the Overton Window for AI dangers.

The first sign is a surprising reaction to the book If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All by Eliezer Yudkowsky and Nate Soares.

Are Intelligent Agents More Ethical?

Posted by Peter on June 20, 2025

Posted in: Artificial Intelligence, Economics. Tagged: ethics, existential risks. 2 comments

This post is a response to a claim by Scott Sumner in his conversation at LessOnline with Nate Soares, about how ethical we should expect AI’s to be.

Sumner sees a pattern of increasing intelligence causing agents to be increasingly ethical, and sounds cautiously optimistic that such a trend will continue when AIs become smarter than humans. I’m guessing that he’s mainly extrapolating from human trends, but extrapolating from trends in the animal kingdom should produce similar results (e.g. the cooperation between single-celled organisms that gave the world multicellular organisms).

I doubt that my response is very novel, but I haven’t seen clear enough articulation of the ideas in this post.

AI 2027 Thoughts

Posted by Peter on April 25, 2025

Posted in: Artificial Intelligence. Tagged: existential risks. Leave a Comment

AI 2027 portrays two well thought out scenarios for how AI is likely to impact the world toward the end of this decade.

I expect those scenarios will prove to be moderately wrong, but close enough to be scary. I also expect that few people will manage to make forecasts that are significantly more accurate.

Here are some scattered thoughts that came to mind while I read AI 2027.

The authors are fairly pessimistic. I see four key areas where their assumptions seem to lead them to see more danger than do more mainstream experts. They see:

a relatively small capabilities lead being enough for a group to conquer the world
more difficulty of alignment
more difficulty of detecting deception
AI companies being less careful than is necessary

I expect that the authors are being appropriately concerned on about two of these assumptions, and a bit too pessimistic on the others. I’m hesitant to bet on which assumptions belong in which category.

They don’t focus much on justifying those assumptions. That’s likely wise, since prior debates on those topics have not been very productive. Instead, they’ve focused more on when various changes will happen.

This post will focus on aspects of the first two assumptions for which I expect further analysis to be relatively valuable.

Should AIs be Encouraged to Cooperate?

Posted by Peter on April 15, 2025

Posted in: Artificial Intelligence. Tagged: risks. Leave a Comment

I have an idea for a fairly cheap strategy for slightly reducing the risk of catastrophic harm from AI.

This post was inspired by a recent LessWrong post How AI Takeover Might Happen in 2 Years.

That post describes a scenario in which most humans die, mainly because an AI that is first to become powerful enough to attempt world conquest sees risks that other AIs, imperfectly aligned with it, will cause danger soon. This creates a perceived need to engage in a few violent pivotal processes, rather than using a combination of persuasion and negotiation.

The specific scenario in that post seems fairly unrealistic. I see less than a 5% chance that a leading AI within the next 10 years will want to attempt that kind of world conquest. But it’s close enough to being realistic that I want to analyze a class of scenarios that are similar to it.

I want to nudge AIs in the direction of being more willing to cooperate with each other, in order to reduce the risk of such a conquest. Specifically, I want them to negotiate a peaceful outcome instead of using pre-emptive destruction.

AI Markets on Manifold

Posted by Peter on March 14, 2025

Posted in: Artificial Intelligence, Idea Futures. Tagged: existential risks, prediction markets. Leave a Comment

I’ve been creating prediction markets on Manifold in order to better predict AI strategies. Please trade them.

If I get a bit more trading in these markets, I will create more AI-related markets. Stay tuned here, or follow me on Manifold.

Retrospective on my Investing Advice

Posted by Peter on February 16, 2025

Posted in: Artificial Intelligence, Investing. Tagged: bubbles. 1 comment

In 2015, I posted some investing advice for people who only spend a few hours per year on investing.

I intended to review it after five years, but a pandemic distracted me. It looks like this whole decade will end up being too busy for me to write everything that I want to write. But I’ve become able to write faster recently, maybe due to the feeling of urgency about AI transforming the world soon. So I’m getting a few old ideas for blog posts off of my to-do list, in order to be able to devote most of my attention to AI when the world becomes wild.

My advice worked poorly enough that I’m too discouraged to quantify the results.

Medical Windfall Prizes

Posted by Peter on February 6, 2025

Posted in: Artificial Intelligence, Health, Politics. Tagged: best posts, prizes. 1 comment

Summary

AI may produce a windfall surge in government revenues in 5 to 10 years. I want governments to spending a small fraction of that windfall on retroactively rewarding entities in proportion to how they have contributed to medical advances, measured by lives saved and suffering avoided.

Uncontrollable

Posted by Peter on January 23, 2025

Posted in: Artificial Intelligence, Book Reviews. Tagged: existential risks. 2 comments

Book review: Uncontrollable: The Threat of Artificial Superintelligence and the Race to Save the World, by Darren McKee.

This is by far the best introduction to AI risk for people who know little about AI. It’s appropriate for a broader class of readers than most laymen-oriented books.

It was published 14 months ago. In this rapidly changing field, most AI books say something that gets discredited by the time they’re that old. I found no clear example of such obsolescence in Uncontrollable (but read on for a set of controversial examples).

Nearly everything in the book was familiar to me, yet the book prompted me to reflect better, thereby changing my mind modestly – mostly re-examining issues that I’ve been neglecting for the past few years, in light of new evidence.

The rest of this review will focus on complaints, mostly about McKee’s overconfidence. The features that I complain about reduce the value of book by maybe 10% compared to the value of an ideal book. But that ideal book doesn’t exist, and I’m not wise enough to write it.

Genesis

Posted by Peter on December 31, 2024

Posted in: Artificial Intelligence, Book Reviews. Tagged: existential risks, history, war. Leave a Comment

Book review: Genesis: Artificial Intelligence, Hope, and the Human Spirit, by Henry A. Kissinger, Eric Schmidt, and Craig Mundie.

Genesis lends a bit of authority to concerns about AI.

It is a frustrating book. It took more effort for me read than it should have taken. The difficulty stems not from complex subject matter (although the topics are complex), but from a peculiarly alien writing style that transcends mere linguistic differences – though Kissinger’s German intellectual heritage may play a role.

The book’s opening meanders through historical vignettes whose relevance remains opaque, testing my patience before finally addressing AI.

Bayesian Investor Blog

Ramblings of a somewhat libertarian stock market speculator

Further Thoughts on AI Ethics

Waking up to AGI

Are Intelligent Agents More Ethical?

AI 2027 Thoughts

Should AIs be Encouraged to Cooperate?

AI Markets on Manifold

Retrospective on my Investing Advice

Medical Windfall Prizes

Summary

Uncontrollable

Genesis

Recent Posts

Recent Comments

Categories

Summary

Recent Posts

Recent Comments

Tags

Categories