{"id":1221,"date":"2016-12-22T15:44:43","date_gmt":"2016-12-22T23:44:43","guid":{"rendered":"https:\/\/bayesianinvestor.com\/blog\/?p=1221"},"modified":"2023-02-12T09:56:09","modified_gmt":"2023-02-12T17:56:09","slug":"the-ai-safety-landscape","status":"publish","type":"post","link":"https:\/\/bayesianinvestor.com\/blog\/index.php\/2016\/12\/22\/the-ai-safety-landscape\/","title":{"rendered":"The AI Safety Landscape"},"content":{"rendered":"<p><a href=\"http:\/\/www.patheos.com\/blogs\/hallq\/2014\/09\/the-2014-effective-altruism-summit-who-stood-out\/\">Two and a half years ago<\/a>, Eliezer was (somewhat plausibly) complaining that virtually nobody outside of <a href=\"https:\/\/intelligence.org\">MIRI<\/a> was working on AI-related existential risks.<\/p>\n<p>This year (at <a href=\"https:\/\/www.eaglobal.org\/events\/ea-global-2016\/\">EAGlobal<\/a>) <a href=\"https:\/\/eaglobal2016.sched.com\/event\/7rCF\/miri-using-machine-learning-to-address-ai-risk\">one of MIRI&#8217;s talks<\/a> was a bit hard to distinguish from an AI safety talk given by <a href=\"https:\/\/eaglobal2016.sched.com\/speaker\/christopher_olah.1vj4egc5\">someone with pretty mainstream AI affiliations<\/a>.<\/p>\n<p>What happened in that time to cause that shift?<\/p>\n<p>A large change was catalyzed by the publication of <a href=\"https:\/\/bayesianinvestor.com\/blog\/index.php\/2014\/07\/28\/superintelligence\/\">Superintelligence<\/a>. I&#8217;ve been mildly disappointed about how little it affected discussions among people who were already interested in the topic. But Superintelligence caused a large change in how many people are willing to express concern over AI risks. That&#8217;s presumably because Superintelligence looks sufficiently academic and neutral to make many people comfortable about citing it, whereas similar arguments by Eliezer\/MIRI didn&#8217;t look sufficiently prestigious within academia.<\/p>\n<p>A smaller part of the change was MIRI shifting its focus somewhat to be more in line with how mainstream machine learning (ML) researchers expect AI to reach human levels.<\/p>\n<p>Also, <a href=\"https:\/\/en.wikipedia.org\/wiki\/OpenAI\">OpenAI<\/a> has been quietly shifting in a more MIRI-like direction (I&#8217;m very unclear on how big a change this is). (Paul Christiano seems to deserve some credit for both the MIRI and OpenAI shifts in strategies.)<\/p>\n<p>Given those changes, it seems like MIRI ought to be able to attract more donations than before. Especially since it has demonstrated <a href=\"https:\/\/bayesianinvestor.com\/blog\/index.php\/2016\/09\/14\/logical-induction\/\">evidence of increasing competence<\/a>, and also because <a href=\"http:\/\/hpmor.com\/\">HPMoR<\/a> seemed to draw significantly more people into the community of people who are interested in MIRI.<\/p>\n<p>MIRI has gotten one big grant from OpenPhilanthropy that it probably couldn&#8217;t have gotten when mainstream AI researchers were treating MIRI&#8217;s concerns as too far-fetched to be worth commenting on. But donations from MIRI&#8217;s usual sources have <a href=\"https:\/\/intelligence.org\/2016\/11\/11\/post-fundraiser-update\/\">stagnated<\/a>. <\/p>\n<p>That pattern suggests that MIRI was previously benefiting from a polarization effect, where the perception of two distinct &#8220;tribes&#8221; (those who care about AI risks versus those who promote AI) energized people to care about &#8220;their tribe&#8221;.<\/p>\n<p>Whereas now there&#8217;s no clear dividing line between MIRI and mainstream researchers. Also, there&#8217;s lots of money going into <a href=\"http:\/\/effective-altruism.com\/ea\/14w\/2017_ai_risk_literature_review_and_charity\/\">other organizations<\/a> that plan to do something about AI safety. (Most of those haven&#8217;t yet articulated enough of a strategy to make me optimistic that that money is well spent. I still endorse the ideas I mentioned last year in <a href=\"https:\/\/bayesianinvestor.com\/blog\/index.php\/2015\/12\/06\/how-much-diversity-of-agi-risk-organizations-is-optimal\/\">How much Diversity of AGI-Risk Organizations is Optimal?<\/a>. I&#8217;m unclear on how much diversity of approaches we&#8217;re getting from the recent proliferation of AI safety organizations.)<\/p>\n<p>That kind of pattern of donations creates perverse incentives to charities to at least market themselves as fighting a powerful group of people, rather than (as the ideal charity should be) addressing a neglected problem. Even if that marketing doesn&#8217;t distort a charity&#8217;s operations, the charity will be tempted to use <a href=\"https:\/\/bayesianinvestor.com\/blog\/index.php\/2005\/01\/31\/alarmism\/\">counterproductive alarmism<\/a>. AI risk organizations have resisted those temptations (at least recently), but it seems risky to tempt them.<\/p>\n<p>That&#8217;s part of why I recently made a modest donation to MIRI, in spite of the uncertainty over the value of their efforts (I had last donated to them in 2009).<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Two and a half years ago, Eliezer was (somewhat plausibly) complaining that virtually nobody outside of MIRI was working on AI-related existential risks. This year (at EAGlobal) one of MIRI&#8217;s talks was a bit hard to distinguish from an AI safety talk given by someone with pretty mainstream AI affiliations. What happened in that time [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"footnotes":"","jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false}}},"categories":[26],"tags":[134,128,155],"class_list":["post-1221","post","type-post","status-publish","format-standard","hentry","category-ai","tag-effective-altruism","tag-existential-risks","tag-miri"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p80O1l-jH","_links":{"self":[{"href":"https:\/\/bayesianinvestor.com\/blog\/index.php\/wp-json\/wp\/v2\/posts\/1221","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/bayesianinvestor.com\/blog\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/bayesianinvestor.com\/blog\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/bayesianinvestor.com\/blog\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/bayesianinvestor.com\/blog\/index.php\/wp-json\/wp\/v2\/comments?post=1221"}],"version-history":[{"count":1,"href":"https:\/\/bayesianinvestor.com\/blog\/index.php\/wp-json\/wp\/v2\/posts\/1221\/revisions"}],"predecessor-version":[{"id":1222,"href":"https:\/\/bayesianinvestor.com\/blog\/index.php\/wp-json\/wp\/v2\/posts\/1221\/revisions\/1222"}],"wp:attachment":[{"href":"https:\/\/bayesianinvestor.com\/blog\/index.php\/wp-json\/wp\/v2\/media?parent=1221"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/bayesianinvestor.com\/blog\/index.php\/wp-json\/wp\/v2\/categories?post=1221"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/bayesianinvestor.com\/blog\/index.php\/wp-json\/wp\/v2\/tags?post=1221"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}