通用人工智能的必要障碍：可纠正性

AI matters Pub Date : 2019-03-20 DOI:10.1145/3362077.3362089

Y. Lo, Chung Yu Woo, Ka Lok Ng

{"title":"通用人工智能的必要障碍：可纠正性","authors":"Y. Lo, Chung Yu Woo, Ka Lok Ng","doi":"10.1145/3362077.3362089","DOIUrl":null,"url":null,"abstract":"With the rapid pace of advancement in the field of artificial intelligence (AI), this essay aims to accentuate the importance of corrigibility in AI in order to stimulate and catalyze more effort and focus in this research area. We will first introduce the idea of corrigibility with its properties and describe the expected behavior for a corrigible AI. Afterwards, based on the established meaning of corrigibility, we will showcase the importance of corrigibility by going over some modern and near-futuristic examples that are specifically selected to be relatable and foreseeable. Then, we will explore existing methods of establishing corrigibility in agents and their respective limitations, using the reinforcement learning (RL) framework as a proxy framework to artificial general intelligence (AGI). At last, we will identify the central themes of potential research frontiers that we believe would be crucial to boosting quality research output in corrigibility.","PeriodicalId":91445,"journal":{"name":"AI matters","volume":"5 1","pages":"77-84"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3362077.3362089","citationCount":"3","resultStr":"{\"title\":\"The necessary roadblock to artificial general intelligence: corrigibility\",\"authors\":\"Y. Lo, Chung Yu Woo, Ka Lok Ng\",\"doi\":\"10.1145/3362077.3362089\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid pace of advancement in the field of artificial intelligence (AI), this essay aims to accentuate the importance of corrigibility in AI in order to stimulate and catalyze more effort and focus in this research area. We will first introduce the idea of corrigibility with its properties and describe the expected behavior for a corrigible AI. Afterwards, based on the established meaning of corrigibility, we will showcase the importance of corrigibility by going over some modern and near-futuristic examples that are specifically selected to be relatable and foreseeable. Then, we will explore existing methods of establishing corrigibility in agents and their respective limitations, using the reinforcement learning (RL) framework as a proxy framework to artificial general intelligence (AGI). At last, we will identify the central themes of potential research frontiers that we believe would be crucial to boosting quality research output in corrigibility.\",\"PeriodicalId\":91445,\"journal\":{\"name\":\"AI matters\",\"volume\":\"5 1\",\"pages\":\"77-84\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-03-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1145/3362077.3362089\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AI matters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3362077.3362089\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AI matters","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3362077.3362089","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

随着人工智能(AI)领域的快速发展，本文旨在强调可纠正性在AI中的重要性，以激发和催化更多的努力和关注这一研究领域。我们将首先介绍可纠错性及其属性的概念，并描述可纠错人工智能的预期行为。之后，基于可纠错性的既定意义，我们将通过一些现代和近未来的例子来展示可纠错性的重要性，这些例子是专门挑选出来的，具有相关性和可预见性。然后，我们将使用强化学习(RL)框架作为人工通用智能(AGI)的代理框架，探索在智能体中建立可纠正性的现有方法及其各自的局限性。最后，我们将确定潜在研究前沿的中心主题，我们认为这些主题对于提高质量研究成果的可纠正性至关重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

The necessary roadblock to artificial general intelligence: corrigibility

With the rapid pace of advancement in the field of artificial intelligence (AI), this essay aims to accentuate the importance of corrigibility in AI in order to stimulate and catalyze more effort and focus in this research area. We will first introduce the idea of corrigibility with its properties and describe the expected behavior for a corrigible AI. Afterwards, based on the established meaning of corrigibility, we will showcase the importance of corrigibility by going over some modern and near-futuristic examples that are specifically selected to be relatable and foreseeable. Then, we will explore existing methods of establishing corrigibility in agents and their respective limitations, using the reinforcement learning (RL) framework as a proxy framework to artificial general intelligence (AGI). At last, we will identify the central themes of potential research frontiers that we believe would be crucial to boosting quality research output in corrigibility.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

AI matters

自引率

0.00%

发文量

期刊最新文献

Conference Reports Welcome to AI Matters 9(3) AI Policy Matters SIGAI Annual Report: July 1 2022 --- August 30 2023 Conference Reports