{"title":"Practice Makes Perfect: Towards Learned Path Planning for Robotic Musicians using Deep Reinforcement Learning","authors":"Lamtharn Hantrakul, Zachary Kondak, Gil Weinberg","doi":"10.1145/3212721.3212839","DOIUrl":null,"url":null,"abstract":"When a pianist effortlessly glides across the keyboard during an improvised solo, the musician is executing a series of movements informed by years of practice ingrained with musical knowledge. This paper proposes an analogous approach that enables Robotic Musicians to learn about its degrees of freedom and physical constraints through \"practice\" in the form of Deep Reinforcement Learning. We use a Deep Q Network (DQN) to train a virtual agent representing a real 4-armed robotic musician, to motion-plan the optimal sequence of movements given a musical sequence through a learned strategy instead of a search strategy. Early results from our proof-of-concept system demonstrate that DRL can achieve optimal control of a musical agent, learning a form of bi-manual coordination in the process.","PeriodicalId":330867,"journal":{"name":"Proceedings of the 5th International Conference on Movement and Computing","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Movement and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3212721.3212839","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
When a pianist effortlessly glides across the keyboard during an improvised solo, the musician is executing a series of movements informed by years of practice ingrained with musical knowledge. This paper proposes an analogous approach that enables Robotic Musicians to learn about its degrees of freedom and physical constraints through "practice" in the form of Deep Reinforcement Learning. We use a Deep Q Network (DQN) to train a virtual agent representing a real 4-armed robotic musician, to motion-plan the optimal sequence of movements given a musical sequence through a learned strategy instead of a search strategy. Early results from our proof-of-concept system demonstrate that DRL can achieve optimal control of a musical agent, learning a form of bi-manual coordination in the process.