{"title":"Prevailing Research Areas for Music AI in the Era of Foundation Models","authors":"Megan Wei, Mateusz Modrzejewski, Aswin Sivaraman, Dorien Herremans","doi":"arxiv-2409.09378","DOIUrl":null,"url":null,"abstract":"In tandem with the recent advancements in foundation model research, there\nhas been a surge of generative music AI applications within the past few years.\nAs the idea of AI-generated or AI-augmented music becomes more mainstream, many\nresearchers in the music AI community may be wondering what avenues of research\nare left. With regards to music generative models, we outline the current areas\nof research with significant room for exploration. Firstly, we pose the\nquestion of foundational representation of these generative models and\ninvestigate approaches towards explainability. Next, we discuss the current\nstate of music datasets and their limitations. We then overview different\ngenerative models, forms of evaluating these models, and their computational\nconstraints/limitations. Subsequently, we highlight applications of these\ngenerative models towards extensions to multiple modalities and integration\nwith artists' workflow as well as music education systems. Finally, we survey\nthe potential copyright implications of generative music and discuss strategies\nfor protecting the rights of musicians. While it is not meant to be exhaustive,\nour survey calls to attention a variety of research directions enabled by music\nfoundation models.","PeriodicalId":501178,"journal":{"name":"arXiv - CS - Sound","volume":"105 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Sound","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09378","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In tandem with the recent advancements in foundation model research, there
has been a surge of generative music AI applications within the past few years.
As the idea of AI-generated or AI-augmented music becomes more mainstream, many
researchers in the music AI community may be wondering what avenues of research
are left. With regards to music generative models, we outline the current areas
of research with significant room for exploration. Firstly, we pose the
question of foundational representation of these generative models and
investigate approaches towards explainability. Next, we discuss the current
state of music datasets and their limitations. We then overview different
generative models, forms of evaluating these models, and their computational
constraints/limitations. Subsequently, we highlight applications of these
generative models towards extensions to multiple modalities and integration
with artists' workflow as well as music education systems. Finally, we survey
the potential copyright implications of generative music and discuss strategies
for protecting the rights of musicians. While it is not meant to be exhaustive,
our survey calls to attention a variety of research directions enabled by music
foundation models.