Sungho Lee, Marco A. Martínez-Ramírez, Wei-Hsiang Liao, Stefan Uhlich, Giorgio Fabbro, Kyogu Lee, Yuki Mitsufuji
{"title":"Searching For Music Mixing Graphs: A Pruning Approach","authors":"Sungho Lee, Marco A. Martínez-Ramírez, Wei-Hsiang Liao, Stefan Uhlich, Giorgio Fabbro, Kyogu Lee, Yuki Mitsufuji","doi":"arxiv-2406.01049","DOIUrl":null,"url":null,"abstract":"Music mixing is compositional -- experts combine multiple audio processors to\nachieve a cohesive mix from dry source tracks. We propose a method to reverse\nengineer this process from the input and output audio. First, we create a\nmixing console that applies all available processors to every chain. Then,\nafter the initial console parameter optimization, we alternate between removing\nredundant processors and fine-tuning. We achieve this through differentiable\nimplementation of both processors and pruning. Consequently, we find a sparse\nmixing graph that achieves nearly identical matching quality of the full mixing\nconsole. We apply this procedure to dry-mix pairs from various datasets and\ncollect graphs that also can be used to train neural networks for music mixing\napplications.","PeriodicalId":501178,"journal":{"name":"arXiv - CS - Sound","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Sound","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.01049","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Music mixing is compositional -- experts combine multiple audio processors to
achieve a cohesive mix from dry source tracks. We propose a method to reverse
engineer this process from the input and output audio. First, we create a
mixing console that applies all available processors to every chain. Then,
after the initial console parameter optimization, we alternate between removing
redundant processors and fine-tuning. We achieve this through differentiable
implementation of both processors and pruning. Consequently, we find a sparse
mixing graph that achieves nearly identical matching quality of the full mixing
console. We apply this procedure to dry-mix pairs from various datasets and
collect graphs that also can be used to train neural networks for music mixing
applications.