Emotion-aware music tower blocks (EmoMTB ): an intelligent audiovisual interface for music discovery and recommendation.

IF 3.6 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE International Journal of Multimedia Information Retrieval Pub Date : 2023-01-01 Epub Date: 2023-06-02 DOI:10.1007/s13735-023-00275-8
Alessandro B Melchiorre, David Penz, Christian Ganhör, Oleg Lesota, Vasco Fragoso, Florian Fritzl, Emilia Parada-Cabaleiro, Franz Schubert, Markus Schedl
{"title":"Emotion-aware music tower blocks (EmoMTB ): an intelligent audiovisual interface for music discovery and recommendation.","authors":"Alessandro B Melchiorre, David Penz, Christian Ganhör, Oleg Lesota, Vasco Fragoso, Florian Fritzl, Emilia Parada-Cabaleiro, Franz Schubert, Markus Schedl","doi":"10.1007/s13735-023-00275-8","DOIUrl":null,"url":null,"abstract":"<p><p>Music listening has experienced a sharp increase during the last decade thanks to music streaming and recommendation services. While they offer text-based search functionality and provide recommendation lists of remarkable utility, their typical mode of interaction is unidimensional, i.e., they provide lists of consecutive tracks, which are commonly inspected in sequential order by the user. The user experience with such systems is heavily affected by cognition biases (e.g., position bias, human tendency to pay more attention to first positions of ordered lists) as well as algorithmic biases (e.g., popularity bias, the tendency of recommender systems to overrepresent popular items). This may cause dissatisfaction among the users by disabling them to find novel music to enjoy. In light of such systems and biases, we propose an intelligent audiovisual music exploration system named EmoMTB . It allows the user to browse the entirety of a given collection in a free nonlinear fashion. The navigation is assisted by a set of personalized emotion-aware recommendations, which serve as starting points for the exploration experience. EmoMTB  adopts the metaphor of a city, in which each track (visualized as a colored cube) represents one floor of a building. Highly similar tracks are located in the same building; moderately similar ones form neighborhoods that mostly correspond to genres. Tracks situated between distinct neighborhoods create a gradual transition between genres. Users can navigate this music city using their smartphones as control devices. They can explore districts of well-known music or decide to leave their comfort zone. In addition, EmoMTB   integrates an emotion-aware music recommendation system that re-ranks the list of suggested starting points for exploration according to the user's self-identified emotion or the collective emotion expressed in EmoMTB 's Twitter channel. Evaluation of EmoMTB   has been carried out in a threefold way: by quantifying the homogeneity of the clustering underlying the construction of the city, by measuring the accuracy of the emotion predictor, and by carrying out a web-based survey composed of open questions to obtain qualitative feedback from users.</p>","PeriodicalId":48501,"journal":{"name":"International Journal of Multimedia Information Retrieval","volume":null,"pages":null},"PeriodicalIF":3.6000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10238318/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Multimedia Information Retrieval","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s13735-023-00275-8","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/6/2 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Music listening has experienced a sharp increase during the last decade thanks to music streaming and recommendation services. While they offer text-based search functionality and provide recommendation lists of remarkable utility, their typical mode of interaction is unidimensional, i.e., they provide lists of consecutive tracks, which are commonly inspected in sequential order by the user. The user experience with such systems is heavily affected by cognition biases (e.g., position bias, human tendency to pay more attention to first positions of ordered lists) as well as algorithmic biases (e.g., popularity bias, the tendency of recommender systems to overrepresent popular items). This may cause dissatisfaction among the users by disabling them to find novel music to enjoy. In light of such systems and biases, we propose an intelligent audiovisual music exploration system named EmoMTB . It allows the user to browse the entirety of a given collection in a free nonlinear fashion. The navigation is assisted by a set of personalized emotion-aware recommendations, which serve as starting points for the exploration experience. EmoMTB  adopts the metaphor of a city, in which each track (visualized as a colored cube) represents one floor of a building. Highly similar tracks are located in the same building; moderately similar ones form neighborhoods that mostly correspond to genres. Tracks situated between distinct neighborhoods create a gradual transition between genres. Users can navigate this music city using their smartphones as control devices. They can explore districts of well-known music or decide to leave their comfort zone. In addition, EmoMTB   integrates an emotion-aware music recommendation system that re-ranks the list of suggested starting points for exploration according to the user's self-identified emotion or the collective emotion expressed in EmoMTB 's Twitter channel. Evaluation of EmoMTB   has been carried out in a threefold way: by quantifying the homogeneity of the clustering underlying the construction of the city, by measuring the accuracy of the emotion predictor, and by carrying out a web-based survey composed of open questions to obtain qualitative feedback from users.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
情绪感知音乐塔块(EmoMTB):用于音乐发现和推荐的智能视听界面。
在过去的十年里,由于音乐流媒体和推荐服务,音乐收听量急剧增加。虽然它们提供基于文本的搜索功能并提供非常有用的推荐列表,但它们的典型交互模式是一维的,即它们提供连续曲目的列表,用户通常按顺序检查这些曲目。这种系统的用户体验在很大程度上受到认知偏见(例如,位置偏见,人类更关注有序列表的第一位置的倾向)以及算法偏见(例如流行度偏见,推荐系统过度表达流行项目的倾向)的影响。这可能会使用户无法找到新颖的音乐来欣赏,从而引起用户的不满。鉴于这些系统和偏见,我们提出了一个名为EmoMTB的智能视听音乐探索系统。它允许用户以自由的非线性方式浏览给定集合的全部内容。导航由一组个性化的情感感知推荐来辅助,这些推荐是探索体验的起点。EmoMTB采用了城市的隐喻,其中每条轨道(可视化为彩色立方体)代表一栋建筑的一层。高度相似的轨道位于同一栋建筑内;适度相似的构成了大部分与流派相对应的邻域。位于不同街区之间的曲目创造了流派之间的逐渐过渡。用户可以使用智能手机作为控制设备在这座音乐城市中导航。他们可以探索知名音乐区,也可以决定离开自己的舒适区。此外,EmoMTB集成了一个情绪感知的音乐推荐系统,该系统根据用户的自我识别情绪或Emo山地车推特频道中表达的集体情绪,对建议的探索起点列表进行重新排序。EmoMTB的评估有三种方式:通过量化城市建设背后集群的同质性,通过测量情绪预测因子的准确性,以及通过进行由开放问题组成的网络调查,从用户那里获得定性反馈。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.80
自引率
5.40%
发文量
36
期刊介绍: Aims and Scope The International Journal of Multimedia Information Retrieval (IJMIR) is a scholarly archival journal publishing original, peer-reviewed research contributions. Its editorial board strives to present the most important research results in areas within the field of multimedia information retrieval. Core areas include exploration, search, and mining in general collections of multimedia consisting of information from the WWW to scientific imaging to personal archives. Comprehensive review and survey papers that offer up new insights, and lay the foundations for further exploratory and experimental work, are also relevant. Relevant topics include Image and video retrieval - theory, algorithms, and systems Social media interaction and retrieval - collaborative filtering, social voting and ranking Music and audio retrieval - theory, algorithms, and systems Scientific and Bio-imaging - MRI, X-ray, ultrasound imaging analysis and retrieval Semantic learning - visual concept detection, object recognition, and tag learning Exploration of media archives - browsing, experiential computing Interfaces - multimedia exploration, visualization, query and retrieval Multimedia mining - life logs, WWW media mining, pervasive media analysis Interactive search - interactive learning and relevance feedback in multimedia retrieval Distributed and high performance media search - efficient and very large scale search Applications - preserving cultural heritage, 3D graphics models, etc. Editorial Policies: We aim for a fast decision time (less than 4 months for the initial decision) There are no page charges in IJMIR. Papers are published on line in advance of print publication. Academic, industrial researchers, and practitioners involved with multimedia search, exploration, and mining will find IJMIR to be an essential source for important results in the field.
期刊最新文献
LSECA: local semantic enhancement and cross aggregation for video-text retrieval Mual: enhancing multimodal sentiment analysis with cross-modal attention and difference loss Human action recognition using an optical flow-gated recurrent neural network Domain-specific image captioning: a comprehensive review Multi-knowledge-driven enhanced module for visible-infrared cross-modal person Re-identification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1