International Journal of Multimedia Information Retrieval最新文献

英文中文

Deep learning for video-text retrieval: a review 视频文本检索的深度学习:综述

IF 5.6 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Multimedia Information Retrieval

Pub Date : 2023-02-23 DOI: 10.1007/s13735-023-00267-8

Cunjuan Zhu, Qi Jia, Wei Chen, Yanming Guo, Yu Liu

引用次数: 1

End-to-end residual learning-based deep neural network model deployment for human activity recognition 基于端到端残差学习的人类活动识别深度神经网络模型部署

IF 5.6 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Multimedia Information Retrieval

Pub Date : 2023-02-19 DOI: 10.1007/s13735-023-00269-6

Alok Negi, Krishan Kumar

引用次数: 0

Emotion-aware music tower blocks (EmoMTB ): an intelligent audiovisual interface for music discovery and recommendation. 情绪感知音乐塔块（EmoMTB）：用于音乐发现和推荐的智能视听界面。

IF 3.6 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Multimedia Information Retrieval

Pub Date : 2023-01-01 Epub Date: 2023-06-02 DOI: 10.1007/s13735-023-00275-8

Alessandro B Melchiorre, David Penz, Christian Ganhör, Oleg Lesota, Vasco Fragoso, Florian Fritzl, Emilia Parada-Cabaleiro, Franz Schubert, Markus Schedl

Music listening has experienced a sharp increase during the last decade thanks to music streaming and recommendation services. While they offer text-based search functionality and provide recommendation lists of remarkable utility, their typical mode of interaction is unidimensional, i.e., they provide lists of consecutive tracks, which are commonly inspected in sequential order by the user. The user experience with such systems is heavily affected by cognition biases (e.g., position bias, human tendency to pay more attention to first positions of ordered lists) as well as algorithmic biases (e.g., popularity bias, the tendency of recommender systems to overrepresent popular items). This may cause dissatisfaction among the users by disabling them to find novel music to enjoy. In light of such systems and biases, we propose an intelligent audiovisual music exploration system named EmoMTB . It allows the user to browse the entirety of a given collection in a free nonlinear fashion. The navigation is assisted by a set of personalized emotion-aware recommendations, which serve as starting points for the exploration experience. EmoMTB adopts the metaphor of a city, in which each track (visualized as a colored cube) represents one floor of a building. Highly similar tracks are located in the same building; moderately similar ones form neighborhoods that mostly correspond to genres. Tracks situated between distinct neighborhoods create a gradual transition between genres. Users can navigate this music city using their smartphones as control devices. They can explore districts of well-known music or decide to leave their comfort zone. In addition, EmoMTB integrates an emotion-aware music recommendation system that re-ranks the list of suggested starting points for exploration according to the user's self-identified emotion or the collective emotion expressed in EmoMTB 's Twitter channel. Evaluation of EmoMTB has been carried out in a threefold way: by quantifying the homogeneity of the clustering underlying the construction of the city, by measuring the accuracy of the emotion predictor, and by carrying out a web-based survey composed of open questions to obtain qualitative feedback from users.

在过去的十年里，由于音乐流媒体和推荐服务，音乐收听量急剧增加。虽然它们提供基于文本的搜索功能并提供非常有用的推荐列表，但它们的典型交互模式是一维的，即它们提供连续曲目的列表，用户通常按顺序检查这些曲目。这种系统的用户体验在很大程度上受到认知偏见（例如，位置偏见，人类更关注有序列表的第一位置的倾向）以及算法偏见（例如流行度偏见，推荐系统过度表达流行项目的倾向）的影响。这可能会使用户无法找到新颖的音乐来欣赏，从而引起用户的不满。鉴于这些系统和偏见，我们提出了一个名为EmoMTB的智能视听音乐探索系统。它允许用户以自由的非线性方式浏览给定集合的全部内容。导航由一组个性化的情感感知推荐来辅助，这些推荐是探索体验的起点。EmoMTB采用了城市的隐喻，其中每条轨道（可视化为彩色立方体）代表一栋建筑的一层。高度相似的轨道位于同一栋建筑内；适度相似的构成了大部分与流派相对应的邻域。位于不同街区之间的曲目创造了流派之间的逐渐过渡。用户可以使用智能手机作为控制设备在这座音乐城市中导航。他们可以探索知名音乐区，也可以决定离开自己的舒适区。此外，EmoMTB集成了一个情绪感知的音乐推荐系统，该系统根据用户的自我识别情绪或Emo山地车推特频道中表达的集体情绪，对建议的探索起点列表进行重新排序。EmoMTB的评估有三种方式：通过量化城市建设背后集群的同质性，通过测量情绪预测因子的准确性，以及通过进行由开放问题组成的网络调查，从用户那里获得定性反馈。

{"title":"Emotion-aware music tower blocks (EmoMTB ): an intelligent audiovisual interface for music discovery and recommendation.","authors":"Alessandro B Melchiorre, David Penz, Christian Ganhör, Oleg Lesota, Vasco Fragoso, Florian Fritzl, Emilia Parada-Cabaleiro, Franz Schubert, Markus Schedl","doi":"10.1007/s13735-023-00275-8","DOIUrl":"10.1007/s13735-023-00275-8","url":null,"abstract":"<p><p>Music listening has experienced a sharp increase during the last decade thanks to music streaming and recommendation services. While they offer text-based search functionality and provide recommendation lists of remarkable utility, their typical mode of interaction is unidimensional, i.e., they provide lists of consecutive tracks, which are commonly inspected in sequential order by the user. The user experience with such systems is heavily affected by cognition biases (e.g., position bias, human tendency to pay more attention to first positions of ordered lists) as well as algorithmic biases (e.g., popularity bias, the tendency of recommender systems to overrepresent popular items). This may cause dissatisfaction among the users by disabling them to find novel music to enjoy. In light of such systems and biases, we propose an intelligent audiovisual music exploration system named EmoMTB . It allows the user to browse the entirety of a given collection in a free nonlinear fashion. The navigation is assisted by a set of personalized emotion-aware recommendations, which serve as starting points for the exploration experience. EmoMTB adopts the metaphor of a city, in which each track (visualized as a colored cube) represents one floor of a building. Highly similar tracks are located in the same building; moderately similar ones form neighborhoods that mostly correspond to genres. Tracks situated between distinct neighborhoods create a gradual transition between genres. Users can navigate this music city using their smartphones as control devices. They can explore districts of well-known music or decide to leave their comfort zone. In addition, EmoMTB integrates an emotion-aware music recommendation system that re-ranks the list of suggested starting points for exploration according to the user's self-identified emotion or the collective emotion expressed in EmoMTB 's Twitter channel. Evaluation of EmoMTB has been carried out in a threefold way: by quantifying the homogeneity of the clustering underlying the construction of the city, by measuring the accuracy of the emotion predictor, and by carrying out a web-based survey composed of open questions to obtain qualitative feedback from users.</p>","PeriodicalId":48501,"journal":{"name":"International Journal of Multimedia Information Retrieval","volume":"12 1","pages":"13"},"PeriodicalIF":3.6,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10238318/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9587418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Tri-RAT: optimizing the attention scores for image captioning Tri-RAT:优化图像字幕的注意分数

IF 5.6 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Multimedia Information Retrieval

Pub Date : 2022-10-06 DOI: 10.1007/s13735-022-00260-7

You Yang, Yongzhi An, Juntao Hu, Longyue Pan

引用次数: 1

Gender classification from face images using central difference convolutional networks 基于中心差分卷积网络的人脸性别分类

IF 5.6 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Multimedia Information Retrieval

Pub Date : 2022-09-22 DOI: 10.1007/s13735-022-00259-0

Mohammadreza Sheikh Fathollahi, Rezvan Heidari

引用次数: 2

MHA-WoML: Multi-head attention and Wasserstein-OT for few-shot learning MHA-WoML:多头注意，Wasserstein-OT:少镜头学习

IF 5.6 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Multimedia Information Retrieval

Pub Date : 2022-09-21 DOI: 10.1007/s13735-022-00254-5

Junyan Yang, Jie Jiang, Yanming Guo

引用次数: 1

Similar interior coordination image retrieval with multi-view features 基于多视点特征的相似内部协调图像检索

IF 5.6 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Multimedia Information Retrieval

Pub Date : 2022-08-26 DOI: 10.1007/s13735-022-00247-4

Ren Togo, Yuki Honma, Maiku Abe, Takahiro Ogawa, M. Haseyama

引用次数: 1

A literature review and perspectives in deepfakes: generation, detection, and applications 深度伪造的文献综述与展望:生成、检测和应用

IF 5.6 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Multimedia Information Retrieval

Pub Date : 2022-07-23 DOI: 10.1007/s13735-022-00241-w

Deepak Dagar, D. Vishwakarma

引用次数: 5

Cross-domain image retrieval: methods and applications 跨域图像检索:方法和应用

IF 5.6 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Multimedia Information Retrieval

Pub Date : 2022-07-23 DOI: 10.1007/s13735-022-00244-7

Xiaoping Zhou, Xiangyu Han, Haoran Li, Jia Wang, Xun Liang

引用次数: 4

Organ segmentation from computed tomography images using the 3D convolutional neural network: a systematic review 使用三维卷积神经网络从计算机断层扫描图像中分割器官:系统回顾

IF 5.6 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Multimedia Information Retrieval

Pub Date : 2022-07-16 DOI: 10.1007/s13735-022-00242-9

A. Ilesanmi, Taiwo Ilesanmi, O. P. Idowu, D. Torigian, J. Udupa

引用次数: 2

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

International Journal of Multimedia Information Retrieval

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀