首页 > 最新文献

MULTIMEDIA '04最新文献

英文 中文
Motion based retrieval of dynamic objects in videos 视频中基于运动的动态对象检索
Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027593
Che-Bin Liu, N. Ahuja
Most existing video retrieval systems use low-level visual features such as color histogram, shape, texture, or motion. In this paper, we explore the use of higher-level motion representation for video retrieval of dynamic objects. We use three motion representations, which together can retrieve a large variety of motion patterns. Our approach works on top of a tracking unit and assumes that each dynamic object has been tracked and circumscribed in a minimal bounding box in each video frame. We represent the motion attributes of each object in terms of changes in the image context of its circumscribing box. The changes are described via motion templates [4], self-similarity plots [3], and image dynamics [9]. Initially, defined criteria of the retrieval process are interactively refined using relevance feedback from the user. Experimental results demonstrate the use of the proposed motion models in retrieving objects undergoing complex motion.
大多数现有的视频检索系统使用低级视觉特征,如颜色直方图、形状、纹理或运动。在本文中,我们探讨了在动态对象的视频检索中使用更高级的运动表示。我们使用三种运动表示,它们一起可以检索到各种各样的运动模式。我们的方法在跟踪单元的基础上工作,并假设每个动态对象在每个视频帧中都被跟踪和限定在最小的边界框中。我们表示每个对象的运动属性在其边界框的图像上下文中的变化。这些变化通过运动模板[4]、自相似图[3]和图像动力学[9]来描述。最初,检索过程的定义标准使用来自用户的相关性反馈交互式地改进。实验结果表明,所提出的运动模型可用于检索经过复杂运动的物体。
{"title":"Motion based retrieval of dynamic objects in videos","authors":"Che-Bin Liu, N. Ahuja","doi":"10.1145/1027527.1027593","DOIUrl":"https://doi.org/10.1145/1027527.1027593","url":null,"abstract":"Most existing video retrieval systems use low-level visual features such as color histogram, shape, texture, or motion. In this paper, we explore the use of higher-level motion representation for video retrieval of dynamic objects. We use three motion representations, which together can retrieve a large variety of motion patterns. Our approach works on top of a tracking unit and assumes that each dynamic object has been tracked and circumscribed in a minimal bounding box in each video frame. We represent the motion attributes of each object in terms of changes in the image context of its circumscribing box. The changes are described via motion templates [4], self-similarity plots [3], and image dynamics [9]. Initially, defined criteria of the retrieval process are interactively refined using relevance feedback from the user. Experimental results demonstrate the use of the proposed motion models in retrieving objects undergoing complex motion.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"104 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134395376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Planet usher: an interactive home movie 星球引座员:一个互动家庭电影
Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027763
Patrick Tarrant
Planet Usher: An Interactive Home Movie is a CD-Rom that draws inspiration and media from my brother's extensive home video archive; an archive that spans twenty years of family events and non-events [1]. What makes even the non-events remarkable, is the fact that my brother went from being a deaf man with a video camera, to a deaf-blind man with an extensive audio-visual archive he can no longer see, nor hear, due to the effects of Usher Syndrome. So Planet Usher offers an exploration of a sustained and enduring amateur practice. It is also a story about disability and the family as they emerge faultingly from the lost archive. And it is a confrontation with the frailties of memory and narrative as they come face to face with the vicissitudes of both interactivity and lived experience.
Planet Usher: An Interactive Home Movie是一张CD-Rom,它从我哥哥大量的家庭视频档案中汲取灵感和媒体;一个跨越20年的家庭事件和非事件的档案[1]。让这些无足轻重的事情变得引人注目的是,我哥哥从一个拥有摄像机的聋哑人,变成了一个拥有大量视听档案的聋哑人,由于Usher综合症的影响,他再也看不见,也听不见。因此,《亚瑟星球》提供了一种持续和持久的业余实践的探索。这也是一个关于残疾和家庭的故事,他们从丢失的档案中错误地出现。这是对记忆和叙事的脆弱性的对抗,因为它们面对着互动和生活经验的变迁。
{"title":"Planet usher: an interactive home movie","authors":"Patrick Tarrant","doi":"10.1145/1027527.1027763","DOIUrl":"https://doi.org/10.1145/1027527.1027763","url":null,"abstract":"<i>Planet Usher: An Interactive Home Movie</i> is a CD-Rom that draws inspiration and media from my brother's extensive home video archive; an archive that spans twenty years of family events and non-events [1]. What makes even the non-events remarkable, is the fact that my brother went from being a deaf man with a video camera, to a deaf-blind man with an extensive audio-visual archive he can no longer see, nor hear, due to the effects of Usher Syndrome. So <i>Planet Usher</i> offers an exploration of a sustained and enduring amateur practice. It is also a story about disability and the family as they emerge faultingly from the lost archive. And it is a confrontation with the frailties of memory and narrative as they come face to face with the vicissitudes of both interactivity and lived experience.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114908513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A new method to segment playfield and its applications in match analysis in sports video 一种新的场地分割方法及其在体育视频比赛分析中的应用
Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027594
Shuqiang Jiang, Qixiang Ye, Wen Gao, Tiejun Huang
With the growing popularity of digitized sports video, automatic analysis of them need be processed to facilitate semantic summarization and retrieval. Playfield plays the fundamental role in automatically analyzing many sports programs. Many semantic clues could be inferred from the results of playfield segmentation. In this paper, a novel playfield segmentation method based on Gaussian mixture models (GMMs) is proposed. Firstly, training pixels are automatically sampled from frames. Then, by supposing that field pixels are the dominant components in most of the video frames, we build the GMMs of the field pixels and use these models to detect playfield pixels. Finally region-growing operation is employed to segment the playfield regions from the background. Experimental results show that the proposed method is robust to various sports videos even for very poor grass field conditions. Based on the results of playfield segmentation, match situation analysis is investigated, which is also desired for sports professionals and longtime fanners. The results are encouraging.
随着数字化体育视频的日益普及,需要对体育视频进行自动分析,以便进行语义总结和检索。在许多体育项目的自动分析中,运动场起着基础性的作用。从游戏领域分割的结果中可以推断出许多语义线索。本文提出了一种基于高斯混合模型的运动场分割方法。首先,从帧中自动采样训练像素。然后,假设场地像素在大多数视频帧中占主导地位,我们建立场地像素的gmm,并使用这些模型来检测场地像素。最后采用区域增长操作从背景中分割出运动场区域。实验结果表明,即使在非常恶劣的草地条件下,该方法对各种运动视频也具有鲁棒性。根据场地分割的结果,进行比赛态势分析,这也是体育专业人士和长期球迷所需要的。结果令人鼓舞。
{"title":"A new method to segment playfield and its applications in match analysis in sports video","authors":"Shuqiang Jiang, Qixiang Ye, Wen Gao, Tiejun Huang","doi":"10.1145/1027527.1027594","DOIUrl":"https://doi.org/10.1145/1027527.1027594","url":null,"abstract":"With the growing popularity of digitized sports video, automatic analysis of them need be processed to facilitate semantic summarization and retrieval. Playfield plays the fundamental role in automatically analyzing many sports programs. Many semantic clues could be inferred from the results of playfield segmentation. In this paper, a novel playfield segmentation method based on Gaussian mixture models (GMMs) is proposed. Firstly, training pixels are automatically sampled from frames. Then, by supposing that field pixels are the dominant components in most of the video frames, we build the GMMs of the field pixels and use these models to detect playfield pixels. Finally region-growing operation is employed to segment the playfield regions from the background. Experimental results show that the proposed method is robust to various sports videos even for very poor grass field conditions. Based on the results of playfield segmentation, match situation analysis is investigated, which is also desired for sports professionals and longtime fanners. The results are encouraging.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133278794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 51
SMARXO: towards secured multimedia applications by adopting RBAC, XML and object-relational database SMARXO:采用RBAC、XML和对象关系数据库,迈向安全的多媒体应用
Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027631
Shu‐Ching Chen, M. Shyu, Na Zhao
In this paper, a framework named SMARXO is proposed to address the security issues in multimedia applications by adopting RBAC (Role-Based Access Control), XML, and Object-Relational Databases. Compared with the other existing security models or projects, SMARXO can deal with more intricate situations. First, the image object-level security and video scene/shot-level security can be easily achieved. Second, the temporal constrains and IP address restrictions are modeled for the access control purpose. Finally, XML queries can be performed such that the administrators can proficiently retrieve useful information from the security roles and policies.
本文采用基于角色的访问控制(RBAC)、XML和对象关系数据库,提出了一个名为SMARXO的框架来解决多媒体应用中的安全问题。与其他现有的安全模型或项目相比,SMARXO可以处理更复杂的情况。首先,可以轻松实现图像对象级安全和视频场景/镜头级安全。其次,为访问控制目的对时间约束和IP地址限制进行建模。最后,可以执行XML查询,以便管理员能够熟练地从安全角色和策略中检索有用的信息。
{"title":"SMARXO: towards secured multimedia applications by adopting RBAC, XML and object-relational database","authors":"Shu‐Ching Chen, M. Shyu, Na Zhao","doi":"10.1145/1027527.1027631","DOIUrl":"https://doi.org/10.1145/1027527.1027631","url":null,"abstract":"In this paper, a framework named SMARXO is proposed to address the security issues in multimedia applications by adopting RBAC (Role-Based Access Control), XML, and Object-Relational Databases. Compared with the other existing security models or projects, SMARXO can deal with more intricate situations. First, the image object-level security and video scene/shot-level security can be easily achieved. Second, the temporal constrains and IP address restrictions are modeled for the access control purpose. Finally, XML queries can be performed such that the administrators can proficiently retrieve useful information from the security roles and policies.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"321 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129408282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
The princess series 公主系列
Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027772
Roxanne Wolanczyk
The Princess Series is a narrative of a modern-day princess who found herself grown up without a fortune. Presented as a series of Flash animations, each story depicts her daily struggle to save her soul while trying to survive in the corporate world by making junk mail. She courageously faces the dilemmas, contradictions, and paradoxes of modern life. She questions her own emotions, ideals, psychology, gender, and identity, all in the hopes of a happy ending. The work can be presented on a variety of mediums such as computer screens, projectors, and plasma screens. Still images can also be scaled to any size for print.
公主系列是一个现代公主的故事,她发现自己长大后没有财富。以一系列Flash动画的形式呈现,每个故事都描绘了她每天努力拯救自己的灵魂,同时试图通过制作垃圾邮件在企业界生存下来。她勇敢地面对现代生活中的困境、矛盾和悖论。她质疑自己的情感、理想、心理、性别和身份,所有这些都是为了一个幸福的结局。作品可以在各种媒介上呈现,如电脑屏幕、投影仪和等离子屏幕。静态图像也可以缩放到任何尺寸用于打印。
{"title":"The princess series","authors":"Roxanne Wolanczyk","doi":"10.1145/1027527.1027772","DOIUrl":"https://doi.org/10.1145/1027527.1027772","url":null,"abstract":"The Princess Series is a narrative of a modern-day princess who found herself grown up without a fortune. Presented as a series of Flash animations, each story depicts her daily struggle to save her soul while trying to survive in the corporate world by making junk mail. She courageously faces the dilemmas, contradictions, and paradoxes of modern life. She questions her own emotions, ideals, psychology, gender, and identity, all in the hopes of a happy ending.\u0000 The work can be presented on a variety of mediums such as computer screens, projectors, and plasma screens. Still images can also be scaled to any size for print.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"74 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134530275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Shibboleth: exploring cultural boundaries in speech Shibboleth:探索语言的文化界限
Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027771
A. Senior
Shibboleth is a multimedia artwork that explores the cultural barriers created and enforced by accent and pronunciation differences. It is founded on the idea of biblical origin of a shibboleth --- a word or phrase that distinguishes one cultural group from another. The artwork consists of a computer interface through which users are able to to see and hear rhythmic audio-visual compositions of shibboleths created from previously recorded data and relevant sounds and imagery. Users can also use the interface to listen to examples of previously recorded shibboleths, as well as to add their own to a growing, geographically-indexed database.
Shibboleth是一件多媒体艺术作品,探讨了口音和发音差异所造成和强化的文化障碍。它是建立在shibboleth的圣经起源上的,shibboleth是一个区分一个文化群体与另一个文化群体的单词或短语。该作品由一个计算机界面组成,通过该界面,用户可以看到和听到由先前记录的数据和相关声音和图像创建的有节奏的视听组合。用户还可以使用该界面来收听先前记录的流行音乐,并将自己的流行音乐添加到不断增长的地理索引数据库中。
{"title":"Shibboleth: exploring cultural boundaries in speech","authors":"A. Senior","doi":"10.1145/1027527.1027771","DOIUrl":"https://doi.org/10.1145/1027527.1027771","url":null,"abstract":"Shibboleth is a multimedia artwork that explores the cultural barriers created and enforced by accent and pronunciation differences. It is founded on the idea of biblical origin of a <i>shibboleth</i> --- a word or phrase that distinguishes one cultural group from another. The artwork consists of a computer interface through which users are able to to see and hear rhythmic audio-visual compositions of shibboleths created from previously recorded data and relevant sounds and imagery. Users can also use the interface to listen to examples of previously recorded shibboleths, as well as to add their own to a growing, geographically-indexed database.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124714729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analyzing discussion scene contents in instructional videos 分析教学视频中的讨论场景内容
Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027587
Y. Li, C. Dorai
This paper describes our current effort on analyzing the contents of discussion scenes in instructional videos based on a clustering technique. Specifically, given a discussion scene pre-detected from an education or training video, we first apply a mode-based clustering approach to group all speech segments into an optimal number of clusters where each cluster contains speech from one speaker; we then analyze the discussion patterns in the scene, and subsequently classify it into either a 2-speaker or multi-speaker discussion. Encouraging classification results have been achieved on 122 discussion scenes detected from five IBM MicroMBA videos. Moreover, we have also observed fairly good performance on the speaker clustering scheme, which demonstrates the superiority of the proposed clustering approach. Undoubtedly, the discussion scene information output from this analysis scheme would facilitate the content browsing, searching and understanding of instructional videos.
本文介绍了基于聚类技术的教学视频讨论场景内容分析的研究现状。具体来说,给定从教育或培训视频中预先检测到的讨论场景,我们首先应用基于模式的聚类方法将所有语音片段分组到最优数量的聚类中,其中每个聚类包含来自一个说话者的语音;然后,我们分析了场景中的讨论模式,并随后将其分类为两个人或多个人的讨论。从5个IBM MicroMBA视频中检测到的122个讨论场景取得了令人鼓舞的分类结果。此外,我们还观察到说话人聚类方案具有相当好的性能,这证明了所提出的聚类方法的优越性。毫无疑问,该分析方案输出的讨论场景信息将有利于教学视频的内容浏览、搜索和理解。
{"title":"Analyzing discussion scene contents in instructional videos","authors":"Y. Li, C. Dorai","doi":"10.1145/1027527.1027587","DOIUrl":"https://doi.org/10.1145/1027527.1027587","url":null,"abstract":"This paper describes our current effort on analyzing the contents of discussion scenes in instructional videos based on a clustering technique. Specifically, given a discussion scene pre-detected from an education or training video, we first apply a mode-based clustering approach to group all speech segments into an optimal number of clusters where each cluster contains speech from one speaker; we then analyze the discussion patterns in the scene, and subsequently classify it into either a 2-speaker or multi-speaker discussion. Encouraging classification results have been achieved on 122 discussion scenes detected from five IBM MicroMBA videos. Moreover, we have also observed fairly good performance on the speaker clustering scheme, which demonstrates the superiority of the proposed clustering approach. Undoubtedly, the discussion scene information output from this analysis scheme would facilitate the content browsing, searching and understanding of instructional videos.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130419417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
An efficient parts-based near-duplicate and sub-image retrieval system 一种高效的基于零件的近重复和子图像检索系统
Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027729
Yan Ke, R. Sukthankar, Larry Huston
We introduce a system for near-duplicate detection and sub-image retrieval. Such a system is useful for finding copyright violations and detecting forged images. We define near-duplicate as images altered with common transformations such as changing contrast, saturation, scaling, cropping, framing, etc. Our system builds a parts-based representation of images using distinctive local descriptors which give high quality matches even under severe transformations. To cope with the large number of features extracted from the images, we employ locality-sensitive hashing to index the local descriptors. This allows us to make approximate similarity queries that only examine a small fraction of the database. Although locality-sensitive hashing has excellent theoretical performance properties, a standard implementation would still be unacceptably slow for this application. We show that, by optimizing layout and access to the index data on disk, we can efficiently query indices containing millions of keypoints. Our system achieves near-perfect accuracy (100% precision at 99.85% recall) on the tests presented in Meng et al. [16], and consistently strong results on our own, significantly more challenging experiments. Query times are interactive even for collections of thousands of images.
介绍了一种近重复检测和子图像检索系统。这种系统对于发现侵犯版权的行为和检测伪造图像非常有用。我们将近复制定义为通过改变对比度、饱和度、缩放、裁剪、取景等常见变换改变的图像。我们的系统使用独特的局部描述符构建基于部件的图像表示,即使在严重的转换下也能给出高质量的匹配。为了处理从图像中提取的大量特征,我们采用位置敏感的哈希方法对局部描述符进行索引。这允许我们进行近似的相似性查询,只检查数据库的一小部分。尽管位置敏感散列在理论上具有出色的性能属性,但是对于这个应用程序,标准实现仍然会慢得令人无法接受。通过优化磁盘上索引数据的布局和访问,我们可以有效地查询包含数百万个关键点的索引。在Meng等人[16]的测试中,我们的系统达到了近乎完美的准确率(100%的准确率和99.85%的召回率),并且在我们自己的实验中也一直表现出很强的结果,这明显更具挑战性。查询时间是交互式的,甚至对于数千个图像的集合也是如此。
{"title":"An efficient parts-based near-duplicate and sub-image retrieval system","authors":"Yan Ke, R. Sukthankar, Larry Huston","doi":"10.1145/1027527.1027729","DOIUrl":"https://doi.org/10.1145/1027527.1027729","url":null,"abstract":"We introduce a system for near-duplicate detection and sub-image retrieval. Such a system is useful for finding copyright violations and detecting forged images. We define near-duplicate as images altered with common transformations such as changing contrast, saturation, scaling, cropping, framing, etc. Our system builds a parts-based representation of images using <i>distinctive local descriptors</i> which give high quality matches even under severe transformations. To cope with the large number of features extracted from the images, we employ <i>locality-sensitive hashing</i> to index the local descriptors. This allows us to make approximate similarity queries that only examine a small fraction of the database. Although locality-sensitive hashing has excellent theoretical performance properties, a standard implementation would still be unacceptably slow for this application. We show that, by optimizing layout and access to the index data on disk, we can efficiently query indices containing millions of keypoints. Our system achieves near-perfect accuracy (100% precision at 99.85% recall) on the tests presented in Meng <i>et al.</i> [16], and consistently strong results on our own, significantly more challenging experiments. Query times are interactive even for collections of thousands of images.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130002706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 425
Music artist style identification by semi-supervised learning from both lyrics and content 通过歌词和内容的半监督学习来识别音乐艺术家的风格
Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027612
Tao Li, M. Ogihara
Efficient and intelligent music information retrieval is a very important topic of the 21st century. With the ultimate goal of building personal music information retrieval systems, this paper studies the problem of identifying "similar" artists using both lyrics and acoustic data. The approach for using a small set of labeled samples for the seed labeling to build classifiers that improve themselves using unlabeled data is presented. This approach is tested on a data set consisting of 43 artists and 56 albums using artist similarity provided by All Music Guide. Experimental results show that using such an approach the accuracy of artist similarity classifiers can be significantly improved and that artist similarity can be efficiently identified.
高效智能的音乐信息检索是21世纪的重要课题。本文以建立个人音乐信息检索系统为最终目标,研究了同时使用歌词和声学数据识别“相似”艺术家的问题。提出了一种使用少量标记样本进行种子标记以构建分类器的方法,该分类器可以使用未标记的数据进行自我改进。使用All Music Guide提供的艺术家相似性,在包含43位艺术家和56张专辑的数据集上测试了这种方法。实验结果表明,采用该方法可以显著提高艺术家相似度分类器的准确率,有效地识别出艺术家相似度。
{"title":"Music artist style identification by semi-supervised learning from both lyrics and content","authors":"Tao Li, M. Ogihara","doi":"10.1145/1027527.1027612","DOIUrl":"https://doi.org/10.1145/1027527.1027612","url":null,"abstract":"Efficient and intelligent music information retrieval is a very important topic of the 21st century. With the ultimate goal of building personal music information retrieval systems, this paper studies the problem of identifying \"similar\" artists using both lyrics and acoustic data. The approach for using a small set of labeled samples for the seed labeling to build classifiers that improve themselves using unlabeled data is presented. This approach is tested on a data set consisting of 43 artists and 56 albums using artist similarity provided by All Music Guide. Experimental results show that using such an approach the accuracy of artist similarity classifiers can be significantly improved and that artist similarity can be efficiently identified.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130469664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 44
DiMaS: distributing multimedia on peer-to-peer file sharing networks dima:在点对点文件共享网络上分发多媒体
Pub Date : 2004-10-10 DOI: 10.1145/1027527.1027560
Tommo Reti, R. Sarvas
This demonstration presents the Digital Content Distribution Management System (DiMaS). DiMaS proves as a concept that it is possible to make a system for multimedia producing communities to publish their work on highly popular P2P networks, and importantly, the system enables producers to insert content metadata, to manage intellectual property and usage rights, and to charge for the consumption. All this can be done without introducing another new content or metadata file format and a dedicated client application to read the format.
本演示展示了数字内容分发管理系统(DiMaS)。作为一个概念,DiMaS证明了可以为多媒体制作社区制作一个系统,让他们在非常流行的P2P网络上发布他们的作品,重要的是,该系统使生产者能够插入内容元数据,管理知识产权和使用权,并对消费收费。所有这些都可以在不引入另一种新的内容或元数据文件格式和专用的客户机应用程序来读取该格式的情况下完成。
{"title":"DiMaS: distributing multimedia on peer-to-peer file sharing networks","authors":"Tommo Reti, R. Sarvas","doi":"10.1145/1027527.1027560","DOIUrl":"https://doi.org/10.1145/1027527.1027560","url":null,"abstract":"This demonstration presents the Digital Content Distribution Management System (DiMaS). DiMaS proves as a concept that it is possible to make a system for multimedia producing communities to publish their work on highly popular P2P networks, and importantly, the system enables producers to insert content metadata, to manage intellectual property and usage rights, and to charge for the consumption. All this can be done without introducing another new content or metadata file format and a dedicated client application to read the format.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"C-35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126492863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
MULTIMEDIA '04
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1