Xiangjian He, S. Luo, D. Tao, Changsheng Xu, Jie Yang
This report on The 21st International Conference on MultiMedia Modeling provides an overview of the best papers and keynote presentations. It also reviews the special sessions on Personal (Big) Data Modeling for Information Access and Retrieval; Social Geo-Media Analytics and Retrieval; and Image or Video Processing, Semantic Analysis, and Understanding.
{"title":"The 21st International Conference on MultiMedia Modeling","authors":"Xiangjian He, S. Luo, D. Tao, Changsheng Xu, Jie Yang","doi":"10.1109/MMUL.2015.49","DOIUrl":"https://doi.org/10.1109/MMUL.2015.49","url":null,"abstract":"This report on The 21st International Conference on MultiMedia Modeling provides an overview of the best papers and keynote presentations. It also reviews the special sessions on Personal (Big) Data Modeling for Information Access and Retrieval; Social Geo-Media Analytics and Retrieval; and Image or Video Processing, Semantic Analysis, and Understanding.","PeriodicalId":290893,"journal":{"name":"IEEE Multim.","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132475713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The role of emotional and social signals in multimedia has not been a core concern of the multimedia research community--an omission explored at the 22nd ACM International Conference on Multimedia during a panel titled, "Emotional and Social Signals in Multimedia: Where Art Thou?" The panel discussion revealed major gaps in the formulation, understanding, and application of emotional and social signal processing in the multimedia domain. Here, the authors review the challenges in bringing this new domain to multimedia, summarizing current feelings in the research community based on discussions during the panel.
{"title":"Emotional and Social Signals: A Neglected Frontier in Multimedia Computing?","authors":"H. Gunes, H. Hung","doi":"10.1109/MMUL.2015.37","DOIUrl":"https://doi.org/10.1109/MMUL.2015.37","url":null,"abstract":"The role of emotional and social signals in multimedia has not been a core concern of the multimedia research community--an omission explored at the 22nd ACM International Conference on Multimedia during a panel titled, \"Emotional and Social Signals in Multimedia: Where Art Thou?\" The panel discussion revealed major gaps in the formulation, understanding, and application of emotional and social signal processing in the multimedia domain. Here, the authors review the challenges in bringing this new domain to multimedia, summarizing current feelings in the research community based on discussions during the panel.","PeriodicalId":290893,"journal":{"name":"IEEE Multim.","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131633013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Associate Editor-in-Chief Alan Hanjalic reflects on developments in the field of multimedia search and the notion of relevance in multimedia search systems. He suggests it may be time to look beyond relevance to a query and toward user intent to maximize the usefulness of the search results for the user who inserted the query.
{"title":"Multimedia Search: From Relevance to Usefulness","authors":"A. Hanjalic","doi":"10.1109/MMUL.2015.11","DOIUrl":"https://doi.org/10.1109/MMUL.2015.11","url":null,"abstract":"Associate Editor-in-Chief Alan Hanjalic reflects on developments in the field of multimedia search and the notion of relevance in multimedia search systems. He suggests it may be time to look beyond relevance to a query and toward user intent to maximize the usefulness of the search results for the user who inserted the query.","PeriodicalId":290893,"journal":{"name":"IEEE Multim.","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116983241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The Video Browser Showdown is an international competition in the field of interactive video search and retrieval. It is held annually as a special session at the International Conference on Multimedia Modeling (MMM). The Video Browser Showdown evaluates the performance of exploratory tools for interactive content search in videos in direct competition and in front of an audience. Its goal is to push research on user-centric video search tools including video navigation, content browsing, content interaction, and video content visualization. This article summarizes the first three VBS competitions (2012-2014).
{"title":"A User-Centric Media Retrieval Competition: The Video Browser Showdown 2012-2014","authors":"Klaus Schöffmann","doi":"10.1109/MMUL.2014.56","DOIUrl":"https://doi.org/10.1109/MMUL.2014.56","url":null,"abstract":"The Video Browser Showdown is an international competition in the field of interactive video search and retrieval. It is held annually as a special session at the International Conference on Multimedia Modeling (MMM). The Video Browser Showdown evaluates the performance of exploratory tools for interactive content search in videos in direct competition and in front of an audience. Its goal is to push research on user-centric video search tools including video navigation, content browsing, content interaction, and video content visualization. This article summarizes the first three VBS competitions (2012-2014).","PeriodicalId":290893,"journal":{"name":"IEEE Multim.","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131109813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Editor-in-Chief Yong Rui reflects on deep neural networks (DNNs), which have quickly gained momentum in various pattern recognition and multimedia applications.
总编辑永瑞对深度神经网络(dnn)进行了反思,深度神经网络在各种模式识别和多媒体应用中迅速发展。
{"title":"Deep Neural Networks: Another Tool for Multimedia Computing","authors":"Y. Rui","doi":"10.1109/MMUL.2014.59","DOIUrl":"https://doi.org/10.1109/MMUL.2014.59","url":null,"abstract":"Editor-in-Chief Yong Rui reflects on deep neural networks (DNNs), which have quickly gained momentum in various pattern recognition and multimedia applications.","PeriodicalId":290893,"journal":{"name":"IEEE Multim.","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129078764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Humans have always been interested in understanding themselves and their environment. Understanding their relationship with the environment is important to survival as well as thriving in the present situation and planning for the future. With advances in technology, the 21st century has witnessed significant advances in storage, processing, sensing, and communication technologies. All these have resulted in the popularization of strong data-dependent approaches, leading to the rise in the popularity of scientism in almost all disciplines where data can be collected. As the availability of data has become widespread, the desire to understand the physical reality at different levels in different applications has also become possible and desirable.
{"title":"Objective Self","authors":"R. Jain, Laleh Jalali","doi":"10.1109/MMUL.2014.63","DOIUrl":"https://doi.org/10.1109/MMUL.2014.63","url":null,"abstract":"Humans have always been interested in understanding themselves and their environment. Understanding their relationship with the environment is important to survival as well as thriving in the present situation and planning for the future. With advances in technology, the 21st century has witnessed significant advances in storage, processing, sensing, and communication technologies. All these have resulted in the popularization of strong data-dependent approaches, leading to the rise in the popularity of scientism in almost all disciplines where data can be collected. As the availability of data has become widespread, the desire to understand the physical reality at different levels in different applications has also become possible and desirable.","PeriodicalId":290893,"journal":{"name":"IEEE Multim.","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121300268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
EEE MultiMedia magazine was founded in 1994 and was the first IEEE publication in the multimedia area. MM serves the community of scholars, developers, practitioners, and students who are interested in multiple media types, used harmoniously together, for creating new experiences. In early 2014, the MM editorial board launched several initiatives to strengthen its collaboration with various multimedia communities in an effort to reach out to a wider range of audience and foster a more efficient publication process. One of the first efforts was to collaborate with the IEEE International Conference on Multimedia & Expo (ICME), the flagship multimedia conference that has been sponsored by four IEEE societies since 2000, to facilitate the publication of extended versions of top ICME papers in MM via a “fast track” review and publication process. In May 2014, the authors of the top 26 ICME 2014 papers were invited to submit extended versions (with at least 30 percent new material) of their papers to this fast track special issue scheduled to be published in the October– December 2014 issue. We received 15 submissions that span various topics such as audio and video coding, vision and pattern analysis, object tracking, quality assessment, and social media. After a rigorous peer-review process, eight of those submissions were accepted for this special issue, now titled “Hot Topics in Multimedia Research.” (Several other fine submissions were also accepted and will be published in early 2015.) A significant subset of the accepted articles address issues in visual analysis and tracking. In “Local Stereo Matching with Improved Matching Cost and Disparity Refinement,” Jianbo Jiao, Ronggang Wang, Wenmin Wang, Shengfu Dong, Zhenyu Wang, and Wen Gao present a technique for improving local stereo matching. They propose a new cost measure to improve the initial matching performance followed by a secondary refinement technique to remove the remaining outliers. In “Multimodal Feature Fusion for 3D Shape Recognition and Retrieval,” Shuhui Bu, Shaoguang Cheng, Zhenbao Liu, and Junwei Han present a deep learning framework to fuse 3D shape and 2D view-based features for 3D shape recognition and retrieval. Gaze-tracking technology is highly valuable in many interactive and diagnostic applications. In “Real-Time Gaze Estimation with Online Calibration,” Li Sun, Mingli Song, Zicheng Liu, and Ming-Ting Sun present a novel 3D-model-based gaze-estimation system using a single consumer depth camera (Kinect) with online calibration to constantly improve person-specific eye parameters. The article “Latent Subspace Projection Pursuit with Online Optimization for Robust Visual Tracking” by Risheng Liu, Wei Jin, Zhixun Su, and Changcheng Zhang proposes an online subspace learning technique to address the problem of feature extraction for visual tracking. In “Online Learning a High-Quality Dictionary and Classifier Jointly for Multitask Object Tracking,” Baojie Fan, Yang Cong, Yin
{"title":"Forging a Close Relationship with Multimedia Communities","authors":"Wenjun Zeng, Zicheng Liu, E. Steinbach","doi":"10.1109/MMUL.2014.60","DOIUrl":"https://doi.org/10.1109/MMUL.2014.60","url":null,"abstract":"EEE MultiMedia magazine was founded in 1994 and was the first IEEE publication in the multimedia area. MM serves the community of scholars, developers, practitioners, and students who are interested in multiple media types, used harmoniously together, for creating new experiences. In early 2014, the MM editorial board launched several initiatives to strengthen its collaboration with various multimedia communities in an effort to reach out to a wider range of audience and foster a more efficient publication process. One of the first efforts was to collaborate with the IEEE International Conference on Multimedia & Expo (ICME), the flagship multimedia conference that has been sponsored by four IEEE societies since 2000, to facilitate the publication of extended versions of top ICME papers in MM via a “fast track” review and publication process. In May 2014, the authors of the top 26 ICME 2014 papers were invited to submit extended versions (with at least 30 percent new material) of their papers to this fast track special issue scheduled to be published in the October– December 2014 issue. We received 15 submissions that span various topics such as audio and video coding, vision and pattern analysis, object tracking, quality assessment, and social media. After a rigorous peer-review process, eight of those submissions were accepted for this special issue, now titled “Hot Topics in Multimedia Research.” (Several other fine submissions were also accepted and will be published in early 2015.) A significant subset of the accepted articles address issues in visual analysis and tracking. In “Local Stereo Matching with Improved Matching Cost and Disparity Refinement,” Jianbo Jiao, Ronggang Wang, Wenmin Wang, Shengfu Dong, Zhenyu Wang, and Wen Gao present a technique for improving local stereo matching. They propose a new cost measure to improve the initial matching performance followed by a secondary refinement technique to remove the remaining outliers. In “Multimodal Feature Fusion for 3D Shape Recognition and Retrieval,” Shuhui Bu, Shaoguang Cheng, Zhenbao Liu, and Junwei Han present a deep learning framework to fuse 3D shape and 2D view-based features for 3D shape recognition and retrieval. Gaze-tracking technology is highly valuable in many interactive and diagnostic applications. In “Real-Time Gaze Estimation with Online Calibration,” Li Sun, Mingli Song, Zicheng Liu, and Ming-Ting Sun present a novel 3D-model-based gaze-estimation system using a single consumer depth camera (Kinect) with online calibration to constantly improve person-specific eye parameters. The article “Latent Subspace Projection Pursuit with Online Optimization for Robust Visual Tracking” by Risheng Liu, Wei Jin, Zhixun Su, and Changcheng Zhang proposes an online subspace learning technique to address the problem of feature extraction for visual tracking. In “Online Learning a High-Quality Dictionary and Classifier Jointly for Multitask Object Tracking,” Baojie Fan, Yang Cong, Yin","PeriodicalId":290893,"journal":{"name":"IEEE Multim.","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129391896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Humans are the best functioning example of multimedia communication and computing - that is, we understand information and experiences through the unified perspective offered by our five senses. This innovative textbook presents emerging techniques in multimedia computing from an experiential perspective in which each medium - audio, images, text, and so on - is a strong component of the complete, integrated exchange of information or experience. The authors' goal is to present current techniques in computing and communication that will lead to the development of a unified and holistic approach to computing using heterogeneous data sources. Gerald Friedland and Ramesh Jain introduce the fundamentals of multimedia computing, describing the properties of perceptually encoded information, presenting common algorithms and concepts for handling it, and outlining the typical requirements for emerging applications that use multifarious information sources. Designed for advanced undergraduate and beginning graduate courses, the book will also serve as an introduction for engineers and researchers interested in understanding the elements of multimedia and their role in building specific applications.
{"title":"Multimedia Computing","authors":"R. Jain","doi":"10.1109/MMUL.1994.10002","DOIUrl":"https://doi.org/10.1109/MMUL.1994.10002","url":null,"abstract":"Humans are the best functioning example of multimedia communication and computing - that is, we understand information and experiences through the unified perspective offered by our five senses. This innovative textbook presents emerging techniques in multimedia computing from an experiential perspective in which each medium - audio, images, text, and so on - is a strong component of the complete, integrated exchange of information or experience. The authors' goal is to present current techniques in computing and communication that will lead to the development of a unified and holistic approach to computing using heterogeneous data sources. Gerald Friedland and Ramesh Jain introduce the fundamentals of multimedia computing, describing the properties of perceptually encoded information, presenting common algorithms and concepts for handling it, and outlining the typical requirements for emerging applications that use multifarious information sources. Designed for advanced undergraduate and beginning graduate courses, the book will also serve as an introduction for engineers and researchers interested in understanding the elements of multimedia and their role in building specific applications.","PeriodicalId":290893,"journal":{"name":"IEEE Multim.","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127226183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Editor-in-Chief Yong Rui reflects on what has been achieved in the past 20 years of multimedia research and where to go next, specifically on the past and future of image search.
总编辑永睿回顾了多媒体研究20年来取得的成就和未来的发展方向,特别是图片搜索的过去和未来。
{"title":"Big Data and Image Search","authors":"Y. Rui","doi":"10.1109/MMUL.2014.39","DOIUrl":"https://doi.org/10.1109/MMUL.2014.39","url":null,"abstract":"Editor-in-Chief Yong Rui reflects on what has been achieved in the past 20 years of multimedia research and where to go next, specifically on the past and future of image search.","PeriodicalId":290893,"journal":{"name":"IEEE Multim.","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125305275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This article provides a recap of the 4th ACM International Conference on Multimedia Retrieval, which took place in Glasgow, Scotland, from 1-4 April 2014.
本文概述了2014年4月1日至4日在苏格兰格拉斯哥举行的第四届ACM多媒体检索国际会议。
{"title":"ACM International Conference on Multimedia Retrieval (ICMR 2014)","authors":"S. Rüger, J. Jose","doi":"10.1109/MMUL.2014.38","DOIUrl":"https://doi.org/10.1109/MMUL.2014.38","url":null,"abstract":"This article provides a recap of the 4th ACM International Conference on Multimedia Retrieval, which took place in Glasgow, Scotland, from 1-4 April 2014.","PeriodicalId":290893,"journal":{"name":"IEEE Multim.","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121900329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}