In this paper, we propose a novel image watermarking algorithm, which is based on the scheme where the watermark is embedded into projection of an image onto the secret set of key-dependant basis functions. We propose a key-dependant image decomposition method, which involves solving a bottleneck hamiltonian path problem by an ant colony optimization technique. Combined with the decomposition technique, the original scheme is significantly improved on the aspects of speed, capacity and security. In particular, our algorithm usually shortens the running time of the original scheme by a factor of several hundred.
{"title":"Key-dependant decomposition based image watermarking","authors":"Shiyan Hu","doi":"10.1145/1027527.1027630","DOIUrl":"https://doi.org/10.1145/1027527.1027630","url":null,"abstract":"In this paper, we propose a novel image watermarking algorithm, which is based on the scheme where the watermark is embedded into projection of an image onto the secret set of key-dependant basis functions. We propose a key-dependant image decomposition method, which involves solving a bottleneck hamiltonian path problem by an ant colony optimization technique. Combined with the decomposition technique, the original scheme is significantly improved on the aspects of speed, capacity and security. In particular, our algorithm usually shortens the running time of the original scheme by a factor of several hundred.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123909648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The growth in the power and connectivity of the Internet has sparked an even larger growth in streaming media. The sheer number of possible users and applications at any point in time raises the probability of streaming multimedia flows encountering congestion. To overcome short-term congestion and avoid long-term congestion collapse, there is a growing consensus that Internet applications must be TCP-Friendly, with proposed approaches to detect and punish non-TCP friendly flows. Unlike TCP, new TCP-friendly streaming media protocols refrain from retransmissions to avoid delay and jitter, but they are susceptible to quality degradation from packet loss. While multimedia applications can tolerate some data loss, excessive packet loss during congestion yields unacceptable media quality. Since video encoding involves interframe dependencies to achieve high compression rates, the random dropping of packets by routers can seriously degrade video quality. For example, as little as 3% MPEG packet loss can cause 30% of the frames to be undecodable. Streaming media flows often utilize lower latency repair approaches, such as Forward Error Correction (FEC), in conjunction with TCP-Friendly protocols to deliver streaming applications over the Internet. However, FEC requires redundant repair data to be added to the original video stream. Current approaches use either apriori, static FEC choices or adapt FEC to perceived packet loss on the network without regard to TCP-Friendly data rate constraints. When a streaming video operates within TCP-Friendly bitrate limits, adding FEC will reduce the effective transmission rate of the original video content.
{"title":"Demonstration of adjusting forward error correction with quality scaling for TCP-friendly streaming MPEG","authors":"H. Wu, M. Claypool, R. Kinicki","doi":"10.1145/1027527.1027564","DOIUrl":"https://doi.org/10.1145/1027527.1027564","url":null,"abstract":"The growth in the power and connectivity of the Internet has sparked an even larger growth in streaming media. The sheer number of possible users and applications at any point in time raises the probability of streaming multimedia flows encountering congestion. To overcome short-term congestion and avoid long-term congestion collapse, there is a growing consensus that Internet applications must be TCP-Friendly, with proposed approaches to detect and punish non-TCP friendly flows. Unlike TCP, new TCP-friendly streaming media protocols refrain from retransmissions to avoid delay and jitter, but they are susceptible to quality degradation from packet loss. While multimedia applications can tolerate some data loss, excessive packet loss during congestion yields unacceptable media quality. Since video encoding involves interframe dependencies to achieve high compression rates, the random dropping of packets by routers can seriously degrade video quality. For example, as little as 3% MPEG packet loss can cause 30% of the frames to be undecodable. Streaming media flows often utilize lower latency repair approaches, such as Forward Error Correction (FEC), in conjunction with TCP-Friendly protocols to deliver streaming applications over the Internet. However, FEC requires redundant repair data to be added to the original video stream. Current approaches use either apriori, static FEC choices or adapt FEC to perceived packet loss on the network without regard to TCP-Friendly data rate constraints. When a streaming video operates within TCP-Friendly bitrate limits, adding FEC will reduce the effective transmission rate of the original video content.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124012921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Rahul Singh, Rachel Knickmeyer, Punit Gupta, R. Jain
With the increasing ubiquity of sensors and computational resources, it is becoming easier and increasingly common for people to electronically record, photographs, text, audio, and video gathered over their lifetime. Assimilating and taking advantage of such data requires recognition of its multimedia nature, development of data models that can represent semantics across different media, representation of complex relationships in the data (such as spatio-temporal, causal, or evolutionary), and finally, development of paradigms to mediate user-media interactions. There is currently a paucity of theoretical frameworks and implementations that allow management of diverse and rich multimedia data collections in context of the aforementioned requirements. This paper presents our research in designing an experiential Multimedia Electronic Chronicle system that addresses many of these issues in the concrete context of personal multimedia information. Central to our approach is the characterization and organization of media using the concept of an "event" for unified modeling and indexing. The event-based unified multimedia model underlies the experiential user interface, which supports direct interactions with the data within a unified presentation-exploration-query environment. In this environment, explicit facilities to model space and time aid in exploration and querying as well as in representation and reasoning with dynamic relationships in the data. Experimental and comparative studies demonstrate the promise of this research.
{"title":"Designing experiential environments for management of personal multimedia","authors":"Rahul Singh, Rachel Knickmeyer, Punit Gupta, R. Jain","doi":"10.1145/1027527.1027647","DOIUrl":"https://doi.org/10.1145/1027527.1027647","url":null,"abstract":"With the increasing ubiquity of sensors and computational resources, it is becoming easier and increasingly common for people to electronically record, photographs, text, audio, and video gathered over their lifetime. Assimilating and taking advantage of such data requires recognition of its multimedia nature, development of data models that can represent semantics across different media, representation of complex relationships in the data (such as spatio-temporal, causal, or evolutionary), and finally, development of paradigms to mediate user-media interactions. There is currently a paucity of theoretical frameworks and implementations that allow management of diverse and rich multimedia data collections in context of the aforementioned requirements. This paper presents our research in designing an experiential Multimedia Electronic Chronicle system that addresses many of these issues in the concrete context of personal multimedia information. Central to our approach is the characterization and organization of media using the concept of an \"event\" for unified modeling and indexing. The event-based unified multimedia model underlies the experiential user interface, which supports direct interactions with the data within a unified presentation-exploration-query environment. In this environment, explicit facilities to model space and time aid in exploration and querying as well as in representation and reasoning with dynamic relationships in the data. Experimental and comparative studies demonstrate the promise of this research.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"564 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123110443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tony Cabello-Miguel, Oscar Fernández-Barracel, Oscar García-Panyella
The iGlue Project is a set of tools meant for the integration of different multimedia technologies in creative applications like multimedia installations or live audiovisual shows. Inspired by the way of working used in electronics, similar concepts are developed in a visual environment to interactively build applications out of components, wires and circuits. This approach enhances the entire R&D workflow, providing for application prototyping, empirical experimentation, collaborative work, software recycling and simplified maintenance.
{"title":"iGlue.v3: an electronics metaphor for multimedia technologies integration","authors":"Tony Cabello-Miguel, Oscar Fernández-Barracel, Oscar García-Panyella","doi":"10.1145/1027527.1027676","DOIUrl":"https://doi.org/10.1145/1027527.1027676","url":null,"abstract":"The iGlue Project is a set of tools meant for the integration of different multimedia technologies in creative applications like multimedia installations or live audiovisual shows.\u0000 Inspired by the way of working used in electronics, similar concepts are developed in a visual environment to interactively build applications out of components, wires and circuits. This approach enhances the entire R&D workflow, providing for application prototyping, empirical experimentation, collaborative work, software recycling and simplified maintenance.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"126 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120886532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Traces of Culture presents a series of real time interactive art events that use custom-crafted searchbots to investigate image search processes and the underlying compendium of visual information available on the World Wide Web.
{"title":"Traces of culture: searchbots scour the web looking for visual information","authors":"Stephen Wilson","doi":"10.1145/1027527.1027760","DOIUrl":"https://doi.org/10.1145/1027527.1027760","url":null,"abstract":"<i>Traces of Culture</i> presents a series of real time interactive art events that use custom-crafted searchbots to investigate image search processes and the underlying compendium of visual information available on the World Wide Web.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115415881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Semantic multimedia management is necessary for the effective and widespread utilization of multimedia repositories and realizing the potential that lies untapped in the rich multimodal information content. This challenge has driven researchers to devise new algorithms and systems that enable automatic or semi-automatic tagging of large scale multimedia content with rich semantics. An emerging research area is the detection of a predetermined set of semantic concepts that can act as semantic filters and aid in search, and manipulation. The NIST TRECVID benchmark has responded by creating a task that has evaluated the performance of concept detection. Within the scope of this benchmark task, this paper studies trends in the emerging concept detection systems, architectures and algorithms. It also analyzes strategies that have yielded reasonable success, and challenges and gaps that lie ahead.
{"title":"On the detection of semantic concepts at TRECVID","authors":"M. Naphade, John R. Smith","doi":"10.1145/1027527.1027680","DOIUrl":"https://doi.org/10.1145/1027527.1027680","url":null,"abstract":"Semantic multimedia management is necessary for the effective and widespread utilization of multimedia repositories and realizing the potential that lies untapped in the rich multimodal information content. This challenge has driven researchers to devise new algorithms and systems that enable automatic or semi-automatic tagging of large scale multimedia content with rich semantics. An emerging research area is the detection of a predetermined set of semantic concepts that can act as semantic filters and aid in search, and manipulation. The NIST TRECVID benchmark has responded by creating a task that has evaluated the performance of concept detection. Within the scope of this benchmark task, this paper studies trends in the emerging concept detection systems, architectures and algorithms. It also analyzes strategies that have yielded reasonable success, and challenges and gaps that lie ahead.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"68 8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131138653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The Evolving Oblique is an interactive video and sound installation that explores the spatialization of cinematic projection and the mediative role of the human body. Centered upon the theme of transformative nature, the installation re-contextualizes notions of landscape and architectural form into a new topology through the embodiment of the moving image.
{"title":"The evolving oblique: the embodiment of a virtual topology","authors":"Joanna Walker, Steffen Bluemm, Bill Haslett","doi":"10.1145/1027527.1027744","DOIUrl":"https://doi.org/10.1145/1027527.1027744","url":null,"abstract":"The Evolving Oblique is an interactive video and sound installation that explores the spatialization of cinematic projection and the mediative role of the human body. Centered upon the theme of transformative nature, the installation re-contextualizes notions of landscape and architectural form into a new topology through the embodiment of the moving image.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125687129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The next generation of medical systems will integrate multimodality and multimedia data from a variety of sources to aid individual ysicians in providing the best treatment and care for their patients and to help researchers understand patterns of disease in collections of large databases. We promote a principle of maximum flexibility for the design of these systems to accommodate new developments in medical imaging and bioinformatics. We present several characteristics of flexible systems that support medical multimedia. We illustrate how these characteristics can be applied within diverse medical research and clinical therapy environments through two in-depth case studies. Within these examples, we describe how specific emerging technologies, including the semantic web and publish and subscribe networking, can enhance the flexibility of medical multimedia architectures.
{"title":"Flexible frameworks for medical multimedia","authors":"M. Halle, R. Kikinis","doi":"10.1145/1027527.1027708","DOIUrl":"https://doi.org/10.1145/1027527.1027708","url":null,"abstract":"The next generation of medical systems will integrate multimodality and multimedia data from a variety of sources to aid individual ysicians in providing the best treatment and care for their patients and to help researchers understand patterns of disease in collections of large databases. We promote a principle of maximum flexibility for the design of these systems to accommodate new developments in medical imaging and bioinformatics. We present several characteristics of flexible systems that support medical multimedia. We illustrate how these characteristics can be applied within diverse medical research and clinical therapy environments through two in-depth case studies. Within these examples, we describe how specific emerging technologies, including the semantic web and publish and subscribe networking, can enhance the flexibility of medical multimedia architectures.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121591598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kristjan Varnik, Jason Freeman, Chandrasekhar Ramakrishnan
Auracle is a networked sound instrument controlled by the voice. Users jam together over the Internet using only a microphone. Throughout the development process, the authors experimented with different approaches to interpreting vocal input and facilitating user interaction. This paper outlines some of the tools used to implement and evaluate those ideas, simulate the wide range of activities of Auracle users, and facilitate the development process.
{"title":"Tools used while developing auracle: a voice-controlled networked instrument","authors":"Kristjan Varnik, Jason Freeman, Chandrasekhar Ramakrishnan","doi":"10.1145/1027527.1027656","DOIUrl":"https://doi.org/10.1145/1027527.1027656","url":null,"abstract":"Auracle is a networked sound instrument controlled by the voice. Users jam together over the Internet using only a microphone. Throughout the development process, the authors experimented with different approaches to interpreting vocal input and facilitating user interaction. This paper outlines some of the tools used to implement and evaluate those ideas, simulate the wide range of activities of Auracle users, and facilitate the development process.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114460463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The artists developed an interactive work based on the imaginary story of an actual 13th century illuminated Bible, investigating the many levels of meaning of the Manuscript: as a beautiful artifact, as a work of art reflecting the convergence of cultures in medieval Spain, and as a text which tells stories which themselves are layered in meanings. Transforming their own photographs and sound samples with digital media, the creative process challenged the artists to incorporate traditional Jewish cultural motifs in a work intended to appeal to contemporary general audiences. Juxtaposition, reinterpretation, and hybrid formats as artistic approach parallel the essential nature of Jewish culture, as well that of other historically nomadic peoples.
{"title":"Layered histories: the wandering bible of Marseilles","authors":"Cynthia Beth Rubin, R. Gluck","doi":"10.1145/1027527.1027773","DOIUrl":"https://doi.org/10.1145/1027527.1027773","url":null,"abstract":"The artists developed an interactive work based on the imaginary story of an actual 13th century illuminated Bible, investigating the many levels of meaning of the Manuscript: as a beautiful artifact, as a work of art reflecting the convergence of cultures in medieval Spain, and as a text which tells stories which themselves are layered in meanings. Transforming their own photographs and sound samples with digital media, the creative process challenged the artists to incorporate traditional Jewish cultural motifs in a work intended to appeal to contemporary general audiences. Juxtaposition, reinterpretation, and hybrid formats as artistic approach parallel the essential nature of Jewish culture, as well that of other historically nomadic peoples.","PeriodicalId":292207,"journal":{"name":"MULTIMEDIA '04","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114692612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}