首页 > 最新文献

Ninth IEEE International Symposium on Multimedia (ISM 2007)最新文献

英文 中文
Information Hiding in Real-Time VoIP Streams 信息隐藏在实时VoIP流
Pub Date : 2007-12-10 DOI: 10.1109/ISM.2007.33
Chung-Yi Wang, Quincy Wu
The real-time speech hiding is to hide the secret speech into a cover speech in real-time communication systems. By hiding one secret speech into the cover speech, we can get a stego speech, which sounds meaningful and indistinguishable from the original cover speech. Therefore, even if the attackers catch the audio packets on Internet, they would not notice that there is another speech hidden inside it. In this paper, we propose a scheme for speech hiding in a real-time communication system such as voice over Internet Protocol (VoIP). We propose a novel design of real-time speech hiding for G.711 codec, which is widely supported by almost every VoIP device. Experimental results show that the processing time for the proposed algorithm takes only 0.257 ms, which is suitable for real-time VoIP applications.
实时语音隐藏是指在实时通信系统中,将秘密语音隐藏成掩护语音。通过将一个秘密演讲隐藏在掩护演讲中,我们可以得到一个隐含演讲,它听起来有意义,与原来的掩护演讲没有区别。因此,即使攻击者捕获了互联网上的音频数据包,他们也不会注意到其中隐藏着另一个语音。在本文中,我们提出了一种实时通信系统(如VoIP)中的语音隐藏方案。我们提出了一种新的基于G.711编解码器的实时语音隐藏设计,该设计被几乎所有的VoIP设备广泛支持。实验结果表明,该算法的处理时间仅为0.257 ms,适用于实时VoIP应用。
{"title":"Information Hiding in Real-Time VoIP Streams","authors":"Chung-Yi Wang, Quincy Wu","doi":"10.1109/ISM.2007.33","DOIUrl":"https://doi.org/10.1109/ISM.2007.33","url":null,"abstract":"The real-time speech hiding is to hide the secret speech into a cover speech in real-time communication systems. By hiding one secret speech into the cover speech, we can get a stego speech, which sounds meaningful and indistinguishable from the original cover speech. Therefore, even if the attackers catch the audio packets on Internet, they would not notice that there is another speech hidden inside it. In this paper, we propose a scheme for speech hiding in a real-time communication system such as voice over Internet Protocol (VoIP). We propose a novel design of real-time speech hiding for G.711 codec, which is widely supported by almost every VoIP device. Experimental results show that the processing time for the proposed algorithm takes only 0.257 ms, which is suitable for real-time VoIP applications.","PeriodicalId":129680,"journal":{"name":"Ninth IEEE International Symposium on Multimedia (ISM 2007)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132553874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 69
Supporting Video Data in Wireless Sensor Networks 支持无线传感器网络中的视频数据
Pub Date : 2007-12-10 DOI: 10.1109/ISM.2007.42
Ju Wang, M. Masilela, Jonathan C. L. Liu
In this paper, we investigate issues associated with the transporting of multimedia streams across wireless sensor networks. We developed a prototype wireless sensor device that is capable of streaming video data through its wireless interface. Our experiments results showed that wireless sensor networks perform poorly with existing networking stack for such applications due to long delivery path and small transmission buffer sizes on the relaying nodes. The effect of a poor link often propagates backwards upstream and causes unnecessary data retransmission. To overcome these problems, we proposed a pipelined transmission scheme with a novel flow control method that monitors local buffer levels. A secondary buffer scheme is also used to reduce the retransmission overhead caused by node failure. Simulation results show that the proposed scheme significantly increase the network efficiency. We also propose a novel stochastic route discovery algorithm for multiple video stream in wireless sensor networks. Our method uses a probing stage where possible routes are explored and their delivery performance recorded. The data collected during the probing stage are used to select the final routes.
在本文中,我们研究了在无线传感器网络中传输多媒体流的相关问题。我们开发了一种原型无线传感器设备,能够通过其无线接口传输视频数据。我们的实验结果表明,由于中继节点上的传输路径长且传输缓冲区大小小,无线传感器网络在现有网络堆栈中表现不佳。链路不良的影响往往会向上游反向传播,造成不必要的数据重传。为了克服这些问题,我们提出了一种管道传输方案,该方案采用一种新颖的流量控制方法来监控本地缓冲区水平。辅助缓冲方案也用于减少节点故障引起的重传开销。仿真结果表明,该方案显著提高了网络效率。我们还提出了一种新的无线传感器网络中多视频流随机路由发现算法。我们的方法使用一个探测阶段,其中可能的路线进行探索,并记录其交付性能。探测阶段收集的数据用于选择最终路由。
{"title":"Supporting Video Data in Wireless Sensor Networks","authors":"Ju Wang, M. Masilela, Jonathan C. L. Liu","doi":"10.1109/ISM.2007.42","DOIUrl":"https://doi.org/10.1109/ISM.2007.42","url":null,"abstract":"In this paper, we investigate issues associated with the transporting of multimedia streams across wireless sensor networks. We developed a prototype wireless sensor device that is capable of streaming video data through its wireless interface. Our experiments results showed that wireless sensor networks perform poorly with existing networking stack for such applications due to long delivery path and small transmission buffer sizes on the relaying nodes. The effect of a poor link often propagates backwards upstream and causes unnecessary data retransmission. To overcome these problems, we proposed a pipelined transmission scheme with a novel flow control method that monitors local buffer levels. A secondary buffer scheme is also used to reduce the retransmission overhead caused by node failure. Simulation results show that the proposed scheme significantly increase the network efficiency. We also propose a novel stochastic route discovery algorithm for multiple video stream in wireless sensor networks. Our method uses a probing stage where possible routes are explored and their delivery performance recorded. The data collected during the probing stage are used to select the final routes.","PeriodicalId":129680,"journal":{"name":"Ninth IEEE International Symposium on Multimedia (ISM 2007)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133658796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
A New Image Compression Scheme Based on Locally Adaptive Coding 一种基于局部自适应编码的图像压缩新方案
Pub Date : 2007-12-10 DOI: 10.1109/ISM.2007.49
Chinchen Chang, Yung-Chen Chou, Chia-Chen Lin
Vector quantization (VQ) is a simple and widely used compression technology in many applications. For image compression, VQ provides both a fixed compression ratio and maintains acceptable distortion. However, the performance of VQ still can be improved in terms of the image quality of compressed images and codebook size used for encoding and decoding. In this paper, a new VQ-like image compression method is proposed to improve the performance of traditional VQ by using locally adaptive coding concept. The experimental results confirm that the image quality of the compressed image offered by the proposed method is higher than 30 dB on average, and the number of codewords used in our codebook is less than that required by traditional VQ.
矢量量化(VQ)是一种简单而广泛应用的压缩技术。对于图像压缩,VQ既提供固定的压缩比,又保持可接受的失真。但是,VQ的性能在压缩图像的图像质量和用于编码和解码的码本大小方面仍然可以得到改进。本文提出了一种新的类VQ图像压缩方法,利用局部自适应编码的概念来提高传统VQ的压缩性能。实验结果表明,该方法压缩后的图像质量平均高于30 dB,码本中使用的码字数少于传统的VQ方法。
{"title":"A New Image Compression Scheme Based on Locally Adaptive Coding","authors":"Chinchen Chang, Yung-Chen Chou, Chia-Chen Lin","doi":"10.1109/ISM.2007.49","DOIUrl":"https://doi.org/10.1109/ISM.2007.49","url":null,"abstract":"Vector quantization (VQ) is a simple and widely used compression technology in many applications. For image compression, VQ provides both a fixed compression ratio and maintains acceptable distortion. However, the performance of VQ still can be improved in terms of the image quality of compressed images and codebook size used for encoding and decoding. In this paper, a new VQ-like image compression method is proposed to improve the performance of traditional VQ by using locally adaptive coding concept. The experimental results confirm that the image quality of the compressed image offered by the proposed method is higher than 30 dB on average, and the number of codewords used in our codebook is less than that required by traditional VQ.","PeriodicalId":129680,"journal":{"name":"Ninth IEEE International Symposium on Multimedia (ISM 2007)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114643694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Complexity Reduction and Fast Algorithm for 2-D Integer Discrete Wavelet Transform Using Symmetric Mask-Based Scheme 基于对称掩模的二维整数离散小波变换的快速降复杂度算法
Pub Date : 2007-12-10 DOI: 10.1109/ISM.2007.27
Chih-Hsien Hsia, Jing-Ming Guo, Jen-Shiun Chiang
Wavelet coding has been shown to be better than discrete cosine transform (DCT) in image/video processing. Moreover, it has the feature of scalability, which is involved in modern video standards. This work presents novel algorithms, namely 2-D symmetric mask-based discrete wavelet transform (SMDWT), to improve the critical issue of the 2-D lifting-based discrete wavelet transform (LDWT), and then obtains the benefit of low latency, high-speed operation, and low temporal memory. The SMDWT also has the advantages of high-performance embedded periodic extension boundary treatment, reduced complexity, regular signal coding, short critical path, reduced latency time, and independent subband coding processing. Moreover, the 2-D lifting-based DWT performance can also be easily improved by exploiting appropriate parallel method inherently in SMDWT. Comparing with the normal 2-D 5/3 integer lifting-based DWT the proposed method significantly improves lifting-based latency and complexity in 2-D DWT without degradation in image quality. The algorithm can be applied to real-time image/video applications, such as JPEG2000, MPEG-4 still texture object decoding, and wavelet-based Scalable Video Coding (SVC).
在图像/视频处理方面,小波编码已被证明优于离散余弦变换(DCT)。此外,它还具有可扩展性的特点,这是现代视频标准所涉及的。本文提出了基于二维对称掩模的离散小波变换(SMDWT)算法,改进了基于二维提升的离散小波变换(LDWT)的关键问题,从而获得了低延迟、高速运行和低时间内存的优点。SMDWT还具有高性能嵌入式周期扩展边界处理、降低复杂度、信号编码规则、关键路径短、延迟时间短、子带编码处理独立等优点。此外,利用SMDWT固有的适当的并行方法也可以很容易地提高基于二维提升的DWT性能。与常规的二维5/3整数提升DWT相比,该方法在不降低图像质量的前提下显著改善了二维DWT的时延和复杂度。该算法可应用于JPEG2000、MPEG-4静态纹理对象解码和基于小波的可缩放视频编码(SVC)等实时图像/视频应用。
{"title":"Complexity Reduction and Fast Algorithm for 2-D Integer Discrete Wavelet Transform Using Symmetric Mask-Based Scheme","authors":"Chih-Hsien Hsia, Jing-Ming Guo, Jen-Shiun Chiang","doi":"10.1109/ISM.2007.27","DOIUrl":"https://doi.org/10.1109/ISM.2007.27","url":null,"abstract":"Wavelet coding has been shown to be better than discrete cosine transform (DCT) in image/video processing. Moreover, it has the feature of scalability, which is involved in modern video standards. This work presents novel algorithms, namely 2-D symmetric mask-based discrete wavelet transform (SMDWT), to improve the critical issue of the 2-D lifting-based discrete wavelet transform (LDWT), and then obtains the benefit of low latency, high-speed operation, and low temporal memory. The SMDWT also has the advantages of high-performance embedded periodic extension boundary treatment, reduced complexity, regular signal coding, short critical path, reduced latency time, and independent subband coding processing. Moreover, the 2-D lifting-based DWT performance can also be easily improved by exploiting appropriate parallel method inherently in SMDWT. Comparing with the normal 2-D 5/3 integer lifting-based DWT the proposed method significantly improves lifting-based latency and complexity in 2-D DWT without degradation in image quality. The algorithm can be applied to real-time image/video applications, such as JPEG2000, MPEG-4 still texture object decoding, and wavelet-based Scalable Video Coding (SVC).","PeriodicalId":129680,"journal":{"name":"Ninth IEEE International Symposium on Multimedia (ISM 2007)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124636850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Unified Framework Based on p-Norm for Feature Aggregation in Content-Based Image Retrieval 基于p范数的图像检索特征聚合统一框架
Pub Date : 2007-12-10 DOI: 10.1109/ISM.2007.22
Jun Zhang, Lei Ye
Feature aggregation is a critical technique in content- based image retrieval systems that employ multiple visual features to characterize image content. In this paper, the p-norm is introduced to feature aggregation that provides a framework to unify various previous feature aggregation schemes such as linear combination, Euclidean distance, Boolean logic and decision fusion schemes in which previous schemes are instances. Some insights of the mechanism of how various aggregation schemes work are discussed through the effects of model parameters in the unified framework. Experiments show that performances vary over feature aggregation schemes that necessitates an unified framework in order to optimize the retrieval performance according to individual queries and user query concept. Revealing experimental results conducted with IAPR TC-12 ImageCLEF2006 benchmark collection that contains over 20,000 photographic images are presented and discussed.
特征聚合是基于内容的图像检索系统中的一项关键技术,它利用多个视觉特征来描述图像内容。本文将p范数引入到特征聚合中,提供了一个框架来统一各种先前的特征聚合方案,如线性组合、欧氏距离、布尔逻辑和决策融合方案。通过模型参数在统一框架中的影响,讨论了各种聚合方案工作机制的一些见解。实验表明,不同的特征聚合方案的检索性能存在差异,因此需要一个统一的框架来根据单个查询和用户查询概念来优化检索性能。本文介绍并讨论了使用IAPR TC-12 ImageCLEF2006基准集进行的揭示性实验结果,该基准集包含20,000多张摄影图像。
{"title":"An Unified Framework Based on p-Norm for Feature Aggregation in Content-Based Image Retrieval","authors":"Jun Zhang, Lei Ye","doi":"10.1109/ISM.2007.22","DOIUrl":"https://doi.org/10.1109/ISM.2007.22","url":null,"abstract":"Feature aggregation is a critical technique in content- based image retrieval systems that employ multiple visual features to characterize image content. In this paper, the p-norm is introduced to feature aggregation that provides a framework to unify various previous feature aggregation schemes such as linear combination, Euclidean distance, Boolean logic and decision fusion schemes in which previous schemes are instances. Some insights of the mechanism of how various aggregation schemes work are discussed through the effects of model parameters in the unified framework. Experiments show that performances vary over feature aggregation schemes that necessitates an unified framework in order to optimize the retrieval performance according to individual queries and user query concept. Revealing experimental results conducted with IAPR TC-12 ImageCLEF2006 benchmark collection that contains over 20,000 photographic images are presented and discussed.","PeriodicalId":129680,"journal":{"name":"Ninth IEEE International Symposium on Multimedia (ISM 2007)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120993850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
An Adaptive Audio Quantizer for Voip Systems 用于Voip系统的自适应音频量化器
Pub Date : 2007-12-10 DOI: 10.1109/ISM.2007.32
Ricardo Bertagna, R. Mello, L. Yang
The Internet evolution has been requiring the development of new technology to support multimedia transmission such as images, database access, audio and video in realtime. Such development needs new services and supports like the voice over IP (VoIP) which has a main motivation in the low cost communication and management. VoIP systems have motivated this work which proposes an adaptive audio quantizer named IQ (intervalar quantizer) to reduce the data dimensionality and consequently the entropy, what allows better audio compression. This quantizer is adaptive because it has an error tolerance parameter which can be varied according to the available network bandwidth, allowing to adapt communication. After transmitting, the audio is improved by using a filter with complex poles in the Z plan. This filter attenuates non-important frequencies, privileging the sensitive ones to human audition. Results confirm that IQ and the filter offer good quality (measured using the mean opinion score metrics) and compress ratio.
互联网的发展要求开发新的技术来支持实时的多媒体传输,如图像、数据库访问、音频和视频。这种发展需要新的服务和支持,如IP语音(VoIP),其主要动机是低成本的通信和管理。VoIP系统推动了这项工作,提出了一种名为IQ(区间量化器)的自适应音频量化器来降低数据维数,从而降低熵,从而实现更好的音频压缩。这个量化器是自适应的,因为它有一个容错参数,可以根据可用的网络带宽变化,允许适应通信。在传输后,通过在Z平面上使用具有复杂极点的滤波器来改善音频。这个滤波器衰减不重要的频率,使敏感的频率对人的听觉有特权。结果证实IQ和过滤器提供了良好的质量(使用平均意见得分指标测量)和压缩比。
{"title":"An Adaptive Audio Quantizer for Voip Systems","authors":"Ricardo Bertagna, R. Mello, L. Yang","doi":"10.1109/ISM.2007.32","DOIUrl":"https://doi.org/10.1109/ISM.2007.32","url":null,"abstract":"The Internet evolution has been requiring the development of new technology to support multimedia transmission such as images, database access, audio and video in realtime. Such development needs new services and supports like the voice over IP (VoIP) which has a main motivation in the low cost communication and management. VoIP systems have motivated this work which proposes an adaptive audio quantizer named IQ (intervalar quantizer) to reduce the data dimensionality and consequently the entropy, what allows better audio compression. This quantizer is adaptive because it has an error tolerance parameter which can be varied according to the available network bandwidth, allowing to adapt communication. After transmitting, the audio is improved by using a filter with complex poles in the Z plan. This filter attenuates non-important frequencies, privileging the sensitive ones to human audition. Results confirm that IQ and the filter offer good quality (measured using the mean opinion score metrics) and compress ratio.","PeriodicalId":129680,"journal":{"name":"Ninth IEEE International Symposium on Multimedia (ISM 2007)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122373960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiuser Mobile Multimedia 多用户移动多媒体
Pub Date : 2007-12-10 DOI: 10.1109/ISM.2007.34
D. Doolan, S. Tabirca, L. Yang
Mobility especially the flexibility given to us by the mobile phone is the future of computing as we know it. No longer are we restricted to sitting at a desk in front of a powerful desktop machine. Mobile technology of today allows users to work, learn and play no matter where they may be. Wireless technology is becoming more and more a standard feature of computing, so much so that it is expected that approximately two billion Bluetooth enabled devices will have been produced by the end of 2007. This paper examines how Bluetooth application development may be simplified for the programmer by use of the mobile message passing interface (MMPI). It explores a selection of application areas that can benefit from this simplified means of wireless inter-device communication, including: compute intensive tasks, mobile learning and multi-player gaming.
正如我们所知,移动性,尤其是手机给我们带来的灵活性是计算的未来。我们不再局限于坐在一台功能强大的台式电脑前。今天的移动技术让用户无论身在何处都可以工作、学习和娱乐。无线技术正日益成为计算的一个标准特征,以至于预计到2007年底将生产大约20亿个支持蓝牙的设备。本文探讨了如何通过使用移动消息传递接口(MMPI)来简化蓝牙应用程序的开发。它探讨了一系列可以从这种简化的无线设备间通信方式中受益的应用领域,包括:计算密集型任务、移动学习和多人游戏。
{"title":"Multiuser Mobile Multimedia","authors":"D. Doolan, S. Tabirca, L. Yang","doi":"10.1109/ISM.2007.34","DOIUrl":"https://doi.org/10.1109/ISM.2007.34","url":null,"abstract":"Mobility especially the flexibility given to us by the mobile phone is the future of computing as we know it. No longer are we restricted to sitting at a desk in front of a powerful desktop machine. Mobile technology of today allows users to work, learn and play no matter where they may be. Wireless technology is becoming more and more a standard feature of computing, so much so that it is expected that approximately two billion Bluetooth enabled devices will have been produced by the end of 2007. This paper examines how Bluetooth application development may be simplified for the programmer by use of the mobile message passing interface (MMPI). It explores a selection of application areas that can benefit from this simplified means of wireless inter-device communication, including: compute intensive tasks, mobile learning and multi-player gaming.","PeriodicalId":129680,"journal":{"name":"Ninth IEEE International Symposium on Multimedia (ISM 2007)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130080622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Open Source Architecture for Low-Latency Video Streaming on PDAs 在pda上实现低延迟视频流的开源架构
Pub Date : 2007-12-10 DOI: 10.1109/ISM.2007.25
Giovanni Gualdi, A. Prati, R. Cucchiara
This paper presents a open-source system for low- latency video streaming on PDAs, specifically addressing mobile video surveillance requirements. The system is based on H.264 and suitably modified to obtain the best trade-off between image quality and video fluidity, working also at very limited bandwidths. Moreover, the used controls allow to keep the number of lost frames very low. A large set of experiments and comparisons have been carried out and the achieved results demonstrate the efficacy and efficiency of our system.
本文提出了一种基于pda的低延迟视频流的开源系统,特别针对移动视频监控的需求。该系统基于H.264,并进行了适当的修改,以获得图像质量和视频流畅性之间的最佳平衡,并且可以在非常有限的带宽下工作。此外,使用的控件允许保持丢失帧的数量非常低。进行了大量的实验和比较,结果证明了系统的有效性和高效性。
{"title":"An Open Source Architecture for Low-Latency Video Streaming on PDAs","authors":"Giovanni Gualdi, A. Prati, R. Cucchiara","doi":"10.1109/ISM.2007.25","DOIUrl":"https://doi.org/10.1109/ISM.2007.25","url":null,"abstract":"This paper presents a open-source system for low- latency video streaming on PDAs, specifically addressing mobile video surveillance requirements. The system is based on H.264 and suitably modified to obtain the best trade-off between image quality and video fluidity, working also at very limited bandwidths. Moreover, the used controls allow to keep the number of lost frames very low. A large set of experiments and comparisons have been carried out and the achieved results demonstrate the efficacy and efficiency of our system.","PeriodicalId":129680,"journal":{"name":"Ninth IEEE International Symposium on Multimedia (ISM 2007)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125780198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Adaptive Early Termination for Fast H.264 Video Coding 快速H.264视频编码的自适应提前终止
Pub Date : 2007-12-10 DOI: 10.1109/ISM.2007.14
Chung-Yen Su, Shu-Li Chang
The H.264 standard applies several powerful coding methods to obtain high compression efficiency. However, it requires a lot of computation especially in variable block-size motion estimation. To reduce the motion estimation redundancy more effectively, an adaptive early termination algorithm is proposed in this paper. The proposed algorithm dynamically changes the thresholds for different coding modes according to video content. With the proposed method, many zero motion blocks can be predicted, the corresponding motion estimation can stop early, and the remaining computation can be omitted. Simulation results show that the proposed method can averagely reduce the entire coding time up to 14.38% and the motion estimation time up to 21.82% at the price of negligible coding loss.
H.264标准采用了几种强大的编码方法来获得较高的压缩效率。然而,它需要大量的计算量,特别是在可变块大小的运动估计中。为了更有效地减少运动估计冗余,本文提出了一种自适应提前终止算法。该算法根据视频内容动态改变不同编码模式的阈值。该方法可以预测出多个零运动块,提前停止相应的运动估计,省去了剩余的计算量。仿真结果表明,该方法平均可将整个编码时间缩短14.38%,运动估计时间缩短21.82%,而编码损失可以忽略不计。
{"title":"Adaptive Early Termination for Fast H.264 Video Coding","authors":"Chung-Yen Su, Shu-Li Chang","doi":"10.1109/ISM.2007.14","DOIUrl":"https://doi.org/10.1109/ISM.2007.14","url":null,"abstract":"The H.264 standard applies several powerful coding methods to obtain high compression efficiency. However, it requires a lot of computation especially in variable block-size motion estimation. To reduce the motion estimation redundancy more effectively, an adaptive early termination algorithm is proposed in this paper. The proposed algorithm dynamically changes the thresholds for different coding modes according to video content. With the proposed method, many zero motion blocks can be predicted, the corresponding motion estimation can stop early, and the remaining computation can be omitted. Simulation results show that the proposed method can averagely reduce the entire coding time up to 14.38% and the motion estimation time up to 21.82% at the price of negligible coding loss.","PeriodicalId":129680,"journal":{"name":"Ninth IEEE International Symposium on Multimedia (ISM 2007)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127141597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Local Binary Patterns for Human Detection on Hexagonal Structure 六边形结构人体局部二值模式检测
Pub Date : 2007-12-10 DOI: 10.1109/ISM.2007.19
Xiangjian He, Jianmin Li, Yan Chen, Qiang Wu, W. Jia
Local binary pattern (LBP) was designed and has been widely used for efficient texture classification. LBP provides a simple and effective way to represent texture patterns. Uniform LBPs play an important role for LBP-based pattern/object recognition as they include majority of LBPs. On the other hand, Human detection based on Mahalanobis distance map (MDM) recognizes appearance of human based on geometrical structure. Each MDM shows a clear texture pattern that can be classified using LBPs. In this paper, we compute LBPs of MDMs on a hexagonal structure. The circular pixel arrangement in hexagonal structure results in higher accuracy for LBP representation than on square structure. Chi-square as a measure is used for human detection based on uniform LBPs obtained. We show that our method using LBPs built on MDMs has a higher human detection rate and a lower false positive rate compared to the method merely based on MDMs. We will also show using experimental results that LBPs on hexagonal structure lead to more robust human classification.
局部二值模式(Local binary pattern, LBP)被广泛用于有效的纹理分类。LBP提供了一种简单有效的纹理模式表示方法。统一的lbp包含了大多数的lbp,在基于lbp的模式/目标识别中起着重要的作用。另一方面,基于马氏距离图(MDM)的人体检测基于人体的几何结构来识别人体的外观。每个MDM都显示一个清晰的纹理模式,可以使用lbp对其进行分类。在本文中,我们计算了六边形结构上MDMs的lbp。六边形结构中的圆形像素排列比正方形结构中的圆形像素排列具有更高的LBP表示精度。基于获得的均匀lbp,将卡方作为测量方法用于人体检测。我们表明,与仅基于MDMs的方法相比,我们使用基于MDMs的lbp的方法具有更高的人类检测率和更低的假阳性率。我们还将使用实验结果表明,六边形结构上的lbp导致更稳健的人类分类。
{"title":"Local Binary Patterns for Human Detection on Hexagonal Structure","authors":"Xiangjian He, Jianmin Li, Yan Chen, Qiang Wu, W. Jia","doi":"10.1109/ISM.2007.19","DOIUrl":"https://doi.org/10.1109/ISM.2007.19","url":null,"abstract":"Local binary pattern (LBP) was designed and has been widely used for efficient texture classification. LBP provides a simple and effective way to represent texture patterns. Uniform LBPs play an important role for LBP-based pattern/object recognition as they include majority of LBPs. On the other hand, Human detection based on Mahalanobis distance map (MDM) recognizes appearance of human based on geometrical structure. Each MDM shows a clear texture pattern that can be classified using LBPs. In this paper, we compute LBPs of MDMs on a hexagonal structure. The circular pixel arrangement in hexagonal structure results in higher accuracy for LBP representation than on square structure. Chi-square as a measure is used for human detection based on uniform LBPs obtained. We show that our method using LBPs built on MDMs has a higher human detection rate and a lower false positive rate compared to the method merely based on MDMs. We will also show using experimental results that LBPs on hexagonal structure lead to more robust human classification.","PeriodicalId":129680,"journal":{"name":"Ninth IEEE International Symposium on Multimedia (ISM 2007)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115734920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
Ninth IEEE International Symposium on Multimedia (ISM 2007)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1