首页 > 最新文献

IEEE International Symposium on Consumer Electronics (ISCE 2010)最新文献

英文 中文
Stable disk performance with non-sequential data block placement 稳定的磁盘性能与非顺序的数据块放置
Pub Date : 2010-06-07 DOI: 10.1109/ISCE.2010.5523279
Damien Le Moa, Donald Molaron, Z. Bandic
General purpose file systems traditionally use sequential data block allocation policies, which by reducing disk head seek and rotational latency increase disk access performance. However, these methods also directly expose the variable data transfer rate of modern multi-zone disks, resulting in unstable and non-predictable disk throughput depending on the files being accessed. In this paper, we present the Zone-Round-Robin (ZRR) block allocation policy which avoids this problem by distributing the data blocks of a file over the entire disk to achieve near constant file access rates. Experimental results show that ZRR leads to stable disk performance independently of the set of files being used and that the data throughput achieved is close to the average disk performance.
通用文件系统传统上使用顺序数据块分配策略,通过减少磁头寻道和旋转延迟来提高磁盘访问性能。然而,这些方法也直接暴露了现代多区域磁盘的可变数据传输速率,导致不稳定和不可预测的磁盘吞吐量,具体取决于所访问的文件。在本文中,我们提出了区域轮询(ZRR)块分配策略,该策略通过将文件的数据块分布在整个磁盘上来实现接近恒定的文件访问速率,从而避免了这个问题。实验结果表明,ZRR方法可以获得稳定的磁盘性能,与所使用的文件集无关,并且所获得的数据吞吐量接近平均磁盘性能。
{"title":"Stable disk performance with non-sequential data block placement","authors":"Damien Le Moa, Donald Molaron, Z. Bandic","doi":"10.1109/ISCE.2010.5523279","DOIUrl":"https://doi.org/10.1109/ISCE.2010.5523279","url":null,"abstract":"General purpose file systems traditionally use sequential data block allocation policies, which by reducing disk head seek and rotational latency increase disk access performance. However, these methods also directly expose the variable data transfer rate of modern multi-zone disks, resulting in unstable and non-predictable disk throughput depending on the files being accessed. In this paper, we present the Zone-Round-Robin (ZRR) block allocation policy which avoids this problem by distributing the data blocks of a file over the entire disk to achieve near constant file access rates. Experimental results show that ZRR leads to stable disk performance independently of the set of files being used and that the data throughput achieved is close to the average disk performance.","PeriodicalId":403652,"journal":{"name":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123804417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Enhancement of the complement of an embedded surveillance system with PIR sensors and ultrasonic sensors 加强与PIR传感器和超声波传感器的嵌入式监视系统的补充
Pub Date : 2010-06-07 DOI: 10.1109/ISCE.2010.5522729
Ying-Wen Bai, Zong-Han Li, Zi-Li Xie
In this paper we design and implement an embedded surveillance system with multiple Pyroelectric Infrared sensors (PIR) and ultrasonic sensors which turn on the embedded surveillance system according to the signals triggered by their majority voting mechanism (MVM). The PIR sensors are used to detect changes in human temperature and environment temperature, but have a high miss rate in the case of a slow walking intruder or an object with heat insulation. To overcome these disadvantages of the PIR sensors, we add ultrasonic sensors to complement them. The ultrasonic sensor module consists of a transmitter and a receiver which are placed in a line direction. The receiver detects signals and decides whether it is blocked. The ultrasonic transmission spreads a beam angle, so we use multiple ultrasonic sensors to receive it and to provide the majority voting mechanism.
本文设计并实现了一个由多个热释电红外传感器(PIR)和超声波传感器组成的嵌入式监控系统,这些传感器根据其多数投票机制(MVM)触发的信号开启嵌入式监控系统。PIR传感器用于检测人体温度和环境温度的变化,但在缓慢行走的入侵者或隔热物体的情况下,失检率很高。为了克服PIR传感器的这些缺点,我们增加了超声波传感器来补充它们。超声波传感器模块由一个发射器和一个接收器组成,它们沿直线方向放置。接收器检测信号并决定是否被阻塞。由于超声波的传输具有波束角度,因此我们采用了多个超声波传感器进行接收,并提供了多数投票机制。
{"title":"Enhancement of the complement of an embedded surveillance system with PIR sensors and ultrasonic sensors","authors":"Ying-Wen Bai, Zong-Han Li, Zi-Li Xie","doi":"10.1109/ISCE.2010.5522729","DOIUrl":"https://doi.org/10.1109/ISCE.2010.5522729","url":null,"abstract":"In this paper we design and implement an embedded surveillance system with multiple Pyroelectric Infrared sensors (PIR) and ultrasonic sensors which turn on the embedded surveillance system according to the signals triggered by their majority voting mechanism (MVM). The PIR sensors are used to detect changes in human temperature and environment temperature, but have a high miss rate in the case of a slow walking intruder or an object with heat insulation. To overcome these disadvantages of the PIR sensors, we add ultrasonic sensors to complement them. The ultrasonic sensor module consists of a transmitter and a receiver which are placed in a line direction. The receiver detects signals and decides whether it is blocked. The ultrasonic transmission spreads a beam angle, so we use multiple ultrasonic sensors to receive it and to provide the majority voting mechanism.","PeriodicalId":403652,"journal":{"name":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115450014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
FlashFind: Responsive search in large datasets on mobile devices FlashFind:移动设备上大型数据集的响应式搜索
Pub Date : 2010-06-07 DOI: 10.1109/ISCE.2010.5523256
Thilo Ernst, A. Schramm
This paper introduces FlashFind, an on-device search engine for making the vast amounts of digital content stored on today's flash memory-enabled mobile devices (PMP's, PND's, e-readers, smart phones) instantly accessible through a single-widget search interface. Without network access, FlashFind provides a search experience comparable to that of standard web search engines. More than that, the search is fully incremental and prefix-based, i.e. results are shown shortly after each key press, and instead of entire words, the user mostly needs to type only a small number of characters until the desired result is found. The pilot application, implemented on Windows Mobile smart phones and on Linux, provides near-instantaneous global search-based selection among the streets, points of interest and cities in a map of Western Europe (-10 millions of entries) and is being integrated into a commercial navigation application. The patent-pending FlashFind technology is equally capable to search other coarsely-structured text-based or text-annotated spatial or non-spatial content, e.g. collections of e-books on an e-reader device. In contrast to server or desktop search solutions, FlashFind is optimized for the limitations of mobile devices in terms of CPU, main memory, and interaction constraints (e.g. having only a phone keypad). Ongoing work is targeted at fuzzy search, at new applications and device classes, and at integrating our search technology with other innovative input methods.
本文介绍FlashFind,这是一个设备上的搜索引擎,可以通过一个小部件搜索界面立即访问存储在当前支持闪存的移动设备(PMP, PND,电子阅读器,智能手机)上的大量数字内容。在没有网络访问的情况下,FlashFind提供了与标准web搜索引擎相当的搜索体验。更重要的是,搜索完全是增量的和基于前缀的,即每次按下键后不久就会显示结果,而不是整个单词,用户通常只需要输入少量字符,直到找到想要的结果。该试点应用程序在Windows Mobile智能手机和Linux上运行,在西欧地图(- 1000万个条目)中提供近乎即时的全球搜索选择,并正在集成到商业导航应用程序中。正在申请专利的FlashFind技术同样能够搜索其他基于文本或文本注释的粗糙结构的空间或非空间内容,例如电子阅读器设备上的电子书集合。与服务器或桌面搜索解决方案相比,FlashFind针对移动设备在CPU、主存和交互约束方面的限制(例如,只有一个电话键盘)进行了优化。正在进行的工作是针对模糊搜索、新的应用程序和设备类别,以及将我们的搜索技术与其他创新的输入法集成。
{"title":"FlashFind: Responsive search in large datasets on mobile devices","authors":"Thilo Ernst, A. Schramm","doi":"10.1109/ISCE.2010.5523256","DOIUrl":"https://doi.org/10.1109/ISCE.2010.5523256","url":null,"abstract":"This paper introduces FlashFind, an on-device search engine for making the vast amounts of digital content stored on today's flash memory-enabled mobile devices (PMP's, PND's, e-readers, smart phones) instantly accessible through a single-widget search interface. Without network access, FlashFind provides a search experience comparable to that of standard web search engines. More than that, the search is fully incremental and prefix-based, i.e. results are shown shortly after each key press, and instead of entire words, the user mostly needs to type only a small number of characters until the desired result is found. The pilot application, implemented on Windows Mobile smart phones and on Linux, provides near-instantaneous global search-based selection among the streets, points of interest and cities in a map of Western Europe (-10 millions of entries) and is being integrated into a commercial navigation application. The patent-pending FlashFind technology is equally capable to search other coarsely-structured text-based or text-annotated spatial or non-spatial content, e.g. collections of e-books on an e-reader device. In contrast to server or desktop search solutions, FlashFind is optimized for the limitations of mobile devices in terms of CPU, main memory, and interaction constraints (e.g. having only a phone keypad). Ongoing work is targeted at fuzzy search, at new applications and device classes, and at integrating our search technology with other innovative input methods.","PeriodicalId":403652,"journal":{"name":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117262563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced IPTV services through time synchronisation 通过时间同步增强IPTV业务
Pub Date : 2010-06-07 DOI: 10.1109/ISCE.2010.5522743
Lourdes Beloqui Yuste, H. Melvin
User interactivity and customisation are key features that must be promoted to help distinguish IPTV from satellite/aerial/cable TV alternatives. Furthermore, IPTV providers also have to compete with Internet TV by providing extra features to justify the cost. In this paper, we present a number of scenarios whereby IPTV providers can achieve this through the use of synchronised time. Integrating and tightly synchronising multiple user-selected media streams in Real-Time that are logically and temporally related potentially adds significant extra value to any IPTV platform. It also raises significant research challenges. We outline the prototype developed for a TV/Text Feed scenario and present some details of ongoing and future work.
用户交互性和定制化是必须促进的关键功能,以帮助将IPTV与卫星/空中/有线电视替代品区分开来。此外,IPTV提供商还必须通过提供额外的功能来证明成本的合理性,从而与互联网电视竞争。在本文中,我们提出了许多IPTV提供商可以通过使用同步时间来实现这一目标的场景。实时集成和紧密同步多个用户选择的媒体流,这些媒体流在逻辑上和时间上是相关的,可能会为任何IPTV平台增加显著的额外价值。它也提出了重大的研究挑战。我们概述了为电视/文本馈送场景开发的原型,并介绍了正在进行和未来工作的一些细节。
{"title":"Enhanced IPTV services through time synchronisation","authors":"Lourdes Beloqui Yuste, H. Melvin","doi":"10.1109/ISCE.2010.5522743","DOIUrl":"https://doi.org/10.1109/ISCE.2010.5522743","url":null,"abstract":"User interactivity and customisation are key features that must be promoted to help distinguish IPTV from satellite/aerial/cable TV alternatives. Furthermore, IPTV providers also have to compete with Internet TV by providing extra features to justify the cost. In this paper, we present a number of scenarios whereby IPTV providers can achieve this through the use of synchronised time. Integrating and tightly synchronising multiple user-selected media streams in Real-Time that are logically and temporally related potentially adds significant extra value to any IPTV platform. It also raises significant research challenges. We outline the prototype developed for a TV/Text Feed scenario and present some details of ongoing and future work.","PeriodicalId":403652,"journal":{"name":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132294911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A joint multi rate optimization framework for video adaptation in H.264/AVC H.264/AVC视频自适应的联合多速率优化框架
Pub Date : 2010-06-07 DOI: 10.1109/ISCE.2010.5523738
M. Semsarzadeh, M. Hashemi
Due to the explosive growth in ubiquitous multimedia access, one needs to adapt the considerable number of pre-encoded video content to a wide variety of user constraints such as different resolution, bitrate, frame rate, processing power and platform. Current adaptation frameworks do not take the encoder into account. In other words the encoder optimizes its outcome for the base layer only, regardless of the remaining adaptation layers and the resulted distortion once the adaptation is applied. In this paper, a joint multi rate optimization algorithm has been proposed for H.264/AVC that generates an encoded bitstream that will have the minimum distortion across all the layers combined, hence providing an improved experience for a larger number of users. The proposed scheme, that only supports bitrate adaption, is independent of the adaptation algorithm and is applicable to most existing adaptation frameworks. Simulation results indicate that the proposed scheme improves the video quality by up to 0.8 dB. The quality improvement of the adapted content is more significant in lower bitrates where even small improvements are visually noticeable due to the inherent low quality in these bitrates.
由于无处不在的多媒体访问的爆炸式增长,人们需要使大量的预编码视频内容适应各种各样的用户约束,如不同的分辨率、比特率、帧率、处理能力和平台。目前的适应框架没有考虑到编码器。换句话说,编码器仅为基础层优化其结果,而不考虑剩余的自适应层和应用自适应后产生的失真。本文提出了一种针对H.264/AVC的联合多速率优化算法,该算法生成的编码比特流在所有层的组合中具有最小的失真,从而为更多的用户提供了改进的体验。该方案只支持比特率自适应,不依赖于自适应算法,适用于大多数现有的自适应框架。仿真结果表明,该方案可将视频质量提高0.8 dB。在较低比特率下,改编内容的质量改进更为显著,因为在这些比特率下,由于固有的低质量,即使很小的改进在视觉上也很明显。
{"title":"A joint multi rate optimization framework for video adaptation in H.264/AVC","authors":"M. Semsarzadeh, M. Hashemi","doi":"10.1109/ISCE.2010.5523738","DOIUrl":"https://doi.org/10.1109/ISCE.2010.5523738","url":null,"abstract":"Due to the explosive growth in ubiquitous multimedia access, one needs to adapt the considerable number of pre-encoded video content to a wide variety of user constraints such as different resolution, bitrate, frame rate, processing power and platform. Current adaptation frameworks do not take the encoder into account. In other words the encoder optimizes its outcome for the base layer only, regardless of the remaining adaptation layers and the resulted distortion once the adaptation is applied. In this paper, a joint multi rate optimization algorithm has been proposed for H.264/AVC that generates an encoded bitstream that will have the minimum distortion across all the layers combined, hence providing an improved experience for a larger number of users. The proposed scheme, that only supports bitrate adaption, is independent of the adaptation algorithm and is applicable to most existing adaptation frameworks. Simulation results indicate that the proposed scheme improves the video quality by up to 0.8 dB. The quality improvement of the adapted content is more significant in lower bitrates where even small improvements are visually noticeable due to the inherent low quality in these bitrates.","PeriodicalId":403652,"journal":{"name":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130031250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Scalable high quality nonlinear up-scaler with guaranteed real time performance 可伸缩的高质量非线性上尺度器,保证实时性能
Pub Date : 2010-06-07 DOI: 10.1109/ISCE.2010.5523734
S. Schiemenz, C. Hentschel
Spatial image resizing is a very important issue for pixel oriented displays with variable input formats. A known example is a scaling factor of two for up-scaling from Standard Definition (SD) to High Definition (HD) format. We investigated an up-scaler for rational factors with nonlinear detail enhancement using principle of Priority Processing. Priority Processing can guarantee real time performance on programmable platforms even with limited resources. Our up-scaler adapts to the available hardware resources and depicts a good output quality for a wide range of scalability.
对于具有可变输入格式的面向像素的显示,空间图像大小调整是一个非常重要的问题。一个已知的例子是从标准清晰度(SD)到高清晰度(HD)格式的缩放系数为2。利用优先级处理原理,研究了一种具有非线性细节增强的合理因子上尺度器。优先级处理可以保证在资源有限的可编程平台上的实时性能。我们的向上扩展器适应可用的硬件资源,并为广泛的可伸缩性描绘出良好的输出质量。
{"title":"Scalable high quality nonlinear up-scaler with guaranteed real time performance","authors":"S. Schiemenz, C. Hentschel","doi":"10.1109/ISCE.2010.5523734","DOIUrl":"https://doi.org/10.1109/ISCE.2010.5523734","url":null,"abstract":"Spatial image resizing is a very important issue for pixel oriented displays with variable input formats. A known example is a scaling factor of two for up-scaling from Standard Definition (SD) to High Definition (HD) format. We investigated an up-scaler for rational factors with nonlinear detail enhancement using principle of Priority Processing. Priority Processing can guarantee real time performance on programmable platforms even with limited resources. Our up-scaler adapts to the available hardware resources and depicts a good output quality for a wide range of scalability.","PeriodicalId":403652,"journal":{"name":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114520485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Optimized temporal scalability for H.264 based codecs and its applications to video conferencing 优化了基于H.264编解码器的时间可扩展性及其在视频会议中的应用
Pub Date : 2010-06-07 DOI: 10.1109/ISCE.2010.5523739
H. Cycon, D. Marpe, T. Schmidt, M. Wahlisch, Martin Winken
In this paper, we describe and analyze a low complexity scalable video codec that extends our H.264-implementation DAVC. We can show that DSVC, our temporally scaled codec, attains an RD performance identical to the non-scaled version at comparable configuration. We achieve this by QP cascading, a method of assigning gradual refining quantization parameters to the declining temporal layers. The different quantization of frames does not lead to visual distinguishable quality fluctuations. This video codec is the core component of a software-based multipoint videoconference system, which works without MCU on a hybrid P2P network structure.
在本文中,我们描述和分析了一种低复杂度的可扩展视频编解码器,它扩展了我们的h .264实现DAVC。我们可以证明,DSVC,我们的临时缩放编解码器,在可比配置下获得与非缩放版本相同的RD性能。我们通过QP级联来实现这一点,QP级联是一种将逐渐细化的量化参数分配给下降的时间层的方法。帧的不同量化不会导致视觉上可区分的质量波动。该视频编解码器是基于软件的多点视频会议系统的核心部件,可以在混合P2P网络结构下不需要MCU的情况下工作。
{"title":"Optimized temporal scalability for H.264 based codecs and its applications to video conferencing","authors":"H. Cycon, D. Marpe, T. Schmidt, M. Wahlisch, Martin Winken","doi":"10.1109/ISCE.2010.5523739","DOIUrl":"https://doi.org/10.1109/ISCE.2010.5523739","url":null,"abstract":"In this paper, we describe and analyze a low complexity scalable video codec that extends our H.264-implementation DAVC. We can show that DSVC, our temporally scaled codec, attains an RD performance identical to the non-scaled version at comparable configuration. We achieve this by QP cascading, a method of assigning gradual refining quantization parameters to the declining temporal layers. The different quantization of frames does not lead to visual distinguishable quality fluctuations. This video codec is the core component of a software-based multipoint videoconference system, which works without MCU on a hybrid P2P network structure.","PeriodicalId":403652,"journal":{"name":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114682051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Single-channel speech separation using zero-phase models 使用零相位模型的单通道语音分离
Pub Date : 2010-06-07 DOI: 10.1109/ISCE.2010.5523701
Y. Lee, Chul Kwak, I. Lee, O. Kwon
This paper addresses the problem of single-channel speech separation to extract and enhance the desired speech signals from mixed speech signals. We propose a new speech separation algorithm by utilizing both magnitude and phase information, which can be applied to multimedia mobile communication and navigation systems. Conventionally, phase information has been neglected in speech signal processing. However, in the proposed method, we originally formulate a probabilistic phase-based speech estimator based on zero-phase models to improve the speech separation performance. In the speech separation experiments, the proposed method is shown to improve speaker-to-interference ratio (SIR) by 2.2 dB compared to the system using magnitude models only. When only phase-based speech estimator is used for speech separation, the SIR was improved by 0.8 dB. This result justify that the proposed phase-based speech estimation method achieves significant SIR improvement compared with the previous magnitude-based method.
本文解决了单通道语音分离问题,从混合语音信号中提取和增强所需的语音信号。提出了一种结合幅度信息和相位信息的语音分离算法,该算法可应用于多媒体移动通信和导航系统。在语音信号处理中,相位信息通常被忽略。然而,在提出的方法中,我们最初建立了一个基于零相位模型的基于概率相位的语音估计器,以提高语音分离性能。在语音分离实验中,与仅使用幅度模型的系统相比,该方法可将扬声器与干扰比(SIR)提高2.2 dB。当仅使用基于相位的语音估计器进行语音分离时,SIR提高了0.8 dB。实验结果表明,基于相位的语音估计方法相对于基于幅度的语音估计方法取得了显著的SIR改进。
{"title":"Single-channel speech separation using zero-phase models","authors":"Y. Lee, Chul Kwak, I. Lee, O. Kwon","doi":"10.1109/ISCE.2010.5523701","DOIUrl":"https://doi.org/10.1109/ISCE.2010.5523701","url":null,"abstract":"This paper addresses the problem of single-channel speech separation to extract and enhance the desired speech signals from mixed speech signals. We propose a new speech separation algorithm by utilizing both magnitude and phase information, which can be applied to multimedia mobile communication and navigation systems. Conventionally, phase information has been neglected in speech signal processing. However, in the proposed method, we originally formulate a probabilistic phase-based speech estimator based on zero-phase models to improve the speech separation performance. In the speech separation experiments, the proposed method is shown to improve speaker-to-interference ratio (SIR) by 2.2 dB compared to the system using magnitude models only. When only phase-based speech estimator is used for speech separation, the SIR was improved by 0.8 dB. This result justify that the proposed phase-based speech estimation method achieves significant SIR improvement compared with the previous magnitude-based method.","PeriodicalId":403652,"journal":{"name":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","volume":"198 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116483242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Semantic mobiletiles: A tool for movie content understanding on mobile phones 语义移动:手机上的电影内容理解工具
Pub Date : 2010-06-07 DOI: 10.1109/ISCE.2010.5522766
K. Glasman, Alex Glasman, E. Grinenko, B. Falchuk
Web-accessible video sites are currently some of the most visited on the mobile Web. It is expected that the number of these sites will continue to grow and that customers will want more powerful exploration tools. Recently, we described an innovative content-based tool for video browsing called MobileTiles. In this paper we describe Semantic MobileTiles, an improved version of MobileTiles based on semantically meaningful key frame selection. We propose a method of approximation of semantically compressed versions of short movies. We found that a good approximation can be achieved if pairs of key frames from each shot boundary are used to represent a sequence of shots. Semantic MobileTiles provides a novel abstraction/understanding tool that enables users to get the main idea of a short movie at a glance. Our algorithm is both effective and computationally simple.
可访问Web的视频网站是目前移动Web上访问量最大的网站之一。预计这些站点的数量将继续增长,客户将需要更强大的勘探工具。最近,我们描述了一个创新的基于内容的视频浏览工具MobileTiles。本文描述了基于语义意义关键帧选择的MobileTiles的改进版本Semantic MobileTiles。我们提出了一种短电影语义压缩版本的近似方法。我们发现,如果使用来自每个镜头边界的关键帧对来表示镜头序列,则可以获得很好的近似。Semantic MobileTiles提供了一种新颖的抽象/理解工具,使用户能够一目了然地了解短片的主要思想。我们的算法既有效又计算简单。
{"title":"Semantic mobiletiles: A tool for movie content understanding on mobile phones","authors":"K. Glasman, Alex Glasman, E. Grinenko, B. Falchuk","doi":"10.1109/ISCE.2010.5522766","DOIUrl":"https://doi.org/10.1109/ISCE.2010.5522766","url":null,"abstract":"Web-accessible video sites are currently some of the most visited on the mobile Web. It is expected that the number of these sites will continue to grow and that customers will want more powerful exploration tools. Recently, we described an innovative content-based tool for video browsing called MobileTiles. In this paper we describe Semantic MobileTiles, an improved version of MobileTiles based on semantically meaningful key frame selection. We propose a method of approximation of semantically compressed versions of short movies. We found that a good approximation can be achieved if pairs of key frames from each shot boundary are used to represent a sequence of shots. Semantic MobileTiles provides a novel abstraction/understanding tool that enables users to get the main idea of a short movie at a glance. Our algorithm is both effective and computationally simple.","PeriodicalId":403652,"journal":{"name":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122463216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Implementation of an embedded H.264 live video streaming system 一个嵌入式H.264视频直播系统的实现
Pub Date : 2010-06-07 DOI: 10.1109/ISCE.2010.5523699
N. Vun, M. Ansary
This paper presents the methodologies taken to integrate open-source LIVE555 based data streamer with a baseline H.264 encoder running on the Texas Instruments's DaVinci based embedded platform. The system implemented is able to stream live H.264 encoded video over the network, and be displayed on remote stations using the VLC media player. The system developed provides an embedded platform to implement a smart surveillance camera system, whereby video analysis can be performed locally on the embedded platform to minimize the streaming throughput by making use of the H.264 motion vector information.
本文介绍了将基于开源LIVE555的数据流与运行在德州仪器基于达芬奇的嵌入式平台上的基线H.264编码器集成在一起的方法。所实现的系统能够通过网络实时传输H.264编码的视频,并使用VLC媒体播放器在远程站点上显示。所开发的系统提供了一个嵌入式平台来实现智能监控摄像机系统,通过利用H.264运动矢量信息,可以在嵌入式平台上本地进行视频分析,从而最大限度地减少流吞吐量。
{"title":"Implementation of an embedded H.264 live video streaming system","authors":"N. Vun, M. Ansary","doi":"10.1109/ISCE.2010.5523699","DOIUrl":"https://doi.org/10.1109/ISCE.2010.5523699","url":null,"abstract":"This paper presents the methodologies taken to integrate open-source LIVE555 based data streamer with a baseline H.264 encoder running on the Texas Instruments's DaVinci based embedded platform. The system implemented is able to stream live H.264 encoded video over the network, and be displayed on remote stations using the VLC media player. The system developed provides an embedded platform to implement a smart surveillance camera system, whereby video analysis can be performed locally on the embedded platform to minimize the streaming throughput by making use of the H.264 motion vector information.","PeriodicalId":403652,"journal":{"name":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124637532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
期刊
IEEE International Symposium on Consumer Electronics (ISCE 2010)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1