首页 > 最新文献

2006 IEEE International Conference on Multimedia and Expo最新文献

英文 中文
SIM: Scalable Island Multicast for Peer-to-Peer Media Streaming SIM:点对点媒体流的可扩展孤岛多播
Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262668
Xing Jin, Kan-Leung Cheng, S. Chan
Despite the fact that global multicast is still not possible in today's Internet, many local networks are already multicast-capable (the so-called multicast "islands"). However, most application-layer multicast (ALM) protocols for streaming has not taken advantage of the underlying IP multicast capability. As IP multicast is more efficient, it would be beneficial if ALM can take advantage of such capability in building overlay trees. In this paper, we propose a fully distributed protocol called Scalable Island Multicast (SIM), which effectively integrates IP multicast and ALM. Hosts in SIM first form an overlay tree using a scalable protocol. They then detect IP multicast islands and employ IP multicast whenever possible. Through simulations on Internet-like topologies, we show that SIM achieves much lower end-to-end delay and link stress as compared with traditional ALM protocols
尽管在今天的Internet中仍然不可能实现全局多播,但许多本地网络已经具有多播能力(所谓的多播“孤岛”)。然而,大多数用于流的应用层多播(ALM)协议没有利用底层IP多播功能。由于IP组播的效率更高,如果ALM能够利用这种能力构建覆盖树将是有益的。本文提出了一种完全分布式的可扩展岛组播(SIM)协议,它有效地集成了IP组播和ALM。SIM中的主机首先使用可扩展协议形成覆盖树。然后,它们检测IP组播孤岛,并尽可能使用IP组播。通过对类互联网拓扑的仿真,我们发现与传统的ALM协议相比,SIM实现了更低的端到端延迟和链路应力
{"title":"SIM: Scalable Island Multicast for Peer-to-Peer Media Streaming","authors":"Xing Jin, Kan-Leung Cheng, S. Chan","doi":"10.1109/ICME.2006.262668","DOIUrl":"https://doi.org/10.1109/ICME.2006.262668","url":null,"abstract":"Despite the fact that global multicast is still not possible in today's Internet, many local networks are already multicast-capable (the so-called multicast \"islands\"). However, most application-layer multicast (ALM) protocols for streaming has not taken advantage of the underlying IP multicast capability. As IP multicast is more efficient, it would be beneficial if ALM can take advantage of such capability in building overlay trees. In this paper, we propose a fully distributed protocol called Scalable Island Multicast (SIM), which effectively integrates IP multicast and ALM. Hosts in SIM first form an overlay tree using a scalable protocol. They then detect IP multicast islands and employ IP multicast whenever possible. Through simulations on Internet-like topologies, we show that SIM achieves much lower end-to-end delay and link stress as compared with traditional ALM protocols","PeriodicalId":339258,"journal":{"name":"2006 IEEE International Conference on Multimedia and Expo","volume":"135 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127350254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Broadcast Video Program Summarization using Face Tracks 基于人脸轨迹的广播视频节目摘要
Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262715
K. A. Peker, I. Otsuka, Ajay Divakaran
We present a novel video summarization and skimming technique using face detection on broadcast video programs. We take the faces in video as our primary target as they constitute the focus of most consumer video programs. We detect face tracks in video and define face-scene fragments based on start and end of face tracks. We define a fast-forward skimming method using frames selected from fragments, thus covering all the faces and their interactions in the video program. We also define novel constraints for a smooth and visually representative summary, and construct longer but smoother summaries
提出了一种基于人脸检测的广播视频节目视频摘要与浏览技术。我们把视频中的人脸作为我们的首要目标,因为他们构成了大多数消费视频节目的焦点。我们检测视频中的人脸轨迹,并根据人脸轨迹的起点和终点来定义人脸场景片段。我们定义了一种快速浏览方法,使用从片段中选择的帧,从而覆盖视频程序中的所有面孔及其相互作用。我们还为流畅和具有视觉代表性的摘要定义了新的约束,并构建了更长但更流畅的摘要
{"title":"Broadcast Video Program Summarization using Face Tracks","authors":"K. A. Peker, I. Otsuka, Ajay Divakaran","doi":"10.1109/ICME.2006.262715","DOIUrl":"https://doi.org/10.1109/ICME.2006.262715","url":null,"abstract":"We present a novel video summarization and skimming technique using face detection on broadcast video programs. We take the faces in video as our primary target as they constitute the focus of most consumer video programs. We detect face tracks in video and define face-scene fragments based on start and end of face tracks. We define a fast-forward skimming method using frames selected from fragments, thus covering all the faces and their interactions in the video program. We also define novel constraints for a smooth and visually representative summary, and construct longer but smoother summaries","PeriodicalId":339258,"journal":{"name":"2006 IEEE International Conference on Multimedia and Expo","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130093529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
SIKAS: A Scalable Distributed Key Management Scheme for Dynamic Collaborative Groups SIKAS:动态协作组的可伸缩分布式密钥管理方案
Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262753
Jiang Zhang, Jian-Guang Luo, Bin Li, Shiqiang Yang
The increasing popularity of distributed and collaborative applications prompts the need for secure communication in collaborative groups. Some distributed collaborative key management protocols have been proposed to provide group communication privacy and data confidentiality for collaborative groups. However, most of them rekey on each member change, and the costs of group rekeying can be quite substantial for large groups with frequent membership changes. In this paper, we propose a scalable distributed key management scheme using a distributed one-way function tree named SIKAS which can significantly reduce the computation and communication costs of maintaining the group key based upon period-based group rekeying. A comparison with previous work has shown that SIKAS provides scalability and rekeying efficiency while preserving both distributed and collaborative properties
分布式和协作应用程序的日益普及促使需要在协作组中进行安全通信。提出了一些分布式协作密钥管理协议,为协作组提供组通信隐私和数据机密性。但是,它们中的大多数在每次成员更改时都需要进行密钥更新,对于成员频繁更改的大型组来说,进行组密钥更新的成本可能相当高。本文提出了一种可扩展的分布式密钥管理方案,该方案使用分布式单向功能树SIKAS,可以显著减少基于周期的组密钥更新的组密钥维护计算和通信开销。与先前工作的比较表明,SIKAS在保留分布式和协作特性的同时提供了可扩展性和更新密钥的效率
{"title":"SIKAS: A Scalable Distributed Key Management Scheme for Dynamic Collaborative Groups","authors":"Jiang Zhang, Jian-Guang Luo, Bin Li, Shiqiang Yang","doi":"10.1109/ICME.2006.262753","DOIUrl":"https://doi.org/10.1109/ICME.2006.262753","url":null,"abstract":"The increasing popularity of distributed and collaborative applications prompts the need for secure communication in collaborative groups. Some distributed collaborative key management protocols have been proposed to provide group communication privacy and data confidentiality for collaborative groups. However, most of them rekey on each member change, and the costs of group rekeying can be quite substantial for large groups with frequent membership changes. In this paper, we propose a scalable distributed key management scheme using a distributed one-way function tree named SIKAS which can significantly reduce the computation and communication costs of maintaining the group key based upon period-based group rekeying. A comparison with previous work has shown that SIKAS provides scalability and rekeying efficiency while preserving both distributed and collaborative properties","PeriodicalId":339258,"journal":{"name":"2006 IEEE International Conference on Multimedia and Expo","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132372347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Receiver-Based Optimization for Video Delivery Over Wireless Links 基于接收器的无线视频传输优化
Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262637
Carri W. Chan, J. Apostolopoulos, Yan Li, N. Bambos
We consider transfer of video frames over a time-varying wireless channel. When the channel is good, the transmitter can send frames at a higher rate than the receiver can consume them via playout. In that case, we introduce the idea of admitting new frames even when the receiver buffer is full, by selectively evicting frames already in the buffer; we can also control the playout rate, so as to optimize the tradeoff between video distortion and the time to freeze when the channel turns bad and frames arrive at a lower rate than should be played out. The decision/control problem of whether to admit a new frame, which already stored one to evict to accommodate the new one, and at what rate to play out frames is formulated within a dynamic programming framework, and an interesting connection to the Knapsack problem is made. Application of the idea in a relevant simple system shows significant performance gains, indicating that it is a promising approach for improving video delivery performance over challenging wireless channels
我们考虑在时变无线信道上传输视频帧。当信道良好时,发送端发送帧的速率比接收端通过播放接收帧的速率要高。在这种情况下,我们引入了允许新帧的想法,即使接收端缓冲区已满,通过选择性地清除缓冲区中已经存在的帧;我们还可以控制播放速率,以优化视频失真和当信道变坏和帧到达的速率低于应该播放的速率时冻结的时间之间的权衡。在动态规划框架中提出了是否接纳新帧、是否将已存储的帧驱逐以适应新帧以及以何种速率播放帧的决策/控制问题,并与背包问题建立了有趣的联系。在相关简单系统中的应用显示了显著的性能提高,表明它是一种有前途的方法,可以在具有挑战性的无线信道上提高视频传输性能
{"title":"Receiver-Based Optimization for Video Delivery Over Wireless Links","authors":"Carri W. Chan, J. Apostolopoulos, Yan Li, N. Bambos","doi":"10.1109/ICME.2006.262637","DOIUrl":"https://doi.org/10.1109/ICME.2006.262637","url":null,"abstract":"We consider transfer of video frames over a time-varying wireless channel. When the channel is good, the transmitter can send frames at a higher rate than the receiver can consume them via playout. In that case, we introduce the idea of admitting new frames even when the receiver buffer is full, by selectively evicting frames already in the buffer; we can also control the playout rate, so as to optimize the tradeoff between video distortion and the time to freeze when the channel turns bad and frames arrive at a lower rate than should be played out. The decision/control problem of whether to admit a new frame, which already stored one to evict to accommodate the new one, and at what rate to play out frames is formulated within a dynamic programming framework, and an interesting connection to the Knapsack problem is made. Application of the idea in a relevant simple system shows significant performance gains, indicating that it is a promising approach for improving video delivery performance over challenging wireless channels","PeriodicalId":339258,"journal":{"name":"2006 IEEE International Conference on Multimedia and Expo","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128868100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
An Architecture Design of Threshold-Based Best-Basis Algorithm 基于阈值的最优基算法体系结构设计
Pub Date : 2006-07-09 DOI: 10.1109/icme.2006.262770
S. Aroutchelvame, K. Raahemifar
The best-basis algorithm has gained much importance on textured-based image compression and denoising of signals. In this paper, an architecture for the wavelet-packet based best-basis algorithm for images is proposed. The paper also describes the architecture for best-tree selection from 2D wavelet packet decomposition. The precision analysis of the proposed architecture is also discussed and the result shows that increase in the precision of input pixel greatly increases the signal-to-noise ratio (SNR) per pixel whereas increase in the precision of filter coefficient does not greatly help in improving the SNR value. The proposed architecture is described in VHDL at the RTL level, simulated successfully for its functional correctness and implemented in an FPGA
最佳基算法在基于纹理的图像压缩和信号去噪中具有重要意义。本文提出了一种基于小波包的图像最佳基算法的结构。本文还描述了从二维小波包分解中选择最佳树的体系结构。对所提出的结构进行了精度分析,结果表明,提高输入像元的精度可大大提高每像元的信噪比,而提高滤波系数的精度对提高信噪比没有太大帮助。在RTL级别用VHDL描述了所提出的体系结构,对其功能正确性进行了成功的仿真,并在FPGA中实现
{"title":"An Architecture Design of Threshold-Based Best-Basis Algorithm","authors":"S. Aroutchelvame, K. Raahemifar","doi":"10.1109/icme.2006.262770","DOIUrl":"https://doi.org/10.1109/icme.2006.262770","url":null,"abstract":"The best-basis algorithm has gained much importance on textured-based image compression and denoising of signals. In this paper, an architecture for the wavelet-packet based best-basis algorithm for images is proposed. The paper also describes the architecture for best-tree selection from 2D wavelet packet decomposition. The precision analysis of the proposed architecture is also discussed and the result shows that increase in the precision of input pixel greatly increases the signal-to-noise ratio (SNR) per pixel whereas increase in the precision of filter coefficient does not greatly help in improving the SNR value. The proposed architecture is described in VHDL at the RTL level, simulated successfully for its functional correctness and implemented in an FPGA","PeriodicalId":339258,"journal":{"name":"2006 IEEE International Conference on Multimedia and Expo","volume":"368 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126705594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Motion Vector Recovery Algorithm for Temporal Error Concealment using Optical Flow in H.264 Video Coding H.264视频编码中基于光流的时间错误隐藏运动矢量恢复算法
Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262880
Donghyung Kim, Sanghyup Cho, Jechang Jeong
For the improvement of coding efficiency, the H.264 standard uses new coding tools. Among the tools, motion estimation using smaller block sizes leads to higher correlation between the motion vectors of neighboring blocks. This characteristic of H.264 is useful for motion vector recovery to conceal a lost macroblock. In this paper, we propose the motion vector recovery method based on optical flow in H.264. We first determine optical flow region to alleviate the complexity, and choose initial value of flow velocity using neighboring motion vectors of a lost macroblock. And then the proposed method recovers the motion vectors of 4times4 blocks included in a lost macroblock using the weighted average of obtained flow velocities. Simulation results show that our proposed method gives higher objective and subjective visual qualities than conventional approaches
为了提高编码效率,H.264标准采用了新的编码工具。在这些工具中,使用较小块大小的运动估计导致相邻块的运动向量之间的相关性更高。H.264的这个特性对于运动矢量恢复很有用,可以隐藏丢失的宏块。本文提出了H.264中基于光流的运动矢量恢复方法。我们首先确定光流区域以降低复杂度,然后利用丢失宏块的相邻运动向量选择流速度的初始值。然后利用得到的流速加权平均,恢复丢失宏块中包含的4times4块的运动向量。仿真结果表明,该方法比传统方法具有更高的主客观视觉质量
{"title":"A Motion Vector Recovery Algorithm for Temporal Error Concealment using Optical Flow in H.264 Video Coding","authors":"Donghyung Kim, Sanghyup Cho, Jechang Jeong","doi":"10.1109/ICME.2006.262880","DOIUrl":"https://doi.org/10.1109/ICME.2006.262880","url":null,"abstract":"For the improvement of coding efficiency, the H.264 standard uses new coding tools. Among the tools, motion estimation using smaller block sizes leads to higher correlation between the motion vectors of neighboring blocks. This characteristic of H.264 is useful for motion vector recovery to conceal a lost macroblock. In this paper, we propose the motion vector recovery method based on optical flow in H.264. We first determine optical flow region to alleviate the complexity, and choose initial value of flow velocity using neighboring motion vectors of a lost macroblock. And then the proposed method recovers the motion vectors of 4times4 blocks included in a lost macroblock using the weighted average of obtained flow velocities. Simulation results show that our proposed method gives higher objective and subjective visual qualities than conventional approaches","PeriodicalId":339258,"journal":{"name":"2006 IEEE International Conference on Multimedia and Expo","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126793816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Improving Speaker Diarization by Cross EM Refinement 基于交叉EM改进的说话人特征化
Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262927
Huazhong Ning, Wei Xu, Yihong Gong, Thomas S. Huang
In this paper, we present a new speaker diarization system that improves the accuracy of traditional hierarchical clustering-based methods with little increase in computational cost. Our contributions are mainly two fold. First, we include a preprocessing called "local clustering" before the hierarchical clustering algorithm to merge very similar adjacent speech segments. This local clustering aims to reduce the number of segments to be clustered by the hierarchical clustering, so as to dramatically increase the processing speed. Second, we perform a postprocessing called "cross EM refinement" to purify the clusters generated by the hierarchical clustering. This algorithm is based on the idea of cross validation and EM algorithm. Our experimental evaluations show that the proposed cross EM refinement approach reduces the speaker diarization error by up to 56%, with an average reduction of 22% compared to the traditional hierarchical clustering method
在本文中,我们提出了一种新的说话人分类系统,它提高了传统的基于层次聚类的方法的准确性,而计算成本却没有增加。我们的贡献主要有两方面。首先,我们在分层聚类算法之前包含一个称为“局部聚类”的预处理,以合并非常相似的相邻语音片段。这种局部聚类的目的是通过分层聚类来减少需要聚类的段数,从而大大提高处理速度。其次,我们执行一个称为“交叉EM细化”的后处理来净化由分层聚类产生的聚类。该算法基于交叉验证和EM算法的思想。实验结果表明,与传统的分层聚类方法相比,本文提出的交叉EM细化方法可将说话人特征化误差降低56%,平均降低22%
{"title":"Improving Speaker Diarization by Cross EM Refinement","authors":"Huazhong Ning, Wei Xu, Yihong Gong, Thomas S. Huang","doi":"10.1109/ICME.2006.262927","DOIUrl":"https://doi.org/10.1109/ICME.2006.262927","url":null,"abstract":"In this paper, we present a new speaker diarization system that improves the accuracy of traditional hierarchical clustering-based methods with little increase in computational cost. Our contributions are mainly two fold. First, we include a preprocessing called \"local clustering\" before the hierarchical clustering algorithm to merge very similar adjacent speech segments. This local clustering aims to reduce the number of segments to be clustered by the hierarchical clustering, so as to dramatically increase the processing speed. Second, we perform a postprocessing called \"cross EM refinement\" to purify the clusters generated by the hierarchical clustering. This algorithm is based on the idea of cross validation and EM algorithm. Our experimental evaluations show that the proposed cross EM refinement approach reduces the speaker diarization error by up to 56%, with an average reduction of 22% compared to the traditional hierarchical clustering method","PeriodicalId":339258,"journal":{"name":"2006 IEEE International Conference on Multimedia and Expo","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126859732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Applications of Autostereoscopic Displays in Ophthalmologic Studies 自立体显示器在眼科研究中的应用
Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262603
Xiaoyi Jiang, D. Mojon
Autostereoscopic displays are an emerging technology which provide 3D viewing experiences without the need of glasses or other encumbering viewing aids. In this paper we explore their potential in ophthalmologic studies. While the vast majority of applications of autostereoscopic displays in other fields is merely based on producing 3D viewing effects, we can distinguish between several classes of ophthalmologic tasks in which autostereoscopic displays play a very different role. Three concrete applications are described. With the steady improvements in autostereoscopic displays we expect to develop qualitatively new ophthalmologic tests in future
自动立体显示器是一项新兴技术,它提供3D观看体验,而不需要眼镜或其他妨碍观看的辅助设备。本文探讨了它们在眼科研究中的潜力。虽然绝大多数的应用在其他领域的自动立体显示器仅仅是基于产生3D观看效果,我们可以区分几种类型的眼科任务,其中自动立体显示器发挥了非常不同的作用。介绍了三种具体应用。随着自立体显示技术的不断进步,我们期望在未来开发出新的眼科检测方法
{"title":"Applications of Autostereoscopic Displays in Ophthalmologic Studies","authors":"Xiaoyi Jiang, D. Mojon","doi":"10.1109/ICME.2006.262603","DOIUrl":"https://doi.org/10.1109/ICME.2006.262603","url":null,"abstract":"Autostereoscopic displays are an emerging technology which provide 3D viewing experiences without the need of glasses or other encumbering viewing aids. In this paper we explore their potential in ophthalmologic studies. While the vast majority of applications of autostereoscopic displays in other fields is merely based on producing 3D viewing effects, we can distinguish between several classes of ophthalmologic tasks in which autostereoscopic displays play a very different role. Three concrete applications are described. With the steady improvements in autostereoscopic displays we expect to develop qualitatively new ophthalmologic tests in future","PeriodicalId":339258,"journal":{"name":"2006 IEEE International Conference on Multimedia and Expo","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121430969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Enhanced Semi-Supervised Learning for Automatic Video Annotation 用于自动视频注释的增强半监督学习
Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262823
Meng Wang, Xiansheng Hua, Lirong Dai, Yan Song
For automatic semantic annotation of large-scale video database, the insufficiency of labeled training samples is a major obstacle. General semi-supervised learning algorithms can help solve the problem but the improvement is limited. In this paper, two semi-supervised learning algorithms, self-training and co-training, are enhanced by exploring the temporal consistency of semantic concepts in video sequences. In the enhanced algorithms, instead of individual shots, time-constraint shot clusters are taken as the basic sample units, in which most mis-classifications can be corrected before they are applied for re-training, thus more accurate statistical models can be obtained. Experiments show that enhanced self-training/co-training significantly improves the performance of video annotation
对于大规模视频数据库的自动语义标注,标记训练样本的不足是一个主要障碍。一般的半监督学习算法可以帮助解决这个问题,但改进是有限的。本文通过探索视频序列中语义概念的时间一致性,对自训练和协同训练两种半监督学习算法进行了改进。在增强算法中,以时间约束的投篮聚类代替单个投篮作为基本样本单位,可以在重新训练之前纠正大多数错误分类,从而获得更准确的统计模型。实验表明,增强的自训练/协同训练显著提高了视频标注的性能
{"title":"Enhanced Semi-Supervised Learning for Automatic Video Annotation","authors":"Meng Wang, Xiansheng Hua, Lirong Dai, Yan Song","doi":"10.1109/ICME.2006.262823","DOIUrl":"https://doi.org/10.1109/ICME.2006.262823","url":null,"abstract":"For automatic semantic annotation of large-scale video database, the insufficiency of labeled training samples is a major obstacle. General semi-supervised learning algorithms can help solve the problem but the improvement is limited. In this paper, two semi-supervised learning algorithms, self-training and co-training, are enhanced by exploring the temporal consistency of semantic concepts in video sequences. In the enhanced algorithms, instead of individual shots, time-constraint shot clusters are taken as the basic sample units, in which most mis-classifications can be corrected before they are applied for re-training, thus more accurate statistical models can be obtained. Experiments show that enhanced self-training/co-training significantly improves the performance of video annotation","PeriodicalId":339258,"journal":{"name":"2006 IEEE International Conference on Multimedia and Expo","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126561518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Low Latency Video Streaming Over Peer-To-Peer Networks 低延迟视频流在点对点网络
Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262472
Eric Setton, Jeonghun Noh, B. Girod
We study peer-to-peer multicast streaming, where a source distributes real-time video to a large population of hosts by making use of their forwarding capacity rather than relying on dedicated media servers. We present a distributed streaming protocol which builds and maintains multiple multicast trees. The protocol is combined with an adaptive scheduling algorithm which ensures packets destined to a large number of peers, or particularly important to decode the video, are sent in priority. Experiments carried out over a simulated network of up to 3000 peers illustrate the performance of the protocol. For low latency video streaming, the prioritization algorithm offers performance gains, especially for large audiences and low latencies
我们研究了点对点多播流,其中一个源通过利用它们的转发能力而不是依赖于专用的媒体服务器,将实时视频分发给大量主机。提出了一种分布式流协议,该协议可以构建和维护多个组播树。该协议与自适应调度算法相结合,确保发送到大量对等体或对视频解码特别重要的数据包优先发送。在多达3000个节点的模拟网络上进行的实验证明了该协议的性能。对于低延迟视频流,优先级算法提供了性能提升,特别是对于大量观众和低延迟
{"title":"Low Latency Video Streaming Over Peer-To-Peer Networks","authors":"Eric Setton, Jeonghun Noh, B. Girod","doi":"10.1109/ICME.2006.262472","DOIUrl":"https://doi.org/10.1109/ICME.2006.262472","url":null,"abstract":"We study peer-to-peer multicast streaming, where a source distributes real-time video to a large population of hosts by making use of their forwarding capacity rather than relying on dedicated media servers. We present a distributed streaming protocol which builds and maintains multiple multicast trees. The protocol is combined with an adaptive scheduling algorithm which ensures packets destined to a large number of peers, or particularly important to decode the video, are sent in priority. Experiments carried out over a simulated network of up to 3000 peers illustrate the performance of the protocol. For low latency video streaming, the prioritization algorithm offers performance gains, especially for large audiences and low latencies","PeriodicalId":339258,"journal":{"name":"2006 IEEE International Conference on Multimedia and Expo","volume":"15 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114114667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
期刊
2006 IEEE International Conference on Multimedia and Expo
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1