首页 > 最新文献

2012 IEEE International Conference on Multimedia and Expo Workshops最新文献

英文 中文
Random Subspace Method for Gait Recognition 步态识别的随机子空间方法
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.55
Yu Guan, Chang-Tsun Li, Yongjian Hu
Over fitting is a common problem for gait recognition algorithms when gait sequences in gallery for training are acquired under a single walking condition. In this paper, we propose an approach based on the random subspace method (RSM) to address such over learning problems. Initially, two-dimensional Principle Component Analysis (2DPCA) is adopted to obtain the full hypothesis space (i.e., eigen space). Multiple inductive biases (i.e., subspaces) are constructed, each with the corresponding basis vectors randomly chosen from the initial eigen space. This procedure can not only largely avoid over adaptation but also facilitate dimension reduction. The final classification is achieved by the decision committee which follows a majority voting criterion from the labeling results of all the subspaces. Experimental results on the benchmark USF Human ID gait database show that the proposed method is a feasible framework for gait recognition under unknown walking conditions.
在单一行走条件下获取训练库中的步态序列时,过度拟合是步态识别算法中常见的问题。在本文中,我们提出了一种基于随机子空间方法(RSM)的方法来解决这种过度学习问题。首先,采用二维主成分分析(2DPCA)获得完整的假设空间(即特征空间)。构造了多个归纳偏置(即子空间),每个子空间具有从初始特征空间中随机选择的相应基向量。该方法不仅可以在很大程度上避免过度适应,而且有利于降维。最终的分类由决策委员会根据所有子空间的标记结果,遵循多数投票的标准来完成。在基准USF人体ID步态数据库上的实验结果表明,该方法是一种可行的未知步行条件下步态识别框架。
{"title":"Random Subspace Method for Gait Recognition","authors":"Yu Guan, Chang-Tsun Li, Yongjian Hu","doi":"10.1109/ICMEW.2012.55","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.55","url":null,"abstract":"Over fitting is a common problem for gait recognition algorithms when gait sequences in gallery for training are acquired under a single walking condition. In this paper, we propose an approach based on the random subspace method (RSM) to address such over learning problems. Initially, two-dimensional Principle Component Analysis (2DPCA) is adopted to obtain the full hypothesis space (i.e., eigen space). Multiple inductive biases (i.e., subspaces) are constructed, each with the corresponding basis vectors randomly chosen from the initial eigen space. This procedure can not only largely avoid over adaptation but also facilitate dimension reduction. The final classification is achieved by the decision committee which follows a majority voting criterion from the labeling results of all the subspaces. Experimental results on the benchmark USF Human ID gait database show that the proposed method is a feasible framework for gait recognition under unknown walking conditions.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116920428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Cloud-Based E-Health Multimedia Framework for Heterogeneous Network 异构网络中基于云的电子医疗多媒体框架
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.84
Atif Alamri
There is a strong need to improve the transmission mechanism for eHealth management and multimedia services over a heterogeneous network. Both efficient mechanisms for patient health information access, and communication and delivery over various devices face challenges. Specifically, they include devices with computation and power supply limitations, and HDTV that has a high bandwidth requirement, but limited resources. Network resource limitations are mostly characterized by throughput variations, and delay or transmission errors. An adaptive bit rate of a video in multimedia health services comprise desirable key features that should be considered. In this paper, we propose and demonstrate a cloud computing framework that uses scalable video coding. This has efficient video compression that relieves heterogeneous devices when they execute heavier multimedia applications, use database storage and when delivering mobile health services over limited bandwidth resources. Simulation results show that the proposed scheme achieves a significant improvement in a PSNR-Y gain, as compared to the existing scheme.
在异构网络上,迫切需要改进电子健康管理和多媒体服务的传输机制。患者健康信息访问的有效机制以及通过各种设备进行通信和传递都面临挑战。具体来说,它们包括具有计算和电源限制的设备,以及具有高带宽要求但资源有限的HDTV。网络资源限制主要表现为吞吐量变化、延迟或传输错误。多媒体保健服务中视频的自适应比特率包括应考虑的理想关键特征。在本文中,我们提出并演示了一个使用可扩展视频编码的云计算框架。它具有高效的视频压缩功能,可在异构设备执行较重的多媒体应用程序、使用数据库存储以及在有限的带宽资源上提供移动医疗服务时减轻异构设备的负担。仿真结果表明,与现有方案相比,该方案在PSNR-Y增益方面有显著提高。
{"title":"Cloud-Based E-Health Multimedia Framework for Heterogeneous Network","authors":"Atif Alamri","doi":"10.1109/ICMEW.2012.84","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.84","url":null,"abstract":"There is a strong need to improve the transmission mechanism for eHealth management and multimedia services over a heterogeneous network. Both efficient mechanisms for patient health information access, and communication and delivery over various devices face challenges. Specifically, they include devices with computation and power supply limitations, and HDTV that has a high bandwidth requirement, but limited resources. Network resource limitations are mostly characterized by throughput variations, and delay or transmission errors. An adaptive bit rate of a video in multimedia health services comprise desirable key features that should be considered. In this paper, we propose and demonstrate a cloud computing framework that uses scalable video coding. This has efficient video compression that relieves heterogeneous devices when they execute heavier multimedia applications, use database storage and when delivering mobile health services over limited bandwidth resources. Simulation results show that the proposed scheme achieves a significant improvement in a PSNR-Y gain, as compared to the existing scheme.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127281491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
An Automatic Multi-sample 3D Face Registration Method Based on Thin Plate Spline and Deformable Model 基于薄板样条和可变形模型的多样本三维人脸自动配准方法
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.85
Wenyu Qin, Yongli Hu, Yanfeng Sun, Baocai Yin
Non-rigid registration of 3D facial surfaces is a crucial step in a variety of computer vision tasks. In this paper, we present a fully automatic 3D face registration method based on the thin plate spline (TPS) and deformable model. To model the non-rigid modality of complex 3D facial surfaces, the thin plate spline is adopted to represent the transformation between 3D faces. The farthest point sampling (FPS) method is used to generate the control points for the thin plate spline transformation automatically. There are two phases for 3D face registration. Firstly, the preliminary registration is obtained by closest points searching between the thin plate spline transformed reference and the target. Then the multi-sample registration is implemented to improve the precision of the registration by using a dynamical reference produced based on deformable model. To eliminate outliers countermeasures are presented in both phases. The experiments on Bu-3dfe and Bjut-3d face databases show that the proposed method is effective and robust.
三维曲面的非刚性配准是各种计算机视觉任务的关键步骤。提出了一种基于薄板样条和可变形模型的三维人脸自动配准方法。为了对复杂三维曲面的非刚性模态进行建模,采用薄板样条表示三维曲面之间的转换。采用最远点采样(FPS)方法自动生成薄板样条变换控制点。三维人脸配准分为两个阶段。首先,通过薄板样条变换后的参考点与目标点之间的最近点搜索获得初步配准;然后利用基于可变形模型产生的动态参考进行多样本配准,提高配准精度。为了消除异常值,在两个阶段都提出了对策。在Bu-3dfe和Bjut-3d人脸数据库上的实验表明,该方法具有较好的鲁棒性。
{"title":"An Automatic Multi-sample 3D Face Registration Method Based on Thin Plate Spline and Deformable Model","authors":"Wenyu Qin, Yongli Hu, Yanfeng Sun, Baocai Yin","doi":"10.1109/ICMEW.2012.85","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.85","url":null,"abstract":"Non-rigid registration of 3D facial surfaces is a crucial step in a variety of computer vision tasks. In this paper, we present a fully automatic 3D face registration method based on the thin plate spline (TPS) and deformable model. To model the non-rigid modality of complex 3D facial surfaces, the thin plate spline is adopted to represent the transformation between 3D faces. The farthest point sampling (FPS) method is used to generate the control points for the thin plate spline transformation automatically. There are two phases for 3D face registration. Firstly, the preliminary registration is obtained by closest points searching between the thin plate spline transformed reference and the target. Then the multi-sample registration is implemented to improve the precision of the registration by using a dynamical reference produced based on deformable model. To eliminate outliers countermeasures are presented in both phases. The experiments on Bu-3dfe and Bjut-3d face databases show that the proposed method is effective and robust.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128209330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Fragment Reduction on Mobile GPU with Content Adaptive Sampling 基于内容自适应采样的移动GPU碎片缩减
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.115
Chia-Yang Chang, Yu-Jung Chen, Chia-Ming Chang, Shao-Yi Chien
Fragment shaders in a graphics pipeline are used to compute the color for each pixel, where lighting, texture loading, and other calculations are involved. The required computing power is proportional to the number of input fragments. In order to improve the power efficiency of mobile GPUs, a content adaptive sampling scheme is proposed to reduce the fragments. The proposed scheme is based on tile-based traversal. For each 4×4 tile, only parts of the fragments are sampled and rendered with the original shader program, and the values of other fragments are interpolated from these rendered fragments if the content checking condition can be passed. With this approach, the sampling patterns are decided adaptively, where more samples are employed for complex regions to avoid quality degradation. Experimental results show that about 30% - 50% fragments can be reduced where high image quality can be still maintained. The proposed scheme can be employed to reduce the power consumption and increase the frame rate for mobile GPUs.
图形管道中的片段着色器用于计算每个像素的颜色,其中涉及照明,纹理加载和其他计算。所需的计算能力与输入片段的数量成正比。为了提高移动gpu的功耗效率,提出了一种内容自适应采样方案来减少碎片。所提出的方案基于基于tile的遍历。对于每个4×4贴图,只有部分片段被采样并使用原始着色器程序渲染,如果能够通过内容检查条件,则从这些渲染片段中插值其他片段的值。该方法可以自适应地确定采样模式,在复杂区域使用更多的采样以避免质量下降。实验结果表明,在保持高图像质量的前提下,可以减少30% ~ 50%的碎片。该方案可用于降低移动gpu的功耗和提高帧率。
{"title":"Fragment Reduction on Mobile GPU with Content Adaptive Sampling","authors":"Chia-Yang Chang, Yu-Jung Chen, Chia-Ming Chang, Shao-Yi Chien","doi":"10.1109/ICMEW.2012.115","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.115","url":null,"abstract":"Fragment shaders in a graphics pipeline are used to compute the color for each pixel, where lighting, texture loading, and other calculations are involved. The required computing power is proportional to the number of input fragments. In order to improve the power efficiency of mobile GPUs, a content adaptive sampling scheme is proposed to reduce the fragments. The proposed scheme is based on tile-based traversal. For each 4×4 tile, only parts of the fragments are sampled and rendered with the original shader program, and the values of other fragments are interpolated from these rendered fragments if the content checking condition can be passed. With this approach, the sampling patterns are decided adaptively, where more samples are employed for complex regions to avoid quality degradation. Experimental results show that about 30% - 50% fragments can be reduced where high image quality can be still maintained. The proposed scheme can be employed to reduce the power consumption and increase the frame rate for mobile GPUs.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124599019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Automatic Hierachical Approach to Music Genre Classification 一种新的音乐体裁自动分级方法
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.104
H. Ariyaratne, Dengsheng Zhang
Automatic music genre classification is an important component in Music Information Retrieval (MIR). It has gained lot of attention lately due to the rapid growth in the use of digital music. Past work in this area has already produced a number of audio features and classification techniques, however, genre classification still remains an unsolved problem. In this paper we explore a hybrid unsupervised/supervised top-down hierarchical classification approach. Most existing work on hierarchical music genre classification relies on human built trees and taxonomies, however these hierarchies may not always translate well into machine classification problems. Therefore, we explore an automatic approach to construct a classification tree through subspace cluster analysis. Experimental results validate the tree building algorithm and provide a new research direction for automatic genre classification. We also addressed the issue of scarcity in publicly available music datasets, by introducing a new dataset containing genre, artist and album labels.
音乐体裁自动分类是音乐信息检索的重要组成部分。由于数字音乐使用的快速增长,它最近获得了很多关注。过去在这一领域的工作已经产生了许多音频特征和分类技术,但是,类型分类仍然是一个未解决的问题。在本文中,我们探索了一种混合的无监督/监督自顶向下分层分类方法。大多数现有的分层音乐类型分类工作依赖于人类构建的树和分类法,然而这些层次结构可能并不总是很好地转化为机器分类问题。因此,我们探索了一种通过子空间聚类分析自动构建分类树的方法。实验结果验证了树构建算法的有效性,为自动类型分类提供了新的研究方向。我们还通过引入包含流派、艺术家和专辑标签的新数据集,解决了公开可用音乐数据集稀缺的问题。
{"title":"A Novel Automatic Hierachical Approach to Music Genre Classification","authors":"H. Ariyaratne, Dengsheng Zhang","doi":"10.1109/ICMEW.2012.104","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.104","url":null,"abstract":"Automatic music genre classification is an important component in Music Information Retrieval (MIR). It has gained lot of attention lately due to the rapid growth in the use of digital music. Past work in this area has already produced a number of audio features and classification techniques, however, genre classification still remains an unsolved problem. In this paper we explore a hybrid unsupervised/supervised top-down hierarchical classification approach. Most existing work on hierarchical music genre classification relies on human built trees and taxonomies, however these hierarchies may not always translate well into machine classification problems. Therefore, we explore an automatic approach to construct a classification tree through subspace cluster analysis. Experimental results validate the tree building algorithm and provide a new research direction for automatic genre classification. We also addressed the issue of scarcity in publicly available music datasets, by introducing a new dataset containing genre, artist and album labels.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129270360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Distributed Augmented Reality Systems: How Much Performance is Enough? 分布式增强现实系统:多少性能才足够?
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.64
M. Chouiten, Jean-Yves Didier, M. Mallem
This paper presents architectural performance requirements for a framework dedicated to Augmented Reality (AR) applications. This work has been made as a starting point to evaluation of a framework named ARCS (Augmented Reality Component Systems) allowing rapid-prototyping, development, debugging and monitoring of AR applications. The article first introduces the need of such a dedicated framework and the importance of distribution for AR. It also identifies main features that should be offered by an AR framework and key performance criteria to be satisfied. These are partially inspired by the SPE (Software Performance Engineering) distributed architecture evaluation approach. These criteria are application specific. Thus, we first classify different distributed AR applications in scenarios and extract relevant criteria for each. We also provide a custom assessment protocol inspired by SPE evaluation methodology and based on previously identified criteria. This assessment is application independent as it is based on generic design patterns and most likely scenarios of distribution. Keeping in mind identified criteria and handling existing framework's constraints, a reliable generic AR framework should satisfy all criteria for each application class. Assessment protocol results for each criterion are compared to the typical features and performance that should be offered by an AR system.
本文提出了用于增强现实(AR)应用的框架的体系结构性能要求。这项工作是作为评估一个名为ARCS(增强现实组件系统)的框架的起点,该框架允许对AR应用进行快速原型设计、开发、调试和监控。本文首先介绍了这种专用框架的需求和AR分发的重要性。它还确定了AR框架应该提供的主要功能和要满足的关键性能标准。这些部分受到了SPE(软件性能工程)分布式体系结构评估方法的启发。这些标准是特定于应用程序的。因此,我们首先对场景中不同的分布式AR应用进行分类,并为每个应用提取相关标准。我们还根据SPE评估方法和先前确定的标准提供了自定义评估协议。此评估与应用程序无关,因为它基于通用设计模式和最可能的分布场景。记住确定的标准和处理现有框架的约束,一个可靠的通用AR框架应该满足每个应用程序类的所有标准。每个标准的评估方案结果与AR系统应该提供的典型特征和性能进行比较。
{"title":"Distributed Augmented Reality Systems: How Much Performance is Enough?","authors":"M. Chouiten, Jean-Yves Didier, M. Mallem","doi":"10.1109/ICMEW.2012.64","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.64","url":null,"abstract":"This paper presents architectural performance requirements for a framework dedicated to Augmented Reality (AR) applications. This work has been made as a starting point to evaluation of a framework named ARCS (Augmented Reality Component Systems) allowing rapid-prototyping, development, debugging and monitoring of AR applications. The article first introduces the need of such a dedicated framework and the importance of distribution for AR. It also identifies main features that should be offered by an AR framework and key performance criteria to be satisfied. These are partially inspired by the SPE (Software Performance Engineering) distributed architecture evaluation approach. These criteria are application specific. Thus, we first classify different distributed AR applications in scenarios and extract relevant criteria for each. We also provide a custom assessment protocol inspired by SPE evaluation methodology and based on previously identified criteria. This assessment is application independent as it is based on generic design patterns and most likely scenarios of distribution. Keeping in mind identified criteria and handling existing framework's constraints, a reliable generic AR framework should satisfy all criteria for each application class. Assessment protocol results for each criterion are compared to the typical features and performance that should be offered by an AR system.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130569193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Classification-Based Adaptive Compression Method for Computer Screen Image 基于分类的计算机屏幕图像自适应压缩方法
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.9
Yanfei Shen, Jintao Li, Zhenmin Zhu, Yun Song
In this paper, a classification-based adaptive compression method for computer screen image is presented. This method firstly divides the computer Screen Image into 16×16 non-overlapping blocks, and then every block is classified into three types: text/graphic, pictorial and hybrid blocks based on the characteristics of histogram distribution and the number of colors. For complex text/graphic block, k-Means clustering method is used to reduce the number of colors to improve compression performance, finally the text/graphic block is coded by our proposed lossless coding method, hybrid block is coded by hybrid coding method and the pictorial block is coded by H.264-like intra coding method. Experiment results show that our proposed block classification method exactly distinguishes three block types, color clustering method can effectively reduce the number of colors for complex text/graphic. The compression performance and subjective image quality of our proposed method can outperform JPEG and JP2k.
提出了一种基于分类的计算机屏幕图像自适应压缩方法。该方法首先将计算机屏幕图像分成16×16不重叠的块,然后根据直方图分布特征和颜色数量将每个块分为文本/图形、图像和混合块三种类型。对于复杂的文本/图形块,采用k-Means聚类方法减少颜色数量以提高压缩性能,最后采用本文提出的无损编码方法对文本/图形块进行编码,混合块采用混合编码方法进行编码,图像块采用类似h .264的帧内编码方法进行编码。实验结果表明,我们提出的块分类方法能够准确地区分出三种块类型,颜色聚类方法可以有效地减少复杂文本/图形的颜色数量。该方法的压缩性能和主观图像质量均优于JPEG和JP2k。
{"title":"Classification-Based Adaptive Compression Method for Computer Screen Image","authors":"Yanfei Shen, Jintao Li, Zhenmin Zhu, Yun Song","doi":"10.1109/ICMEW.2012.9","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.9","url":null,"abstract":"In this paper, a classification-based adaptive compression method for computer screen image is presented. This method firstly divides the computer Screen Image into 16×16 non-overlapping blocks, and then every block is classified into three types: text/graphic, pictorial and hybrid blocks based on the characteristics of histogram distribution and the number of colors. For complex text/graphic block, k-Means clustering method is used to reduce the number of colors to improve compression performance, finally the text/graphic block is coded by our proposed lossless coding method, hybrid block is coded by hybrid coding method and the pictorial block is coded by H.264-like intra coding method. Experiment results show that our proposed block classification method exactly distinguishes three block types, color clustering method can effectively reduce the number of colors for complex text/graphic. The compression performance and subjective image quality of our proposed method can outperform JPEG and JP2k.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123800052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A Visual Search User Study on the Influences of Aspect Ratio Distortion of Preview Thumbnails 视觉搜索用户对预览缩略图宽高比失真影响的研究
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.101
David Ahlström, Klaus Schöffmann
Most image and video retrieval tools used for large-scale media collections present query results as thumbnails arranged in a grid-like display with each thumbnail preserving the aspect ratio of its corresponding source image or video. Often, the outcome of a query is a set of thumbnails with different aspect ratios, thus a varying amount of padding space is used between the thumbnails in the display. This results in a visually erratic display that conflicts with interface design rules and aesthetic principles stipulating alignment and the usage of straight visual lines to guide the human eye while scanning the display. A solution is to create equally sized thumbnails by using cropping algorithms. However, this may remove useful search information. We investigated a simple alternative: to distort thumbnails to the same aspect ratio in order to provide a calm and structured display with straight lines between thumbnails. In a user experiment we evaluated whether and how much such a horizontal distortion can be applied without hampering visual search performance. The results show that distortion does not notably influence error rate and visual search time.
大多数用于大规模媒体集合的图像和视频检索工具以缩略图的形式显示查询结果,缩略图排列在网格状显示中,每个缩略图保留其相应源图像或视频的长宽比。通常,查询的结果是一组具有不同长宽比的缩略图,因此在显示中的缩略图之间使用不同数量的填充空间。这将导致视觉上不稳定的显示,与界面设计规则和美学原则相冲突,这些原则规定了在扫描显示时使用直线来引导人眼的对齐和使用。一个解决方案是使用裁剪算法创建大小相等的缩略图。但是,这可能会删除有用的搜索信息。我们研究了一个简单的替代方案:将缩略图扭曲成相同的宽高比,以便在缩略图之间使用直线提供平静和结构化的显示。在一个用户实验中,我们评估了在不影响视觉搜索性能的情况下,是否可以应用这种水平扭曲,以及在多大程度上可以应用这种水平扭曲。结果表明,畸变对错误率和视觉搜索时间影响不显著。
{"title":"A Visual Search User Study on the Influences of Aspect Ratio Distortion of Preview Thumbnails","authors":"David Ahlström, Klaus Schöffmann","doi":"10.1109/ICMEW.2012.101","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.101","url":null,"abstract":"Most image and video retrieval tools used for large-scale media collections present query results as thumbnails arranged in a grid-like display with each thumbnail preserving the aspect ratio of its corresponding source image or video. Often, the outcome of a query is a set of thumbnails with different aspect ratios, thus a varying amount of padding space is used between the thumbnails in the display. This results in a visually erratic display that conflicts with interface design rules and aesthetic principles stipulating alignment and the usage of straight visual lines to guide the human eye while scanning the display. A solution is to create equally sized thumbnails by using cropping algorithms. However, this may remove useful search information. We investigated a simple alternative: to distort thumbnails to the same aspect ratio in order to provide a calm and structured display with straight lines between thumbnails. In a user experiment we evaluated whether and how much such a horizontal distortion can be applied without hampering visual search performance. The results show that distortion does not notably influence error rate and visual search time.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125591473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Memetic Communication Media - Concepts, Technologies, Applications 模因传播媒介-概念,技术,应用
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.51
K. Jantke, J. Fujima, O. Arnold, A. Schulz
Memetics is an approach to interpret, understand and possibly manage communication and knowledge evolution in a Darwinistic way. Meme Media are implementing Memetics. Intelligent Pad has been the earliest Meme Media middleware. Webble Technology is the most recent form of Intelligent Pad. Webbles are objects in a browser window that allow for direct manipulation by drag and drop. One may pick up any Webble and move it over any other one for operational combination. The new Webble is plugged into the previous one and data between them flow through predefined slots. Human users may reconfigure the slot connections of Webbles as desired. The concepts and the technology are setting the stage for new ways of human communication including playful learning. Webbles at the human users' fingertips allow for exploration and explanation by direct manipulation, for collaboration as well as for competition, and for trial and error investigation.
模因论是一种以达尔文主义的方式解释、理解并可能管理交流和知识进化的方法。模因媒体正在实施模因学。智能Pad一直是Meme Media最早的中间件。Webble Technology是智能Pad的最新形式。webble是浏览器窗口中的对象,允许通过拖放直接操作。人们可以拿起任何一个Webble,并将其移动到任何其他Webble上进行操作组合。新的Webble插入到之前的Webble中,它们之间的数据通过预定义的插槽流动。人类用户可以根据需要重新配置webble的插槽连接。这些概念和技术为人类交流的新方式奠定了基础,包括有趣的学习。人类用户指尖上的Webbles允许通过直接操作进行探索和解释,用于合作和竞争,以及试验和错误调查。
{"title":"Memetic Communication Media - Concepts, Technologies, Applications","authors":"K. Jantke, J. Fujima, O. Arnold, A. Schulz","doi":"10.1109/ICMEW.2012.51","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.51","url":null,"abstract":"Memetics is an approach to interpret, understand and possibly manage communication and knowledge evolution in a Darwinistic way. Meme Media are implementing Memetics. Intelligent Pad has been the earliest Meme Media middleware. Webble Technology is the most recent form of Intelligent Pad. Webbles are objects in a browser window that allow for direct manipulation by drag and drop. One may pick up any Webble and move it over any other one for operational combination. The new Webble is plugged into the previous one and data between them flow through predefined slots. Human users may reconfigure the slot connections of Webbles as desired. The concepts and the technology are setting the stage for new ways of human communication including playful learning. Webbles at the human users' fingertips allow for exploration and explanation by direct manipulation, for collaboration as well as for competition, and for trial and error investigation.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134392866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Non-rigid 3D Model Retrieval Using Set of Local Statistical Features 基于局部统计特征集的非刚性三维模型检索
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.109
Yuki Ohkita, Yuya Ohishi, T. Furuya, Ryutarou Ohbuchi
Various algorithms for shape-based retrieval of non-rigid 3D models, with invariance to articulation and/or global deformation, have been developed. A majority of these algorithms assumes that 3D models have mathematically well-defined representations, e.g., closed, manifold mesh. These algorithms are thus not applicable to other types of shape models, for example, those defined as polygon soup. This paper proposes a 3D model retrieval algorithm that accepts diverse 3D shape representations and is is able to compare non-rigid 3D models. The algorithm employs a set of hundreds to thousands of 3D, statistical, local features to describe a 3D model. These features are integrated into a feature vector per 3D model by using bag-of-features approach for efficiency in comparing 3D models and for invariance against articulation and global deformation. Experimental evaluation showed that the algorithm performed well for non-rigid 3D model retrieval.
已经开发了各种基于形状的非刚性3D模型检索算法,这些算法对关节和/或全局变形具有不变性。这些算法中的大多数假设3D模型具有数学上定义良好的表示,例如,封闭的流形网格。因此,这些算法不适用于其他类型的形状模型,例如那些定义为多边形汤的模型。本文提出了一种接受多种三维形状表示的三维模型检索算法,并能比较非刚性三维模型。该算法使用一组数百到数千个三维、统计、局部特征来描述三维模型。通过使用特征袋方法将这些特征集成到每个3D模型的特征向量中,以提高比较3D模型的效率,并防止关节和全局变形的不变性。实验结果表明,该算法具有较好的非刚性三维模型检索效果。
{"title":"Non-rigid 3D Model Retrieval Using Set of Local Statistical Features","authors":"Yuki Ohkita, Yuya Ohishi, T. Furuya, Ryutarou Ohbuchi","doi":"10.1109/ICMEW.2012.109","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.109","url":null,"abstract":"Various algorithms for shape-based retrieval of non-rigid 3D models, with invariance to articulation and/or global deformation, have been developed. A majority of these algorithms assumes that 3D models have mathematically well-defined representations, e.g., closed, manifold mesh. These algorithms are thus not applicable to other types of shape models, for example, those defined as polygon soup. This paper proposes a 3D model retrieval algorithm that accepts diverse 3D shape representations and is is able to compare non-rigid 3D models. The algorithm employs a set of hundreds to thousands of 3D, statistical, local features to describe a 3D model. These features are integrated into a feature vector per 3D model by using bag-of-features approach for efficiency in comparing 3D models and for invariance against articulation and global deformation. Experimental evaluation showed that the algorithm performed well for non-rigid 3D model retrieval.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131513477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
期刊
2012 IEEE International Conference on Multimedia and Expo Workshops
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1