首页 > 最新文献

2015 IEEE International Symposium on Multimedia (ISM)最新文献

英文 中文
Nonlocal Adaptive In-Loop Filter via Content-Dependent Soft-Thresholding for HEVC 基于内容相关软阈值的HEVC非局部自适应环内滤波
Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.56
Xinfeng Zhang, Weisi Lin, Shiqi Wang, Siwei Ma
In-loop filters have been widely utilized in latest video coding standards to improve the video coding efficiency by reducing compression artifacts. However, existing in-loop filters only utilize image local correlations, leading to limited performance improvement. In this paper, we explore a novel adaptive in-loop filter by means of the nonlocal similar content to improve the quality of reconstructed video frames. In our proposed filter, the input video frame is first divided into different image patch groups based on their similarity, and then a soft-thresholding method is applied to the singular values of matrices composed of image patches in every group. Since compression noise is highly correlated with image content, we propose a group-wise threshold estimation method based on image statistical characteristics, coding modes and quantization parameters. To ensure the filtering efficiency, slice level control flags are utilized and determined based on the distortion changes after filtering. The proposed in-loop filter is integrated into HM7.0, and experimental results show that it can significantly improve the performance of HEVC on top of the state-of-the-art in-loop filters.
环内滤波器在最新的视频编码标准中得到了广泛的应用,通过减少压缩伪影来提高视频编码效率。然而,现有的环内滤波器只利用图像局部相关性,导致性能提高有限。本文研究了一种利用非局部相似内容的自适应环内滤波器,以提高重构视频帧的质量。在我们提出的滤波器中,首先根据输入视频帧的相似度将其划分为不同的图像补丁组,然后对每组图像补丁组成的矩阵的奇异值采用软阈值分割方法。由于压缩噪声与图像内容高度相关,我们提出了一种基于图像统计特征、编码模式和量化参数的分组阈值估计方法。为了保证滤波效率,利用了片电平控制标志,并根据滤波后的失真变化来确定。将所提出的环内滤波器集成到HM7.0中,实验结果表明,在现有环内滤波器的基础上,该滤波器能显著提高HEVC的性能。
{"title":"Nonlocal Adaptive In-Loop Filter via Content-Dependent Soft-Thresholding for HEVC","authors":"Xinfeng Zhang, Weisi Lin, Shiqi Wang, Siwei Ma","doi":"10.1109/ISM.2015.56","DOIUrl":"https://doi.org/10.1109/ISM.2015.56","url":null,"abstract":"In-loop filters have been widely utilized in latest video coding standards to improve the video coding efficiency by reducing compression artifacts. However, existing in-loop filters only utilize image local correlations, leading to limited performance improvement. In this paper, we explore a novel adaptive in-loop filter by means of the nonlocal similar content to improve the quality of reconstructed video frames. In our proposed filter, the input video frame is first divided into different image patch groups based on their similarity, and then a soft-thresholding method is applied to the singular values of matrices composed of image patches in every group. Since compression noise is highly correlated with image content, we propose a group-wise threshold estimation method based on image statistical characteristics, coding modes and quantization parameters. To ensure the filtering efficiency, slice level control flags are utilized and determined based on the distortion changes after filtering. The proposed in-loop filter is integrated into HM7.0, and experimental results show that it can significantly improve the performance of HEVC on top of the state-of-the-art in-loop filters.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134438213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Error Protection with Extended Dual Frame Motion Compensation 扩展双帧运动补偿的错误保护
Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.103
Da Liu, Li Wang, Chunyan Li, Y. Hao, Fang Yin
In this paper, an error resilient coding structure is proposed. Firstly an extended dual frame motion compensation (DFMC) coding structure is proposed, and then an end-to-end distortion model with error resilient filter is used for mode decision in Macroblock (MB) level rate distortion cost decision, finally the number of header bits packet in HQF is determined in frame level rate cost decision. Experimental results show that the proposed method can achieve better performance than previous schemes.
本文提出了一种抗错误编码结构。首先提出了一种扩展的双帧运动补偿(DFMC)编码结构,然后在Macroblock (MB)级速率失真代价决策中采用带错误弹性滤波器的端到端失真模型进行模式决策,最后在帧级速率代价决策中确定HQF中的报头比特数。实验结果表明,该方法比以往的方法具有更好的性能。
{"title":"Error Protection with Extended Dual Frame Motion Compensation","authors":"Da Liu, Li Wang, Chunyan Li, Y. Hao, Fang Yin","doi":"10.1109/ISM.2015.103","DOIUrl":"https://doi.org/10.1109/ISM.2015.103","url":null,"abstract":"In this paper, an error resilient coding structure is proposed. Firstly an extended dual frame motion compensation (DFMC) coding structure is proposed, and then an end-to-end distortion model with error resilient filter is used for mode decision in Macroblock (MB) level rate distortion cost decision, finally the number of header bits packet in HQF is determined in frame level rate cost decision. Experimental results show that the proposed method can achieve better performance than previous schemes.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115434985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
On Influencing Mobile Live Video Broadcasting Users 浅谈影响移动视频直播用户
Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.110
Stefan Wilk, Dimitri Wulffert, W. Effelsberg
A recent trend in user-generated video is to broadcast live video from mobile phones. Mobile broadcasting platforms such as YouNow or Periscope understood this trend and attract multiple thousands of concurrent views per second. Amateur produced mobile live video often suffers from only a limited duration of the recordings. Usually, the live recordings do not cover entire events. To address this problem our approach tries to understand how to increase the recording duration by integrating features for influencing the user behavior including gamification.
最近用户生成视频的一个趋势是通过手机直播视频。YouNow或Periscope等移动广播平台理解了这一趋势,并吸引了每秒数千次的并发观看。业余制作的移动直播视频通常只有有限的录制时间。通常,现场录音不会覆盖整个事件。为了解决这个问题,我们的方法试图理解如何通过整合影响用户行为的功能(包括游戏化)来增加记录时间。
{"title":"On Influencing Mobile Live Video Broadcasting Users","authors":"Stefan Wilk, Dimitri Wulffert, W. Effelsberg","doi":"10.1109/ISM.2015.110","DOIUrl":"https://doi.org/10.1109/ISM.2015.110","url":null,"abstract":"A recent trend in user-generated video is to broadcast live video from mobile phones. Mobile broadcasting platforms such as YouNow or Periscope understood this trend and attract multiple thousands of concurrent views per second. Amateur produced mobile live video often suffers from only a limited duration of the recordings. Usually, the live recordings do not cover entire events. To address this problem our approach tries to understand how to increase the recording duration by integrating features for influencing the user behavior including gamification.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114982875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
MOOC-DASH: A DASH System for Delivering High-Quality MOOCs Videos MOOC-DASH:提供高质量mooc视频的DASH系统
Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.15
Yi Wang, Wenjun Wu, Yihua Lou
Adaptive video streaming is very important for delivering high-quality video content of MOOCs (Massive Online Open Courses) to online learners because they often have Internet connections with different levels of bandwidth. Although DASH (Dynamic adaptive streaming over HTTP) is widely accepted as a viable streaming technology to implement scalable Internet streaming using HTTP transport, few research efforts have been made to investigate how to apply this relatively new technology on improving the quality of experience (QoE) of MOOC video streaming. This paper proposes a DASH scheme for MOOC video streaming (MOOC-DASH) to improve QoE of DASH-based MOOC. This scheme consists of content-aware ROI-based video encoding for MOOCs video and bitrate selection algorithm to provide a high-quality and smooth video streaming service to online learners. Experimental results demonstrate that it can effectively reduce the bandwidth of the video content and improve QoE of MOOC Streaming.
自适应视频流对于向在线学习者提供高质量的mooc(大规模在线开放课程)视频内容非常重要,因为他们通常有不同带宽水平的互联网连接。尽管DASH (Dynamic adaptive streaming over HTTP)被广泛认为是一种可行的流媒体技术,可以使用HTTP传输实现可扩展的互联网流媒体,但很少有人研究如何将这种相对较新的技术应用于提高MOOC视频流媒体的体验质量(QoE)。为了提高基于DASH的MOOC视频流的QoE,本文提出了一种面向MOOC视频流的DASH方案(MOOC-DASH)。该方案由面向mooc视频的基于内容感知roi的视频编码和比特率选择算法组成,为在线学习者提供高质量、流畅的视频流服务。实验结果表明,该方法可以有效地降低视频内容的带宽,提高MOOC流媒体的QoE。
{"title":"MOOC-DASH: A DASH System for Delivering High-Quality MOOCs Videos","authors":"Yi Wang, Wenjun Wu, Yihua Lou","doi":"10.1109/ISM.2015.15","DOIUrl":"https://doi.org/10.1109/ISM.2015.15","url":null,"abstract":"Adaptive video streaming is very important for delivering high-quality video content of MOOCs (Massive Online Open Courses) to online learners because they often have Internet connections with different levels of bandwidth. Although DASH (Dynamic adaptive streaming over HTTP) is widely accepted as a viable streaming technology to implement scalable Internet streaming using HTTP transport, few research efforts have been made to investigate how to apply this relatively new technology on improving the quality of experience (QoE) of MOOC video streaming. This paper proposes a DASH scheme for MOOC video streaming (MOOC-DASH) to improve QoE of DASH-based MOOC. This scheme consists of content-aware ROI-based video encoding for MOOCs video and bitrate selection algorithm to provide a high-quality and smooth video streaming service to online learners. Experimental results demonstrate that it can effectively reduce the bandwidth of the video content and improve QoE of MOOC Streaming.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"386 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116328619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
An Automatic Video Reinforcing System Based on Popularity Rating of Scenes and Level of Detail Controlling 基于场景人气分级和细节控制的视频自动增强系统
Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.31
Yuanyuan Wang, Yukiko Kawai, K. Sumiya, Y. Ishikawa
With the advance of video-on-demand (VOD) services such as Netfix, users are able to watch many kinds of videos anytime and anywhere. While watching a video, recently, users often search related information about it through the Web by using mobile PC. However, users cannot satisfactorily understand and enjoy it because the video keeps playing when they search about it. It is necessary to detect various questions of the video to supplement their related information about each scene for automatic search. However, only one video includes various topics of each scene, furthermore, viewers have different levels of knowledge. Therefore, we have developed a novel automatic video reinforcing system, called TV-Binder, it generates new video contents from one video stream related to viewers' interests and knowledge by adding other related contents (i.e., YouTube videos, images or maps) and by removing unnecessary original scenes, based on topics of each scene. As a result, viewers can satisfy and joyfully watch modified video contents without searching anything. At first, our system extract topics and detect their scenes of a video stream by using closed captions. The system then searches other necessary contents and determines unwanted original scenes based on popularity rating of each original scene and level of detail (LOD) controlling under time pressure. Through this, TV-Binder can automatically generate video contents are classified into four quadrants by two axes, one is digest and detailed videos, the other one is videos for experts with knowledge about particular topics and ordinary viewers without special knowledge. In this paper, we discuss our automatic video reinforcing system and an evaluation of its effectiveness.
随着像Netfix这样的视频点播(VOD)服务的发展,用户可以随时随地观看多种视频。最近,用户在观看视频时,经常使用移动PC通过网络搜索相关信息。但是,用户在搜索的时候,视频一直在播放,不能很好地理解和享受。需要对视频中的各种问题进行检测,以补充每个场景的相关信息进行自动搜索。然而,只有一个视频包含了每个场景的各种主题,而且观众的知识水平也不同。因此,我们开发了一种新颖的自动视频强化系统,称为TV-Binder,它根据每个场景的主题,通过添加其他相关内容(即YouTube视频,图像或地图)并删除不必要的原始场景,从一个视频流中生成与观众的兴趣和知识相关的新视频内容。因此,观众无需搜索任何内容,就可以满意而愉快地观看修改后的视频内容。首先,我们的系统通过使用封闭字幕提取主题并检测视频流中的主题场景。然后,系统根据每个原始场景的受欢迎程度和在时间压力下控制的细节水平(LOD),搜索其他必要的内容,并确定不需要的原始场景。通过这种方式,TV-Binder可以自动生成视频内容,通过两个轴将视频内容分为四个象限,一个是摘要和详细的视频,另一个是具有特定主题知识的专家和没有专业知识的普通观众的视频。本文讨论了我们的自动视频增强系统,并对其有效性进行了评价。
{"title":"An Automatic Video Reinforcing System Based on Popularity Rating of Scenes and Level of Detail Controlling","authors":"Yuanyuan Wang, Yukiko Kawai, K. Sumiya, Y. Ishikawa","doi":"10.1109/ISM.2015.31","DOIUrl":"https://doi.org/10.1109/ISM.2015.31","url":null,"abstract":"With the advance of video-on-demand (VOD) services such as Netfix, users are able to watch many kinds of videos anytime and anywhere. While watching a video, recently, users often search related information about it through the Web by using mobile PC. However, users cannot satisfactorily understand and enjoy it because the video keeps playing when they search about it. It is necessary to detect various questions of the video to supplement their related information about each scene for automatic search. However, only one video includes various topics of each scene, furthermore, viewers have different levels of knowledge. Therefore, we have developed a novel automatic video reinforcing system, called TV-Binder, it generates new video contents from one video stream related to viewers' interests and knowledge by adding other related contents (i.e., YouTube videos, images or maps) and by removing unnecessary original scenes, based on topics of each scene. As a result, viewers can satisfy and joyfully watch modified video contents without searching anything. At first, our system extract topics and detect their scenes of a video stream by using closed captions. The system then searches other necessary contents and determines unwanted original scenes based on popularity rating of each original scene and level of detail (LOD) controlling under time pressure. Through this, TV-Binder can automatically generate video contents are classified into four quadrants by two axes, one is digest and detailed videos, the other one is videos for experts with knowledge about particular topics and ordinary viewers without special knowledge. In this paper, we discuss our automatic video reinforcing system and an evaluation of its effectiveness.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129747243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Customer Behavior Recognition in Retail Store from Surveillance Camera 基于监控摄像头的零售商店顾客行为识别
Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.52
Jingwen Liu, Yanlei Gu, S. Kamijo
The analysis of customer behavior from surveillance camera is one of the most important open topics for marketing. We develop a system to recognize different customer behaviors on the front of shelf: no interest, viewing, turning to shelf, touching, picking and returning to shelf and picking and putting into basket, which show customer's increasing interest to products. In the proposed system, head orientation, body orientation, and arm action, the multiple cues are integrated for the customer behavior recognition. The proposed system discretizes the head and body orientation of customer into 8 directions to estimate whether the customer is looking or turning to the merchandise shelf. Semi-Supervised Learning method is applied to optimize the training dataset and to generate an accurate classifier. As for the arm action recognition, a novel combined hand feature (CHF), which includes hand trajectory, tracking status and the relative position between hand and shopping basket, is proposed to describe different arm actions. The CHF is classified by Dynamic Bayesian Network (DBN) into different arm actions. A series of experiments demonstrate the effectiveness of the proposed methods and the performance to the developed system.
从监控摄像机中分析客户行为是市场营销中最重要的开放话题之一。我们开发了一个系统来识别顾客在货架前的不同行为:不感兴趣、观看、转向货架、触摸、采摘并返回货架、采摘并放入篮子,这表明顾客对产品的兴趣在增加。在该系统中,将头部方向、身体方向和手臂动作等多种线索整合在一起进行顾客行为识别。该系统将顾客的头部和身体方向离散为8个方向,以估计顾客是在看还是转向商品货架。采用半监督学习方法优化训练数据集,生成准确的分类器。在手臂动作识别方面,提出了一种新的组合手特征(CHF)来描述不同的手臂动作,该特征包括手的运动轨迹、跟踪状态以及手与购物篮之间的相对位置。动态贝叶斯网络(DBN)将CHF分类为不同的臂动作。一系列的实验证明了所提方法的有效性和所开发系统的性能。
{"title":"Customer Behavior Recognition in Retail Store from Surveillance Camera","authors":"Jingwen Liu, Yanlei Gu, S. Kamijo","doi":"10.1109/ISM.2015.52","DOIUrl":"https://doi.org/10.1109/ISM.2015.52","url":null,"abstract":"The analysis of customer behavior from surveillance camera is one of the most important open topics for marketing. We develop a system to recognize different customer behaviors on the front of shelf: no interest, viewing, turning to shelf, touching, picking and returning to shelf and picking and putting into basket, which show customer's increasing interest to products. In the proposed system, head orientation, body orientation, and arm action, the multiple cues are integrated for the customer behavior recognition. The proposed system discretizes the head and body orientation of customer into 8 directions to estimate whether the customer is looking or turning to the merchandise shelf. Semi-Supervised Learning method is applied to optimize the training dataset and to generate an accurate classifier. As for the arm action recognition, a novel combined hand feature (CHF), which includes hand trajectory, tracking status and the relative position between hand and shopping basket, is proposed to describe different arm actions. The CHF is classified by Dynamic Bayesian Network (DBN) into different arm actions. A series of experiments demonstrate the effectiveness of the proposed methods and the performance to the developed system.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130050573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Human Action Recognition Using Hybrid Centroid Canonical Correlation Analysis 基于混合质心典型相关分析的人体动作识别
Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.118
Nour El-Din El-Madany, Yifeng He, L. Guan
Human action recognition is a hot research topic in image analysis and computer vision. In this paper, we propose Hybrid Centroid Canonical Correlation Analysis (HCCCA) and multi-set HCCCA for multimodal information analysis and fusion. Furthermore, we present a novel human action recognition framework by using multi-set HCCCA to fuse multimodal features, which include the hierarchal pyramid Depth Motion Map (DMM) for the depth images, the Histogram of Oriented Displacement (HOD) for the skeleton, and the statistical measurements for the accelerometer. The proposed framework was evaluated using two datasets MSR Action 3D dataset and UTD multimodal human action dataset. The experimental results demonstrated that the proposed framework can achieve a higher average accuracy compared to several existing methods.
人体动作识别是图像分析和计算机视觉领域的研究热点。本文提出了混合质心典型相关分析(Hybrid Centroid Canonical Correlation Analysis, HCCCA)和多集典型相关分析(multi-set HCCCA),用于多模态信息的分析和融合。此外,我们提出了一种新的人类动作识别框架,利用多集HCCCA融合多模态特征,包括用于深度图像的分层金字塔深度运动图(DMM),用于骨骼的定向位移直方图(HOD)和用于加速度计的统计测量。使用两个数据集MSR动作3D数据集和UTD多模态人类动作数据集对所提出的框架进行了评估。实验结果表明,与现有的几种方法相比,该框架具有更高的平均精度。
{"title":"Human Action Recognition Using Hybrid Centroid Canonical Correlation Analysis","authors":"Nour El-Din El-Madany, Yifeng He, L. Guan","doi":"10.1109/ISM.2015.118","DOIUrl":"https://doi.org/10.1109/ISM.2015.118","url":null,"abstract":"Human action recognition is a hot research topic in image analysis and computer vision. In this paper, we propose Hybrid Centroid Canonical Correlation Analysis (HCCCA) and multi-set HCCCA for multimodal information analysis and fusion. Furthermore, we present a novel human action recognition framework by using multi-set HCCCA to fuse multimodal features, which include the hierarchal pyramid Depth Motion Map (DMM) for the depth images, the Histogram of Oriented Displacement (HOD) for the skeleton, and the statistical measurements for the accelerometer. The proposed framework was evaluated using two datasets MSR Action 3D dataset and UTD multimodal human action dataset. The experimental results demonstrated that the proposed framework can achieve a higher average accuracy compared to several existing methods.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128892190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Fast Face Model Reconstruction and Synthesis Using an RGB-D Camera and Its Subjective Evaluation RGB-D相机快速人脸模型重建与合成及其主观评价
Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.107
T. Yamasaki, I. Nakamura, K. Aizawa
It is difficult to show a frontal face in video chatting because there is a gap between a display and a camera. We propose a method for real-time face reorientation by creating a 2.5-D face model from a single RGB-D camera and synthesizing the rotated face model with the original face image. Our method uses two kinds face models complementarily: a point cloud based model and a generic face model fitted to the user. We conducted subjective evaluation and confirmed the validity of our proposed system.
在视频聊天中,由于显示器和摄像头之间的间隙,很难显示正面的脸。我们提出了一种实时人脸重定向方法,该方法通过单个RGB-D相机创建2.5维人脸模型,并将旋转后的人脸模型与原始人脸图像合成。我们的方法互补使用两种人脸模型:基于点云的人脸模型和适合用户的通用人脸模型。我们进行了主观评价,并确认了我们提出的制度的有效性。
{"title":"Fast Face Model Reconstruction and Synthesis Using an RGB-D Camera and Its Subjective Evaluation","authors":"T. Yamasaki, I. Nakamura, K. Aizawa","doi":"10.1109/ISM.2015.107","DOIUrl":"https://doi.org/10.1109/ISM.2015.107","url":null,"abstract":"It is difficult to show a frontal face in video chatting because there is a gap between a display and a camera. We propose a method for real-time face reorientation by creating a 2.5-D face model from a single RGB-D camera and synthesizing the rotated face model with the original face image. Our method uses two kinds face models complementarily: a point cloud based model and a generic face model fitted to the user. We conducted subjective evaluation and confirmed the validity of our proposed system.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"251 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131964832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
User Reachability in Multi-Apps Environment 多应用环境下的用户可达性
Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.91
Kundan Singh
Recent progress in web real-time communication (WebRTC) promotes multi-apps environment by creating islands of communication apps where users of one website or service cannot easily communicate with those of another. We describe the architecture and implementation of a multi-platform system to do user reachability in multiple communication services where users decide how they want to be reached on multiple apps, e.g., in an organization that has voice-over-IP, web conferencing and messaging from different vendors. Our architecture separates the user contacts from reachability apps, supports user and endpoint driven reachability policies, and has several independent and non-interoperable WebRTC-based apps for two-way and multi-party multimedia communication. Our flexible implementation can be used for enterprise or personal communications, or as a white-labeled app for consumers of a business.
web实时通信(WebRTC)的最新进展通过创建通信应用孤岛来促进多应用环境,其中一个网站或服务的用户无法轻松地与另一个网站或服务的用户进行通信。我们描述了一个多平台系统的架构和实现,以在多种通信服务中实现用户可达性,其中用户决定如何在多个应用程序上达到他们,例如,在一个具有ip语音、web会议和来自不同供应商的消息传递的组织中。我们的架构将用户联系人从可达性应用程序中分离出来,支持用户和端点驱动的可达性策略,并且有几个独立的、不可互操作的基于webrtc的应用程序,用于双向和多方多媒体通信。我们的灵活实现可以用于企业或个人通信,也可以作为企业消费者的白色标签应用程序。
{"title":"User Reachability in Multi-Apps Environment","authors":"Kundan Singh","doi":"10.1109/ISM.2015.91","DOIUrl":"https://doi.org/10.1109/ISM.2015.91","url":null,"abstract":"Recent progress in web real-time communication (WebRTC) promotes multi-apps environment by creating islands of communication apps where users of one website or service cannot easily communicate with those of another. We describe the architecture and implementation of a multi-platform system to do user reachability in multiple communication services where users decide how they want to be reached on multiple apps, e.g., in an organization that has voice-over-IP, web conferencing and messaging from different vendors. Our architecture separates the user contacts from reachability apps, supports user and endpoint driven reachability policies, and has several independent and non-interoperable WebRTC-based apps for two-way and multi-party multimedia communication. Our flexible implementation can be used for enterprise or personal communications, or as a white-labeled app for consumers of a business.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131587666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Implications of Short Term Memory Research for the Design of Spaced Repetition Based Mobile Learning Games 短期记忆研究对基于间隔重复的移动学习游戏设计的启示
Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.13
Florian Schimanke, Sophie Ribbers, R. Mertens, O. Vornberger
Spaced repetition learning is an approach for choosing the most efficient intervals between rehearsing learning content. Typically used for tasks like learning vocabulary it also offers great potential for content selection in learning games. Learning games do, however differ from classic spaced repetition learning approaches in that content is not only accessed when indicated by a spaced repetition scheduling algorithm but also when the users simply want to play the game or when they decide to play the game multiple times in a row. In these cases, short term memory effects might mask learning effects in user performance, leading to faulty inputs to the calculation of spaced repetition interval lengths. This paper reviews current research literature on the interaction of short term and long term memory in order to determine how short term memory effects can be coped with in the context of spaced repetition based learning games.
间隔重复学习是一种在排练学习内容之间选择最有效间隔的方法。它通常用于学习词汇等任务,也为学习游戏中的内容选择提供了巨大的潜力。然而,学习型游戏与经典的间隔重复学习方法的不同之处在于,内容不仅是在间隔重复调度算法指示的情况下访问,而且当用户只是想玩游戏或决定连续多次玩游戏时也会访问。在这些情况下,短期记忆效应可能掩盖了用户表现中的学习效应,导致在计算间隔重复时间长度时输入错误。本文回顾了目前关于短期记忆和长期记忆相互作用的研究文献,以确定在基于间隔重复的学习游戏中如何应对短期记忆效应。
{"title":"Implications of Short Term Memory Research for the Design of Spaced Repetition Based Mobile Learning Games","authors":"Florian Schimanke, Sophie Ribbers, R. Mertens, O. Vornberger","doi":"10.1109/ISM.2015.13","DOIUrl":"https://doi.org/10.1109/ISM.2015.13","url":null,"abstract":"Spaced repetition learning is an approach for choosing the most efficient intervals between rehearsing learning content. Typically used for tasks like learning vocabulary it also offers great potential for content selection in learning games. Learning games do, however differ from classic spaced repetition learning approaches in that content is not only accessed when indicated by a spaced repetition scheduling algorithm but also when the users simply want to play the game or when they decide to play the game multiple times in a row. In these cases, short term memory effects might mask learning effects in user performance, leading to faulty inputs to the calculation of spaced repetition interval lengths. This paper reviews current research literature on the interaction of short term and long term memory in order to determine how short term memory effects can be coped with in the context of spaced repetition based learning games.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125359317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2015 IEEE International Symposium on Multimedia (ISM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1