Pub Date : 2017-10-01DOI: 10.1109/ICVRV.2017.00099
Shujuan Tian, Xun Luo, Dan Lu, Yi Chen
Internet users spend most of their time reading and using web pages. Because of the reason there is a high demand for the user browsing experience to be pleasant. Experiments show that when people look at objects, the first impression that causes visual response is the color of the object. Therefore, it is of great significance to explore how web color schemes affect user behavior. This paper presents a framework that can automatically recolor the web pages based on reference palettes. Moreover, it analyzes and explores the effect of web color schemes on user behavior through a series of user studies. The preliminary experimental results show that the color scheme has a statistically significant effect on user preferences. The other factors include gender and hearing capability.
{"title":"Study on the Effect of Web Color Scheme on User Behavior","authors":"Shujuan Tian, Xun Luo, Dan Lu, Yi Chen","doi":"10.1109/ICVRV.2017.00099","DOIUrl":"https://doi.org/10.1109/ICVRV.2017.00099","url":null,"abstract":"Internet users spend most of their time reading and using web pages. Because of the reason there is a high demand for the user browsing experience to be pleasant. Experiments show that when people look at objects, the first impression that causes visual response is the color of the object. Therefore, it is of great significance to explore how web color schemes affect user behavior. This paper presents a framework that can automatically recolor the web pages based on reference palettes. Moreover, it analyzes and explores the effect of web color schemes on user behavior through a series of user studies. The preliminary experimental results show that the color scheme has a statistically significant effect on user preferences. The other factors include gender and hearing capability.","PeriodicalId":187934,"journal":{"name":"2017 International Conference on Virtual Reality and Visualization (ICVRV)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129343029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2017-10-01DOI: 10.1109/icvrv.2017.00082
Zewei Tao, Yun Pan, Anying Chen, Licheng Wang
We introduce an open-source web-based platform that integrated multiple methods for visualizing Shor's algorithm. We mainly focus on three different approaches which are widely used in the field of visualizing qubit and quantum algorithms. These methods include Bloch sphere, quantum circuit and probability distribution map. We combine these geometrical methods and abstract the level of quantum circuit in order to introduce the well-known Shor's algorithm more explicitly. Our platform provides a direct and comprehensible perspective for better understanding the basic principles of quantum computation and how the features of quantum algorithms reduce the time complexity of certain problems. It also provides an interactive way for users to easily test the Shor's factoring algorithm. With further improvement and development, potential capacity can be proved in the field of visualization of quantum computation.
{"title":"ShorVis: A Comprehensive Case Study of Quantum Computing Visualization","authors":"Zewei Tao, Yun Pan, Anying Chen, Licheng Wang","doi":"10.1109/icvrv.2017.00082","DOIUrl":"https://doi.org/10.1109/icvrv.2017.00082","url":null,"abstract":"We introduce an open-source web-based platform that integrated multiple methods for visualizing Shor's algorithm. We mainly focus on three different approaches which are widely used in the field of visualizing qubit and quantum algorithms. These methods include Bloch sphere, quantum circuit and probability distribution map. We combine these geometrical methods and abstract the level of quantum circuit in order to introduce the well-known Shor's algorithm more explicitly. Our platform provides a direct and comprehensible perspective for better understanding the basic principles of quantum computation and how the features of quantum algorithms reduce the time complexity of certain problems. It also provides an interactive way for users to easily test the Shor's factoring algorithm. With further improvement and development, potential capacity can be proved in the field of visualization of quantum computation.","PeriodicalId":187934,"journal":{"name":"2017 International Conference on Virtual Reality and Visualization (ICVRV)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128694435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2017-10-01DOI: 10.1109/ICVRV.2017.00101
Xiaohui Liao, Jinliang Niu, Hao Wang, Bingbing Du
When proceeding new employee orientation training in electrical industry, it is indispensable to lead new workers in a safe environment. However, due to the deficiency in the skill and stressfulness in mind, most freshmen would not conduct proper operation on the substation device, which would cause heavily person injury and grid accident. To that end, we proposed and established a virtual simulation system of 500kV substation. With the latest technology adopted, our system can bring the immersive experience to the trainees. The system has fulfilled the substation roaming and animation demonstration; The system implements the parameter display of the substation equipment and the simultaneous speech reading; The bottom of the system is well developed and extensible, which lays the foundation for the development of the incoming functions. Through experiments, the system meets the requirements of substation training, which can be used for new employee training and undergraduate cognitive learning.
{"title":"Research on Virtual Reality Simulation Training System of Substation","authors":"Xiaohui Liao, Jinliang Niu, Hao Wang, Bingbing Du","doi":"10.1109/ICVRV.2017.00101","DOIUrl":"https://doi.org/10.1109/ICVRV.2017.00101","url":null,"abstract":"When proceeding new employee orientation training in electrical industry, it is indispensable to lead new workers in a safe environment. However, due to the deficiency in the skill and stressfulness in mind, most freshmen would not conduct proper operation on the substation device, which would cause heavily person injury and grid accident. To that end, we proposed and established a virtual simulation system of 500kV substation. With the latest technology adopted, our system can bring the immersive experience to the trainees. The system has fulfilled the substation roaming and animation demonstration; The system implements the parameter display of the substation equipment and the simultaneous speech reading; The bottom of the system is well developed and extensible, which lays the foundation for the development of the incoming functions. Through experiments, the system meets the requirements of substation training, which can be used for new employee training and undergraduate cognitive learning.","PeriodicalId":187934,"journal":{"name":"2017 International Conference on Virtual Reality and Visualization (ICVRV)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116378232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Numerous experiments have shown that there are similar gestures in visual-based gesture recognition. In order to solve the problem, this paper proposes a new algorithm based on Convolution neural network. According to the model test results, the confusion matrix is established. And according to the correspondence between each gesture and the predicted result, misjudgment probability matrix is established. Based on the misjudgment probability matrix, we correct the gestures that have been incorrectly identified by the Convolution neural network model. After this algorithm, the recognition rate of similar gestures is increased by 5% to 12%. The innovation of this paper lies in the secondary error correction of the wrong gesture of Convolution neural network structure.
{"title":"An Intelligent Discovery and Error Correction Algorithm for Misunderstanding Gesture Based on Probabilistic Statistics Model","authors":"Kaiyun Sun, Zhiquan Feng, Changsheng Ai, Yingjun Li, Jun Wei, Xiaohui Yang, Xiaopei Guo","doi":"10.23940/ijpe.18.01.p10.89100","DOIUrl":"https://doi.org/10.23940/ijpe.18.01.p10.89100","url":null,"abstract":"Numerous experiments have shown that there are similar gestures in visual-based gesture recognition. In order to solve the problem, this paper proposes a new algorithm based on Convolution neural network. According to the model test results, the confusion matrix is established. And according to the correspondence between each gesture and the predicted result, misjudgment probability matrix is established. Based on the misjudgment probability matrix, we correct the gestures that have been incorrectly identified by the Convolution neural network model. After this algorithm, the recognition rate of similar gestures is increased by 5% to 12%. The innovation of this paper lies in the secondary error correction of the wrong gesture of Convolution neural network structure.","PeriodicalId":187934,"journal":{"name":"2017 International Conference on Virtual Reality and Visualization (ICVRV)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117149454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2017-10-01DOI: 10.1109/ICVRV.2017.00102
Han Guo, Xiangyu Kong, Dongmei Niu, Xiuyang Zhao
Interest points are significant to 3D matching, recognition and 3D retrieval. We use bilateral filtering to extract 3D interest points. Differing from prior work, a novel approach is proposed to select the local neighborhood which is named trust area. And we also use multi-scale DoG to compute the saliency of the vertices. The interest points are extracted by saliency.
{"title":"Extraction of Interest Points on 3D Meshes Based on Bilateral Filtering","authors":"Han Guo, Xiangyu Kong, Dongmei Niu, Xiuyang Zhao","doi":"10.1109/ICVRV.2017.00102","DOIUrl":"https://doi.org/10.1109/ICVRV.2017.00102","url":null,"abstract":"Interest points are significant to 3D matching, recognition and 3D retrieval. We use bilateral filtering to extract 3D interest points. Differing from prior work, a novel approach is proposed to select the local neighborhood which is named trust area. And we also use multi-scale DoG to compute the saliency of the vertices. The interest points are extracted by saliency.","PeriodicalId":187934,"journal":{"name":"2017 International Conference on Virtual Reality and Visualization (ICVRV)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115288688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2017-10-01DOI: 10.1109/ICVRV.2017.00087
Zhiyong Tu, W. Jiang, Jinyuan Jia
In the massive multi-player online game scenes single P2P topology cannot sufficiently accomplish a variety of mission. To strengthen player experience, rapidly respond client request and improve P2P network scalability and flexibility. Firstly, we propose network architecture, which is hierarchical hybrid DVE-P2P based on interested cluster to achieve this purpose. Secondly, based on this network architecture and topology we propose three logical layers along with two physical layers to guarantee its security, flexibility, reliability, scalability and high throughput. Thirdly, based on Area of Interest we propose interested cluster to quickly share similarity resource with other peer in own cluster, selecting super node with excellent performance as central, virtual server, which is connected to Chord ring, to manage load balancing of interested cluster, at same time we present algorithms and strategies to load balancing for different peer, interested cluster and server. Finally, we perform experiments to show that our approaches significantly improve the DVE system's response speed, throughput, dynamic load balancing, load index and stability.
{"title":"Hierarchical Hybrid DVE-P2P Networking Based on Interests Clustering","authors":"Zhiyong Tu, W. Jiang, Jinyuan Jia","doi":"10.1109/ICVRV.2017.00087","DOIUrl":"https://doi.org/10.1109/ICVRV.2017.00087","url":null,"abstract":"In the massive multi-player online game scenes single P2P topology cannot sufficiently accomplish a variety of mission. To strengthen player experience, rapidly respond client request and improve P2P network scalability and flexibility. Firstly, we propose network architecture, which is hierarchical hybrid DVE-P2P based on interested cluster to achieve this purpose. Secondly, based on this network architecture and topology we propose three logical layers along with two physical layers to guarantee its security, flexibility, reliability, scalability and high throughput. Thirdly, based on Area of Interest we propose interested cluster to quickly share similarity resource with other peer in own cluster, selecting super node with excellent performance as central, virtual server, which is connected to Chord ring, to manage load balancing of interested cluster, at same time we present algorithms and strategies to load balancing for different peer, interested cluster and server. Finally, we perform experiments to show that our approaches significantly improve the DVE system's response speed, throughput, dynamic load balancing, load index and stability.","PeriodicalId":187934,"journal":{"name":"2017 International Conference on Virtual Reality and Visualization (ICVRV)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121998837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2017-10-01DOI: 10.1109/ICVRV.2017.00053
Riming Sun, Nannan Li, Shengfa Wang, Lin Ji, Zhenyu Wang
Distortion representation is the key to the rectification of distorted document images. The text-lines are considered to be one of the most significant features of the images, which are extensively used by a majority of rectification algorithms. However, it is quite a challenge to accurately extract the text-lines of document images with distortions and other disruptive factors, such as non-textural objects. In this approach, we present a general document rectification method based on local distortion representation that is depicted by text-features instead of the text-lines. Specially, firstly, according to the similarity of local distortion, we divide the document image into local blocks. Secondly, a text-feature is exploited to depict the warping distortion of each block by considering the skew angle. Then, the rectification problem is formulated utilizing a reverse strategy according to the text-features. Finally, a perspective distortion is restored by making use of random sample consensus. The proposed method is appropriate for document images of multi-column layouts, multi-type fonts and non-textural objects. Various experiments have demonstrated the flexibility and high performance of the approach.
{"title":"The Rectification of Document Images Using Text-features","authors":"Riming Sun, Nannan Li, Shengfa Wang, Lin Ji, Zhenyu Wang","doi":"10.1109/ICVRV.2017.00053","DOIUrl":"https://doi.org/10.1109/ICVRV.2017.00053","url":null,"abstract":"Distortion representation is the key to the rectification of distorted document images. The text-lines are considered to be one of the most significant features of the images, which are extensively used by a majority of rectification algorithms. However, it is quite a challenge to accurately extract the text-lines of document images with distortions and other disruptive factors, such as non-textural objects. In this approach, we present a general document rectification method based on local distortion representation that is depicted by text-features instead of the text-lines. Specially, firstly, according to the similarity of local distortion, we divide the document image into local blocks. Secondly, a text-feature is exploited to depict the warping distortion of each block by considering the skew angle. Then, the rectification problem is formulated utilizing a reverse strategy according to the text-features. Finally, a perspective distortion is restored by making use of random sample consensus. The proposed method is appropriate for document images of multi-column layouts, multi-type fonts and non-textural objects. Various experiments have demonstrated the flexibility and high performance of the approach.","PeriodicalId":187934,"journal":{"name":"2017 International Conference on Virtual Reality and Visualization (ICVRV)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131605537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2017-10-01DOI: 10.1109/ICVRV.2017.00015
Yufeng Chen, Bo Zhang, Xuying Zhao, Zhixuan Li
Neural network is difficult to understand the invariance of input data, which is one of the causes of weak neural network generalization. So the researchers usually carry out data augmentation method on the training set, which makes the neural network remember different deformation patterns. We propose an invariant information learning framework:original CNN+Spatial information Function Zone(SFZ). This framework uses correlation matrix method instead of data augmentation method to make the neural network have the ability to learn the invariance of input data. Finally, our experiment shows that CNN+SFZ can effectively help improve generalization ability without data augmentation. In the absence of data augmentation for the training set, the network with SFZ reduced the error rate by 9.01% over the original network.
{"title":"Invariant Information Learning for Image Recognition","authors":"Yufeng Chen, Bo Zhang, Xuying Zhao, Zhixuan Li","doi":"10.1109/ICVRV.2017.00015","DOIUrl":"https://doi.org/10.1109/ICVRV.2017.00015","url":null,"abstract":"Neural network is difficult to understand the invariance of input data, which is one of the causes of weak neural network generalization. So the researchers usually carry out data augmentation method on the training set, which makes the neural network remember different deformation patterns. We propose an invariant information learning framework:original CNN+Spatial information Function Zone(SFZ). This framework uses correlation matrix method instead of data augmentation method to make the neural network have the ability to learn the invariance of input data. Finally, our experiment shows that CNN+SFZ can effectively help improve generalization ability without data augmentation. In the absence of data augmentation for the training set, the network with SFZ reduced the error rate by 9.01% over the original network.","PeriodicalId":187934,"journal":{"name":"2017 International Conference on Virtual Reality and Visualization (ICVRV)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123138891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2017-10-01DOI: 10.1109/ICVRV.2017.00083
Zhi-Ping Ding, Qian Liu, Qing Wang
Unlike conventional light field camera that records spatial and angular information explicitly, the focused light field camera implicitly collects angular samplings in microimages behind the micro-lens array. Without directly decoded sub-apertures, it is difficult to estimate disparity for focused light field camera. On the other hand, disparity estimation is a critical step for sub-aperture rendering from raw image. It is hence a typical "chicken-and-egg" problem. In this paper we propose a two-stage method for disparity estimation from the raw image. Compared with previous approaches which treat all pixels in a micro-image as a same disparity label, a segmentation-tree based cost aggregation is introduced to provide a more robust disparity estimation for each pixel, which optimizes the disparity of low-texture areas and yields sharper occlusion boundaries. After sub-apertures are rendered from the raw image using initial estimation, the optimal one is globally regularized using the reference sub-aperture image. Experimental results on real scene datasets have demonstrated advantages of our method over previous work, especially in low-texture areas and occlusion boundaries.
{"title":"Disparity Estimation for Focused Light Field Camera Using Cost Aggregation in Micro-Images","authors":"Zhi-Ping Ding, Qian Liu, Qing Wang","doi":"10.1109/ICVRV.2017.00083","DOIUrl":"https://doi.org/10.1109/ICVRV.2017.00083","url":null,"abstract":"Unlike conventional light field camera that records spatial and angular information explicitly, the focused light field camera implicitly collects angular samplings in microimages behind the micro-lens array. Without directly decoded sub-apertures, it is difficult to estimate disparity for focused light field camera. On the other hand, disparity estimation is a critical step for sub-aperture rendering from raw image. It is hence a typical \"chicken-and-egg\" problem. In this paper we propose a two-stage method for disparity estimation from the raw image. Compared with previous approaches which treat all pixels in a micro-image as a same disparity label, a segmentation-tree based cost aggregation is introduced to provide a more robust disparity estimation for each pixel, which optimizes the disparity of low-texture areas and yields sharper occlusion boundaries. After sub-apertures are rendered from the raw image using initial estimation, the optimal one is globally regularized using the reference sub-aperture image. Experimental results on real scene datasets have demonstrated advantages of our method over previous work, especially in low-texture areas and occlusion boundaries.","PeriodicalId":187934,"journal":{"name":"2017 International Conference on Virtual Reality and Visualization (ICVRV)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129681809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2017-10-01DOI: 10.1109/ICVRV.2017.00105
Ying Luo, Li Song, Rong Xie, Chuanfei Luo
In recent years, there has been a significant interest towards virtual reality applications, especially omnidirectional videos. Due to the huge resolutions, omnidirectional videos are difficult to codec and stream in term of traditional multimedia tools. In this paper, a view-dependent video encapsulation solution is proposed. Videos with different resolutions are partly extracted to combine to encapsulate mixed-resolution file according to user's viewport, which can effectively save bandwidth without much noticeable quality impacts. Our solution results can decrease file size to 75% with a 110° × 110° viewport and provide full resolution video inside the FOV.
{"title":"View-Dependent Omnidirectional Video Encapsulation Using Multiple Tracks","authors":"Ying Luo, Li Song, Rong Xie, Chuanfei Luo","doi":"10.1109/ICVRV.2017.00105","DOIUrl":"https://doi.org/10.1109/ICVRV.2017.00105","url":null,"abstract":"In recent years, there has been a significant interest towards virtual reality applications, especially omnidirectional videos. Due to the huge resolutions, omnidirectional videos are difficult to codec and stream in term of traditional multimedia tools. In this paper, a view-dependent video encapsulation solution is proposed. Videos with different resolutions are partly extracted to combine to encapsulate mixed-resolution file according to user's viewport, which can effectively save bandwidth without much noticeable quality impacts. Our solution results can decrease file size to 75% with a 110° × 110° viewport and provide full resolution video inside the FOV.","PeriodicalId":187934,"journal":{"name":"2017 International Conference on Virtual Reality and Visualization (ICVRV)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129982322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}