首页 > 最新文献

Int. J. Semantic Comput.最新文献

英文 中文
Discriminative Robust Head-Pose and Gaze Estimation Using Kernel-DMCCA Features Fusion 基于核- dmcca特征融合的判别鲁棒头部姿态和凝视估计
Pub Date : 2020-03-01 DOI: 10.1142/s1793351x20500014
Salah Rabba, M. Kyan, Lei Gao, A. Quddus, A. S. Zandi, L. Guan
There remain outstanding challenges for improving accuracy of multi-feature information for head-pose and gaze estimation. The proposed framework employs discriminative analysis for head-pose and gaze estimation using kernel discriminative multiple canonical correlation analysis (K-DMCCA). The feature extraction component of the framework includes spatial indexing, statistical and geometrical elements. Head-pose and gaze estimation is constructed by feature aggregation and transforming features into a higher dimensional space using K-DMCCA for accurate estimation. The two main contributions are: Enhancing fusion performance through the use of kernel-based DMCCA, and by introducing an improved iris region descriptor based on quadtree. The overall approach is also inclusive of statistical and geometrical indexing that are calibration free (does not require any subsequent adjustment). We validate the robustness of the proposed framework across a wide variety of datasets, which consist of different modalities (RGB and Depth), constraints (wide range of head-poses, not only frontal), quality (accurately labelled for validation), occlusion (due to glasses, hair bang, facial hair) and illumination. Our method achieved an accurate head-pose and gaze estimation of 4.8∘ using Cave, 4.6∘ using MPII, 5.1∘ using ACS, 5.9∘ using EYEDIAP, 4.3∘ using OSLO and 4.6∘ using UULM datasets.
提高多特征信息在头部姿态和注视估计中的准确性仍然是一个突出的挑战。该框架采用判别分析方法对头部姿态和凝视进行估计,并使用核判别多重典型相关分析(K-DMCCA)进行估计。该框架的特征提取部分包括空间索引、统计元素和几何元素。头部姿态和凝视估计是通过特征聚合构建的,并使用K-DMCCA将特征转换到更高的维度空间进行精确估计。主要有两个方面的贡献:通过使用基于核的DMCCA来提高融合性能,以及通过引入基于四叉树的改进虹膜区域描述符来提高融合性能。总体方法还包括不需要校准的统计和几何索引(不需要任何后续调整)。我们在各种各样的数据集上验证了所提出框架的鲁棒性,这些数据集包括不同的模态(RGB和Depth)、约束(大范围的头部姿势,不仅是正面)、质量(准确标记以进行验证)、遮挡(由于眼镜、发刘海、面部毛发)和照明。我们的方法实现了对4.8°Cave、4.6°MPII、5.1°ACS、5.9°EYEDIAP、4.3°OSLO和4.6°UULM数据集的准确头部姿势和凝视估计。
{"title":"Discriminative Robust Head-Pose and Gaze Estimation Using Kernel-DMCCA Features Fusion","authors":"Salah Rabba, M. Kyan, Lei Gao, A. Quddus, A. S. Zandi, L. Guan","doi":"10.1142/s1793351x20500014","DOIUrl":"https://doi.org/10.1142/s1793351x20500014","url":null,"abstract":"There remain outstanding challenges for improving accuracy of multi-feature information for head-pose and gaze estimation. The proposed framework employs discriminative analysis for head-pose and gaze estimation using kernel discriminative multiple canonical correlation analysis (K-DMCCA). The feature extraction component of the framework includes spatial indexing, statistical and geometrical elements. Head-pose and gaze estimation is constructed by feature aggregation and transforming features into a higher dimensional space using K-DMCCA for accurate estimation. The two main contributions are: Enhancing fusion performance through the use of kernel-based DMCCA, and by introducing an improved iris region descriptor based on quadtree. The overall approach is also inclusive of statistical and geometrical indexing that are calibration free (does not require any subsequent adjustment). We validate the robustness of the proposed framework across a wide variety of datasets, which consist of different modalities (RGB and Depth), constraints (wide range of head-poses, not only frontal), quality (accurately labelled for validation), occlusion (due to glasses, hair bang, facial hair) and illumination. Our method achieved an accurate head-pose and gaze estimation of 4.8∘ using Cave, 4.6∘ using MPII, 5.1∘ using ACS, 5.9∘ using EYEDIAP, 4.3∘ using OSLO and 4.6∘ using UULM datasets.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"85 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126505611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved Semantic Segmentation of Water Bodies and Land in SAR Images Using Generative Adversarial Networks 基于生成对抗网络的SAR图像水体和陆地的改进语义分割
Pub Date : 2020-03-01 DOI: 10.1142/s1793351x20400036
M. Pai, Vaibhav Mehrotra, Ujjwal Verma, R. Pai
The availability of computationally efficient and powerful Deep Learning frameworks and high-resolution satellite imagery has created new approach for developing complex applications in the field of remote sensing. The easy access to abundant image data repository made available by different satellites of space agencies such as Copernicus, Landsat, etc. has opened various avenues of research in monitoring the world’s oceans, land, rivers, etc. The challenging research problem in this direction is the accurate identification and subsequent segmentation of surface water in images in the microwave spectrum. In the recent years, deep learning methods for semantic segmentation are the preferred choice given its high accuracy and ease of use. One major bottleneck in semantic segmentation pipelines is the manual annotation of data. This paper proposes Generative Adversarial Networks (GANs) on the training data (images and their corresponding labels) to create an enhanced dataset on which the networks can be trained, therefore, reducing human effort of manual labeling. Further, the research also proposes the use of deep-learning approaches such as U-Net and FCN-8 to perform an efficient segmentation of auto annotated, enhanced data of water body and land. The experimental results show that the U-Net model without GAN achieves superior performance on SAR images with pixel accuracy of 0.98 and F1 score of 0.9923. However, when augmented with GANs, the results saw a rise in these metrics with PA of 0.99 and F1 score of 0.9954.
计算效率高且功能强大的深度学习框架和高分辨率卫星图像的可用性为开发遥感领域的复杂应用创造了新的方法。哥白尼、陆地卫星等空间机构的不同卫星提供了丰富的图像数据储存库,这为监测世界海洋、陆地、河流等开辟了各种研究途径。微波波谱图像中地表水的准确识别和后续分割是该方向研究的难点。近年来,由于深度学习方法具有较高的准确性和易用性,因此成为语义分割的首选方法。语义分割管道的一个主要瓶颈是数据的手工标注。本文提出了基于训练数据(图像及其相应标签)的生成对抗网络(GANs),以创建一个增强的数据集,从而可以在该数据集上训练网络,从而减少人工标记的工作量。此外,该研究还提出了使用U-Net和FCN-8等深度学习方法对水体和土地的自动注释增强数据进行有效分割。实验结果表明,不含GAN的U-Net模型在SAR图像上的像素精度为0.98,F1分数为0.9923,具有较好的性能。然而,当添加gan时,结果显示这些指标的PA值为0.99,F1值为0.9954。
{"title":"Improved Semantic Segmentation of Water Bodies and Land in SAR Images Using Generative Adversarial Networks","authors":"M. Pai, Vaibhav Mehrotra, Ujjwal Verma, R. Pai","doi":"10.1142/s1793351x20400036","DOIUrl":"https://doi.org/10.1142/s1793351x20400036","url":null,"abstract":"The availability of computationally efficient and powerful Deep Learning frameworks and high-resolution satellite imagery has created new approach for developing complex applications in the field of remote sensing. The easy access to abundant image data repository made available by different satellites of space agencies such as Copernicus, Landsat, etc. has opened various avenues of research in monitoring the world’s oceans, land, rivers, etc. The challenging research problem in this direction is the accurate identification and subsequent segmentation of surface water in images in the microwave spectrum. In the recent years, deep learning methods for semantic segmentation are the preferred choice given its high accuracy and ease of use. One major bottleneck in semantic segmentation pipelines is the manual annotation of data. This paper proposes Generative Adversarial Networks (GANs) on the training data (images and their corresponding labels) to create an enhanced dataset on which the networks can be trained, therefore, reducing human effort of manual labeling. Further, the research also proposes the use of deep-learning approaches such as U-Net and FCN-8 to perform an efficient segmentation of auto annotated, enhanced data of water body and land. The experimental results show that the U-Net model without GAN achieves superior performance on SAR images with pixel accuracy of 0.98 and F1 score of 0.9923. However, when augmented with GANs, the results saw a rise in these metrics with PA of 0.99 and F1 score of 0.9954.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134281043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Deep Learning-Based Stair Segmentation and Behavioral Cloning for Autonomous Stair Climbing 基于深度学习的自主爬楼梯分割与行为克隆
Pub Date : 2019-12-26 DOI: 10.1142/s1793351x1940021x
Navid Panchi, Khush Agrawal, Unmesh Patil, Aniket Gujarathi, Aman Jain, Harsha Namdeo, S. Chiddarwar
Mobile robots are widely used in the surveillance industry, for military and industrial applications. In order to carry out surveillance tasks like urban search and rescue operation, the ability to...
移动机器人广泛应用于监控行业,用于军事和工业应用。为了执行监视任务,如城市搜救行动,能力…
{"title":"Deep Learning-Based Stair Segmentation and Behavioral Cloning for Autonomous Stair Climbing","authors":"Navid Panchi, Khush Agrawal, Unmesh Patil, Aniket Gujarathi, Aman Jain, Harsha Namdeo, S. Chiddarwar","doi":"10.1142/s1793351x1940021x","DOIUrl":"https://doi.org/10.1142/s1793351x1940021x","url":null,"abstract":"Mobile robots are widely used in the surveillance industry, for military and industrial applications. In order to carry out surveillance tasks like urban search and rescue operation, the ability to...","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132096820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Towards Adaptive Ontology Visualization - Predicting User Success from Behavioral Data 面向自适应本体可视化——从行为数据预测用户成功
Pub Date : 2019-12-26 DOI: 10.1142/s1793351x1940018x
Bo Fu, B. Steichen, Wenlu Zhang
Ontology visualization plays an important role in human data interaction by offering clarity and insight for complex structured datasets. Recent usability studies of ontology visualization techniqu...
本体可视化通过为复杂的结构化数据集提供清晰度和洞察力,在人类数据交互中发挥着重要作用。本体可视化技术的可用性研究进展…
{"title":"Towards Adaptive Ontology Visualization - Predicting User Success from Behavioral Data","authors":"Bo Fu, B. Steichen, Wenlu Zhang","doi":"10.1142/s1793351x1940018x","DOIUrl":"https://doi.org/10.1142/s1793351x1940018x","url":null,"abstract":"Ontology visualization plays an important role in human data interaction by offering clarity and insight for complex structured datasets. Recent usability studies of ontology visualization techniqu...","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125441308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Motion Planning and Control with Randomized Payloads on Real Robot Using Deep Reinforcement Learning 基于深度强化学习的真实机器人随机载荷运动规划与控制
Pub Date : 2019-12-26 DOI: 10.1142/s1793351x19400233
A. Demir, Volkan Sezer
In this study, a unified motion planner with low level controller for continuous control of a differential drive mobile robot under variable payload values is presented. The deep reinforcement agen...
本文提出了一种统一的运动规划器和低电平控制器,用于变载荷值下差动驱动移动机器人的连续控制。深层加固…
{"title":"Motion Planning and Control with Randomized Payloads on Real Robot Using Deep Reinforcement Learning","authors":"A. Demir, Volkan Sezer","doi":"10.1142/s1793351x19400233","DOIUrl":"https://doi.org/10.1142/s1793351x19400233","url":null,"abstract":"In this study, a unified motion planner with low level controller for continuous control of a differential drive mobile robot under variable payload values is presented. The deep reinforcement agen...","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117085029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Learning-Based Adaptive Management of QoS and Energy for Mobile Robotic Missions 基于学习的移动机器人任务QoS和能量自适应管理
Pub Date : 2019-12-01 DOI: 10.1142/s1793351x19400221
Dinh-Khanh Ho, K. B. Chehida, Benoît Miramond, M. Auguin
Mobile robotic systems are normally confronted with the shortage of on-board resources such as computing capabilities and energy, as well as significantly influenced by the dynamics of surrounding environmental conditions. This context requires adaptive decisions at run-time that react to the dynamic and uncertain operational circumstances for guaranteeing the performance requirements while respecting the other constraints. In this paper, we propose a reinforcement learning (RL)-based approach for Quality of Service QoS and energy-aware autonomous robotic mission manager. The mobile robotic mission manager leverages the idea of (RL) by monitoring actively the state of performance and energy consumption of the mission and then selecting the best mapping parameter configuration by evaluating an accumulative reward feedback balancing between QoS and energy. As a case study, we apply this methodology to an autonomous navigation mission. Our simulation results demonstrate the efficiency of the proposed management framework and provide a promising solution for the real mobile robotic systems.
移动机器人系统通常面临着计算能力和能源等机载资源的不足,以及受周围环境条件动态影响较大。此上下文需要在运行时对动态和不确定的操作环境做出自适应决策,以在尊重其他约束的同时保证性能需求。本文提出了一种基于强化学习(RL)的服务质量QoS和能量感知自主机器人任务管理器方法。移动机器人任务管理器利用(RL)的思想,主动监测任务的性能状态和能量消耗,然后通过评估QoS和能量之间的累积奖励反馈平衡来选择最佳映射参数配置。作为案例研究,我们将此方法应用于自主导航任务。仿真结果证明了所提出的管理框架的有效性,为实际的移动机器人系统提供了一个有希望的解决方案。
{"title":"Learning-Based Adaptive Management of QoS and Energy for Mobile Robotic Missions","authors":"Dinh-Khanh Ho, K. B. Chehida, Benoît Miramond, M. Auguin","doi":"10.1142/s1793351x19400221","DOIUrl":"https://doi.org/10.1142/s1793351x19400221","url":null,"abstract":"Mobile robotic systems are normally confronted with the shortage of on-board resources such as computing capabilities and energy, as well as significantly influenced by the dynamics of surrounding environmental conditions. This context requires adaptive decisions at run-time that react to the dynamic and uncertain operational circumstances for guaranteeing the performance requirements while respecting the other constraints. In this paper, we propose a reinforcement learning (RL)-based approach for Quality of Service QoS and energy-aware autonomous robotic mission manager. The mobile robotic mission manager leverages the idea of (RL) by monitoring actively the state of performance and energy consumption of the mission and then selecting the best mapping parameter configuration by evaluating an accumulative reward feedback balancing between QoS and energy. As a case study, we apply this methodology to an autonomous navigation mission. Our simulation results demonstrate the efficiency of the proposed management framework and provide a promising solution for the real mobile robotic systems.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"225 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124480237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decrease Product Rating Uncertainty Through Focused Reviews Solicitation 通过集中评论征集减少产品评级的不确定性
Pub Date : 2019-12-01 DOI: 10.1142/s1793351x19400208
Nhat X. T. Le, Ryan Rivas, James M. Flegal, Vagelis Hristidis
Customer reviews are an essential resource to reduce an online product’s uncertainty, which has been shown to be a critical factor for its purchase decision. Existing e-commerce platforms typically ask users to write free-form text reviews, which are sometimes augmented by a small set of predefined questions, e.g. “rate the product description’s accuracy from 1 to 5.” In this paper, we argue that this “passive” style of review solicitation is suboptimal in achieving low-uncertainty “review profiles” for products. Its key drawback is that some product aspects receive a very large number of reviews while other aspects do not have enough reviews to draw confident conclusions. Therefore, we hypothesize that we can achieve lower-uncertainty review profiles by carefully selecting which aspects users are asked to rate. To test this hypothesis, we propose various techniques to dynamically select which aspects to ask users to rate given the current review profile of a product. We use Bayesian inference principles to define reasonable review profile uncertainty measures; specifically, via an aspect’s rating variance. We compare our proposed aspect selection techniques to several baselines on several review profile uncertainty measures. Experimental results on two real-world datasets show that our methods lead to better review profile uncertainty compared to aspect selection baselines and traditional passive review solicitations. Moreover, we present and evaluate a hybrid solicitation method that combines the advantages of both active and passive review solicitations.
顾客评论是减少在线产品不确定性的重要资源,这已被证明是其购买决策的关键因素。现有的电子商务平台通常要求用户撰写自由格式的文本评论,有时还会增加一组预定义的问题,例如“将产品描述的准确性从1到5打分”。在本文中,我们认为这种“被动”的评论请求风格在实现产品的低不确定性“评论概要”方面是次优的。它的主要缺点是某些产品方面收到了大量的评论,而其他方面没有足够的评论来得出自信的结论。因此,我们假设我们可以通过仔细选择用户被要求评价的方面来实现低不确定性的审查概要。为了验证这一假设,我们提出了各种技术来动态选择要求用户对给定产品的当前评论配置文件进行评分的方面。利用贝叶斯推理原理定义合理的评审剖面不确定度度量;具体来说,通过一个方面的评级差异。我们将我们提出的方面选择技术与几个审查概要不确定性度量的几个基线进行比较。在两个真实数据集上的实验结果表明,与方面选择基线和传统的被动评论请求相比,我们的方法具有更好的评论轮廓不确定性。此外,我们提出并评估了一种混合征集方法,结合了主动和被动评审征集的优点。
{"title":"Decrease Product Rating Uncertainty Through Focused Reviews Solicitation","authors":"Nhat X. T. Le, Ryan Rivas, James M. Flegal, Vagelis Hristidis","doi":"10.1142/s1793351x19400208","DOIUrl":"https://doi.org/10.1142/s1793351x19400208","url":null,"abstract":"Customer reviews are an essential resource to reduce an online product’s uncertainty, which has been shown to be a critical factor for its purchase decision. Existing e-commerce platforms typically ask users to write free-form text reviews, which are sometimes augmented by a small set of predefined questions, e.g. “rate the product description’s accuracy from 1 to 5.” In this paper, we argue that this “passive” style of review solicitation is suboptimal in achieving low-uncertainty “review profiles” for products. Its key drawback is that some product aspects receive a very large number of reviews while other aspects do not have enough reviews to draw confident conclusions. Therefore, we hypothesize that we can achieve lower-uncertainty review profiles by carefully selecting which aspects users are asked to rate. To test this hypothesis, we propose various techniques to dynamically select which aspects to ask users to rate given the current review profile of a product. We use Bayesian inference principles to define reasonable review profile uncertainty measures; specifically, via an aspect’s rating variance. We compare our proposed aspect selection techniques to several baselines on several review profile uncertainty measures. Experimental results on two real-world datasets show that our methods lead to better review profile uncertainty compared to aspect selection baselines and traditional passive review solicitations. Moreover, we present and evaluate a hybrid solicitation method that combines the advantages of both active and passive review solicitations.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117036283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluation of Feature Learning Methods for Voice Disorder Detection 语音障碍检测的特征学习方法评价
Pub Date : 2019-12-01 DOI: 10.1142/s1793351x19400191
Hongzhao Guan, Alexander Lerch
Voice disorder is a frequently encountered health issue. Many people, however, either cannot afford to visit a professional doctor or neglect to take good care of their voice. In order to give a patient a preliminary diagnosis without using professional medical devices, previous research has shown that the detection of voice disorders can be carried out by utilizing machine learning and acoustic features extracted from voice recordings. Considering the increasing popularity of deep learning, feature learning and transfer learning, this study explores the possibilities of using these methods to assign voice recordings into one of two classes—Normal and Pathological. While the results show the general viability of deep learning and feature learning for the automatic recognition of voice disorders, they also lead to discussions on how to choose a pre-trained model when using transfer learning for this task. Furthermore, the results demonstrate the shortcomings of the existing datasets for voice disorder detection such as insufficient dataset size and lack of generality.
声音障碍是一个经常遇到的健康问题。然而,许多人要么负担不起看专业医生的费用,要么忽视了照顾好自己的声音。为了在不使用专业医疗设备的情况下对患者进行初步诊断,之前的研究表明,可以通过利用机器学习和从录音中提取的声学特征来检测语音障碍。考虑到深度学习、特征学习和迁移学习的日益普及,本研究探讨了使用这些方法将录音分为正常和病理两类的可能性。虽然结果显示深度学习和特征学习在语音障碍自动识别方面的总体可行性,但它们也引发了关于如何在使用迁移学习进行此任务时选择预训练模型的讨论。此外,研究结果还揭示了现有语音障碍检测数据集的不足,如数据集大小不足和缺乏通用性。
{"title":"Evaluation of Feature Learning Methods for Voice Disorder Detection","authors":"Hongzhao Guan, Alexander Lerch","doi":"10.1142/s1793351x19400191","DOIUrl":"https://doi.org/10.1142/s1793351x19400191","url":null,"abstract":"Voice disorder is a frequently encountered health issue. Many people, however, either cannot afford to visit a professional doctor or neglect to take good care of their voice. In order to give a patient a preliminary diagnosis without using professional medical devices, previous research has shown that the detection of voice disorders can be carried out by utilizing machine learning and acoustic features extracted from voice recordings. Considering the increasing popularity of deep learning, feature learning and transfer learning, this study explores the possibilities of using these methods to assign voice recordings into one of two classes—Normal and Pathological. While the results show the general viability of deep learning and feature learning for the automatic recognition of voice disorders, they also lead to discussions on how to choose a pre-trained model when using transfer learning for this task. Furthermore, the results demonstrate the shortcomings of the existing datasets for voice disorder detection such as insufficient dataset size and lack of generality.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122935004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wavelet-packets Associated with Support Vector Machine Are Effective for Monophone Sorting in Music Signals 结合支持向量机的小波包对音乐信号中的单声道进行分类是有效的
Pub Date : 2019-09-01 DOI: 10.1142/s1793351x19500028
Rafael Rubiati Scalvenzi, R. Guido, N. Marranghello
An abstract interpretation is usually required to analyze acoustic compositions. Nevertheless, there is much signal processing-related research focusing on music processing and similar topics. In that context, the semantic information contained in the melody involving major and minor chords, sharps and flats associated with semibreve, minim, crotchet, quaver, semiquaver and demisemiquaver notes can help in the study of musical sounds. Thus, multiresolution analysis based on discrete wavelet-packet transform (DWPT) associated with a support vector machine (SVM) is used in this paper to inspect and classify those signals, correlating them with a respective acoustic pattern. Results over hundreds of inputs provided almost full accuracy, reassuring the efficacy of the proposed approach for both off-line and real-time usage.
分析声学作品通常需要抽象的解释。然而,有很多与信号处理相关的研究集中在音乐处理和类似的主题上。在这种情况下,旋律中包含的语义信息,包括大调和小调和弦,与半音、小音、八分音符、八分音符、半八分音符和半八分音符相关的升调和降调,可以帮助研究音乐的声音。因此,本文使用基于离散小波包变换(DWPT)和支持向量机(SVM)的多分辨率分析来检查和分类这些信号,并将它们与各自的声学模式相关联。数百个输入的结果提供了几乎完全的准确性,保证了所提出的方法在离线和实时使用中的有效性。
{"title":"Wavelet-packets Associated with Support Vector Machine Are Effective for Monophone Sorting in Music Signals","authors":"Rafael Rubiati Scalvenzi, R. Guido, N. Marranghello","doi":"10.1142/s1793351x19500028","DOIUrl":"https://doi.org/10.1142/s1793351x19500028","url":null,"abstract":"An abstract interpretation is usually required to analyze acoustic compositions. Nevertheless, there is much signal processing-related research focusing on music processing and similar topics. In that context, the semantic information contained in the melody involving major and minor chords, sharps and flats associated with semibreve, minim, crotchet, quaver, semiquaver and demisemiquaver notes can help in the study of musical sounds. Thus, multiresolution analysis based on discrete wavelet-packet transform (DWPT) associated with a support vector machine (SVM) is used in this paper to inspect and classify those signals, correlating them with a respective acoustic pattern. Results over hundreds of inputs provided almost full accuracy, reassuring the efficacy of the proposed approach for both off-line and real-time usage.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"23 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128396817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
User Perception of Situated Product Recommendations in Augmented Reality 增强现实中定位产品推荐的用户感知
Pub Date : 2019-09-01 DOI: 10.1142/s1793351x19400129
Brandon Huynh, Adam Ibrahim, YunSuk Chang, Tobias Höllerer, J. O'Donovan
Augmented reality (AR) interfaces increasingly utilize artificial intelligence systems to tailor content and experiences to the user. We explore the effects of one such system — a recommender system for online shopping — which allows customers to view personalized product recommendations in the physical spaces where they might be used. We describe results of a [Formula: see text] condition exploratory study in which recommendation quality was varied across three user interface types. Our results highlight potential differences in user perception of the recommended objects in an AR environment. Specifically, users rate product recommendations significantly higher in AR and in a 3D browser interface, and show a significant increase in trust in the recommender system, compared to a web interface with 2D product images. Through semi-structured interviews, we gather participant feedback which suggests AR interfaces perform better due to their ability to view products within the physical context where they will be used.
增强现实(AR)界面越来越多地利用人工智能系统为用户定制内容和体验。我们探索了一个这样的系统——在线购物推荐系统——的效果,它允许客户在可能使用它们的物理空间中查看个性化的产品推荐。我们描述了一个[公式:见文本]条件探索性研究的结果,其中推荐质量在三种用户界面类型中是不同的。我们的研究结果突出了AR环境中用户对推荐对象感知的潜在差异。具体来说,与带有2D产品图像的web界面相比,用户在AR和3D浏览器界面中对产品推荐的评价明显更高,并且对推荐系统的信任度显著增加。通过半结构化访谈,我们收集了参与者的反馈,这些反馈表明AR界面表现更好,因为它们能够在使用产品的物理环境中查看产品。
{"title":"User Perception of Situated Product Recommendations in Augmented Reality","authors":"Brandon Huynh, Adam Ibrahim, YunSuk Chang, Tobias Höllerer, J. O'Donovan","doi":"10.1142/s1793351x19400129","DOIUrl":"https://doi.org/10.1142/s1793351x19400129","url":null,"abstract":"Augmented reality (AR) interfaces increasingly utilize artificial intelligence systems to tailor content and experiences to the user. We explore the effects of one such system — a recommender system for online shopping — which allows customers to view personalized product recommendations in the physical spaces where they might be used. We describe results of a [Formula: see text] condition exploratory study in which recommendation quality was varied across three user interface types. Our results highlight potential differences in user perception of the recommended objects in an AR environment. Specifically, users rate product recommendations significantly higher in AR and in a 3D browser interface, and show a significant increase in trust in the recommender system, compared to a web interface with 2D product images. Through semi-structured interviews, we gather participant feedback which suggests AR interfaces perform better due to their ability to view products within the physical context where they will be used.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130120207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
Int. J. Semantic Comput.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1