首页 > 最新文献

2018 First International Conference on Artificial Intelligence for Industries (AI4I)最新文献

英文 中文
Facing Digital Agriculture Challenges with Knowledge Engineering 用知识工程应对数字农业挑战
Marcelo Nery, R. Santos, W. Santos, Vítor Lourenço, M. Moreno
Knowledge Engineering is key to enable knowledge extraction, representation and reasoning, leading to better business insights and decisions. Current advances in machine learning and new trends in AI are bringing a plethora of algorithms capable of performing advanced pattern recognition and data classification. The ability to link, to organize and to query the outputs of these algorithms as well as the ability to handle huge amounts of data and its multiple sources is crucial to maximize the potential of such advances, specially over large datasets. This paper presents challenges in the context of digital agriculture and our position in moving forward with these capabilities whilst using knowledge engineering techniques.
知识工程是实现知识提取、表示和推理的关键,从而产生更好的业务见解和决策。当前机器学习的进步和人工智能的新趋势带来了大量能够执行高级模式识别和数据分类的算法。链接、组织和查询这些算法输出的能力,以及处理大量数据及其多个来源的能力,对于最大限度地发挥这些进步的潜力至关重要,特别是在大型数据集上。本文提出了数字农业背景下的挑战,以及我们在利用知识工程技术推进这些能力方面的地位。
{"title":"Facing Digital Agriculture Challenges with Knowledge Engineering","authors":"Marcelo Nery, R. Santos, W. Santos, Vítor Lourenço, M. Moreno","doi":"10.1109/AI4I.2018.8665708","DOIUrl":"https://doi.org/10.1109/AI4I.2018.8665708","url":null,"abstract":"Knowledge Engineering is key to enable knowledge extraction, representation and reasoning, leading to better business insights and decisions. Current advances in machine learning and new trends in AI are bringing a plethora of algorithms capable of performing advanced pattern recognition and data classification. The ability to link, to organize and to query the outputs of these algorithms as well as the ability to handle huge amounts of data and its multiple sources is crucial to maximize the potential of such advances, specially over large datasets. This paper presents challenges in the context of digital agriculture and our position in moving forward with these capabilities whilst using knowledge engineering techniques.","PeriodicalId":133657,"journal":{"name":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126059323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Applying Machine Learning to Service Assurance in Network Function Virtualization Environment 机器学习在网络功能虚拟化环境下服务保障中的应用
Zhu Zhou, T. Zhang
With the complexity, heterogeneity, and scale of today's networks, service assurance is becoming increasingly complicated. Meanwhile, significant amounts of telemetry data are collected on virtual network functions; it has been proposed that machine learning can be used to predict/forecast key performance indicators by analyzing this data and then taking actions to prevent severe service degradation. In this paper, we demonstrate the process of telemetry data collecting and filtering, feature dimension reduction, and machine learning algorithm selection for detecting packet loss in a NFV based vEPC test system.
随着当今网络的复杂性、异构性和规模,服务保证变得越来越复杂。同时,在虚拟网络功能上采集了大量的遥测数据;有人提出,机器学习可以通过分析这些数据来预测/预测关键性能指标,然后采取措施防止严重的服务退化。在本文中,我们演示了基于NFV的vEPC测试系统中遥测数据的收集和过滤、特征降维和机器学习算法的选择过程,以检测数据包丢失。
{"title":"Applying Machine Learning to Service Assurance in Network Function Virtualization Environment","authors":"Zhu Zhou, T. Zhang","doi":"10.1109/AI4I.2018.8665716","DOIUrl":"https://doi.org/10.1109/AI4I.2018.8665716","url":null,"abstract":"With the complexity, heterogeneity, and scale of today's networks, service assurance is becoming increasingly complicated. Meanwhile, significant amounts of telemetry data are collected on virtual network functions; it has been proposed that machine learning can be used to predict/forecast key performance indicators by analyzing this data and then taking actions to prevent severe service degradation. In this paper, we demonstrate the process of telemetry data collecting and filtering, feature dimension reduction, and machine learning algorithm selection for detecting packet loss in a NFV based vEPC test system.","PeriodicalId":133657,"journal":{"name":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115035770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Fraud Detection Decision Support System via Human On-Line Behavior Characterization and Machine Learning 基于人类在线行为表征和机器学习的欺诈检测决策支持系统
Gian Antonio Susto, M. Terzi, Chiara Masiero, S. Pampuri, A. Schirru
On-line and phone banking frauds are responsible for millions of dollars loss every year. In this work, we propose a Machine Learning-based Decision Support System to automatically associate a risk factor to each transaction performed through an on-line/mobile banking system. The proposed approach has a hierarchical architecture: First, an unsupervised Machine Learning module is used to detect abnormal patterns or wrongly labeled transactions; then, a supervised module provides a risk factor for the transactions that were not marked as anomalies in the previous step. Our solution exploits personal and historical information about the user, statistics that describe online traffic generated on the online/mobile banking system, and features extracted from motives of the transactions. The proposed approach deals with dataset unbalancing effectively. Moreover, it has been validated on a large database of transactions and on-line traffic provided by an industrial partner.
网上和电话银行诈骗每年造成数百万美元的损失。在这项工作中,我们提出了一个基于机器学习的决策支持系统,可以自动将风险因素与通过在线/移动银行系统执行的每笔交易关联起来。提出的方法具有层次结构:首先,使用无监督机器学习模块来检测异常模式或错误标记的交易;然后,受监督的模块为在前一步中未标记为异常的事务提供风险因素。我们的解决方案利用有关用户的个人和历史信息,描述在线/移动银行系统上产生的在线流量的统计数据,以及从交易动机中提取的特征。该方法有效地处理了数据集不平衡问题。此外,它已在一个工业合作伙伴提供的大型交易和在线流量数据库上得到验证。
{"title":"A Fraud Detection Decision Support System via Human On-Line Behavior Characterization and Machine Learning","authors":"Gian Antonio Susto, M. Terzi, Chiara Masiero, S. Pampuri, A. Schirru","doi":"10.1109/AI4I.2018.8665694","DOIUrl":"https://doi.org/10.1109/AI4I.2018.8665694","url":null,"abstract":"On-line and phone banking frauds are responsible for millions of dollars loss every year. In this work, we propose a Machine Learning-based Decision Support System to automatically associate a risk factor to each transaction performed through an on-line/mobile banking system. The proposed approach has a hierarchical architecture: First, an unsupervised Machine Learning module is used to detect abnormal patterns or wrongly labeled transactions; then, a supervised module provides a risk factor for the transactions that were not marked as anomalies in the previous step. Our solution exploits personal and historical information about the user, statistics that describe online traffic generated on the online/mobile banking system, and features extracted from motives of the transactions. The proposed approach deals with dataset unbalancing effectively. Moreover, it has been validated on a large database of transactions and on-line traffic provided by an industrial partner.","PeriodicalId":133657,"journal":{"name":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122071639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Balanced Mini-Batch Training for Imbalanced Image Data Classification with Neural Network 基于神经网络的不平衡图像数据分类的平衡小批训练
Ryota Shimizu, Kosuke Asako, Hiroki Ojima, Shohei Morinaga, M. Hamada, T. Kuroda
We propose a novel method of training neural networks for industrial image classification that can reduce the effect of imbalanced data in supervised training. We considered visual quality inspection of industrial products as an image-classification task and attempted to solve this with a convolutional neural network; however, a problem of imbalanced data emerged in supervised training in which the neural network cannot optimize parameters. Since most industrial products are not defective, samples of defective products were fewer than those of the non-defective products; this difference in the number of samples causes an imbalance in training data. A neural network trained with imbalanced data often has varied levels of precision in determining each class depending on the difference in the number of class samples in the training data, which is a significant problem in industrial quality inspection. As a solution to this problem, we propose a balanced mini-batch training method that can virtually balance the class ratio of training samples. In an experiment, the neural network trained with the proposed method achieved higher classification ability than that trained with over-sampled or undersampled data for two types of imbalanced image datasets.
提出了一种新的工业图像分类神经网络训练方法,可以减少监督训练中不平衡数据的影响。我们认为工业产品的视觉质量检测是一个图像分类任务,并试图用卷积神经网络来解决这个问题;然而,在监督训练中出现了数据不平衡的问题,神经网络无法优化参数。由于大多数工业产品不存在缺陷,因此缺陷产品的样品少于非缺陷产品的样品;样本数量的差异导致了训练数据的不平衡。用不平衡数据训练的神经网络,由于训练数据中类别样本数量的不同,在确定每个类别时往往具有不同程度的精度,这是工业质量检测中的一个重要问题。为了解决这一问题,我们提出了一种平衡的小批量训练方法,该方法实际上可以平衡训练样本的类比。在实验中,使用本文方法训练的神经网络对两类不平衡图像数据集的分类能力优于使用过采样或欠采样数据训练的神经网络。
{"title":"Balanced Mini-Batch Training for Imbalanced Image Data Classification with Neural Network","authors":"Ryota Shimizu, Kosuke Asako, Hiroki Ojima, Shohei Morinaga, M. Hamada, T. Kuroda","doi":"10.1109/AI4I.2018.8665709","DOIUrl":"https://doi.org/10.1109/AI4I.2018.8665709","url":null,"abstract":"We propose a novel method of training neural networks for industrial image classification that can reduce the effect of imbalanced data in supervised training. We considered visual quality inspection of industrial products as an image-classification task and attempted to solve this with a convolutional neural network; however, a problem of imbalanced data emerged in supervised training in which the neural network cannot optimize parameters. Since most industrial products are not defective, samples of defective products were fewer than those of the non-defective products; this difference in the number of samples causes an imbalance in training data. A neural network trained with imbalanced data often has varied levels of precision in determining each class depending on the difference in the number of class samples in the training data, which is a significant problem in industrial quality inspection. As a solution to this problem, we propose a balanced mini-batch training method that can virtually balance the class ratio of training samples. In an experiment, the neural network trained with the proposed method achieved higher classification ability than that trained with over-sampled or undersampled data for two types of imbalanced image datasets.","PeriodicalId":133657,"journal":{"name":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129939177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Image Processing and Image Pattern Recognition a Programming Tutorial 图像处理和图像模式识别编程教程
A. Chakraborty
Image recognition is a major area of application of machine learning - evolving at a rapid pace with a number of programming platforms available to developers. While each platform has its own uniqueness, the methodology of image recognition consists of a sequence of image processing tasks, development of a classifier algorithm, training and testing followed by deployment. This tutorial will delve into the programming aspects of image processing including thresholding, contouring and template matching. In order to provide practical hands on programming this tutorial will closely look at three real life applications of image pattern recognition namely ALPR using Tesseract OCR and will touch upon using CNN for character detection. The tutorial will explain the algorithm, implementation of pseudocode through Python using two major platforms: OpenCV and Tensorflow.
图像识别是机器学习应用的一个主要领域,随着许多编程平台的发展,它正在快速发展。虽然每个平台都有自己的独特性,但图像识别的方法包括一系列图像处理任务、分类器算法的开发、训练和测试,然后是部署。本教程将深入研究图像处理的编程方面,包括阈值,轮廓和模板匹配。为了提供编程的实际操作,本教程将密切关注图像模式识别的三个实际应用,即使用Tesseract OCR的ALPR,并将触及使用CNN进行字符检测。本教程将解释算法,通过Python使用两个主要平台实现伪代码:OpenCV和Tensorflow。
{"title":"Image Processing and Image Pattern Recognition a Programming Tutorial","authors":"A. Chakraborty","doi":"10.1109/AI4I.2018.8665702","DOIUrl":"https://doi.org/10.1109/AI4I.2018.8665702","url":null,"abstract":"Image recognition is a major area of application of machine learning - evolving at a rapid pace with a number of programming platforms available to developers. While each platform has its own uniqueness, the methodology of image recognition consists of a sequence of image processing tasks, development of a classifier algorithm, training and testing followed by deployment. This tutorial will delve into the programming aspects of image processing including thresholding, contouring and template matching. In order to provide practical hands on programming this tutorial will closely look at three real life applications of image pattern recognition namely ALPR using Tesseract OCR and will touch upon using CNN for character detection. The tutorial will explain the algorithm, implementation of pseudocode through Python using two major platforms: OpenCV and Tensorflow.","PeriodicalId":133657,"journal":{"name":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","volume":"38 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124267978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Towards #consistentAI Position Paper 迈向#consistentAI立场文件
Debojyoti Dutta, Amit Kumar Saha, Johnu George, Xinyuan Huang, Ramdoot Pydipaty, Purushotham Kamath, L. Tucker
Even though AI/ML is of strategic importance for many companies, it is not easy to come up with an AI life cycle in a multi-cloud world that is being increasingly embraced. In this paper, we present our position on an AI strategy that is future proof and does not force vendor lock in.
尽管人工智能/机器学习对许多公司具有战略重要性,但在越来越多的人接受的多云世界中,提出人工智能生命周期并不容易。在本文中,我们提出了我们对AI策略的立场,该策略是面向未来的,并且不会强制供应商锁定。
{"title":"Towards #consistentAI Position Paper","authors":"Debojyoti Dutta, Amit Kumar Saha, Johnu George, Xinyuan Huang, Ramdoot Pydipaty, Purushotham Kamath, L. Tucker","doi":"10.1109/AI4I.2018.8665683","DOIUrl":"https://doi.org/10.1109/AI4I.2018.8665683","url":null,"abstract":"Even though AI/ML is of strategic importance for many companies, it is not easy to come up with an AI life cycle in a multi-cloud world that is being increasingly embraced. In this paper, we present our position on an AI strategy that is future proof and does not force vendor lock in.","PeriodicalId":133657,"journal":{"name":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128950927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Ai4i 2018 Organizing Committee Ai4i 2018组委会
{"title":"Ai4i 2018 Organizing Committee","authors":"","doi":"10.1109/ai4i.2018.8665703","DOIUrl":"https://doi.org/10.1109/ai4i.2018.8665703","url":null,"abstract":"","PeriodicalId":133657,"journal":{"name":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129095084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sufficient Statistics for Optimal Decentralized Control in System of Systems 系统的最优分散控制的充分统计量
Nikhil Nigam, S. Lall, P. Hovareshti, Kristopher L. Ezra, L. Mockus, D. Tolani, Shawn Sloan
Research in multi-agent systems has mostly focused on heuristic/semi-heuristic methods for control, which lack in robustness and generalizability. Control theoretic techniques guarantee stability (and often optimality), but the results are limited in scope. Hence, there is a need to design intelligent control techniques as a function of sub-system dynamics, network structure and control/decision processes. We are developing S4C - a control theoretic framework for analysis and design of interacting robotic systems. We use “sufficient statistics” to generalize the separation principle - enabling decoupled optimal control and estimation. These techniques are applied to a missile guidance problem, demonstrating robustness to sensor/process noise. An agent-based simulation architecture has also been developed and used for studies. In addition, we use a verification and validation approach based on Gaussian process regression to test for cases where modeling assumptions are relaxed.
多智能体系统的研究主要集中在启发式/半启发式控制方法上,缺乏鲁棒性和泛化性。控制理论技术保证了稳定性(通常是最优性),但结果在范围上是有限的。因此,有必要设计智能控制技术作为子系统动力学,网络结构和控制/决策过程的功能。我们正在开发S4C -一个用于交互机器人系统分析和设计的控制理论框架。我们使用“充分统计量”来推广分离原理-实现解耦的最优控制和估计。这些技术应用于导弹制导问题,证明了对传感器/过程噪声的鲁棒性。基于智能体的仿真体系结构也被开发出来并用于研究。此外,我们使用基于高斯过程回归的验证和验证方法来测试建模假设放松的情况。
{"title":"Sufficient Statistics for Optimal Decentralized Control in System of Systems","authors":"Nikhil Nigam, S. Lall, P. Hovareshti, Kristopher L. Ezra, L. Mockus, D. Tolani, Shawn Sloan","doi":"10.1109/AI4I.2018.8665711","DOIUrl":"https://doi.org/10.1109/AI4I.2018.8665711","url":null,"abstract":"Research in multi-agent systems has mostly focused on heuristic/semi-heuristic methods for control, which lack in robustness and generalizability. Control theoretic techniques guarantee stability (and often optimality), but the results are limited in scope. Hence, there is a need to design intelligent control techniques as a function of sub-system dynamics, network structure and control/decision processes. We are developing S4C - a control theoretic framework for analysis and design of interacting robotic systems. We use “sufficient statistics” to generalize the separation principle - enabling decoupled optimal control and estimation. These techniques are applied to a missile guidance problem, demonstrating robustness to sensor/process noise. An agent-based simulation architecture has also been developed and used for studies. In addition, we use a verification and validation approach based on Gaussian process regression to test for cases where modeling assumptions are relaxed.","PeriodicalId":133657,"journal":{"name":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121356691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Assisting Seismic Image Interpretations with Hyperknowledge 利用超知识辅助地震图像解译
M. Moreno, R. Santos, Reinaldo Silva, W. Santos, Renato Cerqueira
Seismic data interpretation process is a time consuming and knowledge intensive process. Recently, research community proposed machine learning techniques to extract information from seismic images, aiming at assisting this interpretation process. Although useful, these techniques solve just part of the seismic interpretation problem. They focus on identifying specific features (e.g. salt diapirs, reservoir facies, mini-basins) but they fail in identifying and analyzing the spatial correlation among them. In this work we propose the use of hyper knowledge specifications to address this issue. The main contribution of this work is not only to present hyper knowledge templates to this problem, but also the discussions about how to map hyperknowledge as a knowledge graph as well as creating a reasoning engine that exploits the knowledge graph representation.
地震资料解释过程是一个耗时、知识密集的过程。最近,研究团体提出了机器学习技术来从地震图像中提取信息,旨在帮助这一解释过程。这些技术虽然有用,但只能解决部分地震解释问题。它们侧重于识别特定特征(如盐底辟、储层相、小型盆地),但未能识别和分析它们之间的空间相关性。在这项工作中,我们建议使用超知识规范来解决这个问题。本工作的主要贡献不仅在于提出了解决该问题的超知识模板,而且还讨论了如何将超知识映射为知识图,以及如何创建利用知识图表示的推理引擎。
{"title":"Assisting Seismic Image Interpretations with Hyperknowledge","authors":"M. Moreno, R. Santos, Reinaldo Silva, W. Santos, Renato Cerqueira","doi":"10.1109/AI4I.2018.8665691","DOIUrl":"https://doi.org/10.1109/AI4I.2018.8665691","url":null,"abstract":"Seismic data interpretation process is a time consuming and knowledge intensive process. Recently, research community proposed machine learning techniques to extract information from seismic images, aiming at assisting this interpretation process. Although useful, these techniques solve just part of the seismic interpretation problem. They focus on identifying specific features (e.g. salt diapirs, reservoir facies, mini-basins) but they fail in identifying and analyzing the spatial correlation among them. In this work we propose the use of hyper knowledge specifications to address this issue. The main contribution of this work is not only to present hyper knowledge templates to this problem, but also the discussions about how to map hyperknowledge as a knowledge graph as well as creating a reasoning engine that exploits the knowledge graph representation.","PeriodicalId":133657,"journal":{"name":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114536752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Detection Sound Source Direction in 3D Space Using Convolutional Neural Networks 基于卷积神经网络的三维空间声源方向检测
Xiaofeng Yue, Guangzhi Qu, Bo Liu, Anyi Liu
Sound source detection and localization have a lot of practical uses in many industrial settings. Most of sound source direction detection algorithms in literature are designed to identify the angle of sound source in a 2D space. In this work, we propose to use convolutional neural networks to detect the sound source direction in a 3D space. This algorithm is based on the generalized cross correlation method with phase transform (GCC-PHAT) [1] to derive time delay of arrival (TDOA). By using a convolutional neural network model, this algorithm can be applied and deployed. In addition, by modifying GCC-PHAT formula, this approach also works of multiple sound sources detection. Simulation experimental results on single sound source and multiple sound sources detection show the proposed system could work in most situations.
声源检测和定位在许多工业环境中有许多实际用途。文献中大多数声源方向检测算法都是为了识别二维空间中声源的角度。在这项工作中,我们提出使用卷积神经网络来检测三维空间中的声源方向。该算法基于相位变换广义互相关法(GCC-PHAT)[1]推导到达时延(TDOA)。通过使用卷积神经网络模型,可以实现该算法的应用和部署。此外,通过修改GCC-PHAT公式,该方法也适用于多声源检测。单声源和多声源检测的仿真实验结果表明,该系统在大多数情况下都能正常工作。
{"title":"Detection Sound Source Direction in 3D Space Using Convolutional Neural Networks","authors":"Xiaofeng Yue, Guangzhi Qu, Bo Liu, Anyi Liu","doi":"10.1109/AI4I.2018.8665693","DOIUrl":"https://doi.org/10.1109/AI4I.2018.8665693","url":null,"abstract":"Sound source detection and localization have a lot of practical uses in many industrial settings. Most of sound source direction detection algorithms in literature are designed to identify the angle of sound source in a 2D space. In this work, we propose to use convolutional neural networks to detect the sound source direction in a 3D space. This algorithm is based on the generalized cross correlation method with phase transform (GCC-PHAT) [1] to derive time delay of arrival (TDOA). By using a convolutional neural network model, this algorithm can be applied and deployed. In addition, by modifying GCC-PHAT formula, this approach also works of multiple sound sources detection. Simulation experimental results on single sound source and multiple sound sources detection show the proposed system could work in most situations.","PeriodicalId":133657,"journal":{"name":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124909567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2018 First International Conference on Artificial Intelligence for Industries (AI4I)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1