首页 > 最新文献

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)最新文献

英文 中文
Interpretable Approximation of a Deep Reinforcement Learning Agent as a Set of If-Then Rules 作为一组If-Then规则的深度强化学习代理的可解释逼近
S. Nageshrao, Bruno Costa, Dimitar Filev
In many industrial applications, one of the major bottlenecks in using advanced learning-based methods (such as reinforcement learning) for controls is the lack of interpretability of the trained agent. In this paper, we present a methodology for translating a trained reinforcement learning agent into a set of simple and easy to interpret if-then rules by using the proven universal approximation property of the rules with fuzzy predicates. Proposed methodology combines the optimality of reinforcement learning with interpretability of the theory of approximate reasoning, thus making reinforcement learning-based solutions more accessible to industrial practitioners. The framework presented in this paper has the potential to help address the fundamental problem in widespread adoption of reinforcement learning in industrial applications.
在许多工业应用中,使用基于高级学习的方法(如强化学习)进行控制的主要瓶颈之一是训练后的代理缺乏可解释性。在本文中,我们提出了一种方法,通过使用已证明的带有模糊谓词的规则的普遍近似性质,将训练好的强化学习代理转换为一组简单且易于解释的if-then规则。所提出的方法将强化学习的最优性与近似推理理论的可解释性相结合,从而使基于强化学习的解决方案更容易被工业从业者所接受。本文提出的框架有可能帮助解决在工业应用中广泛采用强化学习的基本问题。
{"title":"Interpretable Approximation of a Deep Reinforcement Learning Agent as a Set of If-Then Rules","authors":"S. Nageshrao, Bruno Costa, Dimitar Filev","doi":"10.1109/ICMLA.2019.00041","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00041","url":null,"abstract":"In many industrial applications, one of the major bottlenecks in using advanced learning-based methods (such as reinforcement learning) for controls is the lack of interpretability of the trained agent. In this paper, we present a methodology for translating a trained reinforcement learning agent into a set of simple and easy to interpret if-then rules by using the proven universal approximation property of the rules with fuzzy predicates. Proposed methodology combines the optimality of reinforcement learning with interpretability of the theory of approximate reasoning, thus making reinforcement learning-based solutions more accessible to industrial practitioners. The framework presented in this paper has the potential to help address the fundamental problem in widespread adoption of reinforcement learning in industrial applications.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134009517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Generating Near and Far Analogies for Educational Applications: Progress and Challenges 在教育应用中产生远近类比:进展与挑战
M. Boger, A. Laverghetta, Nikolai Fetisov, John Licato
Analogical reasoning, it has been argued, fundamentally underlies many cognitive processes and is an important marker of developmental cognition. This connection suggests that the clever use of analogical reasoning tasks can improve cognitive performance in specific ways, thus leading to clear educational applications, as recent psychological work has confirmed. However, currently there are no known methods to either solve or generate analogical word problems, at least to a degree of reliability that would be necessary before such educational applications are possible. To address these concerns we present work to both solve and generate analogy word problems: First, given an analogy word problem, our algorithm performs a parallel random walk through the semantic network ConceptNet to limit the number of choices that are then considered by a vector embedding. We achieve an improvement in accuracy beyond existing state-of-the-art. Second, we explore a method for automatically generating explainable n-step analogy word problems, and analyze the results.
类比推理一直被认为是许多认知过程的基础,是发展认知的重要标志。这种联系表明,巧妙地使用类比推理任务可以以特定的方式提高认知表现,从而导致明确的教育应用,正如最近的心理学研究所证实的那样。然而,目前还没有已知的方法来解决或产生类似的单词问题,至少在这种教育应用成为可能之前,没有一定程度的可靠性。为了解决这些问题,我们提出了解决和生成类比词问题的工作:首先,给定一个类比词问题,我们的算法在语义网络ConceptNet中执行并行随机漫步,以限制向量嵌入所考虑的选择数量。我们实现了精度的提高,超越了现有的最先进的技术。其次,我们探索了一种自动生成可解释的n步类比词问题的方法,并分析了结果。
{"title":"Generating Near and Far Analogies for Educational Applications: Progress and Challenges","authors":"M. Boger, A. Laverghetta, Nikolai Fetisov, John Licato","doi":"10.1109/ICMLA.2019.00316","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00316","url":null,"abstract":"Analogical reasoning, it has been argued, fundamentally underlies many cognitive processes and is an important marker of developmental cognition. This connection suggests that the clever use of analogical reasoning tasks can improve cognitive performance in specific ways, thus leading to clear educational applications, as recent psychological work has confirmed. However, currently there are no known methods to either solve or generate analogical word problems, at least to a degree of reliability that would be necessary before such educational applications are possible. To address these concerns we present work to both solve and generate analogy word problems: First, given an analogy word problem, our algorithm performs a parallel random walk through the semantic network ConceptNet to limit the number of choices that are then considered by a vector embedding. We achieve an improvement in accuracy beyond existing state-of-the-art. Second, we explore a method for automatically generating explainable n-step analogy word problems, and analyze the results.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133529007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
SVM-Based Segmentation of Home Appliance Energy Measurements 基于svm的家电能耗测量分割
Marc Wenninger, Dominik Stecher, Jochen Schmidt
Generating a more detailed understanding of domestic electricity demand is a major topic for energy suppliers and householders in times of climate change. Over the years there have been many studies on consumption feedback systems to inform householders, disaggregation algorithms for Non-Intrusive-Load-Monitoring (NILM), Real-Time-Pricing (RTP) to promote supply aware behavior through monetary incentives and appliance usage prediction algorithms. While these studies are vital steps towards energy awareness, one of the most fundamental challenges has not yet been tackled: Automated detection of start and stop of usage cycles of household appliances. We argue that most research efforts in this area will benefit from a reliable segmentation method to provide accurate usage information. We propose a SVM-based segmentation method for home appliances such as dishwashers and washing machines. The method is evaluated using manually annotated electricity measurements of five different appliances recorded over two years in multiple households.
在气候变化时期,更详细地了解国内电力需求是能源供应商和家庭的一个主要课题。多年来,已经有许多关于消费反馈系统的研究,以告知家庭,非侵入式负荷监测(NILM)的分解算法,实时定价(RTP)通过货币激励和家电使用预测算法来促进供应意识行为。虽然这些研究是提高能源意识的重要一步,但最根本的挑战之一尚未得到解决:家用电器使用周期的启动和停止自动检测。我们认为,该领域的大多数研究工作将受益于可靠的分割方法,以提供准确的使用信息。我们提出了一种基于svm的家用电器分割方法,如洗碗机和洗衣机。该方法是通过对多个家庭在两年内记录的五种不同电器的手动注释电力测量来评估的。
{"title":"SVM-Based Segmentation of Home Appliance Energy Measurements","authors":"Marc Wenninger, Dominik Stecher, Jochen Schmidt","doi":"10.1109/ICMLA.2019.00272","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00272","url":null,"abstract":"Generating a more detailed understanding of domestic electricity demand is a major topic for energy suppliers and householders in times of climate change. Over the years there have been many studies on consumption feedback systems to inform householders, disaggregation algorithms for Non-Intrusive-Load-Monitoring (NILM), Real-Time-Pricing (RTP) to promote supply aware behavior through monetary incentives and appliance usage prediction algorithms. While these studies are vital steps towards energy awareness, one of the most fundamental challenges has not yet been tackled: Automated detection of start and stop of usage cycles of household appliances. We argue that most research efforts in this area will benefit from a reliable segmentation method to provide accurate usage information. We propose a SVM-based segmentation method for home appliances such as dishwashers and washing machines. The method is evaluated using manually annotated electricity measurements of five different appliances recorded over two years in multiple households.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130279786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Temporal Modeling of Deterioration Patterns and Clustering for Disease Prediction of ALS Patients ALS患者疾病预测恶化模式的时间模型和聚类
Dan Halbersberg, B. Lerner
Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease, lasting from the day of onset until death. Factors such as the progression rate and pattern of the disease vary greatly among patients, making it difficult to achieve accurate predictions about ALS. To accurately predict ALS disease state and deterioration, we propose a novel approach that combines: a) sequence clustering based on dynamic time warping for separation among patients with diverse ALS deterioration patterns, b) sequential pattern mining for discovery of deterioration changes that patients of the same type may have in common, and c) deterioration-based patient next-state prediction. Using a clinical dataset, we demonstrate the advantage of the proposed approach in terms of classification accuracy and deterioration detection compared to other classification methods and temporal models such as long short-term memory.
肌萎缩性侧索硬化症(ALS)是一种神经退行性疾病,从发病之日起一直持续到死亡。诸如疾病进展率和模式等因素在患者之间差异很大,因此很难实现对ALS的准确预测。为了准确预测ALS疾病状态和恶化,我们提出了一种新的方法,该方法结合了:a)基于动态时间扭曲的序列聚类,用于分离不同ALS恶化模式的患者;b)序列模式挖掘,用于发现相同类型患者可能具有的恶化变化;c)基于恶化的患者下一状态预测。使用临床数据集,我们证明了与其他分类方法和时间模型(如长短期记忆)相比,所提出的方法在分类准确性和退化检测方面的优势。
{"title":"Temporal Modeling of Deterioration Patterns and Clustering for Disease Prediction of ALS Patients","authors":"Dan Halbersberg, B. Lerner","doi":"10.1109/ICMLA.2019.00019","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00019","url":null,"abstract":"Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease, lasting from the day of onset until death. Factors such as the progression rate and pattern of the disease vary greatly among patients, making it difficult to achieve accurate predictions about ALS. To accurately predict ALS disease state and deterioration, we propose a novel approach that combines: a) sequence clustering based on dynamic time warping for separation among patients with diverse ALS deterioration patterns, b) sequential pattern mining for discovery of deterioration changes that patients of the same type may have in common, and c) deterioration-based patient next-state prediction. Using a clinical dataset, we demonstrate the advantage of the proposed approach in terms of classification accuracy and deterioration detection compared to other classification methods and temporal models such as long short-term memory.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130330356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Denoising Internet Delay Measurements using Weak Supervision 基于弱监督的网络时延测量去噪
A. Muthukumar, Ramakrishnan Durairajan
To understand the delay characteristics of the Internet, a myriad of measurement tools and techniques are proposed by the researchers in academia and industry. Datasets from such measurement tools are curated to facilitate analyses at a later time. Despite the benefits of these tools and datasets, the systematic interpretation of measurements in the face of measurement noise. Unfortunately, state-of-the-art denoising techniques are labor-intensive and ineffective. To tackle this problem, we develop NoMoNoise, an open-source framework for denoising latency measurements by leveraging the recent advancements in weak-supervised learning. NoMoNoise can generate measurement noise labels that could be integrated into the inference and control logic to remove and/or repair noisy measurements in an automated and rapid fashion. We evaluate the efficacy of NoMoNoise in a lab-based setting and a real-world setting by applying it on CAIDA's Ark dataset and show that NoMoNoise can remove noisy measurements effectively with high accuracy.
为了了解互联网的延迟特性,学术界和工业界的研究人员提出了无数的测量工具和技术。来自这些测量工具的数据集经过整理,以方便以后的分析。尽管这些工具和数据集有好处,但面对测量噪声的测量系统解释。不幸的是,最先进的去噪技术是劳动密集型和无效的。为了解决这个问题,我们开发了NoMoNoise,这是一个开源框架,通过利用弱监督学习的最新进展来去噪延迟测量。NoMoNoise可以生成测量噪声标签,可以集成到推理和控制逻辑中,以自动和快速的方式去除和/或修复噪声测量。我们通过将NoMoNoise应用于CAIDA的Ark数据集,在实验室环境和现实环境中评估了NoMoNoise的有效性,并表明NoMoNoise可以有效地去除噪声测量,并且精度很高。
{"title":"Denoising Internet Delay Measurements using Weak Supervision","authors":"A. Muthukumar, Ramakrishnan Durairajan","doi":"10.1109/ICMLA.2019.00089","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00089","url":null,"abstract":"To understand the delay characteristics of the Internet, a myriad of measurement tools and techniques are proposed by the researchers in academia and industry. Datasets from such measurement tools are curated to facilitate analyses at a later time. Despite the benefits of these tools and datasets, the systematic interpretation of measurements in the face of measurement noise. Unfortunately, state-of-the-art denoising techniques are labor-intensive and ineffective. To tackle this problem, we develop NoMoNoise, an open-source framework for denoising latency measurements by leveraging the recent advancements in weak-supervised learning. NoMoNoise can generate measurement noise labels that could be integrated into the inference and control logic to remove and/or repair noisy measurements in an automated and rapid fashion. We evaluate the efficacy of NoMoNoise in a lab-based setting and a real-world setting by applying it on CAIDA's Ark dataset and show that NoMoNoise can remove noisy measurements effectively with high accuracy.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125649665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Gender Estimation from a Hybrid of Face, Upper and Full Body Images at Varying Body Poses 从不同身体姿势的面部,上身和全身图像的混合性别估计
O. Iloanusi, C. Mbah
High gender classification accuracies have been recorded with high-resolution faces under controlled conditions. However, real-life scenarios are faced with challenges not limited to high pose variations in subjects, poor visibility, occlusion, and distance from camera. These have led to the current trend in estimating gender from full body images, notwithstanding the challenges posed by partial body images in a typical life scenario. We demonstrate that there are certain sections in a body image, the face, upper or lower body that are useful for recognition at near or far distances. Given the challenges of body captured at far distance or partially showing body in a photo, we therefore propose a combination of three classifiers for gender estimation from face; upper and full body from single-shot image. Our results in far compared to near distance images suggest that gender is best estimated from a hybrid of face; upper and full body images under challenging conditions.
在控制条件下,高分辨率人脸的性别分类准确率很高。然而,现实生活场景面临的挑战不仅限于高姿态变化的主题,能见度差,遮挡和距离相机。这导致了目前从全身图像估计性别的趋势,尽管在典型的生活场景中部分身体图像带来了挑战。我们证明了身体图像的某些部分,面部,上半身或下半身,对于近距离或远距离的识别是有用的。考虑到远距离拍摄身体或在照片中部分显示身体的挑战,我们因此提出了三种分类器的组合,用于面部性别估计;上身和全身来自单张照片。与近距离图像相比,我们的研究结果表明,从面部混合图像中可以最好地估计出性别;挑战性条件下的上半身和全身图像。
{"title":"Gender Estimation from a Hybrid of Face, Upper and Full Body Images at Varying Body Poses","authors":"O. Iloanusi, C. Mbah","doi":"10.1109/ICMLA.2019.00312","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00312","url":null,"abstract":"High gender classification accuracies have been recorded with high-resolution faces under controlled conditions. However, real-life scenarios are faced with challenges not limited to high pose variations in subjects, poor visibility, occlusion, and distance from camera. These have led to the current trend in estimating gender from full body images, notwithstanding the challenges posed by partial body images in a typical life scenario. We demonstrate that there are certain sections in a body image, the face, upper or lower body that are useful for recognition at near or far distances. Given the challenges of body captured at far distance or partially showing body in a photo, we therefore propose a combination of three classifiers for gender estimation from face; upper and full body from single-shot image. Our results in far compared to near distance images suggest that gender is best estimated from a hybrid of face; upper and full body images under challenging conditions.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132396798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Design of a Cost-Effective Deep Convolutional Neural Network–Based Scheme for Diagnosing Faults in Smart Grids 基于深度卷积神经网络的智能电网故障诊断方案设计
Hossein Hassani, Maryam Farajzadeh-Zanjani, R. Razavi-Far, M. Saif, V. Palade
There has been a growing interest in using smart grids due to their capability in delivering automated and distributed energy level to the consumption units. However, in order to guarantee the safe and reliable delivery of the high-quality power from the generation units to the consumers, smart grids need to be equipped with diagnostic systems. This paper presents an efficient data-driven scheme for diagnosing faults in smart grids. In order to reduce the computational burden and monitor the state of the system with a lower number of smart meters, a method based on the affinity propagation clustering algorithm is suggested for the placement of meters, that makes use of the graph-based representation of the system. The collected voltage data measurements from the installed meters are then decomposed by matching pursuit decomposition in order to generate informative features. Extracted features are then used to train a convolutional neural network, and the constructed deep learning model is then tested using unseen samples of normal and faulty conditions. Simulation results based on the IEEE 39–Bus System demonstrate the effectiveness of the proposed data-driven fault diagnostic system.
由于智能电网能够向消费单位提供自动化和分布式的能源水平,人们对使用智能电网的兴趣越来越大。然而,为了保证发电机组将高质量的电力安全可靠地输送到用户手中,智能电网需要配备诊断系统。提出了一种基于数据驱动的智能电网故障诊断方案。为了在智能电表数量较少的情况下减少系统的计算负担和监控系统的状态,提出了一种基于亲和传播聚类算法的电表放置方法,该方法利用系统的基于图的表示。然后通过匹配追踪分解对安装的仪表收集的电压数据进行分解,以生成信息特征。然后,提取的特征用于训练卷积神经网络,然后使用未见过的正常和故障条件样本测试构建的深度学习模型。基于IEEE 39总线系统的仿真结果验证了数据驱动故障诊断系统的有效性。
{"title":"Design of a Cost-Effective Deep Convolutional Neural Network–Based Scheme for Diagnosing Faults in Smart Grids","authors":"Hossein Hassani, Maryam Farajzadeh-Zanjani, R. Razavi-Far, M. Saif, V. Palade","doi":"10.1109/ICMLA.2019.00232","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00232","url":null,"abstract":"There has been a growing interest in using smart grids due to their capability in delivering automated and distributed energy level to the consumption units. However, in order to guarantee the safe and reliable delivery of the high-quality power from the generation units to the consumers, smart grids need to be equipped with diagnostic systems. This paper presents an efficient data-driven scheme for diagnosing faults in smart grids. In order to reduce the computational burden and monitor the state of the system with a lower number of smart meters, a method based on the affinity propagation clustering algorithm is suggested for the placement of meters, that makes use of the graph-based representation of the system. The collected voltage data measurements from the installed meters are then decomposed by matching pursuit decomposition in order to generate informative features. Extracted features are then used to train a convolutional neural network, and the constructed deep learning model is then tested using unseen samples of normal and faulty conditions. Simulation results based on the IEEE 39–Bus System demonstrate the effectiveness of the proposed data-driven fault diagnostic system.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"257 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134103142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Acoustic Scene Classification Using Deep Mixtures of Pre-trained Convolutional Neural Networks 基于深度混合预训练卷积神经网络的声学场景分类
Truc The Nguyen, Alexander Fuchs, F. Pernkopf
We propose a heterogeneous system of Deep Mixture of Experts (DMoEs) models using different Convolutional Neural Networks (CNNs) for acoustic scene classification (ASC). Each DMoEs module is a mixture of different parallel CNN structures weighted by a gating network. All CNNs use the same input data. The CNN architectures play the role of experts extracting a variety of features. The experts are pre-trained, and kept fixed (frozen) for the DMoEs model. The DMoEs is post-trained by optimizing weights of the gating network, which estimates the contribution of the experts in the mixture. In order to enhance the performance, we use an ensemble of three DMoEs modules each with different pairs of inputs and individual CNN models. The input pairs are spectrogram combinations of binaural audio and mono audio as well as their pre-processed variations using harmonic-percussive source separation (HPSS) and nearest neighbor filters (NNFs). The classification result of the proposed system is 72.1% improving the baseline by around 12% (absolute) on the development data of DCASE 2018 challenge task 1A.
我们提出了一个使用不同卷积神经网络(cnn)进行声场景分类(ASC)的深度混合专家(DMoEs)模型的异构系统。每个DMoEs模块是由门控网络加权的不同并行CNN结构的混合物。所有cnn使用相同的输入数据。CNN架构扮演专家的角色,提取各种特征。专家经过预先训练,并为DMoEs模型保持固定(冻结)。通过优化门控网络的权值对DMoEs进行后训练,估计混合专家的贡献。为了提高性能,我们使用了三个DMoEs模块的集成,每个模块都有不同的输入对和单独的CNN模型。输入对是双耳音频和单声道音频的频谱图组合,以及它们使用谐波冲击源分离(HPSS)和最近邻滤波器(NNFs)进行预处理的变化。在DCASE 2018挑战任务1A的开发数据上,该系统的分类结果为72.1%,比基线提高了约12%(绝对)。
{"title":"Acoustic Scene Classification Using Deep Mixtures of Pre-trained Convolutional Neural Networks","authors":"Truc The Nguyen, Alexander Fuchs, F. Pernkopf","doi":"10.1109/ICMLA.2019.00151","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00151","url":null,"abstract":"We propose a heterogeneous system of Deep Mixture of Experts (DMoEs) models using different Convolutional Neural Networks (CNNs) for acoustic scene classification (ASC). Each DMoEs module is a mixture of different parallel CNN structures weighted by a gating network. All CNNs use the same input data. The CNN architectures play the role of experts extracting a variety of features. The experts are pre-trained, and kept fixed (frozen) for the DMoEs model. The DMoEs is post-trained by optimizing weights of the gating network, which estimates the contribution of the experts in the mixture. In order to enhance the performance, we use an ensemble of three DMoEs modules each with different pairs of inputs and individual CNN models. The input pairs are spectrogram combinations of binaural audio and mono audio as well as their pre-processed variations using harmonic-percussive source separation (HPSS) and nearest neighbor filters (NNFs). The classification result of the proposed system is 72.1% improving the baseline by around 12% (absolute) on the development data of DCASE 2018 challenge task 1A.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134398568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Understanding Early Childhood Obesity via Interpretation of Machine Learning Model Predictions 通过解释机器学习模型预测来理解早期儿童肥胖
Xueqin Pang, C. Forrest, F. Lê-Scherban, A. Masino
Obesity, as an independent risk factor for increased morbidity and mortality throughout the lifecycle, is a major health issue in the United States. Pediatric obesity is a strong risk factor for adult obesity, as it tends to be stable and tracks into adulthood. Therefore, prevention of childhood obesity is urgently required for reduction in obesity prevalence and obesity related comorbidities. In this paper, the general pediatric obesity development pattern and the onset time period of early childhood obesity was identified via analysis of approximately 11 million pediatric clinical encounters of 860,510 unique individuals. XGBoost model was developed to predict at age 2 years if individuals would develop obesity in early childhood. The model is generalized to both males and females, and achieved an AUC of 81% (± 0.1%). Obesity associated risk factors were further analyzed via interpretation of the XGBoost model predictions. Besides known predictive factors such as weight, height, race, and ethnicity, new factors such as body temperature and respiratory rate were also identified. As body temperature and respiratory rate are related to human metabolism, novel physiologic mechanisms that cause these associations might be discovered in future research. We decomposed model recall to different age ranges when obesity incidence occurred. The model recall for individuals with obesity incidence between 24–36 months was 97.63%, while recall for obesity incidence between 72–84 months was 48.96%, suggesting obesity is less predictable further in the future. Since obesity is largely affected by evolving factors such as life style, diet, and living environment, it is possible that obesity prevention may be achieved via changes in adjustable factors.
肥胖作为整个生命周期中发病率和死亡率增加的一个独立风险因素,是美国的一个主要健康问题。儿童肥胖是成人肥胖的一个强大的危险因素,因为它往往是稳定的,并持续到成年。因此,迫切需要预防儿童肥胖,以减少肥胖患病率和肥胖相关的合并症。在本文中,通过分析约1100万儿科临床就诊的860,510个独特个体,确定了儿童肥胖的一般发展模式和儿童早期肥胖的发病时间。XGBoost模型用于预测个体在2岁时是否会在儿童早期出现肥胖。该模型适用于男性和女性,AUC为81%(±0.1%)。通过对XGBoost模型预测的解释,进一步分析了肥胖相关的风险因素。除了已知的预测因素,如体重、身高、种族和民族,新的因素,如体温和呼吸频率也被确定。由于体温和呼吸频率与人体代谢有关,在未来的研究中可能会发现导致这些关联的新的生理机制。我们将模型召回率分解到肥胖发生的不同年龄范围。24-36个月肥胖个体的模型召回率为97.63%,72-84个月肥胖个体的模型召回率为48.96%,表明未来肥胖的可预测性较差。由于肥胖在很大程度上受生活方式、饮食和生活环境等进化因素的影响,因此有可能通过改变可调节因素来预防肥胖。
{"title":"Understanding Early Childhood Obesity via Interpretation of Machine Learning Model Predictions","authors":"Xueqin Pang, C. Forrest, F. Lê-Scherban, A. Masino","doi":"10.1109/ICMLA.2019.00235","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00235","url":null,"abstract":"Obesity, as an independent risk factor for increased morbidity and mortality throughout the lifecycle, is a major health issue in the United States. Pediatric obesity is a strong risk factor for adult obesity, as it tends to be stable and tracks into adulthood. Therefore, prevention of childhood obesity is urgently required for reduction in obesity prevalence and obesity related comorbidities. In this paper, the general pediatric obesity development pattern and the onset time period of early childhood obesity was identified via analysis of approximately 11 million pediatric clinical encounters of 860,510 unique individuals. XGBoost model was developed to predict at age 2 years if individuals would develop obesity in early childhood. The model is generalized to both males and females, and achieved an AUC of 81% (± 0.1%). Obesity associated risk factors were further analyzed via interpretation of the XGBoost model predictions. Besides known predictive factors such as weight, height, race, and ethnicity, new factors such as body temperature and respiratory rate were also identified. As body temperature and respiratory rate are related to human metabolism, novel physiologic mechanisms that cause these associations might be discovered in future research. We decomposed model recall to different age ranges when obesity incidence occurred. The model recall for individuals with obesity incidence between 24–36 months was 97.63%, while recall for obesity incidence between 72–84 months was 48.96%, suggesting obesity is less predictable further in the future. Since obesity is largely affected by evolving factors such as life style, diet, and living environment, it is possible that obesity prevention may be achieved via changes in adjustable factors.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131694589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Lean Training Data Generation for Planar Object Detection Models in Unsteady Logistics Contexts 非定常物流环境下平面目标检测模型的精益训练数据生成
Laura Dörr, Felix Brandt, Anne Meyer, Martin Pouls
Supervised deep learning has become the state of the art method for object detection and is used in many application areas such as autonomous driving, manufacturing industries or security systems. The acquisition of annotated data sets for the training of neural networks is highly time-consuming and error-prone. Thus, the supervised training of such object detection models is not feasible in some cases. This holds for the task of logistics transport label detection, as this use-case stands out by requiring highly specialized, quickly adapting models whilst allowing for little manual efforts in the data preparation and training process. We propose an easy training data generation method enabling the fully automated training of specialized models for the task of logistics transport label detection. For data synthesis, we stitch instances of the transport labels to be detected into background images whilst using image degradation and augmentation methods. We evaluate the employment of both use-case-specific, carefully selected background images and randomly selected real-world background images. Further, we compare two different data generation approaches: one generating realistically looking images and a simpler one making do without any manual image annotation. We examine and evaluate the introduced method on a new and publicly available example data set relevant for logistics transport label detection. We show that accurate models can be trained exclusively on synthetic training data and we compare their performance to models trained on real, manually annotated images.
监督深度学习已经成为物体检测的最新方法,并被用于自动驾驶、制造业或安全系统等许多应用领域。为神经网络训练获取带注释的数据集是非常耗时且容易出错的。因此,这种目标检测模型的监督训练在某些情况下是不可行的。这适用于物流运输标签检测任务,因为这个用例需要高度专业化、快速适应的模型,同时在数据准备和培训过程中只需要很少的人工工作。我们提出了一种简单的训练数据生成方法,使物流运输标签检测任务的专业模型的全自动训练成为可能。对于数据合成,我们将待检测的传输标签实例拼接到背景图像中,同时使用图像退化和增强方法。我们评估了用例特定的、精心选择的背景图像和随机选择的真实世界背景图像的使用情况。此外,我们比较了两种不同的数据生成方法:一种生成逼真的图像,另一种更简单,不需要任何手动图像注释。我们在一个与物流运输标签检测相关的新的和公开可用的示例数据集上检查和评估所引入的方法。我们证明了精确的模型可以只在合成的训练数据上训练,我们将它们的性能与在真实的、手动注释的图像上训练的模型进行了比较。
{"title":"Lean Training Data Generation for Planar Object Detection Models in Unsteady Logistics Contexts","authors":"Laura Dörr, Felix Brandt, Anne Meyer, Martin Pouls","doi":"10.1109/ICMLA.2019.00062","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00062","url":null,"abstract":"Supervised deep learning has become the state of the art method for object detection and is used in many application areas such as autonomous driving, manufacturing industries or security systems. The acquisition of annotated data sets for the training of neural networks is highly time-consuming and error-prone. Thus, the supervised training of such object detection models is not feasible in some cases. This holds for the task of logistics transport label detection, as this use-case stands out by requiring highly specialized, quickly adapting models whilst allowing for little manual efforts in the data preparation and training process. We propose an easy training data generation method enabling the fully automated training of specialized models for the task of logistics transport label detection. For data synthesis, we stitch instances of the transport labels to be detected into background images whilst using image degradation and augmentation methods. We evaluate the employment of both use-case-specific, carefully selected background images and randomly selected real-world background images. Further, we compare two different data generation approaches: one generating realistically looking images and a simpler one making do without any manual image annotation. We examine and evaluate the introduced method on a new and publicly available example data set relevant for logistics transport label detection. We show that accurate models can be trained exclusively on synthetic training data and we compare their performance to models trained on real, manually annotated images.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133253828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1