首页 > 最新文献

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)最新文献

英文 中文
Interpretable Approximation of a Deep Reinforcement Learning Agent as a Set of If-Then Rules 作为一组If-Then规则的深度强化学习代理的可解释逼近
S. Nageshrao, Bruno Costa, Dimitar Filev
In many industrial applications, one of the major bottlenecks in using advanced learning-based methods (such as reinforcement learning) for controls is the lack of interpretability of the trained agent. In this paper, we present a methodology for translating a trained reinforcement learning agent into a set of simple and easy to interpret if-then rules by using the proven universal approximation property of the rules with fuzzy predicates. Proposed methodology combines the optimality of reinforcement learning with interpretability of the theory of approximate reasoning, thus making reinforcement learning-based solutions more accessible to industrial practitioners. The framework presented in this paper has the potential to help address the fundamental problem in widespread adoption of reinforcement learning in industrial applications.
在许多工业应用中,使用基于高级学习的方法(如强化学习)进行控制的主要瓶颈之一是训练后的代理缺乏可解释性。在本文中,我们提出了一种方法,通过使用已证明的带有模糊谓词的规则的普遍近似性质,将训练好的强化学习代理转换为一组简单且易于解释的if-then规则。所提出的方法将强化学习的最优性与近似推理理论的可解释性相结合,从而使基于强化学习的解决方案更容易被工业从业者所接受。本文提出的框架有可能帮助解决在工业应用中广泛采用强化学习的基本问题。
{"title":"Interpretable Approximation of a Deep Reinforcement Learning Agent as a Set of If-Then Rules","authors":"S. Nageshrao, Bruno Costa, Dimitar Filev","doi":"10.1109/ICMLA.2019.00041","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00041","url":null,"abstract":"In many industrial applications, one of the major bottlenecks in using advanced learning-based methods (such as reinforcement learning) for controls is the lack of interpretability of the trained agent. In this paper, we present a methodology for translating a trained reinforcement learning agent into a set of simple and easy to interpret if-then rules by using the proven universal approximation property of the rules with fuzzy predicates. Proposed methodology combines the optimality of reinforcement learning with interpretability of the theory of approximate reasoning, thus making reinforcement learning-based solutions more accessible to industrial practitioners. The framework presented in this paper has the potential to help address the fundamental problem in widespread adoption of reinforcement learning in industrial applications.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134009517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Generating Near and Far Analogies for Educational Applications: Progress and Challenges 在教育应用中产生远近类比:进展与挑战
M. Boger, A. Laverghetta, Nikolai Fetisov, John Licato
Analogical reasoning, it has been argued, fundamentally underlies many cognitive processes and is an important marker of developmental cognition. This connection suggests that the clever use of analogical reasoning tasks can improve cognitive performance in specific ways, thus leading to clear educational applications, as recent psychological work has confirmed. However, currently there are no known methods to either solve or generate analogical word problems, at least to a degree of reliability that would be necessary before such educational applications are possible. To address these concerns we present work to both solve and generate analogy word problems: First, given an analogy word problem, our algorithm performs a parallel random walk through the semantic network ConceptNet to limit the number of choices that are then considered by a vector embedding. We achieve an improvement in accuracy beyond existing state-of-the-art. Second, we explore a method for automatically generating explainable n-step analogy word problems, and analyze the results.
类比推理一直被认为是许多认知过程的基础,是发展认知的重要标志。这种联系表明,巧妙地使用类比推理任务可以以特定的方式提高认知表现,从而导致明确的教育应用,正如最近的心理学研究所证实的那样。然而,目前还没有已知的方法来解决或产生类似的单词问题,至少在这种教育应用成为可能之前,没有一定程度的可靠性。为了解决这些问题,我们提出了解决和生成类比词问题的工作:首先,给定一个类比词问题,我们的算法在语义网络ConceptNet中执行并行随机漫步,以限制向量嵌入所考虑的选择数量。我们实现了精度的提高,超越了现有的最先进的技术。其次,我们探索了一种自动生成可解释的n步类比词问题的方法,并分析了结果。
{"title":"Generating Near and Far Analogies for Educational Applications: Progress and Challenges","authors":"M. Boger, A. Laverghetta, Nikolai Fetisov, John Licato","doi":"10.1109/ICMLA.2019.00316","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00316","url":null,"abstract":"Analogical reasoning, it has been argued, fundamentally underlies many cognitive processes and is an important marker of developmental cognition. This connection suggests that the clever use of analogical reasoning tasks can improve cognitive performance in specific ways, thus leading to clear educational applications, as recent psychological work has confirmed. However, currently there are no known methods to either solve or generate analogical word problems, at least to a degree of reliability that would be necessary before such educational applications are possible. To address these concerns we present work to both solve and generate analogy word problems: First, given an analogy word problem, our algorithm performs a parallel random walk through the semantic network ConceptNet to limit the number of choices that are then considered by a vector embedding. We achieve an improvement in accuracy beyond existing state-of-the-art. Second, we explore a method for automatically generating explainable n-step analogy word problems, and analyze the results.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133529007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
SVM-Based Segmentation of Home Appliance Energy Measurements 基于svm的家电能耗测量分割
Marc Wenninger, Dominik Stecher, Jochen Schmidt
Generating a more detailed understanding of domestic electricity demand is a major topic for energy suppliers and householders in times of climate change. Over the years there have been many studies on consumption feedback systems to inform householders, disaggregation algorithms for Non-Intrusive-Load-Monitoring (NILM), Real-Time-Pricing (RTP) to promote supply aware behavior through monetary incentives and appliance usage prediction algorithms. While these studies are vital steps towards energy awareness, one of the most fundamental challenges has not yet been tackled: Automated detection of start and stop of usage cycles of household appliances. We argue that most research efforts in this area will benefit from a reliable segmentation method to provide accurate usage information. We propose a SVM-based segmentation method for home appliances such as dishwashers and washing machines. The method is evaluated using manually annotated electricity measurements of five different appliances recorded over two years in multiple households.
在气候变化时期,更详细地了解国内电力需求是能源供应商和家庭的一个主要课题。多年来,已经有许多关于消费反馈系统的研究,以告知家庭,非侵入式负荷监测(NILM)的分解算法,实时定价(RTP)通过货币激励和家电使用预测算法来促进供应意识行为。虽然这些研究是提高能源意识的重要一步,但最根本的挑战之一尚未得到解决:家用电器使用周期的启动和停止自动检测。我们认为,该领域的大多数研究工作将受益于可靠的分割方法,以提供准确的使用信息。我们提出了一种基于svm的家用电器分割方法,如洗碗机和洗衣机。该方法是通过对多个家庭在两年内记录的五种不同电器的手动注释电力测量来评估的。
{"title":"SVM-Based Segmentation of Home Appliance Energy Measurements","authors":"Marc Wenninger, Dominik Stecher, Jochen Schmidt","doi":"10.1109/ICMLA.2019.00272","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00272","url":null,"abstract":"Generating a more detailed understanding of domestic electricity demand is a major topic for energy suppliers and householders in times of climate change. Over the years there have been many studies on consumption feedback systems to inform householders, disaggregation algorithms for Non-Intrusive-Load-Monitoring (NILM), Real-Time-Pricing (RTP) to promote supply aware behavior through monetary incentives and appliance usage prediction algorithms. While these studies are vital steps towards energy awareness, one of the most fundamental challenges has not yet been tackled: Automated detection of start and stop of usage cycles of household appliances. We argue that most research efforts in this area will benefit from a reliable segmentation method to provide accurate usage information. We propose a SVM-based segmentation method for home appliances such as dishwashers and washing machines. The method is evaluated using manually annotated electricity measurements of five different appliances recorded over two years in multiple households.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130279786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Temporal Modeling of Deterioration Patterns and Clustering for Disease Prediction of ALS Patients ALS患者疾病预测恶化模式的时间模型和聚类
Dan Halbersberg, B. Lerner
Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease, lasting from the day of onset until death. Factors such as the progression rate and pattern of the disease vary greatly among patients, making it difficult to achieve accurate predictions about ALS. To accurately predict ALS disease state and deterioration, we propose a novel approach that combines: a) sequence clustering based on dynamic time warping for separation among patients with diverse ALS deterioration patterns, b) sequential pattern mining for discovery of deterioration changes that patients of the same type may have in common, and c) deterioration-based patient next-state prediction. Using a clinical dataset, we demonstrate the advantage of the proposed approach in terms of classification accuracy and deterioration detection compared to other classification methods and temporal models such as long short-term memory.
肌萎缩性侧索硬化症(ALS)是一种神经退行性疾病,从发病之日起一直持续到死亡。诸如疾病进展率和模式等因素在患者之间差异很大,因此很难实现对ALS的准确预测。为了准确预测ALS疾病状态和恶化,我们提出了一种新的方法,该方法结合了:a)基于动态时间扭曲的序列聚类,用于分离不同ALS恶化模式的患者;b)序列模式挖掘,用于发现相同类型患者可能具有的恶化变化;c)基于恶化的患者下一状态预测。使用临床数据集,我们证明了与其他分类方法和时间模型(如长短期记忆)相比,所提出的方法在分类准确性和退化检测方面的优势。
{"title":"Temporal Modeling of Deterioration Patterns and Clustering for Disease Prediction of ALS Patients","authors":"Dan Halbersberg, B. Lerner","doi":"10.1109/ICMLA.2019.00019","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00019","url":null,"abstract":"Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease, lasting from the day of onset until death. Factors such as the progression rate and pattern of the disease vary greatly among patients, making it difficult to achieve accurate predictions about ALS. To accurately predict ALS disease state and deterioration, we propose a novel approach that combines: a) sequence clustering based on dynamic time warping for separation among patients with diverse ALS deterioration patterns, b) sequential pattern mining for discovery of deterioration changes that patients of the same type may have in common, and c) deterioration-based patient next-state prediction. Using a clinical dataset, we demonstrate the advantage of the proposed approach in terms of classification accuracy and deterioration detection compared to other classification methods and temporal models such as long short-term memory.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130330356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Deep Ensemble Network for Quantification and Severity Assessment of Knee Osteoarthritis 用于膝骨关节炎量化和严重程度评估的深度集成网络
Mohammed Bany Muhammad, A. Moinuddin, M. Lee, Yanfei Zhang, V. Abedi, R. Zand, M. Yeasin
The assessment of knee joint gap and severity of Osteoarthritis (OA) is subjective and often inaccurate. The main source of error is due to the judgement of human expert from low resolution images (i.e., X-ray images). To address the problem, we developed an ensemble of Deep Learning (DL) model to objectively score the severity of OA only from the radiometric images. The proposed method consists of two main modules. First, we developed a scale invariant and aspect ratio preserving automatic localization and characterization of the kneecap area. Second, we developed multiple instances of "hyper parameter optimized" DL models and fused them using ensemble classification to score the severity of OA. In this implementation, we used three convolutional neural networks to improve the bias-variance trade-off, and boost accuracy and generalization. We tested our modeling framework using a collection of 4,796 X-ray images from Osteoarthritis Initiative (OAI). Our results show a higher performance (~ 2-8%) when compared to the state-of-the-art methods. Finally, this machine learning-based methodology provides a pipeline in decision support system for assessing and quantifying the OA severity.
评估膝关节间隙和骨关节炎(OA)的严重程度是主观的,往往不准确。误差的主要来源是人类专家对低分辨率图像(即x射线图像)的判断。为了解决这个问题,我们开发了一个深度学习(DL)集成模型,仅从放射图像中客观地评分OA的严重程度。该方法包括两个主要模块。首先,我们开发了一种保留尺度不变量和纵横比的膝盖骨区域自动定位和表征方法。其次,我们开发了多个“超参数优化”深度学习模型实例,并使用集成分类对它们进行融合,对OA的严重程度进行评分。在这个实现中,我们使用了三个卷积神经网络来改善偏差-方差权衡,并提高准确性和泛化。我们使用骨关节炎倡议(OAI)收集的4,796张x射线图像来测试我们的建模框架。与最先进的方法相比,我们的结果显示出更高的性能(~ 2-8%)。最后,基于机器学习的方法为OA严重程度的评估和量化提供了决策支持系统的管道。
{"title":"Deep Ensemble Network for Quantification and Severity Assessment of Knee Osteoarthritis","authors":"Mohammed Bany Muhammad, A. Moinuddin, M. Lee, Yanfei Zhang, V. Abedi, R. Zand, M. Yeasin","doi":"10.1109/ICMLA.2019.00163","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00163","url":null,"abstract":"The assessment of knee joint gap and severity of Osteoarthritis (OA) is subjective and often inaccurate. The main source of error is due to the judgement of human expert from low resolution images (i.e., X-ray images). To address the problem, we developed an ensemble of Deep Learning (DL) model to objectively score the severity of OA only from the radiometric images. The proposed method consists of two main modules. First, we developed a scale invariant and aspect ratio preserving automatic localization and characterization of the kneecap area. Second, we developed multiple instances of \"hyper parameter optimized\" DL models and fused them using ensemble classification to score the severity of OA. In this implementation, we used three convolutional neural networks to improve the bias-variance trade-off, and boost accuracy and generalization. We tested our modeling framework using a collection of 4,796 X-ray images from Osteoarthritis Initiative (OAI). Our results show a higher performance (~ 2-8%) when compared to the state-of-the-art methods. Finally, this machine learning-based methodology provides a pipeline in decision support system for assessing and quantifying the OA severity.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115078464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A Deep Learning Approach to Distributed Anomaly Detection for Edge Computing 边缘计算分布式异常检测的深度学习方法
Okwudili M. Ezeme, Q. Mahmoud, Akramul Azim
One of the multiplier effects of the boom in mobile technologies ranging from cell phones to computers and wearables like smart watches is that every public and private common spaces are now dotted with Wi-Fi hotspots. These hotspots provide the convenience of accessing the internet on-the-go for either play or work. Also, with the increased automation of our daily routines by our mobile devices via a multitude of applications, our vulnerability to cyber fraud or attacks becomes higher too. Hence, the need for heightened security that is capable of detecting anomalies on-the-fly. However, these edge devices connected to the local area network come with diverse capabilities with varying degrees of limitations in compute and energy resources. Therefore, running a process-based anomaly detector is not given a high priority in these devices because; a) the primary functions of the applications running on the devices is not security; therefore, the device allocates much of its resources into satisfying the primary duty of the applications. b) the volume and velocity of the data are high. Therefore, in this paper, we introduce a multi-node (nodes and devices are used interchangeably in the paper) ad-hoc network that uses a novel offloading scheme to bring an online anomaly detection capability on the kernel events to the nodes in the network. We test the framework in a Wi-Fi-based ad-hoc network made up of several devices, and the results confirm our hypothesis that the scheme can reduce latency and increase the throughput of the anomaly detector, thereby making online anomaly detection in the edge possible without sacrificing the accuracy of the deep recurrent neural network.
从手机到电脑,再到智能手表等可穿戴设备,移动技术的蓬勃发展带来的一个乘数效应是,每一个公共和私人公共空间现在都点缀着Wi-Fi热点。这些热点提供了方便的上网,无论是玩还是工作。此外,随着我们的移动设备通过大量应用程序提高了日常生活的自动化程度,我们对网络欺诈或攻击的脆弱性也变得更高。因此,需要提高安全性,能够在飞行中检测异常情况。然而,这些连接到局域网的边缘设备具有不同的功能,在计算和能源资源方面有不同程度的限制。因此,在这些设备中,运行基于进程的异常检测器没有被赋予高优先级,因为;A)设备上运行的应用程序的主要功能不是安全;因此,设备将其大部分资源分配给满足应用程序的主要任务。B)数据的量和速度都很高。因此,在本文中,我们引入了一个多节点(节点和设备在本文中互换使用)自组织网络,该网络使用一种新颖的卸载方案,为网络中的节点提供对内核事件的在线异常检测能力。我们在由多个设备组成的基于wi - fi的ad-hoc网络中对该框架进行了测试,结果证实了我们的假设,即该方案可以减少延迟并增加异常检测器的吞吐量,从而在不牺牲深度递归神经网络准确性的情况下实现边缘的在线异常检测。
{"title":"A Deep Learning Approach to Distributed Anomaly Detection for Edge Computing","authors":"Okwudili M. Ezeme, Q. Mahmoud, Akramul Azim","doi":"10.1109/ICMLA.2019.00169","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00169","url":null,"abstract":"One of the multiplier effects of the boom in mobile technologies ranging from cell phones to computers and wearables like smart watches is that every public and private common spaces are now dotted with Wi-Fi hotspots. These hotspots provide the convenience of accessing the internet on-the-go for either play or work. Also, with the increased automation of our daily routines by our mobile devices via a multitude of applications, our vulnerability to cyber fraud or attacks becomes higher too. Hence, the need for heightened security that is capable of detecting anomalies on-the-fly. However, these edge devices connected to the local area network come with diverse capabilities with varying degrees of limitations in compute and energy resources. Therefore, running a process-based anomaly detector is not given a high priority in these devices because; a) the primary functions of the applications running on the devices is not security; therefore, the device allocates much of its resources into satisfying the primary duty of the applications. b) the volume and velocity of the data are high. Therefore, in this paper, we introduce a multi-node (nodes and devices are used interchangeably in the paper) ad-hoc network that uses a novel offloading scheme to bring an online anomaly detection capability on the kernel events to the nodes in the network. We test the framework in a Wi-Fi-based ad-hoc network made up of several devices, and the results confirm our hypothesis that the scheme can reduce latency and increase the throughput of the anomaly detector, thereby making online anomaly detection in the edge possible without sacrificing the accuracy of the deep recurrent neural network.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115715692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Hybrid Condition Monitoring for Power Electronic Systems 电力电子系统的混合状态监测
Nikola Marković, T. Stoetzel, V. Staudt, D. Kolossa
This paper proposes a novel approach for condition monitoring of power electronic systems. When monitoring the state of a power system, reliability is crucial, as this type of system is usually operated continuously for long periods of time, and as both missed faults as well as false detections can easily become prohibitively expensive. Recently, machine-learning-based methods for fault detection of power systems have gained popularity, since they can overcome many of the constrains of model-based techniques. Most of these methods train classifiers for different states of the system under test, and thus, the problem of fault detection becomes a problem of classification. In this paper we compare two of such recent techniques. We show that despite good results, it cannot reasonably be expected that the state classification is solved perfectly for every instant of time, which makes the application of such classifiers infeasible in practical systems. In order to overcome these issues, we propose to re-formulate the task into one of hybrid—neural and statistical—cross-temporal hypothesis testing. This novel hybrid framework allows us to build upon the previous machine-learning-based classification approaches, and to achieve full reliability on a challenging dataset of fault monitoring measurements for a buck-converter.
提出了一种电力电子系统状态监测的新方法。在监测电力系统的状态时,可靠性是至关重要的,因为这种类型的系统通常是长时间连续运行的,而且遗漏的故障和错误的检测都很容易变得非常昂贵。最近,基于机器学习的电力系统故障检测方法越来越受欢迎,因为它们可以克服基于模型技术的许多限制。这些方法大多针对被测系统的不同状态训练分类器,因此,故障检测问题变成了分类问题。在本文中,我们比较了两种这样的新技术。我们表明,尽管结果很好,但不能合理地期望状态分类在每个时刻都得到完美解决,这使得这种分类器在实际系统中的应用不可行。为了克服这些问题,我们建议将任务重新制定为混合神经和统计跨时间假设检验之一。这种新颖的混合框架使我们能够在之前基于机器学习的分类方法的基础上构建,并在具有挑战性的buck转换器故障监测测量数据集上实现完全可靠性。
{"title":"Hybrid Condition Monitoring for Power Electronic Systems","authors":"Nikola Marković, T. Stoetzel, V. Staudt, D. Kolossa","doi":"10.1109/ICMLA.2019.00275","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00275","url":null,"abstract":"This paper proposes a novel approach for condition monitoring of power electronic systems. When monitoring the state of a power system, reliability is crucial, as this type of system is usually operated continuously for long periods of time, and as both missed faults as well as false detections can easily become prohibitively expensive. Recently, machine-learning-based methods for fault detection of power systems have gained popularity, since they can overcome many of the constrains of model-based techniques. Most of these methods train classifiers for different states of the system under test, and thus, the problem of fault detection becomes a problem of classification. In this paper we compare two of such recent techniques. We show that despite good results, it cannot reasonably be expected that the state classification is solved perfectly for every instant of time, which makes the application of such classifiers infeasible in practical systems. In order to overcome these issues, we propose to re-formulate the task into one of hybrid—neural and statistical—cross-temporal hypothesis testing. This novel hybrid framework allows us to build upon the previous machine-learning-based classification approaches, and to achieve full reliability on a challenging dataset of fault monitoring measurements for a buck-converter.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114160327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Uncertainty-Aware Personalized Readability Assessments for Second Language Learners 面向第二语言学习者的不确定性个性化可读性评估
Yo Ehara
Assessing whether an ungraded second language learner can read a given text quickly is important for further instructing and supporting the learner, particularly when evaluating numerous ungraded learners from diverse backgrounds. Second language acquisition (SLA) studies have tackled such assessment tasks wherein only a single short vocabulary test result is available to assess a learner; such studies have shown that the text-coverage, i.e., the percentage of words the learner knows in the text, is the key assessment measure. Currently, count-based percentages are used, in which each word in the given text is classified as being known or unknown to the learner, and the words classified as known are then simply counted. When each word is classified, we can also obtain an uncertainty value as to how likely each word is known to the learner. Although such values can be informative for a readability assessment, how to leverage these values to guarantee their use as an assessment measure that is comparable to that of the previous values remains unclear. We propose a novel framework that allows assessment methods to be uncertainty-aware while guaranteeing comparability to the text-coverage threshold. Such methods involve a computationally complex problem, for which we also propose a practical algorithm. In addition, we propose a neural-network based classifier from which we can obtain better uncertainty values. For evaluation, we created a crowdsourcing-based dataset in which a learner takes both vocabulary and readability tests. The best method under our framework outperformed conventional methods.
评估未分级的第二语言学习者是否能够快速阅读给定的文本对于进一步指导和支持学习者非常重要,特别是在评估来自不同背景的众多未分级的学习者时。第二语言习得(SLA)研究已经解决了这样的评估任务,其中只有一个简短的词汇测试结果可用来评估学习者;这些研究表明,文本覆盖率,即学习者在文本中认识的单词的百分比,是关键的评估指标。目前,使用基于计数的百分比,其中给定文本中的每个单词被学习者分类为已知或未知,然后简单地计数被分类为已知的单词。当每个单词被分类时,我们还可以获得一个不确定性值,即每个单词被学习者知道的可能性有多大。尽管这些值可以为可读性评估提供信息,但是如何利用这些值来保证将它们用作与先前值相当的评估度量仍然不清楚。我们提出了一种新的框架,允许评估方法在保证与文本覆盖阈值的可比性的同时具有不确定性意识。这种方法涉及计算复杂的问题,我们也提出了一种实用的算法。此外,我们提出了一种基于神经网络的分类器,从中我们可以获得更好的不确定性值。为了评估,我们创建了一个基于众包的数据集,学习者在其中接受词汇量和可读性测试。在我们的框架下,最佳方法优于传统方法。
{"title":"Uncertainty-Aware Personalized Readability Assessments for Second Language Learners","authors":"Yo Ehara","doi":"10.1109/ICMLA.2019.00307","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00307","url":null,"abstract":"Assessing whether an ungraded second language learner can read a given text quickly is important for further instructing and supporting the learner, particularly when evaluating numerous ungraded learners from diverse backgrounds. Second language acquisition (SLA) studies have tackled such assessment tasks wherein only a single short vocabulary test result is available to assess a learner; such studies have shown that the text-coverage, i.e., the percentage of words the learner knows in the text, is the key assessment measure. Currently, count-based percentages are used, in which each word in the given text is classified as being known or unknown to the learner, and the words classified as known are then simply counted. When each word is classified, we can also obtain an uncertainty value as to how likely each word is known to the learner. Although such values can be informative for a readability assessment, how to leverage these values to guarantee their use as an assessment measure that is comparable to that of the previous values remains unclear. We propose a novel framework that allows assessment methods to be uncertainty-aware while guaranteeing comparability to the text-coverage threshold. Such methods involve a computationally complex problem, for which we also propose a practical algorithm. In addition, we propose a neural-network based classifier from which we can obtain better uncertainty values. For evaluation, we created a crowdsourcing-based dataset in which a learner takes both vocabulary and readability tests. The best method under our framework outperformed conventional methods.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114474099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Echo Doppler Flow Classification and Goodness Assessment with Convolutional Neural Networks 基于卷积神经网络的回波多普勒流分类及优度评价
Ghada Zamzmi, L. Hsu, Wen Li, V. Sachdev, Sameer Kiran Antani
Doppler Echocardiography is critical for measuring abnormal cardiac function and diagnosing valvular stenosis and regurgitation. The current practice for assessing and interpreting Doppler echo images is time-consuming and depends highly on the experience of the operator. The limitations of this practice can be mitigated using fully automated intelligent systems. Essential first steps toward comprehensive computer-assisted Doppler echocardiographic interpretation include automatic classification into view/flow categories and goodness assessment of these flows. In this paper, we propose a deep learning-based method for Doppler flow classification and goodness assessment. The method has been trained on labeled images representing a wide range of real-world clinical variation. Our method, when evaluated on unseen data, achieved overall accuracies of 91.6% and 88.9% for flow classification and goodness assessment, respectively. While further research is needed, these results are encouraging and prove the feasibility of using fully automated intelligent systems for analyzing and interpreting Doppler echo images.
多普勒超声心动图是测量异常心功能和诊断瓣膜狭窄和反流的关键。目前评估和解释多普勒回波图像的做法是耗时的,并且高度依赖于操作员的经验。使用全自动智能系统可以减轻这种做法的局限性。全面的计算机辅助多普勒超声心动图解释的基本第一步包括自动分类为视点/血流类别和对这些血流的良好评估。本文提出了一种基于深度学习的多普勒流分类和优度评价方法。该方法已经在代表广泛的现实世界临床变化的标记图像上进行了训练。当对未见过的数据进行评估时,我们的方法在流量分类和优度评估方面的总体准确率分别为91.6%和88.9%。虽然需要进一步的研究,但这些结果令人鼓舞,并证明了使用全自动智能系统分析和解释多普勒回波图像的可行性。
{"title":"Echo Doppler Flow Classification and Goodness Assessment with Convolutional Neural Networks","authors":"Ghada Zamzmi, L. Hsu, Wen Li, V. Sachdev, Sameer Kiran Antani","doi":"10.1109/ICMLA.2019.00283","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00283","url":null,"abstract":"Doppler Echocardiography is critical for measuring abnormal cardiac function and diagnosing valvular stenosis and regurgitation. The current practice for assessing and interpreting Doppler echo images is time-consuming and depends highly on the experience of the operator. The limitations of this practice can be mitigated using fully automated intelligent systems. Essential first steps toward comprehensive computer-assisted Doppler echocardiographic interpretation include automatic classification into view/flow categories and goodness assessment of these flows. In this paper, we propose a deep learning-based method for Doppler flow classification and goodness assessment. The method has been trained on labeled images representing a wide range of real-world clinical variation. Our method, when evaluated on unseen data, achieved overall accuracies of 91.6% and 88.9% for flow classification and goodness assessment, respectively. While further research is needed, these results are encouraging and prove the feasibility of using fully automated intelligent systems for analyzing and interpreting Doppler echo images.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123006048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Deep Learning and Thresholding with Class-Imbalanced Big Data 类不平衡大数据下的深度学习与阈值
Justin M. Johnson, T. Khoshgoftaar
Class imbalance is a regularly occurring problem in machine learning that has been studied extensively over the last two decades. Various methods for addressing class imbalance have been introduced, including algorithm-level methods, datalevel methods, and hybrid methods. While these methods are well studied using traditional machine learning algorithms, there are relatively few studies that explore their application to deep neural networks. Thresholding, in particular, is rarely discussed in the deep learning with class imbalance literature. This paper addresses this gap by conducting a systematic study on the application of thresholding with deep neural networks using a Big Data Medicare fraud data set. We use random oversampling (ROS), random under-sampling (RUS), and a hybrid ROS-RUS to create 15 training distributions with varying levels of class imbalance. With the fraudulent class size ranging from 0.03%–60%, we identify optimal classification thresholds for each distribution on random validation sets and then score the thresholds on a 20% holdout test set. Through repetition and statistical analysis, confidence intervals show that the default threshold is never optimal when training data is imbalanced. Results also show that the optimal threshold outperforms the default threshold in nearly all cases, and linear models indicate a strong linear relationship between the minority class size and the optimal decision threshold. To the best of our knowledge, this is the first study to provide statistical results that describe optimal classification thresholds for deep neural networks over a range of class distributions.
类不平衡是机器学习中经常出现的问题,在过去的二十年里得到了广泛的研究。介绍了解决类不平衡的各种方法,包括算法级方法、数据级方法和混合方法。虽然使用传统的机器学习算法对这些方法进行了很好的研究,但探索其在深度神经网络中的应用的研究相对较少。特别是阈值,在具有阶级不平衡的深度学习文献中很少被讨论。本文通过使用大数据医疗欺诈数据集对阈值与深度神经网络的应用进行系统研究,解决了这一差距。我们使用随机过采样(ROS)、随机欠采样(RUS)和混合ROS-RUS来创建15个具有不同等级不平衡的训练分布。在欺诈类大小范围为0.03%-60%的情况下,我们在随机验证集上为每个分布确定了最佳分类阈值,然后在20%的拒绝测试集上对阈值进行评分。通过重复和统计分析,置信区间表明,当训练数据不平衡时,默认阈值永远不是最优的。结果还表明,在几乎所有情况下,最优阈值都优于默认阈值,线性模型表明,少数类大小与最优决策阈值之间存在很强的线性关系。据我们所知,这是第一个提供统计结果来描述深度神经网络在一系列类别分布上的最佳分类阈值的研究。
{"title":"Deep Learning and Thresholding with Class-Imbalanced Big Data","authors":"Justin M. Johnson, T. Khoshgoftaar","doi":"10.1109/ICMLA.2019.00134","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00134","url":null,"abstract":"Class imbalance is a regularly occurring problem in machine learning that has been studied extensively over the last two decades. Various methods for addressing class imbalance have been introduced, including algorithm-level methods, datalevel methods, and hybrid methods. While these methods are well studied using traditional machine learning algorithms, there are relatively few studies that explore their application to deep neural networks. Thresholding, in particular, is rarely discussed in the deep learning with class imbalance literature. This paper addresses this gap by conducting a systematic study on the application of thresholding with deep neural networks using a Big Data Medicare fraud data set. We use random oversampling (ROS), random under-sampling (RUS), and a hybrid ROS-RUS to create 15 training distributions with varying levels of class imbalance. With the fraudulent class size ranging from 0.03%–60%, we identify optimal classification thresholds for each distribution on random validation sets and then score the thresholds on a 20% holdout test set. Through repetition and statistical analysis, confidence intervals show that the default threshold is never optimal when training data is imbalanced. Results also show that the optimal threshold outperforms the default threshold in nearly all cases, and linear models indicate a strong linear relationship between the minority class size and the optimal decision threshold. To the best of our knowledge, this is the first study to provide statistical results that describe optimal classification thresholds for deep neural networks over a range of class distributions.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114950196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
期刊
2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1