2022 International Joint Conference on Neural Networks (IJCNN)最新文献_第5页

An Explainable Tool to Support Age-related Macular Degeneration Diagnosis 支持年龄相关性黄斑变性诊断的一个可解释的工具

2022 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892895

Lourdes Martínez-Villaseñor, Hiram Ponce, Antonieta Martínez-Velasco, Luis Miralles-Pechuán

Artificial intelligence and deep learning, in particu-lar, have gained large attention in the ophthalmology community due to the possibility of processing large amounts of data and dig-itized ocular images. Intelligent systems are developed to support the diagnosis and treatment of a number of ophthalmic diseases such as age-related macular degeneration (AMD), glaucoma and retinopathy of prematurity. Hence, explainability is necessary to gain trust and therefore the adoption of these critical decision support systems. Visual explanations have been proposed for AMD diagnosis only when optical coherence tomography (OCT) images are used, but interpretability using other inputs (i.e. data point-based features) for AMD diagnosis is rather limited. In this paper, we propose a practical tool to support AMD diagnosis based on Artificial Hydrocarbon Networks (AHN) with different kinds of input data such as demographic characteristics, features known as risk factors for AMD, and genetic variants obtained from DNA genotyping. The proposed explainer, namely eXplainable Artificial Hydrocarbon Networks (XAHN) is able to get global and local interpretations of the AHN model. An explainability assessment of the XAHN explainer was applied to clinicians for getting feedback from the tool. We consider the XAHN explainer tool will be beneficial to support expert clinicians in AMD diagnosis, especially where input data are not visual.

特别是人工智能和深度学习，由于有可能处理大量数据和数字化的眼部图像，在眼科界受到了很大的关注。智能系统的开发是为了支持许多眼科疾病的诊断和治疗，如年龄相关性黄斑变性(AMD)、青光眼和早产儿视网膜病变。因此，可解释性对于获得信任和采用这些关键决策支持系统是必要的。仅当使用光学相干断层扫描(OCT)图像时，才提出了AMD诊断的视觉解释，但使用其他输入(即基于数据点的特征)进行AMD诊断的可解释性相当有限。在本文中，我们提出了一种实用的工具来支持基于人工碳氢化合物网络(AHN)的AMD诊断，该网络具有不同类型的输入数据，如人口统计学特征，AMD的危险因素特征以及从DNA基因分型中获得的遗传变异。提出的解释器，即可解释人工碳氢化合物网络(XAHN)，能够获得AHN模型的全局和局部解释。XAHN解释器的可解释性评估应用于临床医生，以获得该工具的反馈。我们认为XAHN解释器工具将有利于支持专业临床医生诊断AMD，特别是在输入数据不是可视化的情况下。

{"title":"An Explainable Tool to Support Age-related Macular Degeneration Diagnosis","authors":"Lourdes Martínez-Villaseñor, Hiram Ponce, Antonieta Martínez-Velasco, Luis Miralles-Pechuán","doi":"10.1109/IJCNN55064.2022.9892895","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892895","url":null,"abstract":"Artificial intelligence and deep learning, in particu-lar, have gained large attention in the ophthalmology community due to the possibility of processing large amounts of data and dig-itized ocular images. Intelligent systems are developed to support the diagnosis and treatment of a number of ophthalmic diseases such as age-related macular degeneration (AMD), glaucoma and retinopathy of prematurity. Hence, explainability is necessary to gain trust and therefore the adoption of these critical decision support systems. Visual explanations have been proposed for AMD diagnosis only when optical coherence tomography (OCT) images are used, but interpretability using other inputs (i.e. data point-based features) for AMD diagnosis is rather limited. In this paper, we propose a practical tool to support AMD diagnosis based on Artificial Hydrocarbon Networks (AHN) with different kinds of input data such as demographic characteristics, features known as risk factors for AMD, and genetic variants obtained from DNA genotyping. The proposed explainer, namely eXplainable Artificial Hydrocarbon Networks (XAHN) is able to get global and local interpretations of the AHN model. An explainability assessment of the XAHN explainer was applied to clinicians for getting feedback from the tool. We consider the XAHN explainer tool will be beneficial to support expert clinicians in AMD diagnosis, especially where input data are not visual.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128533900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-scale Local Region Relation Attention in Convolutional Neural Networks for Facial Action Unit Intensity Prediction 基于卷积神经网络的多尺度局部关系关注面部动作单元强度预测

2022 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892729

Anrui Wang, Weiyang Chen

Facial Action Unit (FAU) intensity can describe the degree of change in the appearance of a specific location on the face and can be used for the analysis of human facial behavior. Due to the subtle changes in FAU, FAU intensity prediction still faces great challenges. Previous works using attention mechanisms for FAU intensity prediction either simply crop the FAU regions or directly use attention mechanisms to obtain local representations of FAUs, but these methods do not capture FAU intensity features at different scales and locations well. In addition, the dependencies between FAUs also contain important information. In this paper, we propose a multi-scale local-region relational attention model based on convolutional neural networks (CNN) for FAU intensity prediction. Specifically, we first reflect the relationship between FAUs by adjusting the luminance values of face images to capture local features with pixel-level relationships. Then, we use the introduced multi-scale local area relational attention model to extract the local attention latent relational features of FAU. Finally, we combine local attention potential relationship features, facial geometry information, and deep global features captured using an autoencoder to achieve robust FAU intensity prediction. The method is evaluated on the public benchmark dataset DISFA, and experimental results show that our method achieves comparable performance to state-of-the-art methods and validates the effectiveness of a multi-scale local-region relational attention model for FAU intensity prediction.

面部动作单元(FAU)强度可以描述面部特定位置的外观变化程度，可用于分析人类面部行为。由于FAU的微妙变化，FAU强度预测仍面临很大挑战。以往使用注意机制进行FAU强度预测的工作要么简单地剪裁FAU区域，要么直接使用注意机制获得FAU的局部表征，但这些方法都不能很好地捕捉不同尺度和位置的FAU强度特征。此外，fau之间的依赖关系也包含重要的信息。本文提出了一种基于卷积神经网络(CNN)的多尺度局部区域关系注意模型，用于FAU强度预测。具体来说，我们首先通过调整人脸图像的亮度值来反映fau之间的关系，以捕获具有像素级关系的局部特征。然后，利用引入的多尺度局部关系注意模型提取FAU的局部注意潜在关系特征。最后，我们结合局部注意潜在关系特征、面部几何信息和使用自编码器捕获的深度全局特征，实现鲁棒FAU强度预测。在公共基准数据集DISFA上对该方法进行了评估，实验结果表明，该方法达到了与现有方法相当的性能，验证了多尺度局部区域关系关注模型用于FAU强度预测的有效性。

{"title":"Multi-scale Local Region Relation Attention in Convolutional Neural Networks for Facial Action Unit Intensity Prediction","authors":"Anrui Wang, Weiyang Chen","doi":"10.1109/IJCNN55064.2022.9892729","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892729","url":null,"abstract":"Facial Action Unit (FAU) intensity can describe the degree of change in the appearance of a specific location on the face and can be used for the analysis of human facial behavior. Due to the subtle changes in FAU, FAU intensity prediction still faces great challenges. Previous works using attention mechanisms for FAU intensity prediction either simply crop the FAU regions or directly use attention mechanisms to obtain local representations of FAUs, but these methods do not capture FAU intensity features at different scales and locations well. In addition, the dependencies between FAUs also contain important information. In this paper, we propose a multi-scale local-region relational attention model based on convolutional neural networks (CNN) for FAU intensity prediction. Specifically, we first reflect the relationship between FAUs by adjusting the luminance values of face images to capture local features with pixel-level relationships. Then, we use the introduced multi-scale local area relational attention model to extract the local attention latent relational features of FAU. Finally, we combine local attention potential relationship features, facial geometry information, and deep global features captured using an autoencoder to achieve robust FAU intensity prediction. The method is evaluated on the public benchmark dataset DISFA, and experimental results show that our method achieves comparable performance to state-of-the-art methods and validates the effectiveness of a multi-scale local-region relational attention model for FAU intensity prediction.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128703473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comparing the Robustness of Classical and Deep Learning Techniques for Text Classification 比较经典和深度学习技术在文本分类中的鲁棒性

2022 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892242

Quynh Tran, Krystsina Shpileuskaya, Elaine Zaunseder, Larissa Putzar, S. Blankenburg

Deep learning algorithms achieve exceptional accuracies in various tasks. Despite this success, those models are known to be prone to errors, i.e. low in robustness, due to differences between training and production environment. One might assume that more model complexity translates directly to more robustness. Therefore, we compare simple, classical models (logistic regression, support vector machine) with complex deep learning techniques (convolutional neural networks, transformers) to provide novel insights into the robustness of machine learning systems. In our approach, we assess the robustness by developing and applying three realistic perturbations, mimicking scanning, typing, and speech recognition errors occurring in inputs for text classification tasks. Hence, we performed a thorough study analyzing the impact of different perturbations with variable strengths on character and word level. A noteworthy finding is that algorithms with low complexity can achieve high robustness. Additionally, we demonstrate that augmented training regarding a specific perturbation can strengthen the chosen models' robustness against other perturbations without reducing their accuracy. Our results can impact the selection of machine learning models and provide a guideline on how to examine the robustness of text classification methods for real-world applications. Moreover, our implementation is publicly available, which contributes to the development of more robust machine learning systems.

深度学习算法在各种任务中实现了卓越的准确性。尽管取得了成功，但由于训练和生产环境之间的差异，这些模型容易出错，即鲁棒性较低。有人可能会认为，更多的模型复杂性直接转化为更强的鲁棒性。因此，我们将简单的经典模型(逻辑回归、支持向量机)与复杂的深度学习技术(卷积神经网络、变压器)进行比较，以提供对机器学习系统鲁棒性的新见解。在我们的方法中，我们通过开发和应用三种现实的扰动来评估鲁棒性，模拟文本分类任务输入中出现的扫描、打字和语音识别错误。因此，我们进行了深入的研究，分析了不同强度的扰动对字符和单词水平的影响。一个值得注意的发现是，低复杂度的算法可以获得高鲁棒性。此外，我们证明了关于特定扰动的增强训练可以增强所选模型对其他扰动的鲁棒性，而不会降低其准确性。我们的研究结果可以影响机器学习模型的选择，并为如何检查现实世界应用中文本分类方法的鲁棒性提供指导。此外，我们的实现是公开的，这有助于开发更强大的机器学习系统。

{"title":"Comparing the Robustness of Classical and Deep Learning Techniques for Text Classification","authors":"Quynh Tran, Krystsina Shpileuskaya, Elaine Zaunseder, Larissa Putzar, S. Blankenburg","doi":"10.1109/IJCNN55064.2022.9892242","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892242","url":null,"abstract":"Deep learning algorithms achieve exceptional accuracies in various tasks. Despite this success, those models are known to be prone to errors, i.e. low in robustness, due to differences between training and production environment. One might assume that more model complexity translates directly to more robustness. Therefore, we compare simple, classical models (logistic regression, support vector machine) with complex deep learning techniques (convolutional neural networks, transformers) to provide novel insights into the robustness of machine learning systems. In our approach, we assess the robustness by developing and applying three realistic perturbations, mimicking scanning, typing, and speech recognition errors occurring in inputs for text classification tasks. Hence, we performed a thorough study analyzing the impact of different perturbations with variable strengths on character and word level. A noteworthy finding is that algorithms with low complexity can achieve high robustness. Additionally, we demonstrate that augmented training regarding a specific perturbation can strengthen the chosen models' robustness against other perturbations without reducing their accuracy. Our results can impact the selection of machine learning models and provide a guideline on how to examine the robustness of text classification methods for real-world applications. Moreover, our implementation is publicly available, which contributes to the development of more robust machine learning systems.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125125169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Dual-stream Speech Dereverberation Network Using Long-term and Short-term Cues 使用长期和短期线索的双流语音去噪网络

2022 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892662

Nan Li, Meng Ge, Longbiao Wang, J. Dang

For reverberation, the current speech is usually influenced by the previous frames. Traditional neural network-based speech dereverberation (SD) methods directly map the current speech frame that only has short-term cues to clean speech or learn a mask, which can not utilize long-term information to remove late reverberation and further limit SD's ability. To address this issue, we propose a dual-stream speech dereverberation network (DualSDNet) using long-term and short-term cues. First, we analyze the effectiveness of using a finite impulse response (FIR) based on long-term information recorded filter by reverberation generation progress. Second, to make full use of both long-term and short-term information, we further design a dual-stream network, it can map both long and short speech to high-dimensional representation and pay more attention to a more helpful time index. The results of the REVERB Challenge data show that our DualSDNet consistently outperforms the state-of-the-art SD baselines.

对于混响，当前的语音通常受到前一帧的影响。传统的基于神经网络的语音去混响(SD)方法直接映射只有短期线索的当前语音帧来清理语音或学习掩码，不能利用长期信息去除后期混响，进一步限制了SD的能力。为了解决这个问题，我们提出了一个使用长期和短期线索的双流语音去噪网络(DualSDNet)。首先，通过混响产生过程分析了基于长期信息记录的有限脉冲响应(FIR)滤波器的有效性。其次，为了充分利用长期和短期信息，我们进一步设计了一个双流网络，它可以将长语音和短语音映射到高维表示，并且更注重一个更有用的时间指标。REVERB Challenge数据的结果表明，我们的DualSDNet始终优于最先进的SD基线。

引用次数: 0

HierRL: Hierarchical Reinforcement Learning for Task Scheduling in Distributed Systems 分布式系统中任务调度的分层强化学习

2022 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892507

Yanxia Guan, Yuntao Liu, Yuan Li, Xinhai Xu

The distributed system Ray has attracted much attention for many decision-making applications. It provides a flexible and powerful distributed running mechanism for the training of the learning algorithms, which could map the computation tasks to the resources automatically. Task scheduling is a critical component in Ray, adopting a two-layer structure. It uses a simple general scheduling principle, which leaves much space to optimize. In this paper, we will study the two-layer scheduling problem in Ray, setting it as an optimization problem. We firstly present a comprehensive formulation for the problem and point out that it is a NP-hard problem. Then we design a hierarchical reinforcement learning method, named HierRL, which consists of a high-level agent and a low-level agent. Sophisticated state space, action space, and reward function are designed for this method. In the high level, we devise a value-based reinforcement learning method, which allocates a task to an appropriate node of the low level. With tasks allocated from the high level and generated from applications, a low-level reinforcement learning method is constructed to select tasks from the queue to be executed. A hierarchical policy learning method is introduced for the training of the two-layer agents. Finally, we simulate the two-layer scheduling procedure in a public platform, Cloudsim, with tasks from a real Dataset generated by the Alibaba Cluster Trace Program. The results show that the proposed method performs much better than the original scheduling method of Ray.

分布式系统Ray在许多决策应用中受到了广泛的关注。它为学习算法的训练提供了一种灵活而强大的分布式运行机制，可以自动将计算任务映射到资源上。任务调度是Ray的关键组成部分，采用两层结构。它使用了一个简单的通用调度原则，为优化留出了很大的空间。本文将研究Ray中的双层调度问题，并将其作为一个优化问题。我们首先给出了这个问题的一个综合表述，并指出它是一个np困难问题。然后，我们设计了一种分层强化学习方法，称为HierRL，它由一个高级智能体和一个低级智能体组成。该方法设计了复杂的状态空间、动作空间和奖励函数。在高层，我们设计了一种基于值的强化学习方法，该方法将任务分配给低层的适当节点。通过从高层分配任务并从应用程序生成任务，构建了一种低级强化学习方法来从队列中选择要执行的任务。提出了一种分层策略学习方法，用于两层智能体的训练。最后，我们利用阿里集群跟踪程序生成的真实数据集中的任务，在公共平台Cloudsim中模拟了双层调度过程。结果表明，该方法的调度性能明显优于原有的Ray调度方法。

{"title":"HierRL: Hierarchical Reinforcement Learning for Task Scheduling in Distributed Systems","authors":"Yanxia Guan, Yuntao Liu, Yuan Li, Xinhai Xu","doi":"10.1109/IJCNN55064.2022.9892507","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892507","url":null,"abstract":"The distributed system Ray has attracted much attention for many decision-making applications. It provides a flexible and powerful distributed running mechanism for the training of the learning algorithms, which could map the computation tasks to the resources automatically. Task scheduling is a critical component in Ray, adopting a two-layer structure. It uses a simple general scheduling principle, which leaves much space to optimize. In this paper, we will study the two-layer scheduling problem in Ray, setting it as an optimization problem. We firstly present a comprehensive formulation for the problem and point out that it is a NP-hard problem. Then we design a hierarchical reinforcement learning method, named HierRL, which consists of a high-level agent and a low-level agent. Sophisticated state space, action space, and reward function are designed for this method. In the high level, we devise a value-based reinforcement learning method, which allocates a task to an appropriate node of the low level. With tasks allocated from the high level and generated from applications, a low-level reinforcement learning method is constructed to select tasks from the queue to be executed. A hierarchical policy learning method is introduced for the training of the two-layer agents. Finally, we simulate the two-layer scheduling procedure in a public platform, Cloudsim, with tasks from a real Dataset generated by the Alibaba Cluster Trace Program. The results show that the proposed method performs much better than the original scheduling method of Ray.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129894692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Q-SYM2 and Automatic Scrap Classification a joint solution for the Circular economy and sustainability of Steel Manufacturing, to ensure the scrap yard operates competitively Q-SYM2和自动废料分类是钢铁制造循环经济和可持续性的联合解决方案，以确保废料场的竞争力

2022 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892611

Davide Armellini, M. Ometto, Cristiano Ponton

A hype topic, one that now has become an established idea, is the possibility to increase plant efficiency by gaining and applying a better awareness of how scrap is performing in the melting process. Scrap management becomes the key point in cost reduction since it could comprise up to the 50% of the overall production costs. Technological innovations promise to be the driver to improving raw material management, shortening its acquisition time and reducing the waste during the metallurgical process. Expensive raw materials require a huge involvement of plant resources, and are highly dependent on the human factor. All the quality and logistics decisions belong to the judgment of the operators, increasing the chance of non-conformities (e.g., erroneous classification, material discharged in the wrong location, error loading material in the buckets). To overcome these issues, online classification of the scrap is the keystone. Starting from the arrival of scrap at the plant, through the acceptance of the delivery note and the check-in of the carriers, Automatic Scrap Classification gives support to inbound-scrap control and classification, enabling real-time traceability of the scrap inside the bays. The Quality Control System will benefit from all the details of the material used in production. Danieli Automation implemented the Q-ASC a system that, leveraging Artificial Intelligence (AI) and deep learning techniques, can assist scrap classification procedures through computer vision and automatic scrap recognition. The goal of scrap identification is to localize and assign a specific class label to a given visual sample of scrap or inert/hazardous material. The classification can be conducted using different methodologies based on material shapes or dimensions. Q-ASC is the entry point for the Scrap Yard Management and can be considered as the central data hub for managing the scrap inbound to the plant, connecting all the systems requiring reliable scrap data.

一个炒作的话题，现在已经成为一个既定的想法，是通过更好地了解废料在熔化过程中的表现来提高工厂效率的可能性。废料管理成为降低成本的关键，因为它可能占总生产成本的50%。技术创新有望成为改善原材料管理，缩短其获取时间和减少冶金过程中的浪费的驱动力。昂贵的原材料需要大量的植物资源参与，并且高度依赖于人为因素。所有的质量和物流决策都属于操作员的判断，增加了不合格的机会(例如，错误的分类，物料排在错误的位置，物料在桶中装载错误)。为了克服这些问题，废钢在线分类是关键。从废料到达工厂开始，通过接受交货单和承运人的登记，自动废料分类为进站废料控制和分类提供支持，使废料在仓库内的实时可追溯性成为可能。质量控制系统将受益于生产中使用的材料的所有细节。达涅利自动化实施了Q-ASC系统，该系统利用人工智能(AI)和深度学习技术，可以通过计算机视觉和自动废料识别来辅助废料分类程序。废料识别的目标是对废料或惰性/有害材料的给定视觉样品进行定位和分配特定的类别标签。分类可以根据材料的形状或尺寸使用不同的方法进行。Q-ASC是废料场管理的入口点，可以被视为管理进入工厂的废料的中央数据中心，连接所有需要可靠废料数据的系统。

{"title":"Q-SYM2 and Automatic Scrap Classification a joint solution for the Circular economy and sustainability of Steel Manufacturing, to ensure the scrap yard operates competitively","authors":"Davide Armellini, M. Ometto, Cristiano Ponton","doi":"10.1109/IJCNN55064.2022.9892611","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892611","url":null,"abstract":"A hype topic, one that now has become an established idea, is the possibility to increase plant efficiency by gaining and applying a better awareness of how scrap is performing in the melting process. Scrap management becomes the key point in cost reduction since it could comprise up to the 50% of the overall production costs. Technological innovations promise to be the driver to improving raw material management, shortening its acquisition time and reducing the waste during the metallurgical process. Expensive raw materials require a huge involvement of plant resources, and are highly dependent on the human factor. All the quality and logistics decisions belong to the judgment of the operators, increasing the chance of non-conformities (e.g., erroneous classification, material discharged in the wrong location, error loading material in the buckets). To overcome these issues, online classification of the scrap is the keystone. Starting from the arrival of scrap at the plant, through the acceptance of the delivery note and the check-in of the carriers, Automatic Scrap Classification gives support to inbound-scrap control and classification, enabling real-time traceability of the scrap inside the bays. The Quality Control System will benefit from all the details of the material used in production. Danieli Automation implemented the Q-ASC a system that, leveraging Artificial Intelligence (AI) and deep learning techniques, can assist scrap classification procedures through computer vision and automatic scrap recognition. The goal of scrap identification is to localize and assign a specific class label to a given visual sample of scrap or inert/hazardous material. The classification can be conducted using different methodologies based on material shapes or dimensions. Q-ASC is the entry point for the Scrap Yard Management and can be considered as the central data hub for managing the scrap inbound to the plant, connecting all the systems requiring reliable scrap data.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130313353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

DeepSet: Deep Learning-based Recommendation with Setwise Preference DeepSet:基于深度学习的推荐和设置偏好

2022 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892627

Lin Li, Weike Pan, Guanliang Chen, Zhong Ming

Recommendation methods based on deep learning frameworks have drastically increased over recent years, covering virtually all the sub-topics in recommender systems. Among these topics, one-class collaborative filtering (OCCF) as a fundamental problem has been studied most extensively. However, most of existing deep learning-based OCCF methods are essentially focused on either defining new prediction rules by replacing conventional shallow and linear inner products with a variety of neural architectures, or learning more expressive user and item factors with neural networks, which may still suffer from the inferior recommendation performance due to the underlying preference assumptions typically defined on single items. In this paper, we propose to address the limitation and justify the capacity of deep learning-based recommendation methods by adapting the setwise preference to the underlying assumption during the model learning process. Specifically, we propose a new setwise preference assumption under the neural recommendation frameworks and devise a general solution named DeepSet, which aims to enhance the learning abilities of neural collaborative filtering methods by activating the setwise preference at different neural layers, namely 1) the feature input layer, 2) the feature output layer, and 3) the prediction layer. Extensive experiments on four commonly used datasets show that our solution can effectively boost the performance of existing deep learning based methods without introducing any new model parameters.

近年来，基于深度学习框架的推荐方法急剧增加，几乎涵盖了推荐系统中的所有子主题。其中，一类协同过滤(OCCF)作为一个基本问题得到了最广泛的研究。然而，大多数现有的基于深度学习的OCCF方法本质上都集中在通过用各种神经结构取代传统的浅线性内积来定义新的预测规则，或者用神经网络学习更具表现力的用户和项目因素，由于通常在单个项目上定义的潜在偏好假设，这些方法仍然可能受到较差的推荐性能的影响。在本文中，我们提出通过在模型学习过程中将设置偏好与潜在假设相适应来解决基于深度学习的推荐方法的局限性并证明其能力。具体而言，我们在神经推荐框架下提出了一个新的集合偏好假设，并设计了一个通用的解决方案DeepSet，旨在通过激活不同神经层的集合偏好来增强神经协同过滤方法的学习能力，即1)特征输入层，2)特征输出层和3)预测层。在四个常用数据集上的大量实验表明，我们的解决方案可以有效地提高现有基于深度学习的方法的性能，而无需引入任何新的模型参数。

{"title":"DeepSet: Deep Learning-based Recommendation with Setwise Preference","authors":"Lin Li, Weike Pan, Guanliang Chen, Zhong Ming","doi":"10.1109/IJCNN55064.2022.9892627","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892627","url":null,"abstract":"Recommendation methods based on deep learning frameworks have drastically increased over recent years, covering virtually all the sub-topics in recommender systems. Among these topics, one-class collaborative filtering (OCCF) as a fundamental problem has been studied most extensively. However, most of existing deep learning-based OCCF methods are essentially focused on either defining new prediction rules by replacing conventional shallow and linear inner products with a variety of neural architectures, or learning more expressive user and item factors with neural networks, which may still suffer from the inferior recommendation performance due to the underlying preference assumptions typically defined on single items. In this paper, we propose to address the limitation and justify the capacity of deep learning-based recommendation methods by adapting the setwise preference to the underlying assumption during the model learning process. Specifically, we propose a new setwise preference assumption under the neural recommendation frameworks and devise a general solution named DeepSet, which aims to enhance the learning abilities of neural collaborative filtering methods by activating the setwise preference at different neural layers, namely 1) the feature input layer, 2) the feature output layer, and 3) the prediction layer. Extensive experiments on four commonly used datasets show that our solution can effectively boost the performance of existing deep learning based methods without introducing any new model parameters.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126898189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Abnormal Event Detection with Self-guiding Multi-instance Ranking Framework 基于自导向多实例排序框架的异常事件检测

2022 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892231

Y. Liu, Jing Liu, Wei Ni, Liang Song

The detection of abnormal events in surveillance videos with weak supervision is a challenging task, which tries to temporally find abnormal frames using readily accessible video-level labels. In this paper, we propose a self-guiding multi-instance ranking (SMR) framework, which has explored task-specific deep representations and considered the temporal correlations between video clips. Specifically, we apply a clustering algorithm to fine-tune the features extracted by the pre-trained 3D-convolutional-based models. Besides, the clustering module can generate clip-level labels for abnormal videos, and the pseudo-labels are in part used to supervise the training of the multi-instance regression. While implementing the regression module, we compare the effectiveness of various recurrent neural networks, and the results demonstrate the necessity of temporal correlations for weakly supervised video anomaly detection tasks. Experimental results on two standard benchmarks reveal that the SMR framework is comparable to the state-of-the-art approaches, with frame-level AUCs of 81.7% and 92.4% on the UCF-crime and UCSD Ped2 datasets respectively. Additionally, ablation studies and visualization results prove the effectiveness of the component, and our framework can accurately locate abnormal events.

在监管薄弱的监控视频中，异常事件的检测是一项具有挑战性的任务，它试图利用易于获取的视频级别标签来临时发现异常帧。在本文中，我们提出了一个自引导多实例排序(SMR)框架，该框架探索了特定任务的深度表示，并考虑了视频片段之间的时间相关性。具体来说，我们应用聚类算法对预训练的基于3d卷积的模型提取的特征进行微调。此外，聚类模块可以对异常视频生成片段级标签，伪标签部分用于监督多实例回归的训练。在实现回归模块时，我们比较了各种递归神经网络的有效性，结果表明时间相关性对于弱监督视频异常检测任务的必要性。在两个标准基准上的实验结果表明，SMR框架与最先进的方法相当，在UCF-crime和UCSD Ped2数据集上，框架级auc分别为81.7%和92.4%。此外，消融研究和可视化结果证明了该组件的有效性，并且我们的框架可以准确地定位异常事件。

{"title":"Abnormal Event Detection with Self-guiding Multi-instance Ranking Framework","authors":"Y. Liu, Jing Liu, Wei Ni, Liang Song","doi":"10.1109/IJCNN55064.2022.9892231","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892231","url":null,"abstract":"The detection of abnormal events in surveillance videos with weak supervision is a challenging task, which tries to temporally find abnormal frames using readily accessible video-level labels. In this paper, we propose a self-guiding multi-instance ranking (SMR) framework, which has explored task-specific deep representations and considered the temporal correlations between video clips. Specifically, we apply a clustering algorithm to fine-tune the features extracted by the pre-trained 3D-convolutional-based models. Besides, the clustering module can generate clip-level labels for abnormal videos, and the pseudo-labels are in part used to supervise the training of the multi-instance regression. While implementing the regression module, we compare the effectiveness of various recurrent neural networks, and the results demonstrate the necessity of temporal correlations for weakly supervised video anomaly detection tasks. Experimental results on two standard benchmarks reveal that the SMR framework is comparable to the state-of-the-art approaches, with frame-level AUCs of 81.7% and 92.4% on the UCF-crime and UCSD Ped2 datasets respectively. Additionally, ablation studies and visualization results prove the effectiveness of the component, and our framework can accurately locate abnormal events.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126992996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Building an Efficient Retrieval-based Dialogue System with Contrastive Learning 基于对比学习的高效检索对话系统的构建

2022 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892217

Jiangwei Li, J. Zhong

We focus on retrieval-based dialogue systems. Such a system aims to select an appropriate response from a candidate pool for a given context. Recent methods commonly utilize powerful interaction-based pre-trained language models like BERT to achieve the goal. However, their time cost is usually not satisfying since the procedure of computing relevance scores is not efficient, especially in scenarios that require online response selection. We propose an efficient dialogue system that utilizes a representation-based BERT to address this issue, which can produce an independent representation for every response candidate and context. The relevance score can be simply calculated by the dot product. We further enhance the representation ability of this model by applying domain adaptive post-training and supervised contrastive learning fine-tuning. Experimental results on two benchmark datasets show that our method achieves competitive performance with other interaction-based models while retaining the advantage of time efficiency. We also provide an empirical and theoretical analysis of time efficiency between representation-based models and interaction-based models. The main contribution of this paper is to propose a novel methodology to build a simple but efficient dialogue system.

我们专注于基于检索的对话系统。这样的系统旨在从给定上下文的候选池中选择适当的响应。最近的方法通常利用强大的基于交互的预训练语言模型(如BERT)来实现目标。然而，由于计算相关分数的过程效率不高，特别是在需要在线选择响应的情况下，它们的时间成本通常不令人满意。我们提出了一个有效的对话系统，利用基于表示的BERT来解决这个问题，该系统可以为每个响应候选者和上下文产生独立的表示。相关性分数可以通过点积简单地计算出来。我们通过应用领域自适应后训练和监督对比学习微调来进一步增强该模型的表示能力。在两个基准数据集上的实验结果表明，该方法在保持时间效率优势的同时，取得了与其他基于交互的模型相当的性能。我们还对基于表示的模型和基于交互的模型之间的时间效率进行了实证和理论分析。本文的主要贡献是提出了一种新的方法来构建一个简单而有效的对话系统。

{"title":"Building an Efficient Retrieval-based Dialogue System with Contrastive Learning","authors":"Jiangwei Li, J. Zhong","doi":"10.1109/IJCNN55064.2022.9892217","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892217","url":null,"abstract":"We focus on retrieval-based dialogue systems. Such a system aims to select an appropriate response from a candidate pool for a given context. Recent methods commonly utilize powerful interaction-based pre-trained language models like BERT to achieve the goal. However, their time cost is usually not satisfying since the procedure of computing relevance scores is not efficient, especially in scenarios that require online response selection. We propose an efficient dialogue system that utilizes a representation-based BERT to address this issue, which can produce an independent representation for every response candidate and context. The relevance score can be simply calculated by the dot product. We further enhance the representation ability of this model by applying domain adaptive post-training and supervised contrastive learning fine-tuning. Experimental results on two benchmark datasets show that our method achieves competitive performance with other interaction-based models while retaining the advantage of time efficiency. We also provide an empirical and theoretical analysis of time efficiency between representation-based models and interaction-based models. The main contribution of this paper is to propose a novel methodology to build a simple but efficient dialogue system.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129203881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Accuracy Evaluation of Transposed Convolution-Based Quantized Neural Networks 基于转置卷积的量化神经网络精度评价

2022 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2022-07-18 DOI: 10.1109/IJCNN55064.2022.9892671

Cristian Sestito, S. Perri, Rob Stewart

Several modern applications in the field of Artificial Intelligence exploit deep learning to make accurate decisions. Recent work on compression techniques allows for deep learning applications, such as computer vision, to run on Edge Computing devices. For instance, quantizing the precision of deep learning architectures allows Edge Computing devices to achieve high throughput at low power. Quantization has been mainly focused on multilayer perceptrons and convolution-based models for classification problems. However, its impact over more complex scenarios, such as image up-sampling, is still underexplored. This paper presents a systematic evaluation of the accuracy achieved by quantized neural networks when performing image up-sampling in three different applications: image compression/decompression, synthetic image generation and semantic segmentation. Taking into account the promising attitude of learnable filters to predict pixels, transposed convolutional layers are used for up-sampling. Experimental results based on analytical metrics show that acceptable accuracies are reached with quantization spanning between 3 and 7 bits. Based on the visual inspection, the range 2–6 bits guarantees appropriate accuracy.

人工智能领域的一些现代应用利用深度学习来做出准确的决策。最近关于压缩技术的工作允许深度学习应用程序(如计算机视觉)在边缘计算设备上运行。例如，量化深度学习架构的精度允许边缘计算设备以低功耗实现高吞吐量。量化主要集中在多层感知器和基于卷积的分类问题模型上。然而，它对更复杂场景(如图像上采样)的影响仍未得到充分探索。本文系统地评估了量化神经网络在图像压缩/解压缩、合成图像生成和语义分割三种不同应用中进行图像上采样时所达到的精度。考虑到可学习滤波器预测像素的前景，采用转置卷积层进行上采样。基于分析度量的实验结果表明，当量化范围在3 ~ 7位之间时，达到了可接受的精度。根据目视检查，2-6位范围保证了适当的精度。

{"title":"Accuracy Evaluation of Transposed Convolution-Based Quantized Neural Networks","authors":"Cristian Sestito, S. Perri, Rob Stewart","doi":"10.1109/IJCNN55064.2022.9892671","DOIUrl":"https://doi.org/10.1109/IJCNN55064.2022.9892671","url":null,"abstract":"Several modern applications in the field of Artificial Intelligence exploit deep learning to make accurate decisions. Recent work on compression techniques allows for deep learning applications, such as computer vision, to run on Edge Computing devices. For instance, quantizing the precision of deep learning architectures allows Edge Computing devices to achieve high throughput at low power. Quantization has been mainly focused on multilayer perceptrons and convolution-based models for classification problems. However, its impact over more complex scenarios, such as image up-sampling, is still underexplored. This paper presents a systematic evaluation of the accuracy achieved by quantized neural networks when performing image up-sampling in three different applications: image compression/decompression, synthetic image generation and semantic segmentation. Taking into account the promising attitude of learnable filters to predict pixels, transposed convolutional layers are used for up-sampling. Experimental results based on analytical metrics show that acceptable accuracies are reached with quantization spanning between 3 and 7 bits. Based on the visual inspection, the range 2–6 bits guarantees appropriate accuracy.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130648767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1