首页 > 最新文献

Proceedings of the 2023 6th International Conference on Machine Vision and Applications最新文献

英文 中文
SARAF: Searching for Adversarial Robust Activation Functions SARAF:搜索对抗鲁棒激活函数
Maghsood Salimi, Mohammad Loni, M. Sirjani, A. Cicchetti, Sara Abbaspour Asadollah
Convolutional Neural Networks (CNNs) have received great attention in the computer vision domain. However, CNNs are vulnerable to adversarial attacks, which are manipulations of input data that are imperceptible to humans but can fool the network. Several studies tried to address this issue, which can be divided into two categories: (i) training the network with adversarial examples, and (ii) optimizing the network architecture and/or hyperparameters. Although adversarial training is a sufficient defense mechanism, they suffer from requiring a large volume of training samples to cover a wide perturbation bound. Tweaking network activation functions (AFs) has been shown to provide promising results where CNNs suffer from performance loss. However, optimizing network AFs for compensating the negative impacts of adversarial attacks has not been addressed in the literature. This paper proposes the idea of searching for AFs that are robust against adversarial attacks. To this aim, we leverage the Simulated Annealing (SA) algorithm with a fast convergence time. This proposed method is called SARAF. We demonstrate the consistent effectiveness of SARAF by achieving up to 16.92%, 18.3%, and 15.57% accuracy improvement against BIM, FGSM, and PGD adversarial attacks, respectively, over ResNet-18 with ReLU AFs (baseline) trained on CIFAR-10. Meanwhile, SARAF provides a significant search efficiency compared to random search as the optimization baseline.
卷积神经网络(cnn)在计算机视觉领域受到了广泛的关注。然而,cnn很容易受到对抗性攻击,这种攻击是对输入数据的操纵,人类无法察觉,但可以欺骗网络。一些研究试图解决这个问题,它可以分为两类:(i)用对抗性示例训练网络,(ii)优化网络架构和/或超参数。尽管对抗性训练是一种足够的防御机制,但它们需要大量的训练样本来覆盖广泛的扰动范围。调整网络激活函数(AFs)已被证明可以在cnn遭受性能损失的情况下提供有希望的结果。然而,优化网络AFs以补偿对抗性攻击的负面影响在文献中尚未得到解决。本文提出了搜索对对抗性攻击具有鲁棒性的af的思想。为此,我们利用具有快速收敛时间的模拟退火(SA)算法。这种建议的方法被称为SARAF。我们通过使用在CIFAR-10上训练的ReLU af(基线)在ResNet-18上对BIM、FGSM和PGD对对性攻击的准确率分别提高了16.92%、18.3%和15.57%,证明了SARAF的一致性有效性。同时,SARAF作为优化基准,与随机搜索相比,提供了显著的搜索效率。
{"title":"SARAF: Searching for Adversarial Robust Activation Functions","authors":"Maghsood Salimi, Mohammad Loni, M. Sirjani, A. Cicchetti, Sara Abbaspour Asadollah","doi":"10.1145/3589572.3589598","DOIUrl":"https://doi.org/10.1145/3589572.3589598","url":null,"abstract":"Convolutional Neural Networks (CNNs) have received great attention in the computer vision domain. However, CNNs are vulnerable to adversarial attacks, which are manipulations of input data that are imperceptible to humans but can fool the network. Several studies tried to address this issue, which can be divided into two categories: (i) training the network with adversarial examples, and (ii) optimizing the network architecture and/or hyperparameters. Although adversarial training is a sufficient defense mechanism, they suffer from requiring a large volume of training samples to cover a wide perturbation bound. Tweaking network activation functions (AFs) has been shown to provide promising results where CNNs suffer from performance loss. However, optimizing network AFs for compensating the negative impacts of adversarial attacks has not been addressed in the literature. This paper proposes the idea of searching for AFs that are robust against adversarial attacks. To this aim, we leverage the Simulated Annealing (SA) algorithm with a fast convergence time. This proposed method is called SARAF. We demonstrate the consistent effectiveness of SARAF by achieving up to 16.92%, 18.3%, and 15.57% accuracy improvement against BIM, FGSM, and PGD adversarial attacks, respectively, over ResNet-18 with ReLU AFs (baseline) trained on CIFAR-10. Meanwhile, SARAF provides a significant search efficiency compared to random search as the optimization baseline.","PeriodicalId":296325,"journal":{"name":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127493308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Employing Machine Learning and an OCR Validation Technique to Identify Product Category Based on Visible Packaging Features 利用机器学习和OCR验证技术识别基于可见包装特征的产品类别
Takorn Prexawanprasut, Lalita Santiworarak, Piyaporn Nurarak, Poom Juasiripukdee
Customs clearance is a challenging and time-consuming process that must be completed in the sphere of international trade. As a result, the cargo is frequently delayed at the port. If the personnel know the initial number of items, they may be able to continue with other procedures even when they are not physically present at the location. Image processing is helpful in this area since it allows for the prediction of the type of goods based on the appearance of the package. This allows for the determination of the quantity of each type of product prior to the arrival of the employees at the site. Three distinct import-export companies contributed 5,675 photos, and a machine learning approach was used to create a model that can predict the types of things that fall into one of five categories. Also, the researchers made an OCR-based classification algorithm with the goal of making machine learning work better for certain types of things that have trouble learning.
清关是国际贸易领域必须完成的一个具有挑战性和耗时的过程。结果,货物经常在港口延误。如果工作人员知道物品的初始数量,即使他们不在现场,他们也可以继续进行其他程序。图像处理在这方面很有帮助,因为它允许根据包装的外观预测商品的类型。这样就可以在员工到达现场之前确定每种产品的数量。三家不同的进出口公司提供了5675张照片,并使用机器学习方法创建了一个模型,该模型可以预测属于五类之一的事物类型。此外,研究人员还开发了一种基于ocr的分类算法,目的是让机器学习更好地处理某些难以学习的事物。
{"title":"Employing Machine Learning and an OCR Validation Technique to Identify Product Category Based on Visible Packaging Features","authors":"Takorn Prexawanprasut, Lalita Santiworarak, Piyaporn Nurarak, Poom Juasiripukdee","doi":"10.1145/3589572.3589589","DOIUrl":"https://doi.org/10.1145/3589572.3589589","url":null,"abstract":"Customs clearance is a challenging and time-consuming process that must be completed in the sphere of international trade. As a result, the cargo is frequently delayed at the port. If the personnel know the initial number of items, they may be able to continue with other procedures even when they are not physically present at the location. Image processing is helpful in this area since it allows for the prediction of the type of goods based on the appearance of the package. This allows for the determination of the quantity of each type of product prior to the arrival of the employees at the site. Three distinct import-export companies contributed 5,675 photos, and a machine learning approach was used to create a model that can predict the types of things that fall into one of five categories. Also, the researchers made an OCR-based classification algorithm with the goal of making machine learning work better for certain types of things that have trouble learning.","PeriodicalId":296325,"journal":{"name":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125101072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Process Quality Prediction Algorithm of Multi output Workshop Based on ATT-CNN-TCN 基于ATT-CNN-TCN的多输出车间工艺质量预测算法
Bin Yi, Wenqiang Lin, Wenqi Li, Xiaohua Gao, Bing Zhou, Jun Tang
In the view of the existing workshop process quality prediction method for the process parameters related timing information mining is not sufficient, existing research does not consider the contribution of different characteristics to the prediction target difference, this paper proposes the fusion of attention mechanism, convolutional neural network and time convolutional network. The attention module adaptively allocates weight information to the input features, convolutional neural network module to deeply mine the correlation information between process parameters was used, extracts the temporal information between process sequences with time convolutional neural learning, and finally superposition the full connection network mapping to obtain the workshop process quality prediction value. After example verification, the experimental results show that the constructed model is better than other process quality prediction models in the prediction accuracy, stability and network structure.
针对现有车间工艺质量预测方法对工艺参数相关时序信息挖掘不够充分,现有研究没有考虑不同特征对预测目标差异的贡献,提出了将注意机制、卷积神经网络和时间卷积网络相融合的方法。注意模块自适应地为输入特征分配权重信息,利用卷积神经网络模块深度挖掘工艺参数之间的相关信息,利用时间卷积神经学习提取工艺序列之间的时间信息,最后将全连接网络映射叠加得到车间工艺质量预测值。经过实例验证,实验结果表明,所构建的模型在预测精度、稳定性和网络结构等方面都优于其他过程质量预测模型。
{"title":"Process Quality Prediction Algorithm of Multi output Workshop Based on ATT-CNN-TCN","authors":"Bin Yi, Wenqiang Lin, Wenqi Li, Xiaohua Gao, Bing Zhou, Jun Tang","doi":"10.1145/3589572.3589590","DOIUrl":"https://doi.org/10.1145/3589572.3589590","url":null,"abstract":"In the view of the existing workshop process quality prediction method for the process parameters related timing information mining is not sufficient, existing research does not consider the contribution of different characteristics to the prediction target difference, this paper proposes the fusion of attention mechanism, convolutional neural network and time convolutional network. The attention module adaptively allocates weight information to the input features, convolutional neural network module to deeply mine the correlation information between process parameters was used, extracts the temporal information between process sequences with time convolutional neural learning, and finally superposition the full connection network mapping to obtain the workshop process quality prediction value. After example verification, the experimental results show that the constructed model is better than other process quality prediction models in the prediction accuracy, stability and network structure.","PeriodicalId":296325,"journal":{"name":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128650230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Compact Quantum Classifier Based on Kernel Method 基于核方法的紧凑量子分类器研究
Ruihong Jia, Guang Yang, Min Nie, Yun Zhang
Kernel method is widely used in machine learning. At present, the connection between kernel methods and quantum computing has been gradually established, which provides a new algorithm idea for the field of quantum machine learning. Research shows that the construction of minimized quantum circuits can be reliably performed on Noisy Intermediate-Scale Quantum (NISQ) devices. This paper proposes a compact quantum classifier based on kernel method. By introducing the compact amplitude encoding, the data label of the phase corresponding to the quantum state is encoded. Compared with the proposed classifier based on quantum kernel method, it can reduce 2 quantum registers, further reduce the circuit depth, and thus reduce the algorithm complexity. The double qubit measurement is simplified to single qubit measurement. In addition, this model achieves the optimal variance in quantum circuit parameters, which can effectively save computational resources. Experimental simulation shows that the expected value measurement in the proposed classifier model is closer to the theoretical value, and the classification accuracy is more accurate. At the same time, the system model has low entanglement, which can effectively reduce the cost of the whole preparation.
核方法在机器学习中有着广泛的应用。目前,核方法与量子计算之间的联系已经逐渐建立起来,为量子机器学习领域提供了新的算法思路。研究表明,在有噪声的中尺度量子(NISQ)器件上可以可靠地构建最小化量子电路。提出了一种基于核方法的紧凑量子分类器。通过引入紧凑幅度编码,对量子态对应的相位数据标号进行编码。与所提出的基于量子核方法的分类器相比,该分类器减少了2个量子寄存器,进一步减小了电路深度,从而降低了算法复杂度。将双量子位测量简化为单量子位测量。此外,该模型实现了量子电路参数的最优方差,可以有效地节省计算资源。实验仿真表明,所提分类器模型的期望值测量值更接近理论值,分类精度更高。同时,该系统模型具有低纠缠性,可以有效降低整个制备的成本。
{"title":"Research on Compact Quantum Classifier Based on Kernel Method","authors":"Ruihong Jia, Guang Yang, Min Nie, Yun Zhang","doi":"10.1145/3589572.3589592","DOIUrl":"https://doi.org/10.1145/3589572.3589592","url":null,"abstract":"Kernel method is widely used in machine learning. At present, the connection between kernel methods and quantum computing has been gradually established, which provides a new algorithm idea for the field of quantum machine learning. Research shows that the construction of minimized quantum circuits can be reliably performed on Noisy Intermediate-Scale Quantum (NISQ) devices. This paper proposes a compact quantum classifier based on kernel method. By introducing the compact amplitude encoding, the data label of the phase corresponding to the quantum state is encoded. Compared with the proposed classifier based on quantum kernel method, it can reduce 2 quantum registers, further reduce the circuit depth, and thus reduce the algorithm complexity. The double qubit measurement is simplified to single qubit measurement. In addition, this model achieves the optimal variance in quantum circuit parameters, which can effectively save computational resources. Experimental simulation shows that the expected value measurement in the proposed classifier model is closer to the theoretical value, and the classification accuracy is more accurate. At the same time, the system model has low entanglement, which can effectively reduce the cost of the whole preparation.","PeriodicalId":296325,"journal":{"name":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129147980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating User Gaze with Verbal Instruction to Reliably Estimate Robotic Task Parameters in a Human-Robot Collaborative Environment 基于用户注视和语言指令的人机协作环境下机器人任务参数可靠估计
S. K. Paul, M. Nicolescu, M. Nicolescu
As robots become more ubiquitous in our daily life, it has become very important to extract task and environmental information through more natural, meaningful, and easy-to-use interaction interfaces. Not only this helps the user to adapt to (thus trust) a robot in a collaborative environment, it can supplement the core sensory information, helping the robot make reliable decisions. This paper presents a framework that combines two natural interaction interfaces: speech and gaze to reliably infer the object of interest and the robotic task parameters. The gaze estimation module utilizes pre-defined 3D facial points and matches them to a set of extracted estimated 3D facial landmarks of the users from 2D images to infer the gaze direction. Subsequently, the verbal instructions are passed through a deep learning model to extract the information relevant to a robotic task. These extracted task parameters from verbal instructions along with the estimated gaze directions are combined to detect and/or disambiguate objects in the scene to generate the final task configurations. The proposed framework shows very promising results in integrating the relevant task parameters for the intended robotic tasks in different real-world interaction scenarios.
随着机器人在我们的日常生活中越来越普遍,通过更自然、更有意义、更易于使用的交互界面提取任务和环境信息变得非常重要。这不仅有助于用户在协作环境中适应(从而信任)机器人,还可以补充核心感官信息,帮助机器人做出可靠的决策。本文提出了一个结合语音和凝视两种自然交互界面的框架,以可靠地推断感兴趣的对象和机器人任务参数。注视估计模块利用预定义的3D面部点,将其与一组从2D图像中提取的估计用户的3D面部地标进行匹配,从而推断出注视方向。随后,口头指令通过深度学习模型来提取与机器人任务相关的信息。这些从口头指令中提取的任务参数与估计的凝视方向相结合,以检测和/或消除场景中的物体的歧义,从而生成最终的任务配置。所提出的框架在整合不同现实世界交互场景中机器人任务的相关任务参数方面显示出非常有希望的结果。
{"title":"Integrating User Gaze with Verbal Instruction to Reliably Estimate Robotic Task Parameters in a Human-Robot Collaborative Environment","authors":"S. K. Paul, M. Nicolescu, M. Nicolescu","doi":"10.1145/3589572.3589580","DOIUrl":"https://doi.org/10.1145/3589572.3589580","url":null,"abstract":"As robots become more ubiquitous in our daily life, it has become very important to extract task and environmental information through more natural, meaningful, and easy-to-use interaction interfaces. Not only this helps the user to adapt to (thus trust) a robot in a collaborative environment, it can supplement the core sensory information, helping the robot make reliable decisions. This paper presents a framework that combines two natural interaction interfaces: speech and gaze to reliably infer the object of interest and the robotic task parameters. The gaze estimation module utilizes pre-defined 3D facial points and matches them to a set of extracted estimated 3D facial landmarks of the users from 2D images to infer the gaze direction. Subsequently, the verbal instructions are passed through a deep learning model to extract the information relevant to a robotic task. These extracted task parameters from verbal instructions along with the estimated gaze directions are combined to detect and/or disambiguate objects in the scene to generate the final task configurations. The proposed framework shows very promising results in integrating the relevant task parameters for the intended robotic tasks in different real-world interaction scenarios.","PeriodicalId":296325,"journal":{"name":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124519124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quality metrics prediction in process manufacturing based on CNN-LSTM transfer learning algorithm 基于CNN-LSTM迁移学习算法的过程制造质量指标预测
Bin Yi, Jun Tang, Wenqiang Lin, Xiaohua Gao, Bing Zhou, Junjun Fang, Yulei Gao, Wenqi Li
The prediction of production process quality indicators plays an important role in product quality and production scheduling in process industries. In order to exploit the effective information contained in the massive process data, improve the prediction accuracy of production process quality indicators and apply to the changes of processing conditions, a hybrid model quality indicator migration learning prediction method based on convolutional network (CNN) and long-short-term memory (LSTM) is proposed. Massive amounts of historical process data, operational data and date data were constructed into a continuous feature matrix with a time-sliding window. The feature vectors are first extracted using CNN, and the feature vectors are constructed in a time-series sequence and used as input data for the LSTM network. Then the LSTM network is used for quality index prediction. In this process, migration learning strategy is introduced, which reduced the training time while ensuring the training accuracy. Finally, the correctness and effectiveness of the proposed method is verified by using the process data of a tobacco factory microtobacco cutting test line as a case object.
生产过程质量指标的预测在过程工业的产品质量和生产调度中起着重要作用。为了挖掘海量过程数据中蕴含的有效信息,提高生产过程质量指标的预测精度,并适用于加工条件的变化,提出了一种基于卷积网络(CNN)和长短期记忆(LSTM)的混合模型质量指标迁移学习预测方法。将大量历史过程数据、运行数据和日期数据构建成具有时间滑动窗口的连续特征矩阵。首先使用CNN提取特征向量,并在时间序列序列中构造特征向量,作为LSTM网络的输入数据。然后利用LSTM网络进行质量指标预测。在此过程中引入迁移学习策略,在保证训练精度的同时减少了训练时间。最后,以某烟厂微烟切割试验线工艺数据为案例对象,验证了所提方法的正确性和有效性。
{"title":"Quality metrics prediction in process manufacturing based on CNN-LSTM transfer learning algorithm","authors":"Bin Yi, Jun Tang, Wenqiang Lin, Xiaohua Gao, Bing Zhou, Junjun Fang, Yulei Gao, Wenqi Li","doi":"10.1145/3589572.3589591","DOIUrl":"https://doi.org/10.1145/3589572.3589591","url":null,"abstract":"The prediction of production process quality indicators plays an important role in product quality and production scheduling in process industries. In order to exploit the effective information contained in the massive process data, improve the prediction accuracy of production process quality indicators and apply to the changes of processing conditions, a hybrid model quality indicator migration learning prediction method based on convolutional network (CNN) and long-short-term memory (LSTM) is proposed. Massive amounts of historical process data, operational data and date data were constructed into a continuous feature matrix with a time-sliding window. The feature vectors are first extracted using CNN, and the feature vectors are constructed in a time-series sequence and used as input data for the LSTM network. Then the LSTM network is used for quality index prediction. In this process, migration learning strategy is introduced, which reduced the training time while ensuring the training accuracy. Finally, the correctness and effectiveness of the proposed method is verified by using the process data of a tobacco factory microtobacco cutting test line as a case object.","PeriodicalId":296325,"journal":{"name":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130045792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-temporal process quality prediction based on graph neural network 基于图神经网络的多时间过程质量预测
Bin Yi, Wenqi Li, Jun Tang, Xiaohua Gao, Bing Zhou, Xiaoli Xu, Peng Qin, Wenqiang Lin
For the complex dependencies of production data in time and space, a multi-temporal processing process quality prediction model GLSTM based on graph neural networks is proposed, which uses graph structure data to model the process relationships among production indicators, uses graph neural networks to aggregate spatial information among production indicators, and uses long and short term memory networks to model the complex dependencies of shop floor processing quality indicator sequences in time, and the experimental The results show that the model is able to achieve relative performance improvements of 5.40%, 15.04% and 0.30% compared to time series analysis methods.
针对生产数据在时间和空间上的复杂依赖关系,提出了一种基于图神经网络的多时态加工过程质量预测模型GLSTM,该模型利用图结构数据对生产指标间的过程关系进行建模,利用图神经网络对生产指标间的空间信息进行聚合,并利用长短期记忆网络对车间加工质量指标序列在时间上的复杂依赖关系进行建模,实验结果表明,与时间序列分析方法相比,该模型的相对性能提高了5.40%、15.04%和0.30%。
{"title":"Multi-temporal process quality prediction based on graph neural network","authors":"Bin Yi, Wenqi Li, Jun Tang, Xiaohua Gao, Bing Zhou, Xiaoli Xu, Peng Qin, Wenqiang Lin","doi":"10.1145/3589572.3589599","DOIUrl":"https://doi.org/10.1145/3589572.3589599","url":null,"abstract":"For the complex dependencies of production data in time and space, a multi-temporal processing process quality prediction model GLSTM based on graph neural networks is proposed, which uses graph structure data to model the process relationships among production indicators, uses graph neural networks to aggregate spatial information among production indicators, and uses long and short term memory networks to model the complex dependencies of shop floor processing quality indicator sequences in time, and the experimental The results show that the model is able to achieve relative performance improvements of 5.40%, 15.04% and 0.30% compared to time series analysis methods.","PeriodicalId":296325,"journal":{"name":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116348438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of Fibrillatory Episodes in Atrial Fibrillation Rhythms via Topology-informed Machine Learning 通过拓扑信息的机器学习检测心房颤动节律中的纤颤发作
Paul Samuel P. Ignacio
Effective and efficient methods for diagnosing cardiac conditions remain of significant importance and relevance in clinical cardiology. As such, advances in machine- and deep-learning technologies pave the way to high throughput approaches to automated classification of cardiac abnormalities. While there is rich literature on ECG-based classification of cardiac conditions, particularly on diagnosing Atrial Fibrillation, there is a dearth on algorithms that can effectively measure the onset and offset of atrial fibrillation events within an ECG. In this work, we show that an off-the-shelf machine learning algorithm can be trained on mathematically-computable shape signatures embedded within the local topology of ECGs to identify fibrillatory episodes in ECGs of AF patients. More precisely, we show that a topology-informed machine learning algorithm can accurately classify segments within an ECG as either resembling an atrial fibrillation event or not. Furthermore, we show that based on the model-provided classification of segments, a simple criterion may be used to determine whether the AF rhythm is paroxysmal or persistent.
诊断心脏疾病的有效方法在临床心脏病学中仍然具有重要意义和相关性。因此,机器和深度学习技术的进步为心脏异常自动分类的高通量方法铺平了道路。虽然有丰富的基于心电图的心脏状况分类文献,特别是在诊断房颤方面,但缺乏能够有效测量心电图中房颤事件的发作和偏移的算法。在这项工作中,我们展示了一种现成的机器学习算法可以在嵌入在心电图局部拓扑中的数学可计算的形状特征上进行训练,以识别AF患者心电图中的纤颤发作。更准确地说,我们表明拓扑信息的机器学习算法可以准确地将ECG中的片段分类为类似心房颤动事件或不类似心房颤动事件。此外,我们表明,基于模型提供的节段分类,可以使用一个简单的标准来确定AF节律是阵发性的还是持续性的。
{"title":"Detection of Fibrillatory Episodes in Atrial Fibrillation Rhythms via Topology-informed Machine Learning","authors":"Paul Samuel P. Ignacio","doi":"10.1145/3589572.3589576","DOIUrl":"https://doi.org/10.1145/3589572.3589576","url":null,"abstract":"Effective and efficient methods for diagnosing cardiac conditions remain of significant importance and relevance in clinical cardiology. As such, advances in machine- and deep-learning technologies pave the way to high throughput approaches to automated classification of cardiac abnormalities. While there is rich literature on ECG-based classification of cardiac conditions, particularly on diagnosing Atrial Fibrillation, there is a dearth on algorithms that can effectively measure the onset and offset of atrial fibrillation events within an ECG. In this work, we show that an off-the-shelf machine learning algorithm can be trained on mathematically-computable shape signatures embedded within the local topology of ECGs to identify fibrillatory episodes in ECGs of AF patients. More precisely, we show that a topology-informed machine learning algorithm can accurately classify segments within an ECG as either resembling an atrial fibrillation event or not. Furthermore, we show that based on the model-provided classification of segments, a simple criterion may be used to determine whether the AF rhythm is paroxysmal or persistent.","PeriodicalId":296325,"journal":{"name":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","volume":"159 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115636831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discrete Radial-Harmonic-Fourier Moments for Image Description 离散径向-谐波-傅里叶矩用于图像描述
Kejia Wang, Ziliang Ping, Y. Sheng
A new type of multi-distorted invariant discrete orthogonal moments, discrete Radial-Harmonic-Fourier moments was proposed. The kernel function of the moments was composed of radial discrete orthogonal triangular function and angular Fourier complex componential factor. The relationship between discrete Radial-Harmonic-Fourier moments and Radial-Harmonic-Fourier moments was also analyzed. The experimental results indicate that the discrete Radial-Harmonic-Fourier moments have excellent image description ability and can be effectively used as invariant image features in image analysis and pattern recognition.
提出了一种新的多畸变不变离散正交矩——离散径向-谐波-傅立叶矩。力矩的核函数由径向离散正交三角函数和角向傅立叶复分量因子组成。分析了离散径向-谐波-傅里叶矩与径向-谐波-傅里叶矩的关系。实验结果表明,离散径向-谐波-傅里叶矩具有良好的图像描述能力,可以有效地用作图像分析和模式识别中的不变图像特征。
{"title":"Discrete Radial-Harmonic-Fourier Moments for Image Description","authors":"Kejia Wang, Ziliang Ping, Y. Sheng","doi":"10.1145/3589572.3589600","DOIUrl":"https://doi.org/10.1145/3589572.3589600","url":null,"abstract":"A new type of multi-distorted invariant discrete orthogonal moments, discrete Radial-Harmonic-Fourier moments was proposed. The kernel function of the moments was composed of radial discrete orthogonal triangular function and angular Fourier complex componential factor. The relationship between discrete Radial-Harmonic-Fourier moments and Radial-Harmonic-Fourier moments was also analyzed. The experimental results indicate that the discrete Radial-Harmonic-Fourier moments have excellent image description ability and can be effectively used as invariant image features in image analysis and pattern recognition.","PeriodicalId":296325,"journal":{"name":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128682268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Road Lane Segmentation Using Vehicle Trajectory Tracking and Lane Demarcation Lines 基于车辆轨迹跟踪和车道分界线的道路车道分割
Adriel Isaiah V. Amoguis, Hernand Ang Hermida, G. J. B. Madrid, Gabriel Costes Marquez, Justin Opulencia Dy, Jose Gerardo Ortile Guerrero, J. Ilao
As levels of road traffic congestion increase relative to population density, it is becoming increasingly necessary for traffic managers to have awareness of road situations in real-time to keep up with traffic management. There are already existing techniques and applications in computer vision that traffic managers use to collect real-time telemetry, such as but not limited to vehicle counting algorithms. However, these algorithms and applications may not be lane-aware. Enabling lane awareness to these systems allows them to be more granular, which enables more in-depth telemetry such as lane usage, driver pattern recognition, and anomaly detection, among others. Lane awareness in these systems are enabled by performing lane segmentation. This study investigates two approaches to this. The first approach uses vehicle trajectories to generate aggregated trajectory maps, which are then clustered to determine trajectory lane membership and to generate representative trajectories that describes the lane. On the other hand, the second approach takes an end-to-end method and uses road lane features such as demarcation lines to segment lanes. The first approach proved to be more viable as a lane segmentation algorithm compared to the second approach as it was able to segment lanes more reliably, given enough vehicle trajectories are present.
随着道路交通拥堵程度相对于人口密度的增加,交通管理人员越来越有必要实时了解道路情况,以跟上交通管理的步伐。在计算机视觉方面,交通管理人员已经使用现有的技术和应用来收集实时遥测数据,例如但不限于车辆计数算法。然而,这些算法和应用程序可能不具有车道感知功能。为这些系统启用车道感知功能可以使它们更加精细,从而实现更深入的遥测,例如车道使用情况、驾驶员模式识别和异常检测等。这些系统中的车道感知是通过执行车道分割来实现的。本研究探讨了两种方法。第一种方法使用车辆轨迹来生成聚合轨迹图,然后将其聚类以确定轨迹车道的隶属关系并生成描述车道的代表性轨迹。另一方面,第二种方法采用端到端方法,利用道路车道特征(如分界线)来分割车道。与第二种方法相比,第一种方法被证明是一种更可行的车道分割算法,因为在给定足够的车辆轨迹的情况下,它能够更可靠地分割车道。
{"title":"Road Lane Segmentation Using Vehicle Trajectory Tracking and Lane Demarcation Lines","authors":"Adriel Isaiah V. Amoguis, Hernand Ang Hermida, G. J. B. Madrid, Gabriel Costes Marquez, Justin Opulencia Dy, Jose Gerardo Ortile Guerrero, J. Ilao","doi":"10.1145/3589572.3589582","DOIUrl":"https://doi.org/10.1145/3589572.3589582","url":null,"abstract":"As levels of road traffic congestion increase relative to population density, it is becoming increasingly necessary for traffic managers to have awareness of road situations in real-time to keep up with traffic management. There are already existing techniques and applications in computer vision that traffic managers use to collect real-time telemetry, such as but not limited to vehicle counting algorithms. However, these algorithms and applications may not be lane-aware. Enabling lane awareness to these systems allows them to be more granular, which enables more in-depth telemetry such as lane usage, driver pattern recognition, and anomaly detection, among others. Lane awareness in these systems are enabled by performing lane segmentation. This study investigates two approaches to this. The first approach uses vehicle trajectories to generate aggregated trajectory maps, which are then clustered to determine trajectory lane membership and to generate representative trajectories that describes the lane. On the other hand, the second approach takes an end-to-end method and uses road lane features such as demarcation lines to segment lanes. The first approach proved to be more viable as a lane segmentation algorithm compared to the second approach as it was able to segment lanes more reliably, given enough vehicle trajectories are present.","PeriodicalId":296325,"journal":{"name":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","volume":"228 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124524930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 2023 6th International Conference on Machine Vision and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1