首页 > 最新文献

International Conference on Pattern Recognition Applications and Methods最新文献

英文 中文
PatchSVD: A Non-Uniform SVD-Based Image Compression Algorithm PatchSVD:基于非均匀 SVD 的图像压缩算法
Pub Date : 2024-06-07 DOI: 10.5220/0012488500003654
Zahra Golpayegani, Nizar Bouguila
Storing data is particularly a challenge when dealing with image data which often involves large file sizes due to the high resolution and complexity of images. Efficient image compression algorithms are crucial to better manage data storage costs. In this paper, we propose a novel region-based lossy image compression technique, called PatchSVD, based on the Singular Value Decomposition (SVD) algorithm. We show through experiments that PatchSVD outperforms SVD-based image compression with respect to three popular image compression metrics. Moreover, we compare PatchSVD compression artifacts with those of Joint Photographic Experts Group (JPEG) and SVD-based image compression and illustrate some cases where PatchSVD compression artifacts are preferable compared to JPEG and SVD artifacts.
在处理图像数据时,存储数据尤其是一项挑战,由于图像的高分辨率和复杂性,其文件大小往往很大。高效的图像压缩算法对于更好地管理数据存储成本至关重要。在本文中,我们提出了一种基于奇异值分解(SVD)算法的新型区域有损图像压缩技术,称为 PatchSVD。我们通过实验证明,就三种流行的图像压缩指标而言,PatchSVD 优于基于 SVD 的图像压缩技术。此外,我们还比较了 PatchSVD 与联合图像专家组(JPEG)和基于 SVD 的图像压缩的压缩伪影,并说明了在某些情况下 PatchSVD 的压缩伪影优于 JPEG 和 SVD 的伪影。
{"title":"PatchSVD: A Non-Uniform SVD-Based Image Compression Algorithm","authors":"Zahra Golpayegani, Nizar Bouguila","doi":"10.5220/0012488500003654","DOIUrl":"https://doi.org/10.5220/0012488500003654","url":null,"abstract":"Storing data is particularly a challenge when dealing with image data which often involves large file sizes due to the high resolution and complexity of images. Efficient image compression algorithms are crucial to better manage data storage costs. In this paper, we propose a novel region-based lossy image compression technique, called PatchSVD, based on the Singular Value Decomposition (SVD) algorithm. We show through experiments that PatchSVD outperforms SVD-based image compression with respect to three popular image compression metrics. Moreover, we compare PatchSVD compression artifacts with those of Joint Photographic Experts Group (JPEG) and SVD-based image compression and illustrate some cases where PatchSVD compression artifacts are preferable compared to JPEG and SVD artifacts.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":" 11","pages":"886-893"},"PeriodicalIF":0.0,"publicationDate":"2024-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141371207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On Spectrogram Analysis in a Multiple Classifier Fusion Framework for Power Grid Classification Using Electric Network Frequency 论利用电网频率进行电网分类的多分类器融合框架中的谱图分析
Pub Date : 2024-03-27 DOI: 10.5220/0012418400003654
Georgios Tzolopoulos, Christos Korgialas, C. Kotropoulos
The Electric Network Frequency (ENF) serves as a unique signature inherent to power distribution systems. Here, a novel approach for power grid classification is developed, leveraging ENF. Spectrograms are generated from audio and power recordings across different grids, revealing distinctive ENF patterns that aid in grid classification through a fusion of classifiers. Four traditional machine learning classifiers plus a Convolutional Neural Network (CNN), optimized using Neural Architecture Search, are developed for One-vs-All classification. This process generates numerous predictions per sample, which are then compiled and used to train a shallow multi-label neural network specifically designed to model the fusion process, ultimately leading to the conclusive class prediction for each sample. Experimental findings reveal that both validation and testing accuracy outperform those of current state-of-the-art classifiers, underlining the effectiveness and robustness of the proposed methodology.
电网频率(ENF)是配电系统固有的独特特征。在此,我们开发了一种利用 ENF 进行电网分类的新方法。从不同电网的音频和电力记录中生成频谱图,揭示出独特的 ENF 模式,通过融合分类器帮助电网分类。四个传统机器学习分类器加上一个卷积神经网络 (CNN),利用神经架构搜索进行了优化,用于 "单对全 "分类。这一过程会为每个样本生成大量预测结果,然后对这些预测结果进行编译,用于训练一个专门为模拟融合过程而设计的浅层多标签神经网络,最终为每个样本生成结论性的类别预测结果。实验结果表明,验证和测试准确性均优于当前最先进的分类器,凸显了所提方法的有效性和稳健性。
{"title":"On Spectrogram Analysis in a Multiple Classifier Fusion Framework for Power Grid Classification Using Electric Network Frequency","authors":"Georgios Tzolopoulos, Christos Korgialas, C. Kotropoulos","doi":"10.5220/0012418400003654","DOIUrl":"https://doi.org/10.5220/0012418400003654","url":null,"abstract":"The Electric Network Frequency (ENF) serves as a unique signature inherent to power distribution systems. Here, a novel approach for power grid classification is developed, leveraging ENF. Spectrograms are generated from audio and power recordings across different grids, revealing distinctive ENF patterns that aid in grid classification through a fusion of classifiers. Four traditional machine learning classifiers plus a Convolutional Neural Network (CNN), optimized using Neural Architecture Search, are developed for One-vs-All classification. This process generates numerous predictions per sample, which are then compiled and used to train a shallow multi-label neural network specifically designed to model the fusion process, ultimately leading to the conclusive class prediction for each sample. Experimental findings reveal that both validation and testing accuracy outperform those of current state-of-the-art classifiers, underlining the effectiveness and robustness of the proposed methodology.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"5 9","pages":"91-99"},"PeriodicalIF":0.0,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140375871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantic Properties of cosine based bias scores for word embeddings 基于余弦的词嵌入偏差分数的语义特性
Pub Date : 2024-01-27 DOI: 10.48550/arXiv.2401.15499
Sarah Schröder, Alexander Schulz, Fabian Hinder, Barbara Hammer
Plenty of works have brought social biases in language models to attention and proposed methods to detect such biases. As a result, the literature contains a great deal of different bias tests and scores, each introduced with the premise to uncover yet more biases that other scores fail to detect. What severely lacks in the literature, however, are comparative studies that analyse such bias scores and help researchers to understand the benefits or limitations of the existing methods. In this work, we aim to close this gap for cosine based bias scores. By building on a geometric definition of bias, we propose requirements for bias scores to be considered meaningful for quantifying biases. Furthermore, we formally analyze cosine based scores from the literature with regard to these requirements. We underline these findings with experiments to show that the bias scores' limitations have an impact in the application case.
大量的研究已经引起了人们对语言模型中社会偏差的关注,并提出了检测这些偏差的方法。因此,文献中包含了大量不同的偏差测试和评分方法,每种方法都以发现更多其他评分方法未能发现的偏差为前提。然而,文献中严重缺乏对这些偏差评分进行分析的比较研究,这有助于研究人员了解现有方法的优点或局限性。在这项工作中,我们的目标是缩小基于余弦的偏差分数的这一差距。通过建立在偏差的几何定义基础上,我们提出了对偏差分数的要求,使其被认为对量化偏差有意义。此外,我们还根据这些要求对文献中基于余弦的分数进行了正式分析。我们通过实验强调了这些发现,以证明偏差分数的局限性在应用案例中会产生影响。
{"title":"Semantic Properties of cosine based bias scores for word embeddings","authors":"Sarah Schröder, Alexander Schulz, Fabian Hinder, Barbara Hammer","doi":"10.48550/arXiv.2401.15499","DOIUrl":"https://doi.org/10.48550/arXiv.2401.15499","url":null,"abstract":"Plenty of works have brought social biases in language models to attention and proposed methods to detect such biases. As a result, the literature contains a great deal of different bias tests and scores, each introduced with the premise to uncover yet more biases that other scores fail to detect. What severely lacks in the literature, however, are comparative studies that analyse such bias scores and help researchers to understand the benefits or limitations of the existing methods. In this work, we aim to close this gap for cosine based bias scores. By building on a geometric definition of bias, we propose requirements for bias scores to be considered meaningful for quantifying biases. Furthermore, we formally analyze cosine based scores from the literature with regard to these requirements. We underline these findings with experiments to show that the bias scores' limitations have an impact in the application case.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"73 11","pages":"160-168"},"PeriodicalIF":0.0,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140491662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Double Trouble? Impact and Detection of Duplicates in Face Image Datasets 双重麻烦?人脸图像数据集中重复图像的影响与检测
Pub Date : 2024-01-25 DOI: 10.5220/0012422500003654
Torsten Schlett, C. Rathgeb, Juan E. Tapia, Christoph Busch
Various face image datasets intended for facial biometrics research were created via web-scraping, i.e. the collection of images publicly available on the internet. This work presents an approach to detect both exactly and nearly identical face image duplicates, using file and image hashes. The approach is extended through the use of face image preprocessing. Additional steps based on face recognition and face image quality assessment models reduce false positives, and facilitate the deduplication of the face images both for intra- and inter-subject duplicate sets. The presented approach is applied to five datasets, namely LFW, TinyFace, Adience, CASIA-WebFace, and C-MS-Celeb (a cleaned MS-Celeb-1M variant). Duplicates are detected within every dataset, with hundreds to hundreds of thousands of duplicates for all except LFW. Face recognition and quality assessment experiments indicate a minor impact on the results through the duplicate removal. The final deduplication data is publicly available.
用于人脸生物识别研究的各种人脸图像数据集是通过网络抓取(即收集互联网上公开的图像)创建的。这项研究提出了一种利用文件和图像哈希值检测完全相同和几乎完全相同的重复人脸图像的方法。该方法通过使用人脸图像预处理进行扩展。基于人脸识别和人脸图像质量评估模型的附加步骤可减少误报,并有助于重复删除主体内和主体间重复集的人脸图像。所介绍的方法适用于五个数据集,即 LFW、TinyFace、Adience、CASIA-WebFace 和 C-MS-Celeb(经过清理的 MS-Celeb-1M 变体)。每个数据集中都能检测到重复数据,除 LFW 外,其他数据集中都有数百至数十万个重复数据。人脸识别和质量评估实验表明,重复删除对结果的影响很小。重复数据删除的最终结果已公开。
{"title":"Double Trouble? Impact and Detection of Duplicates in Face Image Datasets","authors":"Torsten Schlett, C. Rathgeb, Juan E. Tapia, Christoph Busch","doi":"10.5220/0012422500003654","DOIUrl":"https://doi.org/10.5220/0012422500003654","url":null,"abstract":"Various face image datasets intended for facial biometrics research were created via web-scraping, i.e. the collection of images publicly available on the internet. This work presents an approach to detect both exactly and nearly identical face image duplicates, using file and image hashes. The approach is extended through the use of face image preprocessing. Additional steps based on face recognition and face image quality assessment models reduce false positives, and facilitate the deduplication of the face images both for intra- and inter-subject duplicate sets. The presented approach is applied to five datasets, namely LFW, TinyFace, Adience, CASIA-WebFace, and C-MS-Celeb (a cleaned MS-Celeb-1M variant). Duplicates are detected within every dataset, with hundreds to hundreds of thousands of duplicates for all except LFW. Face recognition and quality assessment experiments indicate a minor impact on the results through the duplicate removal. The final deduplication data is publicly available.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"68 12","pages":"801-808"},"PeriodicalIF":0.0,"publicationDate":"2024-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140496716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting Brain Tumors through Multimodal Neural Networks 通过多模态神经网络检测脑肿瘤
Pub Date : 2024-01-10 DOI: 10.5220/0012608600003654
Antonio Curci, Andrea Esposito
Tumors can manifest in various forms and in different areas of the human body. Brain tumors are specifically hard to diagnose and treat because of the complexity of the organ in which they develop. Detecting them in time can lower the chances of death and facilitate the therapy process for patients. The use of Artificial Intelligence (AI) and, more specifically, deep learning, has the potential to significantly reduce costs in terms of time and resources for the discovery and identification of tumors from images obtained through imaging techniques. This research work aims to assess the performance of a multimodal model for the classification of Magnetic Resonance Imaging (MRI) scans processed as grayscale images. The results are promising, and in line with similar works, as the model reaches an accuracy of around 98%. We also highlight the need for explainability and transparency to ensure human control and safety.
肿瘤的表现形式多种多样,可发生在人体的不同部位。脑肿瘤因其发病器官的复杂性而特别难以诊断和治疗。及时发现它们可以降低患者的死亡几率,促进治疗过程。使用人工智能(AI),更具体地说是深度学习,有可能大大降低从成像技术获得的图像中发现和识别肿瘤的时间和资源成本。这项研究工作旨在评估一个多模态模型的性能,该模型用于对处理为灰度图像的磁共振成像(MRI)扫描进行分类。结果很有希望,与同类研究结果一致,因为该模型的准确率达到了 98% 左右。我们还强调了可解释性和透明度的必要性,以确保人类的控制和安全。
{"title":"Detecting Brain Tumors through Multimodal Neural Networks","authors":"Antonio Curci, Andrea Esposito","doi":"10.5220/0012608600003654","DOIUrl":"https://doi.org/10.5220/0012608600003654","url":null,"abstract":"Tumors can manifest in various forms and in different areas of the human body. Brain tumors are specifically hard to diagnose and treat because of the complexity of the organ in which they develop. Detecting them in time can lower the chances of death and facilitate the therapy process for patients. The use of Artificial Intelligence (AI) and, more specifically, deep learning, has the potential to significantly reduce costs in terms of time and resources for the discovery and identification of tumors from images obtained through imaging techniques. This research work aims to assess the performance of a multimodal model for the classification of Magnetic Resonance Imaging (MRI) scans processed as grayscale images. The results are promising, and in line with similar works, as the model reaches an accuracy of around 98%. We also highlight the need for explainability and transparency to ensure human control and safety.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"5 4","pages":"995-1000"},"PeriodicalIF":0.0,"publicationDate":"2024-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Independently from Causality in Multi-Agent Environments 基于因果关系的多智能体环境独立学习
Pub Date : 2023-11-05 DOI: 10.5220/0011747900003411
Rafael Pina, V. D. Silva, Corentin Artaud
Multi-Agent Reinforcement Learning (MARL) comprises an area of growing interest in the field of machine learning. Despite notable advances, there are still problems that require investigation. The lazy agent pathology is a famous problem in MARL that denotes the event when some of the agents in a MARL team do not contribute to the common goal, letting the teammates do all the work. In this work, we aim to investigate this problem from a causality-based perspective. We intend to create the bridge between the fields of MARL and causality and argue about the usefulness of this link. We study a fully decentralised MARL setup where agents need to learn cooperation strategies and show that there is a causal relation between individual observations and the team reward. The experiments carried show how this relation can be used to improve independent agents in MARL, resulting not only on better performances as a team but also on the rise of more intelligent behaviours on individual agents.
多智能体强化学习(MARL)是机器学习领域中一个越来越受关注的领域。尽管取得了显著的进展,但仍有一些问题需要调查。懒惰代理病理是MARL中一个著名的问题,它指的是MARL团队中的一些代理不为共同目标做出贡献,让团队成员做所有的工作。在这项工作中,我们旨在从基于因果关系的角度来研究这个问题。我们打算在MARL和因果关系领域之间建立一座桥梁,并讨论这种联系的有用性。我们研究了一个完全分散的MARL设置,其中智能体需要学习合作策略,并表明个人观察和团队奖励之间存在因果关系。所进行的实验表明,这种关系如何用于改进MARL中的独立代理,不仅可以提高团队的表现,还可以提高个体代理的智能行为。
{"title":"Learning Independently from Causality in Multi-Agent Environments","authors":"Rafael Pina, V. D. Silva, Corentin Artaud","doi":"10.5220/0011747900003411","DOIUrl":"https://doi.org/10.5220/0011747900003411","url":null,"abstract":"Multi-Agent Reinforcement Learning (MARL) comprises an area of growing interest in the field of machine learning. Despite notable advances, there are still problems that require investigation. The lazy agent pathology is a famous problem in MARL that denotes the event when some of the agents in a MARL team do not contribute to the common goal, letting the teammates do all the work. In this work, we aim to investigate this problem from a causality-based perspective. We intend to create the bridge between the fields of MARL and causality and argue about the usefulness of this link. We study a fully decentralised MARL setup where agents need to learn cooperation strategies and show that there is a causal relation between individual observations and the team reward. The experiments carried show how this relation can be used to improve independent agents in MARL, resulting not only on better performances as a team but also on the rise of more intelligent behaviours on individual agents.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130384231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rethinking Image-based Table Recognition Using Weakly Supervised Methods 基于图像的弱监督表识别方法的再思考
Pub Date : 2023-03-14 DOI: 10.5220/0011682600003411
N. Ly, A. Takasu, Phuc Nguyen, H. Takeda
Most of the previous methods for table recognition rely on training datasets containing many richly annotated table images. Detailed table image annotation, e.g., cell or text bounding box annotation, however, is costly and often subjective. In this paper, we propose a weakly supervised model named WSTabNet for table recognition that relies only on HTML (or LaTeX) code-level annotations of table images. The proposed model consists of three main parts: an encoder for feature extraction, a structure decoder for generating table structure, and a cell decoder for predicting the content of each cell in the table. Our system is trained end-to-end by stochastic gradient descent algorithms, requiring only table images and their ground-truth HTML (or LaTeX) representations. To facilitate table recognition with deep learning, we create and release WikiTableSet, the largest publicly available image-based table recognition dataset built from Wikipedia. WikiTableSet contains nearly 4 million English table images, 590K Japanese table images, and 640k French table images with corresponding HTML representation and cell bounding boxes. The extensive experiments on WikiTableSet and two large-scale datasets: FinTabNet and PubTabNet demonstrate that the proposed weakly supervised model achieves better, or similar accuracies compared to the state-of-the-art models on all benchmark datasets.
以前的大多数表识别方法依赖于包含许多丰富注释的表图像的训练数据集。然而,详细的表格图像注释,例如单元格或文本边界框注释,是昂贵的,而且往往是主观的。在本文中,我们提出了一个弱监督模型WSTabNet用于表识别,该模型仅依赖于表图像的HTML(或LaTeX)代码级注释。该模型由三个主要部分组成:用于特征提取的编码器、用于生成表结构的结构解码器和用于预测表中每个单元的内容的单元解码器。我们的系统通过随机梯度下降算法进行端到端的训练,只需要表图像及其基本真实的HTML(或LaTeX)表示。为了促进深度学习的表识别,我们创建并发布了WikiTableSet,这是基于维基百科构建的最大的基于图像的表识别数据集。WikiTableSet包含近400万张英文表格图像、590K张日文表格图像和640k张法文表格图像,并带有相应的HTML表示和单元格边界框。在WikiTableSet和两个大型数据集:FinTabNet和PubTabNet上进行的大量实验表明,与所有基准数据集上的最新模型相比,所提出的弱监督模型达到了更好或相似的精度。
{"title":"Rethinking Image-based Table Recognition Using Weakly Supervised Methods","authors":"N. Ly, A. Takasu, Phuc Nguyen, H. Takeda","doi":"10.5220/0011682600003411","DOIUrl":"https://doi.org/10.5220/0011682600003411","url":null,"abstract":"Most of the previous methods for table recognition rely on training datasets containing many richly annotated table images. Detailed table image annotation, e.g., cell or text bounding box annotation, however, is costly and often subjective. In this paper, we propose a weakly supervised model named WSTabNet for table recognition that relies only on HTML (or LaTeX) code-level annotations of table images. The proposed model consists of three main parts: an encoder for feature extraction, a structure decoder for generating table structure, and a cell decoder for predicting the content of each cell in the table. Our system is trained end-to-end by stochastic gradient descent algorithms, requiring only table images and their ground-truth HTML (or LaTeX) representations. To facilitate table recognition with deep learning, we create and release WikiTableSet, the largest publicly available image-based table recognition dataset built from Wikipedia. WikiTableSet contains nearly 4 million English table images, 590K Japanese table images, and 640k French table images with corresponding HTML representation and cell bounding boxes. The extensive experiments on WikiTableSet and two large-scale datasets: FinTabNet and PubTabNet demonstrate that the proposed weakly supervised model achieves better, or similar accuracies compared to the state-of-the-art models on all benchmark datasets.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114511425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Instance Segmentation Based Graph Extraction for Handwritten Circuit Diagram Images 基于实例分割的手写电路图图像提取
Pub Date : 2023-01-09 DOI: 10.48550/arXiv.2301.03155
Johannes Bayer, Amit Kumar Roy, A. Dengel
Handwritten circuit diagrams from educational scenarios or historic sources usually exist on analogue media. For deriving their functional principles or flaws automatically, they need to be digitized, extracting their electrical graph. Recently, the base technologies for automated pipelines facilitating this process shifted from computer vision to machine learning. This paper describes an approach for extracting both the electrical components (including their terminals and describing texts) as well their interconnections (including junctions and wire hops) by the means of instance segmentation and keypoint extraction. Consequently, the resulting graph extraction process consists of a simple two-step process of model inference and trivial geometric keypoint matching. The dataset itself, its preparation, model training and post-processing are described and publicly available.
来自教育场景或历史资料的手写电路图通常存在于模拟媒体上。为了自动导出它们的功能原理或缺陷,需要对它们进行数字化,提取它们的电图。最近,促进这一过程的自动化管道的基础技术从计算机视觉转向了机器学习。本文描述了一种通过实例分割和关键点提取的方法来提取电子元件(包括它们的终端和描述文本)以及它们的互连(包括结点和跳线)的方法。因此,所得到的图提取过程由简单的模型推理和简单的几何关键点匹配两步过程组成。数据集本身、它的准备、模型训练和后处理都被描述并公开可用。
{"title":"Instance Segmentation Based Graph Extraction for Handwritten Circuit Diagram Images","authors":"Johannes Bayer, Amit Kumar Roy, A. Dengel","doi":"10.48550/arXiv.2301.03155","DOIUrl":"https://doi.org/10.48550/arXiv.2301.03155","url":null,"abstract":"Handwritten circuit diagrams from educational scenarios or historic sources usually exist on analogue media. For deriving their functional principles or flaws automatically, they need to be digitized, extracting their electrical graph. Recently, the base technologies for automated pipelines facilitating this process shifted from computer vision to machine learning. This paper describes an approach for extracting both the electrical components (including their terminals and describing texts) as well their interconnections (including junctions and wire hops) by the means of instance segmentation and keypoint extraction. Consequently, the resulting graph extraction process consists of a simple two-step process of model inference and trivial geometric keypoint matching. The dataset itself, its preparation, model training and post-processing are described and publicly available.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116011458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Fault Classification using Hamiltonian Neural Networks 基于哈密顿神经网络的机器故障分类
Pub Date : 2023-01-04 DOI: 10.48550/arXiv.2301.02243
Jer-Sheng Shen, Jawad Chowdhury, Sourav Banerjee, G. Terejanu
A new approach is introduced to classify faults in rotating machinery based on the total energy signature estimated from sensor measurements. The overall goal is to go beyond using black-box models and incorporate additional physical constraints that govern the behavior of mechanical systems. Observational data is used to train Hamiltonian neural networks that describe the conserved energy of the system for normal and various abnormal regimes. The estimated total energy function, in the form of the weights of the Hamiltonian neural network, serves as the new feature vector to discriminate between the faults using off-the-shelf classification models. The experimental results are obtained using the MaFaulDa database, where the proposed model yields a promising area under the curve (AUC) of $0.78$ for the binary classification (normal vs abnormal) and $0.84$ for the multi-class problem (normal, and $5$ different abnormal regimes).
提出了一种基于传感器测量估计的总能量特征对旋转机械故障进行分类的新方法。总体目标是超越使用黑盒模型,并结合额外的物理约束来控制机械系统的行为。观测数据用于训练描述系统在正常和各种异常状态下的守恒能量的哈密顿神经网络。估计的总能量函数以哈密顿神经网络的权重形式作为新的特征向量,使用现成的分类模型来区分故障。实验结果是使用maaulda数据库获得的,其中所提出的模型在二元分类(正常与异常)和多类问题(正常和不同异常制度)中产生了一个有希望的曲线下面积(AUC)为0.78美元和0.84美元。
{"title":"Machine Fault Classification using Hamiltonian Neural Networks","authors":"Jer-Sheng Shen, Jawad Chowdhury, Sourav Banerjee, G. Terejanu","doi":"10.48550/arXiv.2301.02243","DOIUrl":"https://doi.org/10.48550/arXiv.2301.02243","url":null,"abstract":"A new approach is introduced to classify faults in rotating machinery based on the total energy signature estimated from sensor measurements. The overall goal is to go beyond using black-box models and incorporate additional physical constraints that govern the behavior of mechanical systems. Observational data is used to train Hamiltonian neural networks that describe the conserved energy of the system for normal and various abnormal regimes. The estimated total energy function, in the form of the weights of the Hamiltonian neural network, serves as the new feature vector to discriminate between the faults using off-the-shelf classification models. The experimental results are obtained using the MaFaulDa database, where the proposed model yields a promising area under the curve (AUC) of $0.78$ for the binary classification (normal vs abnormal) and $0.84$ for the multi-class problem (normal, and $5$ different abnormal regimes).","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132710041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Evaluation of Induced Expert Knowledge in Causal Structure Learning by NOTEARS NOTEARS对因果结构学习中诱导专家知识的评价
Pub Date : 2023-01-04 DOI: 10.48550/arXiv.2301.01817
Jawad Chowdhury, Rezaur Rashid, G. Terejanu
Causal modeling provides us with powerful counterfactual reasoning and interventional mechanism to generate predictions and reason under various what-if scenarios. However, causal discovery using observation data remains a nontrivial task due to unobserved confounding factors, finite sampling, and changes in the data distribution. These can lead to spurious cause-effect relationships. To mitigate these challenges in practice, researchers augment causal learning with known causal relations. The goal of the paper is to study the impact of expert knowledge on causal relations in the form of additional constraints used in the formulation of the nonparametric NOTEARS. We provide a comprehensive set of comparative analyses of biasing the model using different types of knowledge. We found that (i) knowledge that corrects the mistakes of the NOTEARS model can lead to statistically significant improvements, (ii) constraints on active edges have a larger positive impact on causal discovery than inactive edges, and surprisingly, (iii) the induced knowledge does not correct on average more incorrect active and/or inactive edges than expected. We also demonstrate the behavior of the model and the effectiveness of domain knowledge on a real-world dataset.
因果模型为我们提供了强大的反事实推理和干预机制,在各种假设情景下产生预测和推理。然而,由于未观察到的混杂因素、有限的抽样和数据分布的变化,使用观测数据进行因果发现仍然是一项艰巨的任务。这些可能导致虚假的因果关系。为了在实践中减轻这些挑战,研究人员用已知的因果关系来增强因果学习。本文的目的是研究专家知识对非参数NOTEARS公式中附加约束形式的因果关系的影响。我们提供了一套全面的比较分析,使用不同类型的知识来偏置模型。我们发现(i)纠正NOTEARS模型错误的知识可以导致统计上显着的改进,(ii)对活动边的约束比非活动边对因果发现有更大的积极影响,令人惊讶的是,(iii)诱导知识平均没有纠正比预期更多的不正确的活动和/或非活动边。我们还展示了模型的行为和领域知识在真实数据集上的有效性。
{"title":"Evaluation of Induced Expert Knowledge in Causal Structure Learning by NOTEARS","authors":"Jawad Chowdhury, Rezaur Rashid, G. Terejanu","doi":"10.48550/arXiv.2301.01817","DOIUrl":"https://doi.org/10.48550/arXiv.2301.01817","url":null,"abstract":"Causal modeling provides us with powerful counterfactual reasoning and interventional mechanism to generate predictions and reason under various what-if scenarios. However, causal discovery using observation data remains a nontrivial task due to unobserved confounding factors, finite sampling, and changes in the data distribution. These can lead to spurious cause-effect relationships. To mitigate these challenges in practice, researchers augment causal learning with known causal relations. The goal of the paper is to study the impact of expert knowledge on causal relations in the form of additional constraints used in the formulation of the nonparametric NOTEARS. We provide a comprehensive set of comparative analyses of biasing the model using different types of knowledge. We found that (i) knowledge that corrects the mistakes of the NOTEARS model can lead to statistically significant improvements, (ii) constraints on active edges have a larger positive impact on causal discovery than inactive edges, and surprisingly, (iii) the induced knowledge does not correct on average more incorrect active and/or inactive edges than expected. We also demonstrate the behavior of the model and the effectiveness of domain knowledge on a real-world dataset.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"322 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133906041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
International Conference on Pattern Recognition Applications and Methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1