首页 > 最新文献

2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)最新文献

英文 中文
Efficient Neural Architecture Search for Long Short-Term Memory Networks 长短期记忆网络的高效神经结构搜索
Pub Date : 2021-01-21 DOI: 10.1109/SAMI50585.2021.9378612
Hamdi Abed, Bálint Gyires-Tóth
Automated machine learning (AutoML) is a technique which helps to determine the optimal or near-optimal model for a specific dataset and has been a focused research area during the last years. The automation of model design opens doors for non-machine learning experts to utilize machine learning models in several scenarios, which is both appealing for a wide range of researchers and for cloud services as well. Neural Architecture Search is a subfield of AutoML where the optimal artificial neural network model's architecture is generally searched with adaptive algorithms. This paper proposes a method to apply Efficient Neural Architecture Search (ENAS) to LSTM-like recurrent architecture, which uses a gating mechanism an inner memory. Using this method, the paper investigates if the handcrafted Long Short-Term Memory (LSTM) cell is an optimal or near-optimal solution of sequence modelling for a given dataset, or other, automatically defined recurrent structures outperform. The performance of vanilla LSTM, and advanced recurrent architectures designed by random search, and reinforcement learning-based ENAS are examined and compared. The proposed methods are evaluated in a text generation task on the Penn TreeBank dataset.
自动化机器学习(AutoML)是一种帮助确定特定数据集的最优或接近最优模型的技术,在过去几年中一直是一个重点研究领域。模型设计的自动化为非机器学习专家在多种场景中使用机器学习模型打开了大门,这既吸引了广泛的研究人员,也吸引了云服务。神经结构搜索是AutoML的一个子领域,一般使用自适应算法搜索最优人工神经网络模型的结构。本文提出了一种将高效神经结构搜索(ENAS)应用于类lstm循环结构的方法,该方法使用了一种内部存储器的门控机制。使用这种方法,论文研究了手工制作的长短期记忆(LSTM)单元是否是给定数据集序列建模的最优或接近最优解决方案,或者其他自动定义的循环结构优于此。对普通LSTM、随机搜索设计的高级循环架构和基于强化学习的ENAS的性能进行了检验和比较。在Penn TreeBank数据集的文本生成任务中对所提出的方法进行了评估。
{"title":"Efficient Neural Architecture Search for Long Short-Term Memory Networks","authors":"Hamdi Abed, Bálint Gyires-Tóth","doi":"10.1109/SAMI50585.2021.9378612","DOIUrl":"https://doi.org/10.1109/SAMI50585.2021.9378612","url":null,"abstract":"Automated machine learning (AutoML) is a technique which helps to determine the optimal or near-optimal model for a specific dataset and has been a focused research area during the last years. The automation of model design opens doors for non-machine learning experts to utilize machine learning models in several scenarios, which is both appealing for a wide range of researchers and for cloud services as well. Neural Architecture Search is a subfield of AutoML where the optimal artificial neural network model's architecture is generally searched with adaptive algorithms. This paper proposes a method to apply Efficient Neural Architecture Search (ENAS) to LSTM-like recurrent architecture, which uses a gating mechanism an inner memory. Using this method, the paper investigates if the handcrafted Long Short-Term Memory (LSTM) cell is an optimal or near-optimal solution of sequence modelling for a given dataset, or other, automatically defined recurrent structures outperform. The performance of vanilla LSTM, and advanced recurrent architectures designed by random search, and reinforcement learning-based ENAS are examined and compared. The proposed methods are evaluated in a text generation task on the Penn TreeBank dataset.","PeriodicalId":402414,"journal":{"name":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133783646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Modelling and implementation of Single Line Diagram data in IEC 61850 environment IEC 61850环境下单线图数据的建模和实现
Pub Date : 2021-01-21 DOI: 10.1109/SAMI50585.2021.9378622
D. Poștovei, C. Bulac, I. Tristiu, Balduino Estison Mugilila Camachi, N. Anton
The aim of this paper is to improve the automation processes of electrical power substations based on IEC 61850 standard, with the focus on data management related to data structure, path, and communication model. When modelling substations items a simple and clear approach is necessary to effectively match binary or analog data from the switchyard with the IEC 61850 data object model. The study is made by simply transforming data from a single line diagram and its functionalities into object model by explaining its hierarchy structure and how to use it to create an operational process interface. This possible through the three key components of the IEC 61850 standard: Client-Server based on TCP/IP MMS (Manufacturing Messaging Specification) which perform the monitoring and control functions, GOOSE (Generic Object-Oriented Substation Event) protocol used in applications between IEDs (Intelligent Electronic Device) like interlocking signals and trip messages and Sampled Values (SV) protocol used for fast transmission of analogue values over the network.
本文旨在改进基于IEC 61850标准的变电站自动化过程,重点研究与数据结构、路径和通信模型相关的数据管理。在对变电站项目进行建模时,需要一种简单明了的方法来有效地将开关站的二进制或模拟数据与IEC 61850数据对象模型相匹配。通过解释其层次结构以及如何使用它来创建操作过程接口,简单地将数据从单线图及其功能转换为对象模型。这可以通过IEC 61850标准的三个关键组件实现:基于TCP/IP MMS(制造消息传递规范)的客户端-服务器执行监视和控制功能,GOOSE(通用面向对象变电站事件)协议用于ied(智能电子设备)之间的应用,如联锁信号和trip消息以及用于在网络上快速传输模拟值的采样值(SV)协议。
{"title":"Modelling and implementation of Single Line Diagram data in IEC 61850 environment","authors":"D. Poștovei, C. Bulac, I. Tristiu, Balduino Estison Mugilila Camachi, N. Anton","doi":"10.1109/SAMI50585.2021.9378622","DOIUrl":"https://doi.org/10.1109/SAMI50585.2021.9378622","url":null,"abstract":"The aim of this paper is to improve the automation processes of electrical power substations based on IEC 61850 standard, with the focus on data management related to data structure, path, and communication model. When modelling substations items a simple and clear approach is necessary to effectively match binary or analog data from the switchyard with the IEC 61850 data object model. The study is made by simply transforming data from a single line diagram and its functionalities into object model by explaining its hierarchy structure and how to use it to create an operational process interface. This possible through the three key components of the IEC 61850 standard: Client-Server based on TCP/IP MMS (Manufacturing Messaging Specification) which perform the monitoring and control functions, GOOSE (Generic Object-Oriented Substation Event) protocol used in applications between IEDs (Intelligent Electronic Device) like interlocking signals and trip messages and Sampled Values (SV) protocol used for fast transmission of analogue values over the network.","PeriodicalId":402414,"journal":{"name":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131161148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Framework for Lecture Video Segmentation from Extracted Speech Content 基于演讲内容提取的演讲视频分割框架
Pub Date : 2021-01-21 DOI: 10.1109/SAMI50585.2021.9378632
Dipesh Chand, H. Oğul
Increasing demand for lecture videos in digital libraries has raised the challenge of automatic annotation of lecture content for effective navigation of lectures by users. One direction is the prior segmentation of lecture videos to simplify several applications such as indexing, keyword spotting, and targeted search. In this study, we present a lecture video segmentation framework based on the speech content of the instructors. The framework is built upon a model that extracts textual and acoustic features from speech and uses them to identify topical segment boundaries of the lecture video. To evaluate our proposed model, we collected our own dataset containing a diverse set of 37 lecture videos and also manually created ground truth. The performance was measured by using metrics like Precision, Recall, and F1 Score and obtained 0.69, 0.58, and 0.63 respectively. We also compared our model with some previously known similar models where our model outperformed others. The overall results of the study are presented as a lecture video segmentation model, integrating various tools and techniques, and showing promising performance. Findings can be used further for research in content-based search and retrieval using speech content.
数字图书馆对讲座视频的需求日益增长,对讲座内容的自动标注提出了挑战,使用户能够有效地浏览讲座。一个方向是讲座视频的预先分割,以简化索引、关键字定位和目标搜索等几个应用。在本研究中,我们提出了一个基于讲师演讲内容的讲座视频分割框架。该框架建立在一个模型之上,该模型从语音中提取文本和声学特征,并使用它们来识别讲座视频的主题片段边界。为了评估我们提出的模型,我们收集了自己的数据集,其中包含37个不同的讲座视频,并手动创建了地面真相。使用Precision、Recall和F1 Score等指标来衡量性能,分别获得0.69、0.58和0.63。我们还将我们的模型与一些已知的类似模型进行了比较,其中我们的模型优于其他模型。该研究的总体结果以讲座视频分割模型的形式呈现,该模型集成了各种工具和技术,并显示出良好的性能。研究结果可以进一步用于基于内容的语音内容搜索和检索的研究。
{"title":"A Framework for Lecture Video Segmentation from Extracted Speech Content","authors":"Dipesh Chand, H. Oğul","doi":"10.1109/SAMI50585.2021.9378632","DOIUrl":"https://doi.org/10.1109/SAMI50585.2021.9378632","url":null,"abstract":"Increasing demand for lecture videos in digital libraries has raised the challenge of automatic annotation of lecture content for effective navigation of lectures by users. One direction is the prior segmentation of lecture videos to simplify several applications such as indexing, keyword spotting, and targeted search. In this study, we present a lecture video segmentation framework based on the speech content of the instructors. The framework is built upon a model that extracts textual and acoustic features from speech and uses them to identify topical segment boundaries of the lecture video. To evaluate our proposed model, we collected our own dataset containing a diverse set of 37 lecture videos and also manually created ground truth. The performance was measured by using metrics like Precision, Recall, and F1 Score and obtained 0.69, 0.58, and 0.63 respectively. We also compared our model with some previously known similar models where our model outperformed others. The overall results of the study are presented as a lecture video segmentation model, integrating various tools and techniques, and showing promising performance. Findings can be used further for research in content-based search and retrieval using speech content.","PeriodicalId":402414,"journal":{"name":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134640134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Slip Control by Identifying the Magnetic Field of the Elements of an Asynchronous Motor 识别异步电动机元件磁场的转差控制
Pub Date : 2021-01-21 DOI: 10.1109/SAMI50585.2021.9378658
J. Novotňák, M. Oravec, Jan Hijj, Daniel Jurč
The article deals with the issue of measuring the slip of an asynchronous motor based on the identification of the magnetic field of rotating elements and the rotating field of the stator of an asynchronous motor. By measuring the magnetic field of the rotating elements of the asynchronous motor and the rotating field of the stator, it is possible to identify the corresponding frequencies. Based on this, it is possible to control the slip of the asynchronous motor. This is especially important from the point of view of speed control, reduction of heat losses and from the point of view of increasing the service life of the elements of the frequency converter. Due to the above facts, the measurement was performed on an asynchronous motor with a frequency converter. The measurement results are given in the form of the frequency spectrum of the asynchronous motor and in the form of the value corresponding to the slip of the asynchronous motor. The article also describes the design of a system for controlling the slip of an asynchronous motor.
本文在对异步电动机旋转元件磁场和定子旋转磁场进行识别的基础上,研究了异步电动机转差的测量问题。通过测量异步电动机旋转元件的磁场和定子的旋转磁场,就可以识别出相应的频率。在此基础上,实现对异步电动机转差的控制。从速度控制、减少热损失和增加变频器元件使用寿命的角度来看,这一点尤为重要。由于上述事实,测量是在异步电机与变频器上进行的。测量结果以异步电动机的频谱和异步电动机转差对应的值的形式给出。本文还介绍了异步电动机转差控制系统的设计。
{"title":"Slip Control by Identifying the Magnetic Field of the Elements of an Asynchronous Motor","authors":"J. Novotňák, M. Oravec, Jan Hijj, Daniel Jurč","doi":"10.1109/SAMI50585.2021.9378658","DOIUrl":"https://doi.org/10.1109/SAMI50585.2021.9378658","url":null,"abstract":"The article deals with the issue of measuring the slip of an asynchronous motor based on the identification of the magnetic field of rotating elements and the rotating field of the stator of an asynchronous motor. By measuring the magnetic field of the rotating elements of the asynchronous motor and the rotating field of the stator, it is possible to identify the corresponding frequencies. Based on this, it is possible to control the slip of the asynchronous motor. This is especially important from the point of view of speed control, reduction of heat losses and from the point of view of increasing the service life of the elements of the frequency converter. Due to the above facts, the measurement was performed on an asynchronous motor with a frequency converter. The measurement results are given in the form of the frequency spectrum of the asynchronous motor and in the form of the value corresponding to the slip of the asynchronous motor. The article also describes the design of a system for controlling the slip of an asynchronous motor.","PeriodicalId":402414,"journal":{"name":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114081499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Kinematics of Delta-type Parallel Robot Mechanisms via Screw Theory: A tutorial paper 基于螺旋理论的三角型并联机器人机构运动学:一篇指导论文
Pub Date : 2021-01-21 DOI: 10.1109/SAMI50585.2021.9378688
László Szűcs, P. Galambos, D. Drexler
In the past decade, parallel manipulators began gaining more attention since they can outperform their serial counterparts at several areas. The use of common parallel delta robot mechanisms is wide-spread within the industry, especially for fast pick and place applications. There is a new type of parallel mechanism on the rise, called the Generalized Triangle Parallel Robot (GTPR), where the parameters of the robot may differ from arm to arm. Due to the asymmetric structure, the kinematic description of such a mechanism is less trivial. For this reason, we wish to show that many mechanical problems become more straightforward by using screw theory througout the formalism. Screw theory uses the Plücker coordinate representation of mechanical structures. This representation leads to an elegant and tractable form of the kinematics equations. This paper presents a compact tutorial about screw theory and a joint velocity calculation example, with the complete derivation of screws and numerical results.
在过去的十年中,并行机器人开始获得更多的关注,因为它们在一些领域的表现优于串行机器人。常见的并联delta机器人机构的使用在行业内广泛传播,特别是对于快速取放应用。有一种新型并联机构正在兴起,称为广义三角形并联机器人(GTPR),该机器人的各个臂的参数可能不同。由于其结构不对称,该机构的运动学描述就不那么简单了。由于这个原因,我们希望表明,通过在整个形式主义中使用螺杆论,许多机械问题变得更加直接。螺旋理论使用plpl克尔坐标表示机械结构。这种表示导致了一种优雅和易于处理的运动学方程形式。本文给出了一个紧凑的螺旋理论教程和一个关节速度计算实例,给出了螺旋的完整推导和数值结果。
{"title":"Kinematics of Delta-type Parallel Robot Mechanisms via Screw Theory: A tutorial paper","authors":"László Szűcs, P. Galambos, D. Drexler","doi":"10.1109/SAMI50585.2021.9378688","DOIUrl":"https://doi.org/10.1109/SAMI50585.2021.9378688","url":null,"abstract":"In the past decade, parallel manipulators began gaining more attention since they can outperform their serial counterparts at several areas. The use of common parallel delta robot mechanisms is wide-spread within the industry, especially for fast pick and place applications. There is a new type of parallel mechanism on the rise, called the Generalized Triangle Parallel Robot (GTPR), where the parameters of the robot may differ from arm to arm. Due to the asymmetric structure, the kinematic description of such a mechanism is less trivial. For this reason, we wish to show that many mechanical problems become more straightforward by using screw theory througout the formalism. Screw theory uses the Plücker coordinate representation of mechanical structures. This representation leads to an elegant and tractable form of the kinematics equations. This paper presents a compact tutorial about screw theory and a joint velocity calculation example, with the complete derivation of screws and numerical results.","PeriodicalId":402414,"journal":{"name":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128094031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Gender domain adaptation for automatic speech recognition 自动语音识别的性别域自适应
Pub Date : 2021-01-21 DOI: 10.1109/SAMI50585.2021.9378626
Artem Sokolov, A. Savchenko
This paper is focused on the finetuning of acoustic models for speaker adaptation goals on a given gender. We pretrained the Transformer baseline model on Librispeech-960 and conducted experiments with finetuning on the gender-specific test subsets. The obtained word error rate (WER) relatively to the baseline is up to 5% and 3% lower on male and female subsets, respectively, if the layers in the encoder and decoder are not frozen, and the tuning is started from the last checkpoints. Moreover, we adapted our base model on the complete L2 Arctic dataset of accented speech and finetuned it for particular speakers and male and female genders separately. The models trained on the gender subsets obtained 1–2% lower WER when compared to the model tuned on the whole L2 Arctic dataset. Finally, it was experimentally confirmed that the concatenation of the pretrained voice embeddings (x-vector) and embeddings from a conventional encoder cannot significantly improve the speech recognition accuracy.
本文主要研究针对特定性别的说话人自适应目标的声学模型的微调。我们在librisspeech -960上对Transformer基线模型进行了预训练,并对特定性别的测试子集进行了微调实验。如果编码器和解码器中的层没有冻结,并且从最后一个检查点开始调优,则相对于基线获得的单词错误率(WER)在男性和女性子集上分别降低5%和3%。此外,我们在完整的L2北极重音语音数据集上调整了我们的基本模型,并分别针对特定的说话者和男性和女性性别进行了微调。与在整个L2北极数据集上调整的模型相比,在性别子集上训练的模型获得的WER降低了1-2%。最后,通过实验证明,将预训练好的语音嵌入(x向量)与传统编码器的嵌入进行拼接并不能显著提高语音识别的准确率。
{"title":"Gender domain adaptation for automatic speech recognition","authors":"Artem Sokolov, A. Savchenko","doi":"10.1109/SAMI50585.2021.9378626","DOIUrl":"https://doi.org/10.1109/SAMI50585.2021.9378626","url":null,"abstract":"This paper is focused on the finetuning of acoustic models for speaker adaptation goals on a given gender. We pretrained the Transformer baseline model on Librispeech-960 and conducted experiments with finetuning on the gender-specific test subsets. The obtained word error rate (WER) relatively to the baseline is up to 5% and 3% lower on male and female subsets, respectively, if the layers in the encoder and decoder are not frozen, and the tuning is started from the last checkpoints. Moreover, we adapted our base model on the complete L2 Arctic dataset of accented speech and finetuned it for particular speakers and male and female genders separately. The models trained on the gender subsets obtained 1–2% lower WER when compared to the model tuned on the whole L2 Arctic dataset. Finally, it was experimentally confirmed that the concatenation of the pretrained voice embeddings (x-vector) and embeddings from a conventional encoder cannot significantly improve the speech recognition accuracy.","PeriodicalId":402414,"journal":{"name":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123469030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved Word Representations Via Summed Target and Context Embeddings 基于目标和上下文嵌入的改进词表示
Pub Date : 2021-01-21 DOI: 10.1109/SAMI50585.2021.9378672
Nancy Fulda, Nathaniel R. Robinson
Neural embedding models are often described as having an ‘embedding layer’, or a set of network activations that can be extracted from the model in order to obtain word or sentence representations. In this paper, we show via a modification of the well-known word2vec algorithm that relevant semantic information is contained throughout the entirety of the network, not just in the commonly-extracted hidden layer. This extra information can be extracted by summing embeddings from both the input and output weight matrices of a skip-gram model. Word embeddings generated via this method exhibit strong semantic structure, and are able to outperform traditionally extracted word2vec embeddings in a number of analogy tasks.
神经嵌入模型通常被描述为具有一个“嵌入层”,或者一组可以从模型中提取的网络激活,以获得单词或句子的表示。在本文中,我们通过对著名的word2vec算法的修改表明,相关的语义信息包含在整个网络中,而不仅仅是在通常提取的隐藏层中。这些额外的信息可以通过对skip-gram模型的输入和输出权重矩阵的嵌入求和来提取。通过这种方法生成的词嵌入显示出强大的语义结构,并且能够在许多类比任务中优于传统提取的word2vec嵌入。
{"title":"Improved Word Representations Via Summed Target and Context Embeddings","authors":"Nancy Fulda, Nathaniel R. Robinson","doi":"10.1109/SAMI50585.2021.9378672","DOIUrl":"https://doi.org/10.1109/SAMI50585.2021.9378672","url":null,"abstract":"Neural embedding models are often described as having an ‘embedding layer’, or a set of network activations that can be extracted from the model in order to obtain word or sentence representations. In this paper, we show via a modification of the well-known word2vec algorithm that relevant semantic information is contained throughout the entirety of the network, not just in the commonly-extracted hidden layer. This extra information can be extracted by summing embeddings from both the input and output weight matrices of a skip-gram model. Word embeddings generated via this method exhibit strong semantic structure, and are able to outperform traditionally extracted word2vec embeddings in a number of analogy tasks.","PeriodicalId":402414,"journal":{"name":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124717258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Key-Value Pair Searhing System via Tesseract OCR and Post Processing 基于Tesseract OCR和后处理的键值对搜索系统
Pub Date : 2021-01-21 DOI: 10.1109/SAMI50585.2021.9378680
Áron Zoltán Kaló, M. Sipos
Optical character recognition systems make it possible to extract text from images. In many cases, this may be sufficient, but there are cases where key-value pairs are required. In this paper, we investigate the use of the open source Tesseract OCR system, to extract text data from images, and perform a key-value pair search. Image noise needs to be minimized with image processing algorithms before recognition. It is necessary to perform so-called post processing procedures on the output of the Tesseract. These post-processors can transform the result of the recognition performed by the OCR system. Those can improve the accuracy of the information extracted during the transformation, for example with the help of regular expressions. The key value pair search is performed after these procedures.
光学字符识别系统使从图像中提取文本成为可能。在许多情况下,这可能就足够了,但是在某些情况下需要键值对。在本文中,我们研究了使用开源的Tesseract OCR系统,从图像中提取文本数据,并执行键值对搜索。在识别之前,需要使用图像处理算法将图像噪声降至最低。有必要对Tesseract的输出执行所谓的后处理程序。这些后置处理器可以对OCR系统执行的识别结果进行变换。它们可以提高在转换过程中提取信息的准确性,例如在正则表达式的帮助下。在这些过程之后执行键值对搜索。
{"title":"Key-Value Pair Searhing System via Tesseract OCR and Post Processing","authors":"Áron Zoltán Kaló, M. Sipos","doi":"10.1109/SAMI50585.2021.9378680","DOIUrl":"https://doi.org/10.1109/SAMI50585.2021.9378680","url":null,"abstract":"Optical character recognition systems make it possible to extract text from images. In many cases, this may be sufficient, but there are cases where key-value pairs are required. In this paper, we investigate the use of the open source Tesseract OCR system, to extract text data from images, and perform a key-value pair search. Image noise needs to be minimized with image processing algorithms before recognition. It is necessary to perform so-called post processing procedures on the output of the Tesseract. These post-processors can transform the result of the recognition performed by the OCR system. Those can improve the accuracy of the information extracted during the transformation, for example with the help of regular expressions. The key value pair search is performed after these procedures.","PeriodicalId":402414,"journal":{"name":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130088412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Towards Granular Knowledge Structures: Comparison of Different Approaches 走向粒状知识结构:不同方法的比较
Pub Date : 2021-01-21 DOI: 10.1109/SAMI50585.2021.9378677
Florian Stalder, Alexander Denzler, L. Mazzola
Nowadays, it is becoming essential to extract knowledge from diverse, large scale data-sources. An effective approach to make knowledge accessible and providing the necessary means for efficient reasoning to take place is through the use of knowledge graphs. The process of building knowledge graphs is usually focused on generating meaningful representations. Hence, applying structure to it, which takes into account the existence of different knowledge domains, their depth and breadth is mostly disregarded. This particular shortcoming leads to a loss of valuable information that could else be harnessed to provide various additional functionalities to an application. In other words, enhancing knowledge graphs in such a way that they can be explored similar to how Google Maps presents the world to us. By zooming in and out, different relevant aspects become visible while unnecessary noise is blended out. Granular computing by itself is more of a theorem that highlights potential benefits from the application of fuzzy and hierarchical structures. Little is said on how a potential granular knowledge graph can be built and which existing clustering algorithms can be used for this task. As such, this paper aims to provide (1) an in-depth view of which critical requirements need to be met by an algorithm to establish a granular structure, (2) the process for how different commonly used algorithms are coping with them, as well as (3) an overview that outlines the different steps in the process of establishing a granular knowledge structure. Two approaches are identified as the most promising ones: for low dimensional data, a Growing Hierarchical Self-Organizing Map (with its adaptive behaviour) and, in case of data with high dimensionality, one approach from the projective clustering family, thanks to their capability of finding strong correlation in sub-spaces of the original dimensions.
如今,从各种各样的大规模数据源中提取知识变得越来越重要。使用知识图是一种使知识易于获取并为有效推理提供必要手段的有效方法。构建知识图谱的过程通常侧重于生成有意义的表示。因此,考虑到不同知识领域的存在,对其应用结构,往往忽略了它们的深度和广度。这个特殊的缺点会导致宝贵信息的丢失,而这些信息本来可以用来为应用程序提供各种附加功能。换句话说,以一种类似于谷歌地图向我们展示世界的方式来增强知识图谱。通过放大和缩小,不同的相关方面变得可见,而不必要的噪音被混掉。颗粒计算本身更像是一个定理,强调了模糊和分层结构应用的潜在好处。关于如何构建一个潜在的颗粒知识图,以及哪些现有的聚类算法可以用于这项任务,很少有人说。因此,本文旨在提供(1)深入了解建立颗粒结构的算法需要满足哪些关键需求,(2)不同常用算法如何处理这些关键需求的过程,以及(3)概述建立颗粒知识结构过程中的不同步骤。两种方法被认为是最有前途的:对于低维数据,一个不断增长的层次自组织映射(具有自适应行为),对于高维数据,一个来自投影聚类族的方法,这要归功于它们在原始维度的子空间中发现强相关性的能力。
{"title":"Towards Granular Knowledge Structures: Comparison of Different Approaches","authors":"Florian Stalder, Alexander Denzler, L. Mazzola","doi":"10.1109/SAMI50585.2021.9378677","DOIUrl":"https://doi.org/10.1109/SAMI50585.2021.9378677","url":null,"abstract":"Nowadays, it is becoming essential to extract knowledge from diverse, large scale data-sources. An effective approach to make knowledge accessible and providing the necessary means for efficient reasoning to take place is through the use of knowledge graphs. The process of building knowledge graphs is usually focused on generating meaningful representations. Hence, applying structure to it, which takes into account the existence of different knowledge domains, their depth and breadth is mostly disregarded. This particular shortcoming leads to a loss of valuable information that could else be harnessed to provide various additional functionalities to an application. In other words, enhancing knowledge graphs in such a way that they can be explored similar to how Google Maps presents the world to us. By zooming in and out, different relevant aspects become visible while unnecessary noise is blended out. Granular computing by itself is more of a theorem that highlights potential benefits from the application of fuzzy and hierarchical structures. Little is said on how a potential granular knowledge graph can be built and which existing clustering algorithms can be used for this task. As such, this paper aims to provide (1) an in-depth view of which critical requirements need to be met by an algorithm to establish a granular structure, (2) the process for how different commonly used algorithms are coping with them, as well as (3) an overview that outlines the different steps in the process of establishing a granular knowledge structure. Two approaches are identified as the most promising ones: for low dimensional data, a Growing Hierarchical Self-Organizing Map (with its adaptive behaviour) and, in case of data with high dimensionality, one approach from the projective clustering family, thanks to their capability of finding strong correlation in sub-spaces of the original dimensions.","PeriodicalId":402414,"journal":{"name":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"258 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127102100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diversity in Ensemble Model for Classification of Data Streams with Concept Drift 概念漂移数据流分类集成模型的多样性
Pub Date : 2021-01-21 DOI: 10.1109/SAMI50585.2021.9378625
Michal Kolárik, M. Sarnovský, Ján Paralič
Data streams can be defined as the continuous stream of data in many forms coming from different sources. Data streams are usually non-stationary with continually changing their underlying structure. Solving of predictive or classification tasks on such data must consider this aspect. Traditional machine learning models applied on the drifting data may become invalid in the case when a concept change appears. To tackle this problem, we must utilize special adaptive learning models, which utilize various tools able to reflect the drifting data. One of the most popular groups of such methods are adaptive ensembles. This paper describes the work focused on the design and implementation of a novel adaptive ensemble learning model, which is based on the construction of a robust ensemble consisting of a heterogeneous set of its members. We used k-NN, Naive Bayes and Hoeffding trees as base learners and implemented an update mechanism, which considers dynamic class-weighting and Q statistics diversity calculation to ensure the diversity of the ensemble. The model was experimentally evaluated on the streaming datasets, and the effects of the diversity calculation were analyzed.
数据流可以定义为来自不同来源的多种形式的连续数据流。数据流通常是非平稳的,其底层结构不断变化。解决基于此类数据的预测或分类任务必须考虑这一方面。在概念发生变化的情况下,应用于漂移数据的传统机器学习模型可能会失效。为了解决这个问题,我们必须利用特殊的自适应学习模型,该模型利用各种能够反映漂移数据的工具。这类方法中最流行的一组是自适应集成。本文描述了一种新的自适应集成学习模型的设计和实现,该模型基于由其成员的异构集合组成的鲁棒集成的构建。我们使用k-NN、朴素贝叶斯和Hoeffding树作为基础学习器,并实现了一种更新机制,该机制考虑了动态类加权和Q统计多样性计算,以确保集合的多样性。在流数据集上对该模型进行了实验验证,并分析了分集计算的影响。
{"title":"Diversity in Ensemble Model for Classification of Data Streams with Concept Drift","authors":"Michal Kolárik, M. Sarnovský, Ján Paralič","doi":"10.1109/SAMI50585.2021.9378625","DOIUrl":"https://doi.org/10.1109/SAMI50585.2021.9378625","url":null,"abstract":"Data streams can be defined as the continuous stream of data in many forms coming from different sources. Data streams are usually non-stationary with continually changing their underlying structure. Solving of predictive or classification tasks on such data must consider this aspect. Traditional machine learning models applied on the drifting data may become invalid in the case when a concept change appears. To tackle this problem, we must utilize special adaptive learning models, which utilize various tools able to reflect the drifting data. One of the most popular groups of such methods are adaptive ensembles. This paper describes the work focused on the design and implementation of a novel adaptive ensemble learning model, which is based on the construction of a robust ensemble consisting of a heterogeneous set of its members. We used k-NN, Naive Bayes and Hoeffding trees as base learners and implemented an update mechanism, which considers dynamic class-weighting and Q statistics diversity calculation to ensure the diversity of the ensemble. The model was experimentally evaluated on the streaming datasets, and the effects of the diversity calculation were analyzed.","PeriodicalId":402414,"journal":{"name":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"125 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130007240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1