首页 > 最新文献

Information Technology and Control最新文献

英文 中文
BERT-based Transfer Learning Model for COVID-19 Sentiment Analysis on Turkish Instagram Comments 基于bert的土耳其语Instagram评论COVID-19情绪分析迁移学习模型
IF 1.1 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2022-09-23 DOI: 10.5755/j01.itc.51.3.30276
Habibe Karayigit, A. Akdagli, C. Aci
First seen in Wuhan, China, the coronavirus disease (COVID-19) became a worldwide epidemic. Turkey’s first reported case was announced on March 11, 2020—the day the World Health Organization declared COVID-19 is a pandemic. Due to the intense and widespread use of social media during the pandemic, determining the role and effect (i.e., positive, negative, neutral) of social media gives us important information about society's perspective on events. In our study, two datasets (i.e. Dataset1, Dataset2) consisting of Instagram comments on COVID-19 were composed between different dates of the pandemic, and the change between users' feelings and thoughts about the epidemic was analyzed. The datasets are the first publicly available Turkish datasets on the sentiment analysis of COVID-19, as far as we know. The sentiment analysis of Turkish Instagram comments was performed using Machine Learning models (i.e., Traditional Machine Learning, Deep Learning, and BERT-based Transfer Learning). In the experiments, the balanced versions of these datasets (i.e. resDataset1, resDataset2) were taken into account as well as the original ones. The BERT-based Transfer Learning model achieved the highest classification success with 0.7864 macro-averaged F1 score values in resDataset1 and 0.7120 in resDataset2. It has been proven that the use of a pre-trained language model in Turkish datasets is more successful than other models in terms of classification performance.
首先在中国武汉发现的冠状病毒病(COVID-19)已成为全球流行病。土耳其的第一例报告病例于2020年3月11日宣布,也就是世界卫生组织宣布COVID-19大流行的当天。由于大流行期间社交媒体的密集和广泛使用,确定社交媒体的作用和影响(即积极、消极、中立)为我们提供了有关社会对事件看法的重要信息。在我们的研究中,我们在疫情的不同日期之间组成了两个数据集(即Dataset1, Dataset2),这些数据集由Instagram上关于COVID-19的评论组成,并分析了用户对疫情的感受和想法的变化。据我们所知,这些数据集是土耳其首次公开提供的关于COVID-19情绪分析的数据集。使用机器学习模型(即传统机器学习,深度学习和基于bert的迁移学习)对土耳其Instagram评论进行情感分析。在实验中,考虑了这些数据集的平衡版本(即resDataset1, resDataset2)以及原始数据集。基于bert的迁移学习模型在resDataset1和resDataset2中分别以0.7864和0.7120的宏观平均F1分值取得了最高的分类成功率。已经证明,在土耳其数据集中使用预训练的语言模型在分类性能方面比其他模型更成功。
{"title":"BERT-based Transfer Learning Model for COVID-19 Sentiment Analysis on Turkish Instagram Comments","authors":"Habibe Karayigit, A. Akdagli, C. Aci","doi":"10.5755/j01.itc.51.3.30276","DOIUrl":"https://doi.org/10.5755/j01.itc.51.3.30276","url":null,"abstract":"First seen in Wuhan, China, the coronavirus disease (COVID-19) became a worldwide epidemic. Turkey’s first reported case was announced on March 11, 2020—the day the World Health Organization declared COVID-19 is a pandemic. Due to the intense and widespread use of social media during the pandemic, determining the role and effect (i.e., positive, negative, neutral) of social media gives us important information about society's perspective on events. In our study, two datasets (i.e. Dataset1, Dataset2) consisting of Instagram comments on COVID-19 were composed between different dates of the pandemic, and the change between users' feelings and thoughts about the epidemic was analyzed. The datasets are the first publicly available Turkish datasets on the sentiment analysis of COVID-19, as far as we know. The sentiment analysis of Turkish Instagram comments was performed using Machine Learning models (i.e., Traditional Machine Learning, Deep Learning, and BERT-based Transfer Learning). In the experiments, the balanced versions of these datasets (i.e. resDataset1, resDataset2) were taken into account as well as the original ones. The BERT-based Transfer Learning model achieved the highest classification success with 0.7864 macro-averaged F1 score values in resDataset1 and 0.7120 in resDataset2. It has been proven that the use of a pre-trained language model in Turkish datasets is more successful than other models in terms of classification performance.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"117 1","pages":"409-428"},"PeriodicalIF":1.1,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80357852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A Comprehensive Study of Learning Approaches for Author Gender Identification 作者性别认同学习方法的综合研究
IF 1.1 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2022-09-23 DOI: 10.5755/j01.itc.51.3.29907
Tuǧba Dalyan, H. Ayral, Özgür Özdemir
In recent years, author gender identification is an important yet challenging task in the fields of information retrieval and computational linguistics. In this paper, different learning approaches are presented to address the problem of author gender identification for Turkish articles. First, several classification algorithms are applied to the list of representations based on different paradigms: fixed-length vector representations such as Stylometric Features (SF), Bag-of-Words (BoW) and distributed word/document embeddings such as Word2vec, fastText and Doc2vec. Secondly, deep learning architectures, Convolution Neural Network (CNN), Recurrent Neural Network (RNN), special kinds of RNN such as Long-Short Term Memory (LSTM) and Gated Recurrent Unit (GRU), C-RNN, Bidirectional LSTM (bi-LSTM), Bidirectional GRU (bi-GRU), Hierarchical Attention Networks and Multi-head Attention (MHA) are designated and their comparable performances are evaluated. We conducted a variety of experiments and achieved outstanding empirical results. To conclude, ML algorithms with BoW have promising results. fast-Text is also probably suitable between embedding models. This comprehensive study contributes to literature utilizing different learning approaches based on several ways of representations. It is also first important attempt to identify author gender applying SF, fastText and DNN architectures to the Turkish language.
作者性别识别是近年来信息检索和计算语言学领域的一个重要而又具有挑战性的课题。在本文中,提出了不同的学习方法来解决作者性别认同的问题土耳其文章。首先,将几种分类算法应用于基于不同范式的表示列表:固定长度向量表示,如文体特征(SF)、词袋(BoW)和分布式词/文档嵌入,如Word2vec、fastText和Doc2vec。其次,指定了深度学习架构、卷积神经网络(CNN)、循环神经网络(RNN)、长短期记忆(LSTM)和门控循环单元(GRU)、C-RNN、双向LSTM (bi-LSTM)、双向GRU (bi-GRU)、分层注意网络和多级注意(MHA)等特殊类型的RNN,并对它们的性能进行了比较评价。我们进行了各种各样的实验,并取得了出色的实证结果。综上所述,带有BoW的ML算法有很好的效果。fast-Text也可能适用于嵌入模型之间。这项综合研究有助于文献利用基于几种表征方式的不同学习方法。这也是将SF、fastText和DNN架构应用于土耳其语来识别作者性别的第一次重要尝试。
{"title":"A Comprehensive Study of Learning Approaches for Author Gender Identification","authors":"Tuǧba Dalyan, H. Ayral, Özgür Özdemir","doi":"10.5755/j01.itc.51.3.29907","DOIUrl":"https://doi.org/10.5755/j01.itc.51.3.29907","url":null,"abstract":"In recent years, author gender identification is an important yet challenging task in the fields of information retrieval and computational linguistics. In this paper, different learning approaches are presented to address the problem of author gender identification for Turkish articles. First, several classification algorithms are applied to the list of representations based on different paradigms: fixed-length vector representations such as Stylometric Features (SF), Bag-of-Words (BoW) and distributed word/document embeddings such as Word2vec, fastText and Doc2vec. Secondly, deep learning architectures, Convolution Neural Network (CNN), Recurrent Neural Network (RNN), special kinds of RNN such as Long-Short Term Memory (LSTM) and Gated Recurrent Unit (GRU), C-RNN, Bidirectional LSTM (bi-LSTM), Bidirectional GRU (bi-GRU), Hierarchical Attention Networks and Multi-head Attention (MHA) are designated and their comparable performances are evaluated. We conducted a variety of experiments and achieved outstanding empirical results. To conclude, ML algorithms with BoW have promising results. fast-Text is also probably suitable between embedding models. This comprehensive study contributes to literature utilizing different learning approaches based on several ways of representations. It is also first important attempt to identify author gender applying SF, fastText and DNN architectures to the Turkish language.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"28 1","pages":"429-445"},"PeriodicalIF":1.1,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74817409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Human Detection Algorithm Based on Improved YOLO v4 基于改进YOLO v4的人体检测算法
IF 1.1 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2022-09-23 DOI: 10.5755/j01.itc.51.3.30540
Xuan Zhou, Jianping Yi, Guokun Xie, Yajuan Jia, Genqi Xu, Min Sun
The human behavior datasets have the characteristics of complex background, diverse poses, partial occlusion, and diverse sizes. Firstly, this paper adopts YOLO v3 and YOLO v4 algorithms to detect human objects in videos, and qualitatively analyzes and compares detection performance of two algorithms on UTI, UCF101, HMDB51 and CASIA datasets. Then, this paper proposed an improved YOLO v4 algorithm since the vanilla YOLO v4 has incomplete human detection in specific video frames. Specifically, the improved YOLO v4 introduces the Ghost module in the CBM module to further reduce the number of parameters. Lateral connection is added in the CSP module to improve the feature representation capability of the network. Furthermore, we also substitute MaxPool with SoftPool in the primary SPP module, which not only avoids the feature loss, but also provides a regularization effect for the network, thus improving the generalization ability of the network. Finally, this paper qualitatively compares the detection effects of the improved YOLO v4 and primary YOLO v4 algorithm on specific datasets. The experimental results show that the improved YOLO v4 can solve the problem of complex targets in human detection tasks effectively, and further improve the detection speed.
人类行为数据集具有背景复杂、姿态多样、局部遮挡、大小不一等特点。首先,本文采用YOLO v3和YOLO v4算法对视频中的人体目标进行检测,并对两种算法在UTI、UCF101、HMDB51和CASIA数据集上的检测性能进行定性分析和比较。然后,针对普通的YOLO v4在特定视频帧中存在不完全的人类检测问题,提出了一种改进的YOLO v4算法。具体来说,改进的YOLO v4在CBM模块中引入了Ghost模块,以进一步减少参数的数量。在CSP模块中加入横向连接,提高网络的特征表示能力。此外,我们还将主SPP模块中的MaxPool替换为SoftPool,既避免了特征损失,又为网络提供了正则化效果,从而提高了网络的泛化能力。最后,对改进后的YOLO v4算法和主要YOLO v4算法在特定数据集上的检测效果进行了定性比较。实验结果表明,改进的YOLO v4能够有效解决人工检测任务中复杂目标的问题,进一步提高检测速度。
{"title":"Human Detection Algorithm Based on Improved YOLO v4","authors":"Xuan Zhou, Jianping Yi, Guokun Xie, Yajuan Jia, Genqi Xu, Min Sun","doi":"10.5755/j01.itc.51.3.30540","DOIUrl":"https://doi.org/10.5755/j01.itc.51.3.30540","url":null,"abstract":"The human behavior datasets have the characteristics of complex background, diverse poses, partial occlusion, and diverse sizes. Firstly, this paper adopts YOLO v3 and YOLO v4 algorithms to detect human objects in videos, and qualitatively analyzes and compares detection performance of two algorithms on UTI, UCF101, HMDB51 and CASIA datasets. Then, this paper proposed an improved YOLO v4 algorithm since the vanilla YOLO v4 has incomplete human detection in specific video frames. Specifically, the improved YOLO v4 introduces the Ghost module in the CBM module to further reduce the number of parameters. Lateral connection is added in the CSP module to improve the feature representation capability of the network. Furthermore, we also substitute MaxPool with SoftPool in the primary SPP module, which not only avoids the feature loss, but also provides a regularization effect for the network, thus improving the generalization ability of the network. Finally, this paper qualitatively compares the detection effects of the improved YOLO v4 and primary YOLO v4 algorithm on specific datasets. The experimental results show that the improved YOLO v4 can solve the problem of complex targets in human detection tasks effectively, and further improve the detection speed.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"27 1","pages":"485-498"},"PeriodicalIF":1.1,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88400715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Weber Global Statistics Tri- Directional Pattern (WGSTriDP): A Texture Feature Descriptor for Image Retrieval 韦伯全局统计三向模式:一种用于图像检索的纹理特征描述符
IF 1.1 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2022-09-23 DOI: 10.5755/j01.itc.51.3.30795
Callins Christiyana Chelladurai, R. Vayanaperumal
The texture is a high-flying feature in an image and has been extracted to represent the image for image retrieval applications. Many texture features are being offered for image retrieval. This paper proposes a local binary pattern-based texture feature called Weber Global Statistics Tri-Directional Pattern (WGSTriDP) to retrieve the images. This pattern combines the advantages of differential excitation components in the Weber Local Binary Pattern (WLBP), sign and magnitude components in the Local Tri-Directional Pattern (LTriDP), and global statistics. Differential Excitation (DE) and Global Statistics TriDirectional Pattern (GSTriDP) are two components of WGSTriDP. The WGSTriDP gains the benefit of discrimination concerning human perception from differential excitation as well as incorporates global statistics into sign and magnitude components in the pattern derived from the local neighborhoods. The effectiveness of the pattern in image retrieval is experimented with in two benchmark databases, such as ORL (face database) and UIUC (texture database). According to the results of the experiments, WGSTriDP outperforms other local patterns in retrieving similar images from the database.
纹理是图像中的一个重要特征,它被提取出来用于图像检索应用。为图像检索提供了许多纹理特征。本文提出了一种基于局部二值模式的纹理特征Weber Global Statistics tridirectional Pattern (WGSTriDP)来检索图像。该模式结合了韦伯局部二元模式(WLBP)中的差分激励分量、局部三向模式(LTriDP)中的符号和幅度分量以及全局统计的优点。差分激励(DE)和全局统计三向模式(GSTriDP)是WGSTriDP的两个组成部分。WGSTriDP从差分激励中获得了对人类感知的区分,并将全局统计数据纳入了从局部邻域导出的模式中的符号和幅度分量。在ORL(人脸数据库)和UIUC(纹理数据库)两个基准数据库中对该模式在图像检索中的有效性进行了实验。实验结果表明,WGSTriDP在从数据库中检索相似图像方面优于其他局部模式。
{"title":"Weber Global Statistics Tri- Directional Pattern (WGSTriDP): A Texture Feature Descriptor for Image Retrieval","authors":"Callins Christiyana Chelladurai, R. Vayanaperumal","doi":"10.5755/j01.itc.51.3.30795","DOIUrl":"https://doi.org/10.5755/j01.itc.51.3.30795","url":null,"abstract":"The texture is a high-flying feature in an image and has been extracted to represent the image for image retrieval applications. Many texture features are being offered for image retrieval. This paper proposes a local binary pattern-based texture feature called Weber Global Statistics Tri-Directional Pattern (WGSTriDP) to retrieve the images. This pattern combines the advantages of differential excitation components in the Weber Local Binary Pattern (WLBP), sign and magnitude components in the Local Tri-Directional Pattern (LTriDP), and global statistics. Differential Excitation (DE) and Global Statistics TriDirectional Pattern (GSTriDP) are two components of WGSTriDP. The WGSTriDP gains the benefit of discrimination concerning human perception from differential excitation as well as incorporates global statistics into sign and magnitude components in the pattern derived from the local neighborhoods. The effectiveness of the pattern in image retrieval is experimented with in two benchmark databases, such as ORL (face database) and UIUC (texture database). According to the results of the experiments, WGSTriDP outperforms other local patterns in retrieving similar images from the database.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"38 1","pages":"515-530"},"PeriodicalIF":1.1,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85339114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Fuzzy Logic Path Planning Algorithm Based on Geometric Landmarks and Kinetic Constraints 基于几何地标和动力学约束的模糊路径规划算法
IF 1.1 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2022-09-23 DOI: 10.5755/j01.itc.51.3.30016
Jinghua Wang, Ziyu Xu, Xiyu Zheng, Ziwei Liu
This paper mainly focuses on the path planning of mobile robots in complex two-dimensional terrain. It proposes a fuzzy rule-based path planning algorithm for multiple guide points by changing the spatial point-taking method and combining Dijkstra's algorithm and fuzzy logic algorithm. The planning process of this algorithm divide into three stages. The first stage identifies the edge points of the forbidden area by designing the search space, marks the feasible area widths of the edge points in X and Y directions, and marks their midpoints. The second stage uses Dijkstra's algorithm that does the road map sorting on these marked points and the starting and ending points and takes the lowest cost sequence as the search road map. In the third stage, using a fuzzy logic system to search these road signs one by one until the endpoint area is searched. The simulation results show that this algorithm can solve the complex environment that traditional fuzzy inference algorithms cannot plan. Compared with the graph search algorithm, this algorithm dramatically reduces the planning time and provides more flexible turning angles. This algorithm can better consider the robot's size and the relationship between speed and turning angles while estimating the motion state at each step compared with the sampling algorithm. This algorithm will extend to group path planning and dynamic environment planning in subsequent studies.
本文主要研究复杂二维地形下移动机器人的路径规划问题。通过改变空间取点方法,结合Dijkstra算法和模糊逻辑算法,提出了一种基于模糊规则的多导点路径规划算法。该算法的规划过程分为三个阶段。第一阶段通过设计搜索空间识别禁区边缘点,标记X、Y方向边缘点的可行区域宽度,并标记其中点。第二阶段使用Dijkstra算法,对这些标记点以及起点和终点进行路线图排序,并采用成本最低的序列作为搜索路线图。第三阶段,使用模糊逻辑系统逐个搜索路标,直到搜索到终点区域。仿真结果表明,该算法可以解决传统模糊推理算法无法规划的复杂环境。与图搜索算法相比,该算法大大缩短了规划时间,并提供了更灵活的转弯角度。与采样算法相比,该算法在估计每一步的运动状态时,能更好地考虑机器人的尺寸以及速度与转角的关系。该算法将在后续的研究中扩展到群体路径规划和动态环境规划中。
{"title":"A Fuzzy Logic Path Planning Algorithm Based on Geometric Landmarks and Kinetic Constraints","authors":"Jinghua Wang, Ziyu Xu, Xiyu Zheng, Ziwei Liu","doi":"10.5755/j01.itc.51.3.30016","DOIUrl":"https://doi.org/10.5755/j01.itc.51.3.30016","url":null,"abstract":"This paper mainly focuses on the path planning of mobile robots in complex two-dimensional terrain. It proposes a fuzzy rule-based path planning algorithm for multiple guide points by changing the spatial point-taking method and combining Dijkstra's algorithm and fuzzy logic algorithm. The planning process of this algorithm divide into three stages. The first stage identifies the edge points of the forbidden area by designing the search space, marks the feasible area widths of the edge points in X and Y directions, and marks their midpoints. The second stage uses Dijkstra's algorithm that does the road map sorting on these marked points and the starting and ending points and takes the lowest cost sequence as the search road map. In the third stage, using a fuzzy logic system to search these road signs one by one until the endpoint area is searched. The simulation results show that this algorithm can solve the complex environment that traditional fuzzy inference algorithms cannot plan. Compared with the graph search algorithm, this algorithm dramatically reduces the planning time and provides more flexible turning angles. This algorithm can better consider the robot's size and the relationship between speed and turning angles while estimating the motion state at each step compared with the sampling algorithm. This algorithm will extend to group path planning and dynamic environment planning in subsequent studies.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"7 1","pages":"499-514"},"PeriodicalIF":1.1,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75502313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Deep Learning for Forgery Face Detection Using Fuzzy Fisher Capsule Dual Graph 基于模糊Fisher胶囊对偶图的深度学习人脸伪造检测
IF 1.1 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2022-09-23 DOI: 10.5755/j01.itc.51.3.31510
P. M. Arunkumar, Y. Sangeetha, P. Raja, S. N. Sangeetha
In digital manipulation, creating fake images/videos or swapping face images/videos with another person is done by using a deep learning algorithm is termed deep fake. Fake pornography is a harmful one because of the inclusion of fake content in the hoaxes, fake news, and fraud things in the financial. The Deep Learning technique is an effective tool in the detection of deep fake images or videos. With the advancement of Generative adversarial networks (GAN) in the deep learning techniques, deep fake has become an essential one in the social media platform. This may threaten the public, therefore detection of deep fake images/videos is needed. For detecting the forged images/videos, many research works have been done and those methods are inefficient in the detection of new threats or newly created forgery images or videos, and also consumption time is high. Therefore, this paper focused on the detection of different types of fake images or videos using Fuzzy Fisher face with Capsule dual graph (FFF-CDG). The data set used in this work is FFHQ, 100K-Faces DFFD, VGG-Face2, and Wild Deep fake. The accuracy for FFHQ datasets, the existing and proposed systems obtained the accuracy of 81.5%, 89.32%, 91.35%, and 95.82% respectively.
在数字操作中,使用深度学习算法创建虚假图像/视频或与他人交换人脸图像/视频被称为深度伪造。虚假色情是一种有害的,因为在骗局、假新闻和金融欺诈中包含虚假内容。深度学习技术是检测深度虚假图像或视频的有效工具。随着生成对抗网络(GAN)在深度学习技术中的发展,深度虚假已经成为社交媒体平台中必不可少的一种。这可能会威胁到公众,因此需要检测深度假图像/视频。对于伪造图像/视频的检测,已有大量的研究工作,但这些方法在检测新的威胁或新生成的伪造图像或视频时效率低下,且耗时高。因此,本文主要研究了利用Fuzzy Fisher face with Capsule对偶图(FFF-CDG)对不同类型的假图像或假视频进行检测。本文使用的数据集为FFHQ、100K-Faces DFFD、VGG-Face2和Wild Deep fake。在FFHQ数据集上,现有系统和所提系统的准确率分别为81.5%、89.32%、91.35%和95.82%。
{"title":"Deep Learning for Forgery Face Detection Using Fuzzy Fisher Capsule Dual Graph","authors":"P. M. Arunkumar, Y. Sangeetha, P. Raja, S. N. Sangeetha","doi":"10.5755/j01.itc.51.3.31510","DOIUrl":"https://doi.org/10.5755/j01.itc.51.3.31510","url":null,"abstract":"In digital manipulation, creating fake images/videos or swapping face images/videos with another person is done by using a deep learning algorithm is termed deep fake. Fake pornography is a harmful one because of the inclusion of fake content in the hoaxes, fake news, and fraud things in the financial. The Deep Learning technique is an effective tool in the detection of deep fake images or videos. With the advancement of Generative adversarial networks (GAN) in the deep learning techniques, deep fake has become an essential one in the social media platform. This may threaten the public, therefore detection of deep fake images/videos is needed. For detecting the forged images/videos, many research works have been done and those methods are inefficient in the detection of new threats or newly created forgery images or videos, and also consumption time is high. Therefore, this paper focused on the detection of different types of fake images or videos using Fuzzy Fisher face with Capsule dual graph (FFF-CDG). The data set used in this work is FFHQ, 100K-Faces DFFD, VGG-Face2, and Wild Deep fake. The accuracy for FFHQ datasets, the existing and proposed systems obtained the accuracy of 81.5%, 89.32%, 91.35%, and 95.82% respectively.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"1 1","pages":"563-574"},"PeriodicalIF":1.1,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90734930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Neighborhood Based Particle Swarm Optimization with Sine Co-sine Mutation Operator for Feature Selection 基于正弦余弦变异算子的邻域粒子群特征选择算法
IF 1.1 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2022-09-23 DOI: 10.5755/j01.itc.51.3.31271
Chenye Qiu
Feature selection is a vital data pre-processing process in many practical applications. Feature selection aims to get rid of those unnecessary features and improve the performance of the classification model. In this paper, a neighborhood based particle swarm optimization with sine cosine mutation operator (NPSOSC) is proposed to select the most informative feature subset. The improvements are included to strengthen its search capacity and avoid local optima stagnation. A distance and fitness based neighborhood search strategy is developed to form stable neighborhood structures for the particles. Each particle adopts superior information from its neighborhoods and the entire swarm can search different regions of the entire search space. The second improvement incorporates a sine cosine mutation operator to enhance the exploration ability and add more randomness into the search process. The improvements will lead to an enhanced balance between exploration and exploitation. To demonstrate the performance of the proposed NPSOSC, seven well-known optimizers are compared with the NPSOSC on 16 well-regarded datasets with different difficulty levels. The experimental results and statistical tests demonstrate the excellent performance of the proposed NPSOSC in exploring the feature space and selecting the most informative features.
特征选择是许多实际应用中重要的数据预处理过程。特征选择的目的是去除那些不需要的特征,提高分类模型的性能。本文提出了一种基于邻域的带正弦余弦变异算子的粒子群优化算法(NPSOSC)来选择信息量最大的特征子集。改进包括增强其搜索能力和避免局部最优停滞。提出了一种基于距离和适应度的邻域搜索策略,为粒子形成稳定的邻域结构。每个粒子从它的邻域中获取优势信息,整个群体可以搜索整个搜索空间的不同区域。第二次改进采用了正弦余弦变异算子,增强了搜索能力,并在搜索过程中增加了更多的随机性。这些改进将使勘探和开采之间的平衡得到加强。为了证明所提出的NPSOSC的性能,在16个公认的不同难度数据集上,将7个知名的优化器与NPSOSC进行了比较。实验结果和统计测试表明,该算法在特征空间的挖掘和信息量最大的特征选择方面具有优异的性能。
{"title":"A Neighborhood Based Particle Swarm Optimization with Sine Co-sine Mutation Operator for Feature Selection","authors":"Chenye Qiu","doi":"10.5755/j01.itc.51.3.31271","DOIUrl":"https://doi.org/10.5755/j01.itc.51.3.31271","url":null,"abstract":"Feature selection is a vital data pre-processing process in many practical applications. Feature selection aims to get rid of those unnecessary features and improve the performance of the classification model. In this paper, a neighborhood based particle swarm optimization with sine cosine mutation operator (NPSOSC) is proposed to select the most informative feature subset. The improvements are included to strengthen its search capacity and avoid local optima stagnation. A distance and fitness based neighborhood search strategy is developed to form stable neighborhood structures for the particles. Each particle adopts superior information from its neighborhoods and the entire swarm can search different regions of the entire search space. The second improvement incorporates a sine cosine mutation operator to enhance the exploration ability and add more randomness into the search process. The improvements will lead to an enhanced balance between exploration and exploitation. To demonstrate the performance of the proposed NPSOSC, seven well-known optimizers are compared with the NPSOSC on 16 well-regarded datasets with different difficulty levels. The experimental results and statistical tests demonstrate the excellent performance of the proposed NPSOSC in exploring the feature space and selecting the most informative features.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"81 1","pages":"575-591"},"PeriodicalIF":1.1,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89259868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Texture Image Analysis for Larger Lattice Structure using Orthogonal Polynomial framework 基于正交多项式框架的大晶格结构纹理图像分析
IF 1.1 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2022-09-23 DOI: 10.5755/j01.itc.51.3.29322
L. Ganesan, C. Umarani, M. Kaliappan, S. Vimal, Seifedine Kadry, Yunyoung Nam
An Orthogonal Polynomial Framework using 3 x 3 mathematical model has been proposed and attempted for the textureanalysis by L.Ganesan and P.Bhattacharyya during 1990. They proposed this frame work which was unified to address both edgeand texture detection. Subsequently, this work has been extended for different applications by them and by different authors overa period of time. Now the Orthogonal Polynomial Framework has been shown effective for larger grid size of (5 x 5) or (7 x 7) orhigher, to analyze textured surfaces. The image region (5 x 5) under consideration is evaluated to be textured or untextured usinga statistical approach. Once the image region is concluded to be textured, it is proposed to be described by a local descriptor,called pro5num, computed by a simple coding scheme on the individual pixels based on their computed significant variances. Thehistogram of all the pro5nums computed over the entire image, called pro5spectrum, is considered to be the global descriptor.The novelty of this scheme is that it can be used for discriminating the region under consideration is micro or macro texture,based on the range of values in the global descriptor. This method works fine for many standard texture images. The works usingthe proposed descriptors for many texture analysis problems with (5 x5) including higher grid size and applications are underprogress
1990年,L.Ganesan和P.Bhattacharyya提出并尝试了一种使用3 × 3数学模型的正交多项式框架,用于纹理分析。他们提出了统一处理边缘和纹理检测的框架。随后,这项工作在一段时间内被他们和不同的作者扩展为不同的应用。现在,正交多项式框架已被证明对(5 × 5)或(7 × 7)或更大的网格尺寸有效,以分析纹理表面。使用统计方法评估正在考虑的图像区域(5 x 5)是纹理化的还是非纹理化的。一旦得出图像区域被纹理化的结论,建议用一个称为pro5num的局部描述符来描述它,该描述符由一个简单的编码方案根据计算出的显著方差对单个像素进行计算。在整个图像上计算的所有pro5nums的直方图称为pro5spectrum,被认为是全局描述符。该方案的新颖之处在于,它可以用于根据全局描述符中的值范围来区分所考虑的区域是微观纹理还是宏观纹理。这种方法适用于许多标准纹理图像。使用所提出的描述符解决(5 × 5)纹理分析问题的工作正在进行中,包括更高的网格尺寸和应用
{"title":"Texture Image Analysis for Larger Lattice Structure using Orthogonal Polynomial framework","authors":"L. Ganesan, C. Umarani, M. Kaliappan, S. Vimal, Seifedine Kadry, Yunyoung Nam","doi":"10.5755/j01.itc.51.3.29322","DOIUrl":"https://doi.org/10.5755/j01.itc.51.3.29322","url":null,"abstract":"An Orthogonal Polynomial Framework using 3 x 3 mathematical model has been proposed and attempted for the textureanalysis by L.Ganesan and P.Bhattacharyya during 1990. They proposed this frame work which was unified to address both edgeand texture detection. Subsequently, this work has been extended for different applications by them and by different authors overa period of time. Now the Orthogonal Polynomial Framework has been shown effective for larger grid size of (5 x 5) or (7 x 7) orhigher, to analyze textured surfaces. The image region (5 x 5) under consideration is evaluated to be textured or untextured usinga statistical approach. Once the image region is concluded to be textured, it is proposed to be described by a local descriptor,called pro5num, computed by a simple coding scheme on the individual pixels based on their computed significant variances. Thehistogram of all the pro5nums computed over the entire image, called pro5spectrum, is considered to be the global descriptor.The novelty of this scheme is that it can be used for discriminating the region under consideration is micro or macro texture,based on the range of values in the global descriptor. This method works fine for many standard texture images. The works usingthe proposed descriptors for many texture analysis problems with (5 x5) including higher grid size and applications are underprogress","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"178 1","pages":"531-544"},"PeriodicalIF":1.1,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78351532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
GATSum: Graph-Based Topic-Aware Abstract Text Summarization GATSum:基于图形的主题感知抽象文本摘要
IF 1.1 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2022-06-23 DOI: 10.5755/j01.itc.51.2.30796
Ming Jiang, Yifan Zou, Jian Xu, Min Zhang
The purpose of text summarization is to compress a text document into a summary containing key information. abstract approaches are challenging tasks, it is necessary to design a mechanism to effectively extract salient information from the source text, and then generate a summary. However, most of the existing abstract approaches are difficult to capture global semantics, ignoring the impact of global information on obtaining important content. To solve this problem, this paper proposes a Graph-Based Topic Aware abstract Text Summarization (GTASum) framework. Specifically, GTASum seamlessly incorporates a neural topic model to discover potential topic information, which can provide document-level features for generating summaries. In addition, the model integrates the graph neural network which can effectively capture the relationship between sentences through the document representation of graph structure, and simultaneously update the local and global information. The further discussion showed that latent topics can help the model capture salient content. We conducted experiments on two datasets, and the result shows that GTASum is superior to many extractive and abstract approaches in terms of ROUGE measurement. The result of the ablation study proves that the model has the ability to capture the original subject and the correct information and improve the factual accuracy of the summarization.
文本摘要的目的是将文本文档压缩为包含关键信息的摘要。摘要方法是一项具有挑战性的任务,有必要设计一种机制来有效地从源文本中提取显著信息,然后生成摘要。然而,现有的抽象方法大多难以捕获全局语义,忽略了全局信息对获取重要内容的影响。为了解决这一问题,本文提出了一个基于图的主题感知抽象文本摘要(GTASum)框架。具体来说,GTASum无缝地集成了一个神经主题模型来发现潜在的主题信息,它可以为生成摘要提供文档级功能。此外,该模型还集成了图神经网络,通过图结构的文档表示有效地捕捉句子之间的关系,并同时更新局部和全局信息。进一步的讨论表明,潜在主题可以帮助模型捕获突出内容。我们在两个数据集上进行了实验,结果表明GTASum在ROUGE测量方面优于许多提取和抽象方法。烧蚀研究结果表明,该模型具有捕获原始主题和正确信息的能力,提高了摘要的事实准确性。
{"title":"GATSum: Graph-Based Topic-Aware Abstract Text Summarization","authors":"Ming Jiang, Yifan Zou, Jian Xu, Min Zhang","doi":"10.5755/j01.itc.51.2.30796","DOIUrl":"https://doi.org/10.5755/j01.itc.51.2.30796","url":null,"abstract":"The purpose of text summarization is to compress a text document into a summary containing key information. abstract approaches are challenging tasks, it is necessary to design a mechanism to effectively extract salient information from the source text, and then generate a summary. However, most of the existing abstract approaches are difficult to capture global semantics, ignoring the impact of global information on obtaining important content. To solve this problem, this paper proposes a Graph-Based Topic Aware abstract Text Summarization (GTASum) framework. Specifically, GTASum seamlessly incorporates a neural topic model to discover potential topic information, which can provide document-level features for generating summaries. In addition, the model integrates the graph neural network which can effectively capture the relationship between sentences through the document representation of graph structure, and simultaneously update the local and global information. The further discussion showed that latent topics can help the model capture salient content. We conducted experiments on two datasets, and the result shows that GTASum is superior to many extractive and abstract approaches in terms of ROUGE measurement. The result of the ablation study proves that the model has the ability to capture the original subject and the correct information and improve the factual accuracy of the summarization.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"98 1","pages":"345-355"},"PeriodicalIF":1.1,"publicationDate":"2022-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77334223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Novel Approach for Synchronizing of Fractional Order Uncertain Chaotic Systems in the Presence of Unknown Time-Variant Delay and Disturbance 存在未知时变延迟和扰动的分数阶不确定混沌系统的一种新同步方法
IF 1.1 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2022-06-23 DOI: 10.5755/j01.itc.51.2.29411
Linli Wu, Xiuwei Fu
This paper presents a new method for synchronizing between two fractional order chaotic systems in the simultaneous presence of three categories including uncertainty, external disturbance and time-varying delay. The uncertainties considered in chaotic drive and response systems are on the nonlinear functions, the external disturbances are finite with unknown upper bound,  and the delays in the nonlinear functions are 1- variable with time 2- unknown and 3- different from each other in two drive and response systems. A new hybrid method based on fuzzy, adaptive and robust techniques is proposed to achieve synchronization for a specific class of fractional order chaotic systems. The fuzzy method is used to estimate the effects of uncertainties and delayed functions, the adaptive method is employed to obtain the optimal weights of the fuzzy approximator as well as the estimation for upper bound of disturbances, and the robust method is performed to ensure the stability of synchronization and also to cover the errors of both fuzzy and adaptive methods. Simulation in MATLAB environment shows the efficiency of the proposed method in achieving the synchronization goal despite the problems of delay, disturbance and uncertainty.
提出了一种同时存在不确定性、外部干扰和时变时滞的分数阶混沌系统同步的新方法。混沌驱动与响应系统考虑的不确定性是非线性函数,外部扰动是有限的,上界未知,非线性函数的时滞在两个驱动与响应系统中是1-变量,时间为2-未知,3-不同。针对一类分数阶混沌系统,提出了一种基于模糊、自适应和鲁棒的混合同步方法。采用模糊方法估计不确定性和延迟函数的影响,采用自适应方法获得模糊逼近器的最优权值以及扰动上界的估计,并采用鲁棒方法保证同步的稳定性,同时覆盖模糊方法和自适应方法的误差。在MATLAB环境下的仿真表明,尽管存在延迟、干扰和不确定性等问题,该方法仍能有效地实现同步目标。
{"title":"A Novel Approach for Synchronizing of Fractional Order Uncertain Chaotic Systems in the Presence of Unknown Time-Variant Delay and Disturbance","authors":"Linli Wu, Xiuwei Fu","doi":"10.5755/j01.itc.51.2.29411","DOIUrl":"https://doi.org/10.5755/j01.itc.51.2.29411","url":null,"abstract":"This paper presents a new method for synchronizing between two fractional order chaotic systems in the simultaneous presence of three categories including uncertainty, external disturbance and time-varying delay. The uncertainties considered in chaotic drive and response systems are on the nonlinear functions, the external disturbances are finite with unknown upper bound,  and the delays in the nonlinear functions are 1- variable with time 2- unknown and 3- different from each other in two drive and response systems. A new hybrid method based on fuzzy, adaptive and robust techniques is proposed to achieve synchronization for a specific class of fractional order chaotic systems. The fuzzy method is used to estimate the effects of uncertainties and delayed functions, the adaptive method is employed to obtain the optimal weights of the fuzzy approximator as well as the estimation for upper bound of disturbances, and the robust method is performed to ensure the stability of synchronization and also to cover the errors of both fuzzy and adaptive methods. Simulation in MATLAB environment shows the efficiency of the proposed method in achieving the synchronization goal despite the problems of delay, disturbance and uncertainty.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"45 1","pages":"221-234"},"PeriodicalIF":1.1,"publicationDate":"2022-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88075998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Information Technology and Control
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1