首页 > 最新文献

Journal of Computer Science and Cybernetics最新文献

英文 中文
PROVING THE SECURITY OF AES BLOCK CIPHER BASED ON MODIFIED MIXCOLUMN 证明基于修正混合列的 AES 区块密码的安全性
Pub Date : 2024-06-10 DOI: 10.15625/1813-9663/18058
Luong Tran Thi
Block ciphers in general, Substitution-Permutation Network (SPN) block ciphers in particular are cryptographic fields widely applied today. AES is an SPN block cipher used in many security applications. However, there are many strong attacks on block ciphers as linear attacks, differential attacks, and algebraic attacks which are challenging for cryptographers. Therefore, the research to improve the security of block ciphers in general and AES, in particular, is a topic of great interest today. Along with security, the issue of the execution cost of block ciphers is also crucial in practice. In this paper, we clarify the role of the MDS matrix in increasing the branch number of the diffusion layer of the block ciphers, thereby improving the security of the block ciphers. We propose a method improving the security of the AES block cipher by changing the Mixcolumn transformation of AES using execution-efficient MDS matrices of size 4, 8, or 16. We present a method to find a new diffusion matrix of modified AES block ciphers from which to evaluate the number of fixed points and coefficient of fixed points  of the modified AES diffusion layers. In addition, we prove the branch number of the modified AES diffusion layers with MDS matrices of sizes 8, and 16. Then we also analyze the security, statistical standards and execution speed of modified AES block ciphers generated from those MDS matrices. The results show that our proposed method can significantly improve the security of the AES block cipher.
块密码,特别是置换-置换网络(SPN)块密码是当今广泛应用的密码领域。AES 是一种 SPN 区块密码,在许多安全应用中都有使用。然而,对块密码存在许多强攻击,如线性攻击、差分攻击和代数攻击,这对密码学家来说具有挑战性。因此,如何提高块密码的安全性,特别是 AES 的安全性,是当今备受关注的课题。除了安全性,块密码的执行成本问题在实践中也至关重要。本文阐明了 MDS 矩阵在增加分块密码扩散层分支数方面的作用,从而提高了分块密码的安全性。我们提出了一种方法,通过使用大小为 4、8 或 16 的高效执行 MDS 矩阵来改变 AES 的混合列变换,从而提高 AES 区块密码的安全性。我们提出了一种方法来找到修改后 AES 区块密码的新扩散矩阵,并据此评估修改后 AES 扩散层的固定点数量和固定点系数。此外,我们还证明了大小为 8 和 16 的 MDS 矩阵的改进 AES 扩散层的分支数。然后,我们还分析了由这些 MDS 矩阵生成的修正 AES 区块密码的安全性、统计标准和执行速度。结果表明,我们提出的方法能显著提高 AES 区块密码的安全性。
{"title":"PROVING THE SECURITY OF AES BLOCK CIPHER BASED ON MODIFIED MIXCOLUMN","authors":"Luong Tran Thi","doi":"10.15625/1813-9663/18058","DOIUrl":"https://doi.org/10.15625/1813-9663/18058","url":null,"abstract":"Block ciphers in general, Substitution-Permutation Network (SPN) block ciphers in particular are cryptographic fields widely applied today. AES is an SPN block cipher used in many security applications. However, there are many strong attacks on block ciphers as linear attacks, differential attacks, and algebraic attacks which are challenging for cryptographers. Therefore, the research to improve the security of block ciphers in general and AES, in particular, is a topic of great interest today. Along with security, the issue of the execution cost of block ciphers is also crucial in practice. In this paper, we clarify the role of the MDS matrix in increasing the branch number of the diffusion layer of the block ciphers, thereby improving the security of the block ciphers. We propose a method improving the security of the AES block cipher by changing the Mixcolumn transformation of AES using execution-efficient MDS matrices of size 4, 8, or 16. We present a method to find a new diffusion matrix of modified AES block ciphers from which to evaluate the number of fixed points and coefficient of fixed points  of the modified AES diffusion layers. In addition, we prove the branch number of the modified AES diffusion layers with MDS matrices of sizes 8, and 16. Then we also analyze the security, statistical standards and execution speed of modified AES block ciphers generated from those MDS matrices. The results show that our proposed method can significantly improve the security of the AES block cipher.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":" 19","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141365079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AN IMPROVED INDEXING METHOD FOR QUERYING BIG XML FILES 一种用于查询大型 XML 文件的改进索引方法
Pub Date : 2023-12-25 DOI: 10.15625/1813-9663/19018
Dinh Duc Luong, Vuong Quang Phuong, Hoang Do Thanh Tung
The exponential growth of bioinformatics in the healthcare domain has revolutionized our understanding of DNA, proteins, and other biomolecular entities. This remarkable progress has generated an overwhelming volume of data, necessitating big data technologies for efficient storage and indexing. While big data technologies like Hadoop offer substantial support for big XML file storage, the challenges of indexing data sizes and XPath query performance persist. To enhance the efficiency of XPath queries and address the data size problem, a novel approach that is derived from the spatial indexing method of the R-tre family. The proposed method is to modify the structure of leaf nodes in the indexing tree to preserve XML-sibling connections. Then, new algorithms for constructing the new tree structure and processing sibling queries better are introduced. Experimental results demonstrate the superior efficiency of sibling XPath queries with reduced data sizes for indexing, while other XPath queries exhibit notable performance improvements. This research contributes to the development of more effective indexing methods for managing and querying large XML datasets in bioinformatics applications, ultimately advancing biomedical research and healthcare initiatives.
生物信息学在医疗保健领域的飞速发展彻底改变了我们对 DNA、蛋白质和其他生物分子实体的认识。这一令人瞩目的进步产生了大量数据,需要大数据技术来进行高效存储和索引。虽然 Hadoop 等大数据技术为大型 XML 文件存储提供了大量支持,但索引数据规模和 XPath 查询性能方面的挑战依然存在。为了提高 XPath 查询的效率并解决数据大小问题,一种源自 R-tre 系列空间索引方法的新方法应运而生。所提出的方法是修改索引树中叶节点的结构,以保留 XML 同胞连接。然后,介绍了构建新的树结构和更好地处理同胞查询的新算法。实验结果表明,在索引数据量减少的情况下,同胞 XPath 查询的效率更高,而其他 XPath 查询的性能也有显著提高。这项研究有助于开发更有效的索引方法,用于管理和查询生物信息学应用中的大型 XML 数据集,最终推动生物医学研究和医疗保健计划。
{"title":"AN IMPROVED INDEXING METHOD FOR QUERYING BIG XML FILES","authors":"Dinh Duc Luong, Vuong Quang Phuong, Hoang Do Thanh Tung","doi":"10.15625/1813-9663/19018","DOIUrl":"https://doi.org/10.15625/1813-9663/19018","url":null,"abstract":"The exponential growth of bioinformatics in the healthcare domain has revolutionized our understanding of DNA, proteins, and other biomolecular entities. This remarkable progress has generated an overwhelming volume of data, necessitating big data technologies for efficient storage and indexing. While big data technologies like Hadoop offer substantial support for big XML file storage, the challenges of indexing data sizes and XPath query performance persist. To enhance the efficiency of XPath queries and address the data size problem, a novel approach that is derived from the spatial indexing method of the R-tre family. The proposed method is to modify the structure of leaf nodes in the indexing tree to preserve XML-sibling connections. Then, new algorithms for constructing the new tree structure and processing sibling queries better are introduced. Experimental results demonstrate the superior efficiency of sibling XPath queries with reduced data sizes for indexing, while other XPath queries exhibit notable performance improvements. This research contributes to the development of more effective indexing methods for managing and querying large XML datasets in bioinformatics applications, ultimately advancing biomedical research and healthcare initiatives.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"1 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139157596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OHYEAH AT VLSP2022-EVJVQA CHALLENGE: A JOINTLY LANGUAGE-IMAGE MODEL FOR MULTILINGUAL VISUAL QUESTION ANSWERING ohyeah at vlsp2022-evjvqa challenge: a jointly language-image model for multilingual visual question answering(用于多语言视觉问题解答的语言-图像联合模型)...
Pub Date : 2023-12-25 DOI: 10.15625/1813-9663/18122
Luan Ngo Dinh, Hiếu Lê Ngọc, Long Quoc Phan
Multilingual Visual Question Answering (mVQA) is an extremely challenging task which needs to answer a question given in different languages and take the context in an image. This problem can only be addressed by the combination of Natural Language Processing and Computer Vision. In this paper, we propose applying a jointly developed model to the task of multilingual visual question answering. Specifically, we conduct experiments on a multimodal sequence-to-sequence transformer model derived from the T5 encoder-decoder architecture. Text tokens and Vision Transformer (ViT) dense image embeddings are inputs to an encoder then we used a decoder to automatically anticipate discrete text tokens. We achieved the F1-score of 0.4349 on the private test set and ranked 2nd in the EVJVQA task at the VLSP shared task 2022. For reproducing the result, the code can be found at https://github.com/DinhLuan14/VLSP2022-VQA-OhYeah
多语言视觉问题解答(mVQA)是一项极具挑战性的任务,它需要回答用不同语言提出的问题,并在图像中提取上下文。这个问题只能通过自然语言处理和计算机视觉的结合来解决。在本文中,我们建议将联合开发的模型应用于多语言视觉问题解答任务。具体来说,我们将对源自 T5 编码器-解码器架构的多模态序列-序列转换器模型进行实验。文本标记和视觉转换器(ViT)密集图像嵌入是编码器的输入,然后我们使用解码器自动预测离散文本标记。我们在私人测试集上取得了 0.4349 的 F1 分数,并在 2022 年 VLSP 共享任务中的 EVJVQA 任务中排名第二。要重现这一结果,可在 https://github.com/DinhLuan14/VLSP2022-VQA-OhYeah 上找到相关代码。
{"title":"OHYEAH AT VLSP2022-EVJVQA CHALLENGE: A JOINTLY LANGUAGE-IMAGE MODEL FOR MULTILINGUAL VISUAL QUESTION ANSWERING","authors":"Luan Ngo Dinh, Hiếu Lê Ngọc, Long Quoc Phan","doi":"10.15625/1813-9663/18122","DOIUrl":"https://doi.org/10.15625/1813-9663/18122","url":null,"abstract":"Multilingual Visual Question Answering (mVQA) is an extremely challenging task which needs to answer a question given in different languages and take the context in an image. This problem can only be addressed by the combination of Natural Language Processing and Computer Vision. In this paper, we propose applying a jointly developed model to the task of multilingual visual question answering. Specifically, we conduct experiments on a multimodal sequence-to-sequence transformer model derived from the T5 encoder-decoder architecture. Text tokens and Vision Transformer (ViT) dense image embeddings are inputs to an encoder then we used a decoder to automatically anticipate discrete text tokens. We achieved the F1-score of 0.4349 on the private test set and ranked 2nd in the EVJVQA task at the VLSP shared task 2022. For reproducing the result, the code can be found at https://github.com/DinhLuan14/VLSP2022-VQA-OhYeah","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"21 7","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139157603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A NOVEL ALGORITHM FOR FINDING ALL REDUCTS IN THE INCOMPLETE DECISION TABLE 寻找不完整判定表中所有还原的新算法
Pub Date : 2023-11-21 DOI: 10.15625/1813-9663/18680
Pham Viet Anh, Vu Duc Thi, Nguyen Ngoc Cuong
Attribute reduction, or attribute selection in the decision table, is a fundamental problem of rough set theory. Currently, many scientists are interested in and developing these issues. Unfortunately, most studies focus mainly on the complete decision table. On incomplete decision tables, researchers have proposed tolerance relations and designed attribute reduction algorithms based on different measures. However, these algorithms only return a reduct and do not preserve information in the decision tables. This paper will propose an efficient method to determine entire reducts of incomplete decision tables according to the relational database approach. In the complex case, this algorithm has exponential computational complexity. However, this algorithm has polynomial computational complexity in the different cases of databases.
属性还原或决策表中的属性选择是粗糙集理论的一个基本问题。目前,许多科学家都对这些问题很感兴趣,并在不断研究。遗憾的是,大多数研究主要集中在完整决策表上。对于不完整的决策表,研究人员提出了容差关系,并设计了基于不同度量的属性缩减算法。然而,这些算法只能返回还原结果,并不能保留决策表中的信息。本文将根据关系数据库方法,提出一种有效的方法来确定不完整决策表的整个还原。在复杂情况下,该算法的计算复杂度为指数级。然而,在数据库的不同情况下,该算法的计算复杂度为多项式。
{"title":"A NOVEL ALGORITHM FOR FINDING ALL REDUCTS IN THE INCOMPLETE DECISION TABLE","authors":"Pham Viet Anh, Vu Duc Thi, Nguyen Ngoc Cuong","doi":"10.15625/1813-9663/18680","DOIUrl":"https://doi.org/10.15625/1813-9663/18680","url":null,"abstract":"Attribute reduction, or attribute selection in the decision table, is a fundamental problem of rough set theory. Currently, many scientists are interested in and developing these issues. Unfortunately, most studies focus mainly on the complete decision table. On incomplete decision tables, researchers have proposed tolerance relations and designed attribute reduction algorithms based on different measures. However, these algorithms only return a reduct and do not preserve information in the decision tables. This paper will propose an efficient method to determine entire reducts of incomplete decision tables according to the relational database approach. In the complex case, this algorithm has exponential computational complexity. However, this algorithm has polynomial computational complexity in the different cases of databases.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"18 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139253590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
THE VNPT-IT EMOTION TRANSPLANTATION APPROACH FOR VLSP 2022 针对 VLSP 2022 的 VNPT-IT 情感移植方法
Pub Date : 2023-11-21 DOI: 10.15625/1813-9663/18236
Van Thang Nguyen, Thanh Long Luong, Huan Vu
Emotional speech synthesis is a challenging task in speech processing. To build an emotional Text-to-speech (TTS) system, one would need to have a quality emotional dataset of the target speaker. However, collecting such data is difficult, sometimes even impossible. This paper presents our approach that addresses the problem of transplanting a source speaker's emotional expression to a target speaker, one of the Vietnamese Language and Speech Processsing (VLSP) 2022 TTS tasks. Our approach includes a complete data pre-processing pipeline and two training algorithms. We first train a source speaker's expressive TTS model, then adapt the voice characteristics for the target speaker. Empirical results have shown the efficacy of our method in generating the expressive speech of a speaker under a limited training data regime.
情感语音合成是语音处理中一项具有挑战性的任务。要建立一个情感文本到语音(TTS)系统,需要有一个高质量的目标说话人情感数据集。然而,收集此类数据十分困难,有时甚至是不可能的。本文介绍了我们的方法,该方法可解决将源说话者的情感表达移植到目标说话者身上的问题,这是越南语和语音处理(VLSP)2022 TTS 任务之一。我们的方法包括一个完整的数据预处理管道和两种训练算法。我们首先训练源说话者的表达式 TTS 模型,然后调整目标说话者的语音特征。经验结果表明,在有限的训练数据条件下,我们的方法能有效地生成说话人富有表现力的语音。
{"title":"THE VNPT-IT EMOTION TRANSPLANTATION APPROACH FOR VLSP 2022","authors":"Van Thang Nguyen, Thanh Long Luong, Huan Vu","doi":"10.15625/1813-9663/18236","DOIUrl":"https://doi.org/10.15625/1813-9663/18236","url":null,"abstract":"Emotional speech synthesis is a challenging task in speech processing. To build an emotional Text-to-speech (TTS) system, one would need to have a quality emotional dataset of the target speaker. However, collecting such data is difficult, sometimes even impossible. This paper presents our approach that addresses the problem of transplanting a source speaker's emotional expression to a target speaker, one of the Vietnamese Language and Speech Processsing (VLSP) 2022 TTS tasks. Our approach includes a complete data pre-processing pipeline and two training algorithms. We first train a source speaker's expressive TTS model, then adapt the voice characteristics for the target speaker. Empirical results have shown the efficacy of our method in generating the expressive speech of a speaker under a limited training data regime.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"83 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139251953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TAEKWONDO POSE ESTIMATION WITH DEEP LEARNING ARCHITECTURES ON ONE-DIMENSIONAL AND TWO-DIMENSIONAL DATA 在一维和二维数据上利用深度学习架构进行跆拳道姿势估计
Pub Date : 2023-11-21 DOI: 10.15625/1813-9663/18043
Dat Tien Nguyen, Chau Ngoc Ha, Ha Thanh Thi Hoang, Truong Nhat Nguyen, Tuyet Ngoc Huynh, Hai Thanh Nguyen
Practicing sports is an activity that helps people maintain and improve their health, enhance memory and concentration, reduce anxiety and stress, and train teamwork and leadership ability. With the development of science and technology, artificial intelligence in sports has become increasingly popular with the public and brings many benefits. In particular, many applications help people track and evaluate athletes' achievements in competitions. This study extracts images from Taekwondo videos and generates skeleton data from frames using the Fast Forward Moving Picture Experts Group (FFMPEG) technique using MoveNet. After that, we use deep learning architectures such as Long Short-Term Memory Networks, Convolutional Long Short-Term Memory, and Long-term Recurrent Convolutional Networks to perform the poses classification tasks in Taegeuk in Jang lessons. This work presents two approaches. The first approach uses a sequence skeleton extracted from the image by Movenet. Second, we use sequence images to train using video classification architecture. Finally, we recognize poses in sports lessons using skeleton data to remove noise in the image, such as background and extraneous objects behind the exerciser. As a result, our proposed method has achieved promising performance in pose classification tasks in an introductory Taekwondo lesson.
体育运动是一项有助于人们保持和改善健康、增强记忆力和注意力、减轻焦虑和压力、锻炼团队精神和领导能力的活动。随着科学技术的发展,人工智能在体育运动中的应用越来越受到大众的青睐,并带来了诸多益处。特别是,许多应用程序可以帮助人们跟踪和评估运动员在比赛中取得的成绩。本研究从跆拳道视频中提取图像,并利用移动网络(MoveNet)使用快速移动图像专家组(FFMPEG)技术从帧中生成骨架数据。然后,我们使用长短期记忆网络、卷积长短期记忆和长期递归卷积网络等深度学习架构来执行张氏跆拳道课程中的姿势分类任务。这项工作提出了两种方法。第一种方法使用 Movenet 从图像中提取的序列骨架。其次,我们使用序列图像来训练视频分类架构。最后,我们使用骨架数据识别体育课中的姿势,以去除图像中的噪声,如背景和运动者身后的无关物体。结果,我们提出的方法在跆拳道入门课程的姿势分类任务中取得了可喜的成绩。
{"title":"TAEKWONDO POSE ESTIMATION WITH DEEP LEARNING ARCHITECTURES ON ONE-DIMENSIONAL AND TWO-DIMENSIONAL DATA","authors":"Dat Tien Nguyen, Chau Ngoc Ha, Ha Thanh Thi Hoang, Truong Nhat Nguyen, Tuyet Ngoc Huynh, Hai Thanh Nguyen","doi":"10.15625/1813-9663/18043","DOIUrl":"https://doi.org/10.15625/1813-9663/18043","url":null,"abstract":"Practicing sports is an activity that helps people maintain and improve their health, enhance memory and concentration, reduce anxiety and stress, and train teamwork and leadership ability. With the development of science and technology, artificial intelligence in sports has become increasingly popular with the public and brings many benefits. In particular, many applications help people track and evaluate athletes' achievements in competitions. This study extracts images from Taekwondo videos and generates skeleton data from frames using the Fast Forward Moving Picture Experts Group (FFMPEG) technique using MoveNet. After that, we use deep learning architectures such as Long Short-Term Memory Networks, Convolutional Long Short-Term Memory, and Long-term Recurrent Convolutional Networks to perform the poses classification tasks in Taegeuk in Jang lessons. This work presents two approaches. The first approach uses a sequence skeleton extracted from the image by Movenet. Second, we use sequence images to train using video classification architecture. Finally, we recognize poses in sports lessons using skeleton data to remove noise in the image, such as background and extraneous objects behind the exerciser. As a result, our proposed method has achieved promising performance in pose classification tasks in an introductory Taekwondo lesson.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"59 18","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139253167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A PLANT RECOGNITION APPROACH USING HIGH RESOLUTION NETWORK 一种基于高分辨率网络的植物识别方法
Pub Date : 2023-10-06 DOI: 10.15625/1813-9663/18144
Dang Ngan Ha, Hieu Trung Huynh
Plant species recognition plays an important role in agriculture, the pharmaceutical industry, and conservation. The traditional approaches may take days and have difficulties for non-experts. Several computer vision-based models have been proposed, which can partially assist and speed up the plant recognition process. Thanks to the development of data collection and computational systems, the models based on machine learning have considerably improved their performance in the last decades. In this paper, we present a model for plant recognition in Southeast Asia based on the high-resolution network. The evaluation is carried out on a public dataset consisting of 26 different species in Southeast Asia. It shows high accuracy in recognition.
植物物种识别在农业、医药工业和保护中起着重要的作用。传统的方法可能需要几天的时间,对非专业人士来说也有困难。已经提出了几种基于计算机视觉的模型,这些模型可以部分地辅助和加快植物识别过程。由于数据收集和计算系统的发展,基于机器学习的模型在过去几十年中大大提高了它们的性能。在本文中,我们提出了一个基于高分辨率网络的东南亚植物识别模型。该评估是在一个由东南亚26个不同物种组成的公共数据集上进行的。该方法具有较高的识别准确率。
{"title":"A PLANT RECOGNITION APPROACH USING HIGH RESOLUTION NETWORK","authors":"Dang Ngan Ha, Hieu Trung Huynh","doi":"10.15625/1813-9663/18144","DOIUrl":"https://doi.org/10.15625/1813-9663/18144","url":null,"abstract":"Plant species recognition plays an important role in agriculture, the pharmaceutical industry, and conservation. The traditional approaches may take days and have difficulties for non-experts. Several computer vision-based models have been proposed, which can partially assist and speed up the plant recognition process. Thanks to the development of data collection and computational systems, the models based on machine learning have considerably improved their performance in the last decades. In this paper, we present a model for plant recognition in Southeast Asia based on the high-resolution network. The evaluation is carried out on a public dataset consisting of 26 different species in Southeast Asia. It shows high accuracy in recognition.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134946454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EVJVQA CHALLENGE: MULTILINGUAL VISUAL QUESTION ANSWERING Evjvqa挑战:多语言视觉问答
Pub Date : 2023-09-26 DOI: 10.15625/1813-9663/18157
Nguyen, Ngan Luu-Thuy, Nguyen, Nghia Hieu, Vo, Duong T. D, Tran, Khanh Quoc, Van Nguyen, Kiet
Visual Question Answering (VQA) is a challenging task of natural language processing (NLP) and computer vision (CV), attracting significant attention from researchers. English is a resource-rich language that has witnessed various developments in datasets and models for visual question answering. Visual question answering in other languages also would be developed for resources and models. In addition, there is no multilingual dataset targeting the visual content of a particular country with its own objects and cultural characteristics. To address the weakness, we provide the research community with a benchmark dataset named EVJVQA, including 33,000+ pairs of question-answer over three languages: Vietnamese, English, and Japanese, on approximately 5,000 images taken from Vietnam for evaluating multilingual VQA systems or models. EVJVQA is used as a benchmark dataset for the challenge of multilingual visual question answering at the 9th Workshop on Vietnamese Language and Speech Processing (VLSP 2022). This task attracted 62 participant teams from various universities and organizations. In this article, we present details of the organization of the challenge, an overview of the methods employed by shared-task participants, and the results. The highest performances are 0.4392 in F1-score and 0.4009 in BLUE on the private test set. The multilingual QA systems proposed by the top 2 teams use ViT for the pre-trained vision model and mT5 for the pre-trained language model, a powerful pre-trained language model based on the transformer architecture. EVJVQA is a challenging dataset that motivates NLP and CV researchers to further explore the multilingual models or systems for visual question answering systems.
视觉问答(Visual Question answer, VQA)是自然语言处理(NLP)和计算机视觉(CV)领域的一项具有挑战性的任务,受到了研究人员的广泛关注。英语是一种资源丰富的语言,它见证了可视化问答数据集和模型的各种发展。还将为资源和模型开发其他语言的可视化问答。此外,还没有针对具有自己的对象和文化特征的特定国家的视觉内容的多语言数据集。为了解决这一弱点,我们为研究界提供了一个名为EVJVQA的基准数据集,其中包括33,000多对三种语言的问答:越南语,英语和日语,取自越南的约5,000张图像,用于评估多语言VQA系统或模型。EVJVQA在第九届越南语言和语音处理研讨会(VLSP 2022)上被用作多语言视觉问答挑战的基准数据集。这项任务吸引了来自不同大学和组织的62支参赛队伍。在本文中,我们详细介绍了挑战的组织、共享任务参与者使用的方法的概述以及结果。在私人测试集上,f1得分最高为0.4392,BLUE得分最高为0.4009。前两名团队提出的多语言QA系统使用ViT作为预训练的视觉模型,使用mT5作为预训练的语言模型,这是一种基于transformer架构的强大的预训练语言模型。EVJVQA是一个具有挑战性的数据集,激励NLP和CV研究人员进一步探索视觉问答系统的多语言模型或系统。
{"title":"EVJVQA CHALLENGE: MULTILINGUAL VISUAL QUESTION ANSWERING","authors":"Nguyen, Ngan Luu-Thuy, Nguyen, Nghia Hieu, Vo, Duong T. D, Tran, Khanh Quoc, Van Nguyen, Kiet","doi":"10.15625/1813-9663/18157","DOIUrl":"https://doi.org/10.15625/1813-9663/18157","url":null,"abstract":"Visual Question Answering (VQA) is a challenging task of natural language processing (NLP) and computer vision (CV), attracting significant attention from researchers. English is a resource-rich language that has witnessed various developments in datasets and models for visual question answering. Visual question answering in other languages also would be developed for resources and models. In addition, there is no multilingual dataset targeting the visual content of a particular country with its own objects and cultural characteristics. To address the weakness, we provide the research community with a benchmark dataset named EVJVQA, including 33,000+ pairs of question-answer over three languages: Vietnamese, English, and Japanese, on approximately 5,000 images taken from Vietnam for evaluating multilingual VQA systems or models. EVJVQA is used as a benchmark dataset for the challenge of multilingual visual question answering at the 9th Workshop on Vietnamese Language and Speech Processing (VLSP 2022). This task attracted 62 participant teams from various universities and organizations. In this article, we present details of the organization of the challenge, an overview of the methods employed by shared-task participants, and the results. The highest performances are 0.4392 in F1-score and 0.4009 in BLUE on the private test set. The multilingual QA systems proposed by the top 2 teams use ViT for the pre-trained vision model and mT5 for the pre-trained language model, a powerful pre-trained language model based on the transformer architecture. EVJVQA is a challenging dataset that motivates NLP and CV researchers to further explore the multilingual models or systems for visual question answering systems.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134903242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
DATA AUGMENTATION ANALYSIS OF VEHICLE DETECTION IN AERIAL IMAGES 航拍图像中车辆检测的数据增强分析
Pub Date : 2023-09-22 DOI: 10.15625/1813-9663/18259
Khang Nguyen
Drones are increasingly used in various application domains including surveillance, agriculture, delivery, search and rescue missions. Object detection in aerial images (captured by drones) gradually gains more interest in computer vision community. However, research activities are still very few in this area due to numerous challenges such as top-view angle, small-scale object, diverse directions, and data imbalance. In this paper, we investigate different data augmentation techniques. Furthermore, we propose combining data augmentation methods to further enhance the performance of the state-of-the-art object detection methods. Extensive experiments on two datasets, namely, AERIAU, and XDUAV, demonstrate that the combination of random cropped and vertical flipped data boosts the performance of object detectors on aerial images.
无人机越来越多地用于各种应用领域,包括监视,农业,交付,搜索和救援任务。无人机拍摄的航拍图像中的目标检测逐渐受到计算机视觉界的关注。然而,由于俯视图角度、对象小尺度、方向多样、数据不平衡等诸多挑战,该领域的研究活动仍然很少。在本文中,我们研究了不同的数据增强技术。此外,我们提出结合数据增强方法,以进一步提高最先进的目标检测方法的性能。在AERIAU和XDUAV两个数据集上进行的大量实验表明,随机裁剪和垂直翻转数据的组合提高了航空图像上目标检测器的性能。
{"title":"DATA AUGMENTATION ANALYSIS OF VEHICLE DETECTION IN AERIAL IMAGES","authors":"Khang Nguyen","doi":"10.15625/1813-9663/18259","DOIUrl":"https://doi.org/10.15625/1813-9663/18259","url":null,"abstract":"Drones are increasingly used in various application domains including surveillance, agriculture, delivery, search and rescue missions. Object detection in aerial images (captured by drones) gradually gains more interest in computer vision community. However, research activities are still very few in this area due to numerous challenges such as top-view angle, small-scale object, diverse directions, and data imbalance. In this paper, we investigate different data augmentation techniques. Furthermore, we propose combining data augmentation methods to further enhance the performance of the state-of-the-art object detection methods. Extensive experiments on two datasets, namely, AERIAU, and XDUAV, demonstrate that the combination of random cropped and vertical flipped data boosts the performance of object detectors on aerial images.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136100800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A NOVEL APPROACH TO MODELLING A DIAGNOSIS AND TREATMENT OF TRADITIONAL VIETNAMESE MEDICINE 越南传统医学诊断和治疗建模的新方法
Pub Date : 2023-09-22 DOI: 10.15625/1813-9663/18015
Truong Thi Hong Thuy, Nguyen Hoang Phuong
Traditional Vietnamese Medicine (TVM) is based on the experiences of thousands of years of Vietnamese people in the struggle against diseases; therefore, TVM is very important in the medical system of Vietnam. In this paper, we propose a novel model of an expert system for diagnosing disease syndromes and treating traditional Vietnamese medicine. In this model, the knowledge base consists of IF-THEN rules, in which the antecedent of a rule is an elementary conjunction of propositions and negated propositions. The inference mechanism for the diagnosis of disease syndromes and treatment of traditional Vietnamese medicine applies Abelian group operations. A comparison of the inference of our model with the fuzzy max-min inferences shows that our model can have very similar rules whose contributions sum up to high weight. On the other hand, in our model, a rule with a negative weight may diminish an effect of a rule with a good weight. This feature is absent in the systems with fuzzy max-min inferences. We have built rule patterns for the diagnosis of about 50 disease syndromes and their treatment by Herbs and Acupuncture with the cooperation of practitioners of Oriental Traditional Medicine in Vietnam. Some examples of databases and the rules for disease syndrome differentiation and treatment by herbal medicine and Acupuncture are shown. Finally, some conclusions and future works are given.
越南传统医学(TVM)以越南人民数千年来与疾病作斗争的经验为基础;因此,TVM在越南的医疗系统中非常重要。在本文中,我们提出了一种新的诊断疾病证候和治疗越南传统医学的专家系统模型。在该模型中,知识库由IF-THEN规则组成,其中规则的前提是命题和否定命题的初等合取。越医疾病证候诊断与治疗的推理机制应用阿别群操作。我们的模型与模糊最大最小推理的比较表明,我们的模型可以有非常相似的规则,这些规则的贡献总和很高。另一方面,在我们的模型中,具有负权重的规则可能会削弱具有良好权重的规则的效果。在具有模糊极大极小推理的系统中不存在这一特征。我们与越南东方传统医学医师合作,建立了约50种疾病证候的诊断和中医针灸治疗的规则模式。给出了中医辨证论治中医针灸辨证论治的一些数据库实例和规律。最后,对本文的研究工作进行了总结和展望。
{"title":"A NOVEL APPROACH TO MODELLING A DIAGNOSIS AND TREATMENT OF TRADITIONAL VIETNAMESE MEDICINE","authors":"Truong Thi Hong Thuy, Nguyen Hoang Phuong","doi":"10.15625/1813-9663/18015","DOIUrl":"https://doi.org/10.15625/1813-9663/18015","url":null,"abstract":"Traditional Vietnamese Medicine (TVM) is based on the experiences of thousands of years of Vietnamese people in the struggle against diseases; therefore, TVM is very important in the medical system of Vietnam. In this paper, we propose a novel model of an expert system for diagnosing disease syndromes and treating traditional Vietnamese medicine. In this model, the knowledge base consists of IF-THEN rules, in which the antecedent of a rule is an elementary conjunction of propositions and negated propositions. The inference mechanism for the diagnosis of disease syndromes and treatment of traditional Vietnamese medicine applies Abelian group operations. A comparison of the inference of our model with the fuzzy max-min inferences shows that our model can have very similar rules whose contributions sum up to high weight. On the other hand, in our model, a rule with a negative weight may diminish an effect of a rule with a good weight. This feature is absent in the systems with fuzzy max-min inferences. We have built rule patterns for the diagnosis of about 50 disease syndromes and their treatment by Herbs and Acupuncture with the cooperation of practitioners of Oriental Traditional Medicine in Vietnam. Some examples of databases and the rules for disease syndrome differentiation and treatment by herbal medicine and Acupuncture are shown. Finally, some conclusions and future works are given.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136100798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Computer Science and Cybernetics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1