首页 > 最新文献

2021 RIVF International Conference on Computing and Communication Technologies (RIVF)最新文献

英文 中文
MC-OCR Challenge 2021: Towards Document Understanding for Unconstrained Mobile-Captured Vietnamese Receipts MC-OCR挑战2021:面向无约束移动捕获越南收据的文档理解
Pub Date : 2021-08-19 DOI: 10.1109/RIVF51545.2021.9642126
Hoai Viet Nguyen, Linh Doan Bao, Hoang Viet Trinh, Hoang Huy Phan, Ta Minh Thanh
The Mobile capture receipts Optical Character Recognition (MC-OCR) [14] challenge deliver two tasks: Receipt Image Quality Evaluation and Key Information Extraction. In the first task, we introduce a regression model to map various inputs, for instance the probability of the output OCR, cropped text boxes, images to actual label. In the second task, we propose a stacked multi-model as a solution to solve this problem. The robust models are incorporated by image segmentation, image classification, text detection, text recognition, and text classification. Follow this solution, we can get vital tackle various noise receipt types: horizontal, skew, and blur receipt.
移动捕获收据光学字符识别(MC-OCR)[14]挑战包含两个任务:收据图像质量评估和关键信息提取。在第一个任务中,我们引入了一个回归模型来映射各种输入,例如输出OCR的概率,裁剪的文本框,图像到实际标签。在第二项任务中,我们提出了一个堆叠多模型来解决这个问题。鲁棒模型包括图像分割、图像分类、文本检测、文本识别和文本分类。按照这个解决方案,我们可以得到重要的解决各种噪音收据类型:水平,倾斜和模糊收据。
{"title":"MC-OCR Challenge 2021: Towards Document Understanding for Unconstrained Mobile-Captured Vietnamese Receipts","authors":"Hoai Viet Nguyen, Linh Doan Bao, Hoang Viet Trinh, Hoang Huy Phan, Ta Minh Thanh","doi":"10.1109/RIVF51545.2021.9642126","DOIUrl":"https://doi.org/10.1109/RIVF51545.2021.9642126","url":null,"abstract":"The Mobile capture receipts Optical Character Recognition (MC-OCR) [14] challenge deliver two tasks: Receipt Image Quality Evaluation and Key Information Extraction. In the first task, we introduce a regression model to map various inputs, for instance the probability of the output OCR, cropped text boxes, images to actual label. In the second task, we propose a stacked multi-model as a solution to solve this problem. The robust models are incorporated by image segmentation, image classification, text detection, text recognition, and text classification. Follow this solution, we can get vital tackle various noise receipt types: horizontal, skew, and blur receipt.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"105 1","pages":"1-5"},"PeriodicalIF":0.0,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75689532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Novel Image Watermarking Scheme Using LU Decomposition 一种新的基于LU分解的图像水印方案
Pub Date : 2021-08-19 DOI: 10.1109/RIVF51545.2021.9642085
Phuong Thi Nha, Ta Minh Thanh
In recent years, protecting copyright of digital images is an indispensable requirement for owners. To against with rapidly increasing of attacks, many techniques have been proposed in transform domain for ensuring quality of watermarked image, robustness of extracted watermark and execution time. Among these techniques, LU decomposition is considered as an outstanding technique in term of computation. However, it is that not all square matrices have an LU decomposition. Therefore, the suitable blocks need to be chosen before factorizing pixel matrices into lower and upper triangular matrix. In addition, in order to improve the invisibility of watermarked image, watermark should be embedded on one element of L matrix instead of two elements as the previous proposals. In this paper, we propose a novel image watermarking scheme which is based on strategy of LU blocks selection and an improved embedding method. Beside that, the extraction time is significantly sped up by a new solution to get out L(2,1) and L(3,1) elements of L matrix without performing LU decomposition in the extracting stage. According to the experimental results, our proposed method not only has the much better visual quality of watermarked images, but also can effectively extracts the watermark under some attacks.
近年来,保护数字图像的版权是对版权所有人不可或缺的要求。针对快速增加的水印攻击,在变换域提出了许多保证水印图像质量、水印提取的鲁棒性和执行时间的技术。在这些技术中,逻辑单元分解被认为是计算能力较强的一种技术。然而,并不是所有的方阵都有LU分解。因此,在将像素矩阵分解为上下三角矩阵之前,需要选择合适的块。此外,为了提高水印图像的不可见性,水印应该嵌入到L矩阵的一个元素上,而不是像之前的建议那样嵌入到两个元素上。本文提出了一种基于LU块选择策略和改进嵌入方法的图像水印方案。此外,在提取阶段不进行LU分解,提取出L矩阵中的L(2,1)和L(3,1)个元素,显著加快了提取时间。实验结果表明,该方法不仅具有较好的水印图像视觉质量,而且在某些攻击下也能有效地提取水印。
{"title":"A Novel Image Watermarking Scheme Using LU Decomposition","authors":"Phuong Thi Nha, Ta Minh Thanh","doi":"10.1109/RIVF51545.2021.9642085","DOIUrl":"https://doi.org/10.1109/RIVF51545.2021.9642085","url":null,"abstract":"In recent years, protecting copyright of digital images is an indispensable requirement for owners. To against with rapidly increasing of attacks, many techniques have been proposed in transform domain for ensuring quality of watermarked image, robustness of extracted watermark and execution time. Among these techniques, LU decomposition is considered as an outstanding technique in term of computation. However, it is that not all square matrices have an LU decomposition. Therefore, the suitable blocks need to be chosen before factorizing pixel matrices into lower and upper triangular matrix. In addition, in order to improve the invisibility of watermarked image, watermark should be embedded on one element of L matrix instead of two elements as the previous proposals. In this paper, we propose a novel image watermarking scheme which is based on strategy of LU blocks selection and an improved embedding method. Beside that, the extraction time is significantly sped up by a new solution to get out L(2,1) and L(3,1) elements of L matrix without performing LU decomposition in the extracting stage. According to the experimental results, our proposed method not only has the much better visual quality of watermarked images, but also can effectively extracts the watermark under some attacks.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"1 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73029750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Algorithm for Multiple Benefit Thresholds Problem in Online Social Networks 在线社交网络中多利益阈值问题的高效算法
Pub Date : 2021-08-19 DOI: 10.1109/RIVF51545.2021.9642099
P. H. Pham, Bich-Ngan T. Nguyen, Canh V. Pham, Nghia D. Nghia, V. Snás̃el
In the context of viral marketing in Online Social Networks (OSNs), companies often find some users (called a seed set) to initiate the spread of their product’s information so that the benefit gained exceeds a given threshold. However, in a realistic scenario, marketing strategies often change so the selection of a seed set for a particular threshold is not enough to provide an effective solution. Motivated by this phenomenon, we investigate the Multiple Benefit Thresholds (MBT), defined as follows: Given a social network under an information diffusion and a set of thresholds T = {T1, T2, … , Tk}, the problem finds seed sets S1, S2, … , Sk with the minimal cost so that their benefit gained after the influence process are at least T1, T2, … , Tk, respectively. To find the solution, we propose an efficient algorithm with theoretical guarantees, named Efficient Sampling for Selecting Multiple seed sets (ESSM) by developing an algorithmic framework and utilizing the sampling technique for estimating the objective function. We perform extensive experiments using some real networks show that the effective and performance of our algorithm, which outperforms other algorithms in term both the cost and running time.
在网络社交网络(OSNs)的病毒式营销背景下,公司通常会找到一些用户(称为种子集)来发起产品信息的传播,从而获得超过给定阈值的收益。然而,在现实情况中,营销策略经常变化,因此为特定阈值选择种子集不足以提供有效的解决方案。基于这一现象,我们研究了多重效益阈值(Multiple Benefit threshold, MBT)问题,其定义如下:给定一个信息扩散的社会网络和一组阈值T = {T1, T2,…,Tk},该问题寻找代价最小的种子集S1, S2,…,Sk,使其在影响过程后获得的效益分别至少为T1, T2,…,Tk。为了解决这个问题,我们提出了一种具有理论保证的高效算法,即高效抽样选择多种子集(ESSM),通过开发算法框架并利用抽样技术来估计目标函数。我们在一些真实网络上进行了大量的实验,结果表明我们的算法在成本和运行时间方面都优于其他算法。
{"title":"Efficient Algorithm for Multiple Benefit Thresholds Problem in Online Social Networks","authors":"P. H. Pham, Bich-Ngan T. Nguyen, Canh V. Pham, Nghia D. Nghia, V. Snás̃el","doi":"10.1109/RIVF51545.2021.9642099","DOIUrl":"https://doi.org/10.1109/RIVF51545.2021.9642099","url":null,"abstract":"In the context of viral marketing in Online Social Networks (OSNs), companies often find some users (called a seed set) to initiate the spread of their product’s information so that the benefit gained exceeds a given threshold. However, in a realistic scenario, marketing strategies often change so the selection of a seed set for a particular threshold is not enough to provide an effective solution. Motivated by this phenomenon, we investigate the Multiple Benefit Thresholds (MBT), defined as follows: Given a social network under an information diffusion and a set of thresholds T = {T1, T2, … , Tk}, the problem finds seed sets S1, S2, … , Sk with the minimal cost so that their benefit gained after the influence process are at least T1, T2, … , Tk, respectively. To find the solution, we propose an efficient algorithm with theoretical guarantees, named Efficient Sampling for Selecting Multiple seed sets (ESSM) by developing an algorithmic framework and utilizing the sampling technique for estimating the objective function. We perform extensive experiments using some real networks show that the effective and performance of our algorithm, which outperforms other algorithms in term both the cost and running time.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"1 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85196814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MC-OCR Challenge 2021: A Multi-modal Approach for Mobile-Captured Vietnamese Receipts Recognition MC-OCR挑战2021:移动捕获越南收据识别的多模式方法
Pub Date : 2021-08-19 DOI: 10.1109/RIVF51545.2021.9642088
Bao Hieu Tran, Duc Viet Hoang, Nguyen Manh Hiep, Pham Ngoc Bao Anh, Hoang Gia Bao, Nguyen Duc Anh, Bui Hai Phong, T. Nguyen, Phi-Le Nguyen, Thi-Lan Le
Mobile captured receipts OCR (MC-OCR) recognizes text from structured and semi-structured receipts and invoices captured by mobile devices. This process plays a critical role in streamlining document-intensive processes and office automation in many financial, accounting, and taxation areas. Although many efforts have been devoted, MC-OCR still faces significant challenges due to mobile captured images’ complexity. First, receipts might be crumpled, or the content might be blurred. Second, different from scanned images, the quality of photos taken by mobile devices shows high diversity due to the light condition and the dynamic environment (e.g., indoor, out-door, complex background, etc.) where the receipts were captured. These difficulties lead to a low accuracy of the recognition results. In this challenge, we target two tasks to address these issues, including (1) evaluating the quality of the captured receipts, and (2) recognizing required fields of the receipts. Our idea is to leverage a multi-modal approach which can take advantage of both areas: computer vision and natural language processing, two of the main interests of the RIVF community. The paper presents the BK-OCR team’s methodology and results in the Mobile-Captured Image Document Recognition for Vietnamese Receipts 2021.
移动捕获收据OCR (MC-OCR)从移动设备捕获的结构化和半结构化收据和发票中识别文本。在许多财务、会计和税务领域,该流程在简化文档密集型流程和办公自动化方面起着关键作用。尽管已经做出了许多努力,但由于移动拍摄图像的复杂性,MC-OCR仍然面临着重大挑战。首先,收据可能会被弄皱,或者内容可能会模糊不清。其次,与扫描图像不同,移动设备拍摄的照片由于拍摄收据的光线条件和动态环境(如室内、室外、复杂背景等)而呈现出高度的多样性。这些困难导致了识别结果的准确率较低。在这个挑战中,我们的目标是两个任务来解决这些问题,包括(1)评估捕获收据的质量,(2)识别收据的必要字段。我们的想法是利用一种多模式的方法,它可以利用两个领域:计算机视觉和自然语言处理,这是RIVF社区的两个主要兴趣。本文介绍了BK-OCR团队在2021年越南收据移动捕获图像文档识别中的方法和结果。
{"title":"MC-OCR Challenge 2021: A Multi-modal Approach for Mobile-Captured Vietnamese Receipts Recognition","authors":"Bao Hieu Tran, Duc Viet Hoang, Nguyen Manh Hiep, Pham Ngoc Bao Anh, Hoang Gia Bao, Nguyen Duc Anh, Bui Hai Phong, T. Nguyen, Phi-Le Nguyen, Thi-Lan Le","doi":"10.1109/RIVF51545.2021.9642088","DOIUrl":"https://doi.org/10.1109/RIVF51545.2021.9642088","url":null,"abstract":"Mobile captured receipts OCR (MC-OCR) recognizes text from structured and semi-structured receipts and invoices captured by mobile devices. This process plays a critical role in streamlining document-intensive processes and office automation in many financial, accounting, and taxation areas. Although many efforts have been devoted, MC-OCR still faces significant challenges due to mobile captured images’ complexity. First, receipts might be crumpled, or the content might be blurred. Second, different from scanned images, the quality of photos taken by mobile devices shows high diversity due to the light condition and the dynamic environment (e.g., indoor, out-door, complex background, etc.) where the receipts were captured. These difficulties lead to a low accuracy of the recognition results. In this challenge, we target two tasks to address these issues, including (1) evaluating the quality of the captured receipts, and (2) recognizing required fields of the receipts. Our idea is to leverage a multi-modal approach which can take advantage of both areas: computer vision and natural language processing, two of the main interests of the RIVF community. The paper presents the BK-OCR team’s methodology and results in the Mobile-Captured Image Document Recognition for Vietnamese Receipts 2021.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"6 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84344788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Self-Supervised Learning for Action Recognition by Video Denoising 基于视频去噪的自监督学习行为识别
Pub Date : 2021-08-19 DOI: 10.1109/RIVF51545.2021.9642129
T. Phung, Thi Hong Thu Ma, Van Truong Nguyen, Duc-Quang Vu
Deep learning is a data-hungry technique that is more effective when being applied to large datasets. However, large-scale annotation datasets are not always available. A new approach, such as self-supervised learning of which labels can be automatically generated, is essential. Therefore, using self- supervised learning is a new approach to state-of-the-art methods. In this paper, we introduce a new self-supervised method namely video denoising. This method requires an autoencoder model to restore original videos. The second model is proposed, which is called the discriminator. It is used for the quality evaluation of output videos from the autoencoder. By reconstructing videos, the autoencoder is learned both spatial and temporal relations of video frames to process the downstream task easily. In the experiments, we have demonstrated that our model is well transferred to the action recognition task and outperforms state- of-the-art methods on the UCF-101 and HMDB-51 datasets.
深度学习是一种数据密集型技术,在应用于大型数据集时更为有效。然而,大规模标注数据集并不总是可用的。一种新的方法,如可以自动生成标签的自我监督学习,是必不可少的。因此,使用自监督学习是一种新的方法。本文介绍了一种新的自监督方法——视频去噪。这种方法需要一个自动编码器模型来恢复原始视频。提出了第二种模型,称为鉴别器。它用于自编码器输出视频的质量评价。通过重构视频,自动编码器学习视频帧的空间和时间关系,便于后续任务的处理。在实验中,我们已经证明了我们的模型可以很好地转移到动作识别任务中,并且在UCF-101和HMDB-51数据集上优于最先进的方法。
{"title":"Self-Supervised Learning for Action Recognition by Video Denoising","authors":"T. Phung, Thi Hong Thu Ma, Van Truong Nguyen, Duc-Quang Vu","doi":"10.1109/RIVF51545.2021.9642129","DOIUrl":"https://doi.org/10.1109/RIVF51545.2021.9642129","url":null,"abstract":"Deep learning is a data-hungry technique that is more effective when being applied to large datasets. However, large-scale annotation datasets are not always available. A new approach, such as self-supervised learning of which labels can be automatically generated, is essential. Therefore, using self- supervised learning is a new approach to state-of-the-art methods. In this paper, we introduce a new self-supervised method namely video denoising. This method requires an autoencoder model to restore original videos. The second model is proposed, which is called the discriminator. It is used for the quality evaluation of output videos from the autoencoder. By reconstructing videos, the autoencoder is learned both spatial and temporal relations of video frames to process the downstream task easily. In the experiments, we have demonstrated that our model is well transferred to the action recognition task and outperforms state- of-the-art methods on the UCF-101 and HMDB-51 datasets.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"46 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87113712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Lightweight Model for Falling Detection 一个轻量级的坠落检测模型
Pub Date : 2021-08-19 DOI: 10.1109/RIVF51545.2021.9642122
T. Hoa, Val Randolf M. Madrid, Eliezer A. Albacea
In human activities life, accidental falls are a frequent occurrence. It can happen in children, the elderly, and even adults. Early detection of human falls is the most effective way to avoid the high risk of loss of self-control, death, or injury in humans. This means also reducing the national health system’s cost. Therefore research and development of fall detection and rescue systems are needed. Currently, the fall detection system is mainly based on wearable sensors, ambient, and vision sensors. Each method has certain advantages and limitations. The previous works usually focused on size while the speed was not often considered. Therefore, studies that aim to propose a lightweight model for Fall Detection with less complexity of memory and processing time but having reasonable accuracy are still potential. A 3-dimensional lightweight model has been proposed based on MobileNet architecture for falling detection in this paper.
在人类的活动生活中,意外跌倒是经常发生的事情。它可能发生在儿童、老年人甚至成年人身上。早期发现人类跌倒是避免人类失去自我控制、死亡或受伤的高风险的最有效方法。这也意味着降低国家卫生系统的成本。因此,有必要研究和开发坠落检测和救援系统。目前,跌倒检测系统主要基于可穿戴传感器、环境传感器和视觉传感器。每种方法都有一定的优点和局限性。以前的作品通常关注尺寸,而不经常考虑速度。因此,旨在提出一种记忆复杂性和处理时间较低但具有合理准确性的轻量级跌倒检测模型的研究仍有潜力。本文提出了一种基于MobileNet体系结构的三维轻量化跌落检测模型。
{"title":"A Lightweight Model for Falling Detection","authors":"T. Hoa, Val Randolf M. Madrid, Eliezer A. Albacea","doi":"10.1109/RIVF51545.2021.9642122","DOIUrl":"https://doi.org/10.1109/RIVF51545.2021.9642122","url":null,"abstract":"In human activities life, accidental falls are a frequent occurrence. It can happen in children, the elderly, and even adults. Early detection of human falls is the most effective way to avoid the high risk of loss of self-control, death, or injury in humans. This means also reducing the national health system’s cost. Therefore research and development of fall detection and rescue systems are needed. Currently, the fall detection system is mainly based on wearable sensors, ambient, and vision sensors. Each method has certain advantages and limitations. The previous works usually focused on size while the speed was not often considered. Therefore, studies that aim to propose a lightweight model for Fall Detection with less complexity of memory and processing time but having reasonable accuracy are still potential. A 3-dimensional lightweight model has been proposed based on MobileNet architecture for falling detection in this paper.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"32 1","pages":"1-5"},"PeriodicalIF":0.0,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87144745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Hypercuboid-Based Machine Learning Algorithm for Malware Classification 基于超长方体的恶意软件分类机器学习算法
Pub Date : 2021-08-19 DOI: 10.1109/RIVF51545.2021.9642093
Thi Thu Trang Nguyen, Dai Tho Nguyen, Duy Loi Vu
Malware attacks have been among the most serious threats to cyber security in the last decade. Antimalware software can help safeguard information systems and minimize their exposure to the malware. Most of anti-malware programs detect malware instances based on signature or pattern matching. Data mining and machine learning techniques can be used to automatically detect models and patterns behind different types of malware variants. However, traditional machine-based learning techniques such as SVM, decision trees and naive Bayes seem to be only suitable for detecting malicious code, not effective enough for complex problems such as classification. In this article, we propose a new prototype extraction method for non-traditional prototype-based machine learning classification. The prototypes are extracted using hypercuboids. Each hypercuboid covers all training data points of a malware family. Then we choose the data points nearest to the hyperplanes as the prototypes. Malware samples will be classified based on the distances to the prototypes. Experiments results show that our proposition leads to F1 score of 96.5% for classification of known malware and 97.7% for classification of unknown malware, both better than the original prototype-based classification method.
在过去十年中,恶意软件攻击一直是网络安全面临的最严重威胁之一。反恶意软件可以帮助保护信息系统,并将其暴露于恶意软件的风险降到最低。大多数反恶意软件程序检测恶意软件实例基于签名或模式匹配。数据挖掘和机器学习技术可用于自动检测不同类型恶意软件变体背后的模型和模式。然而,传统的基于机器的学习技术,如支持向量机、决策树和朴素贝叶斯似乎只适用于检测恶意代码,对分类等复杂问题不够有效。在本文中,我们提出了一种新的基于非传统原型的机器学习分类原型提取方法。原型是用超长方体提取的。每个超长方体涵盖恶意软件家族的所有训练数据点。然后选择离超平面最近的数据点作为原型。恶意软件样本将根据与原型的距离进行分类。实验结果表明,该方法对已知恶意软件的分类F1得分为96.5%,对未知恶意软件的分类F1得分为97.7%,均优于原始的基于原型的分类方法。
{"title":"A Hypercuboid-Based Machine Learning Algorithm for Malware Classification","authors":"Thi Thu Trang Nguyen, Dai Tho Nguyen, Duy Loi Vu","doi":"10.1109/RIVF51545.2021.9642093","DOIUrl":"https://doi.org/10.1109/RIVF51545.2021.9642093","url":null,"abstract":"Malware attacks have been among the most serious threats to cyber security in the last decade. Antimalware software can help safeguard information systems and minimize their exposure to the malware. Most of anti-malware programs detect malware instances based on signature or pattern matching. Data mining and machine learning techniques can be used to automatically detect models and patterns behind different types of malware variants. However, traditional machine-based learning techniques such as SVM, decision trees and naive Bayes seem to be only suitable for detecting malicious code, not effective enough for complex problems such as classification. In this article, we propose a new prototype extraction method for non-traditional prototype-based machine learning classification. The prototypes are extracted using hypercuboids. Each hypercuboid covers all training data points of a malware family. Then we choose the data points nearest to the hyperplanes as the prototypes. Malware samples will be classified based on the distances to the prototypes. Experiments results show that our proposition leads to F1 score of 96.5% for classification of known malware and 97.7% for classification of unknown malware, both better than the original prototype-based classification method.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"8 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81440105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Improving Speaker Verification in Noisy Environment Using DNN Classifier 利用DNN分类器改进噪声环境下的说话人验证
Pub Date : 2021-08-19 DOI: 10.1109/RIVF51545.2021.9642074
Chung Tran Quang, Quang Minh Nguyen, Pham Ngoc Phuong, Quoc Truong Do
Speaker verification in noisy environments is still a challenging task. Previous studies have proposed speaker embeddings (x-vectors, ThinResNet) with classifier models (PLDA, cosine) to classify if an audio is spoken by a specific speaker. The verification process is defined in 3 steps: training an embedding extractor, enrollment and verification. Most studies were trying to mitigate the noisy issue by augmenting noises in the embedding extractor. This method helps the extractor to tolerate more types of noise during the inference process. However, the classification model is still sensitive in noisy environments. In this paper, we (1) evaluate the effectiveness of different speaker embedding models and classifiers in various conditions, and (2) propose a neural network classifier on top of embedding vectors and train it with data augmentation. Experimental results indicate that the proposed pipeline outperforms the traditional pipeline by 5% F1 on a clean test set and 9% F1 on noisy test sets.
噪声环境下的说话人验证仍然是一项具有挑战性的任务。先前的研究已经提出了使用分类器模型(PLDA,余弦)对说话者嵌入(x向量,ThinResNet)进行分类,以确定音频是否由特定的说话者说话。验证过程定义为3个步骤:训练嵌入提取器、登记和验证。大多数研究都试图通过增加嵌入提取器中的噪声来缓解噪声问题。这种方法有助于提取器在推理过程中容忍更多类型的噪声。但是,该分类模型在噪声环境下仍然比较敏感。在本文中,我们(1)评估了不同的说话人嵌入模型和分类器在不同条件下的有效性;(2)提出了一种基于嵌入向量的神经网络分类器,并对其进行了数据增强训练。实验结果表明,所提出的管道在无噪声测试集和有噪声测试集上的性能分别比传统管道高5%和9%。
{"title":"Improving Speaker Verification in Noisy Environment Using DNN Classifier","authors":"Chung Tran Quang, Quang Minh Nguyen, Pham Ngoc Phuong, Quoc Truong Do","doi":"10.1109/RIVF51545.2021.9642074","DOIUrl":"https://doi.org/10.1109/RIVF51545.2021.9642074","url":null,"abstract":"Speaker verification in noisy environments is still a challenging task. Previous studies have proposed speaker embeddings (x-vectors, ThinResNet) with classifier models (PLDA, cosine) to classify if an audio is spoken by a specific speaker. The verification process is defined in 3 steps: training an embedding extractor, enrollment and verification. Most studies were trying to mitigate the noisy issue by augmenting noises in the embedding extractor. This method helps the extractor to tolerate more types of noise during the inference process. However, the classification model is still sensitive in noisy environments. In this paper, we (1) evaluate the effectiveness of different speaker embedding models and classifiers in various conditions, and (2) propose a neural network classifier on top of embedding vectors and train it with data augmentation. Experimental results indicate that the proposed pipeline outperforms the traditional pipeline by 5% F1 on a clean test set and 9% F1 on noisy test sets.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"18 1","pages":"1-5"},"PeriodicalIF":0.0,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87342006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mining Japanese-Vietnamese multi-level parallel text corpus from Wikipedia data resource 从维基百科数据资源中挖掘日越多级平行文本语料库
Pub Date : 2021-08-19 DOI: 10.1109/RIVF51545.2021.9642108
T. Do
This paper presents the task of mining a Japanese - Vietnamese parallel text corpus from comparable data resources in application of machine translation. Data resource for this language pair is few and rare so the parallel text should be extracted at multi levels, sentence level and fragment level, to get as much data as possible. Moreover, the proposed method considers word order independently so it can be applied to different language families. The result applied on Japanese- Vietnamese Wikipedia resource shows that the proposed method increases significantly the number of extracted parallel data. The extracted multi-level parallel text contributes to the quality of machine translation as well. More than 144,000 pairs of parallel sentences and 148,000 pairs of parallel fragments had been mined and opened to the research community.
本文提出了在机器翻译应用中从可比数据资源中挖掘日越平行文本语料库的任务。由于这种语言对的数据资源很少,因此应该从句子层面和片段层面对平行文本进行多层次的提取,以获得尽可能多的数据。此外,该方法独立考虑词序,因此可以适用于不同的语系。在日语-越南语维基百科资源上的应用结果表明,该方法显著提高了提取并行数据的数量。提取的多层次平行文本也有助于提高机器翻译的质量。超过14.4万对平行句子和14.8万对平行片段已被挖掘并向研究界开放。
{"title":"Mining Japanese-Vietnamese multi-level parallel text corpus from Wikipedia data resource","authors":"T. Do","doi":"10.1109/RIVF51545.2021.9642108","DOIUrl":"https://doi.org/10.1109/RIVF51545.2021.9642108","url":null,"abstract":"This paper presents the task of mining a Japanese - Vietnamese parallel text corpus from comparable data resources in application of machine translation. Data resource for this language pair is few and rare so the parallel text should be extracted at multi levels, sentence level and fragment level, to get as much data as possible. Moreover, the proposed method considers word order independently so it can be applied to different language families. The result applied on Japanese- Vietnamese Wikipedia resource shows that the proposed method increases significantly the number of extracted parallel data. The extracted multi-level parallel text contributes to the quality of machine translation as well. More than 144,000 pairs of parallel sentences and 148,000 pairs of parallel fragments had been mined and opened to the research community.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"47 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90836282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
[Copyright notice] (版权)
Pub Date : 2021-08-19 DOI: 10.1109/rivf51545.2021.9642127
{"title":"[Copyright notice]","authors":"","doi":"10.1109/rivf51545.2021.9642127","DOIUrl":"https://doi.org/10.1109/rivf51545.2021.9642127","url":null,"abstract":"","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74586101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2021 RIVF International Conference on Computing and Communication Technologies (RIVF)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1