首页 > 最新文献

Informatica最新文献

英文 中文
LP SVM with A Novel Similarity function Outperforms Powerful LP-QP-Kernel-SVM Considering Efficient Classification 基于新颖相似函数的LP支持向量机在分类效率方面优于强大的LP- qp -核支持向量机
IF 2.9 4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-09-04 DOI: 10.31449/inf.v47i8.4767
Rezaul Karim, Mahmudul Hasan, Amit Kumar Kundu, Ali Ahmed Ave
While the core quality of SVM comes from its ability to get the global optima, its classification performance also depends on computing kernels. However, while this kernel-complexity generates the power of machine, it is also responsible for the computational load to execute this kernel. Moreover, insisting on a similarity function to be a positive definite kernel demands some properties to be satisfied that seem unproductive sometimes raising a question about which similarity measures to be used for classifier. We model Vapnik’s LPSVM proposing a new similarity function replacing kernel function. Following the strategy of ”Accuracy first, speed second”, we have modelled a similarity function that is mathematically well-defined depending on analysis as well as geometry and complex enough to train the machine for generating solid generalization ability. Being consistent with the theory of learning by Balcan and Blum [1], our similarity function does not need to be a valid kernel function and demands less computational cost for executing compared to its counterpart like RBF or other kernels while provides sufficient power to the classifier using its optimal complexity. Benchmarking shows that our similarity function based LPSVM poses test error 0.86 times of the most powerful RBF based QP SVM but demands only 0.40 times of its computational cost.
虽然支持向量机的核心质量来自于其获得全局最优的能力,但其分类性能也依赖于计算核。然而,虽然这种内核复杂性产生了机器的力量,但它也负责执行该内核的计算负载。此外,坚持相似函数是正定核要求满足一些似乎无效的属性,有时会提出关于使用哪种相似度量用于分类器的问题。我们对Vapnik的LPSVM进行建模,提出一个新的相似函数来代替核函数。遵循“精度第一,速度第二”的策略,我们建立了一个相似函数,该函数根据分析和几何在数学上定义良好,并且足够复杂,可以训练机器产生可靠的泛化能力。与Balcan和Blum[1]的学习理论一致,我们的相似函数不需要是一个有效的核函数,与RBF或其他核函数相比,执行相似函数所需的计算成本更低,同时利用其最优复杂度为分类器提供足够的能力。基准测试表明,基于相似函数的LPSVM的测试误差是最强大的基于RBF的QP支持向量机的0.86倍,而计算成本仅为其0.40倍。
{"title":"LP SVM with A Novel Similarity function Outperforms Powerful LP-QP-Kernel-SVM Considering Efficient Classification","authors":"Rezaul Karim, Mahmudul Hasan, Amit Kumar Kundu, Ali Ahmed Ave","doi":"10.31449/inf.v47i8.4767","DOIUrl":"https://doi.org/10.31449/inf.v47i8.4767","url":null,"abstract":"While the core quality of SVM comes from its ability to get the global optima, its classification performance also depends on computing kernels. However, while this kernel-complexity generates the power of machine, it is also responsible for the computational load to execute this kernel. Moreover, insisting on a similarity function to be a positive definite kernel demands some properties to be satisfied that seem unproductive sometimes raising a question about which similarity measures to be used for classifier. We model Vapnik’s LPSVM proposing a new similarity function replacing kernel function. Following the strategy of ”Accuracy first, speed second”, we have modelled a similarity function that is mathematically well-defined depending on analysis as well as geometry and complex enough to train the machine for generating solid generalization ability. Being consistent with the theory of learning by Balcan and Blum [1], our similarity function does not need to be a valid kernel function and demands less computational cost for executing compared to its counterpart like RBF or other kernels while provides sufficient power to the classifier using its optimal complexity. Benchmarking shows that our similarity function based LPSVM poses test error 0.86 times of the most powerful RBF based QP SVM but demands only 0.40 times of its computational cost.","PeriodicalId":56292,"journal":{"name":"Informatica","volume":"47 1","pages":""},"PeriodicalIF":2.9,"publicationDate":"2023-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69808614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-Resource Neural Machine Translation Improvement Using Data Augmentation Strategies 基于数据增强策略的低资源神经机器翻译改进
4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-29 DOI: 10.31449/inf.v47i3.4761
Thai Nguyen Quoc, Huong Le Thanh, Hanh Pham Van
The development of neural models has greatly improved the performance of machine translation, but these methods require large-scale parallel data, which can be difficult to obtain for low-resource language pairs. To address this issue, this research employs a pre-trained multilingual model and fine-tunes it by using a small bilingual dataset. Additionally, two data-augmentation strategies are proposed to generate new training data: (i) back-translation with the dataset from the source language; (ii) data augmentation via the English pivot language. The proposed approach is applied to the Khmer-Vietnamese machine translation. Experimental results show that our proposed approach outperforms the Google Translator model by 5.3% in terms of BLEU score on a test set of 2,000 Khmer-Vietnamese sentence pairs.
神经模型的发展极大地提高了机器翻译的性能,但这些方法需要大规模的并行数据,而对于低资源的语言对,这些数据很难获得。为了解决这个问题,本研究采用了一个预训练的多语言模型,并通过使用一个小型双语数据集对其进行微调。此外,提出了两种数据增强策略来生成新的训练数据:(i)从源语言反翻译数据集;(ii)通过英语支点语言进行数据增强。将该方法应用于高棉-越南语的机器翻译。实验结果表明,在2000个高棉-越南语句子对的测试集上,我们提出的方法在BLEU得分方面比Google Translator模型高出5.3%。
{"title":"Low-Resource Neural Machine Translation Improvement Using Data Augmentation Strategies","authors":"Thai Nguyen Quoc, Huong Le Thanh, Hanh Pham Van","doi":"10.31449/inf.v47i3.4761","DOIUrl":"https://doi.org/10.31449/inf.v47i3.4761","url":null,"abstract":"The development of neural models has greatly improved the performance of machine translation, but these methods require large-scale parallel data, which can be difficult to obtain for low-resource language pairs. To address this issue, this research employs a pre-trained multilingual model and fine-tunes it by using a small bilingual dataset. Additionally, two data-augmentation strategies are proposed to generate new training data: (i) back-translation with the dataset from the source language; (ii) data augmentation via the English pivot language. The proposed approach is applied to the Khmer-Vietnamese machine translation. Experimental results show that our proposed approach outperforms the Google Translator model by 5.3% in terms of BLEU score on a test set of 2,000 Khmer-Vietnamese sentence pairs.","PeriodicalId":56292,"journal":{"name":"Informatica","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136349756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Motion Embedded Images: An Approach to Capture Spatial and Temporal Features for Action Recognition 运动嵌入图像:一种捕捉动作识别的空间和时间特征的方法
4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-29 DOI: 10.31449/inf.v47i3.4755
Tri Le, Nham Huynh-Duc, Chung Thai Nguyen, Minh-Triet Tran
The demand for human activity recognition (HAR) from videos has witnessed a significant surge in various real-life applications, including video surveillance, healthcare, elderly care, among others. The explotion of short-form videos on social media platforms has further intensified the interest in this domain. This research endeavors to focus on the problem of HAR in general short videos. In contrast to still images, video clips offer both spatial and temporal information, rendering it challenging to extract complementary information on appearance from still frames and motion between frames. This research makes a two-fold contribution. Firstly, we investigate the use of motion-embedded images in a variant of two-stream Convolutional Neural Network architecture, in which one stream captures motion using combined batches of frames, while another stream employs a normal image classification ConvNet to classify static appearance. Secondly, we create a novel dataset of Southeast Asian Sports short videos that encompasses both videos with and without effects, which is a modern factor that is lacking in all currently available datasets used for benchmarking models. The proposed model is trained and evaluated on two benchmarks: UCF-101 and SEAGS-V1. The results reveal that the proposed model yields competitive performance compared to prior attempts to address the same problem.
对视频中人类活动识别(HAR)的需求在各种现实生活应用中激增,包括视频监控、医疗保健、老年人护理等。社交媒体平台对短视频的开发进一步加剧了人们对这一领域的兴趣。本研究致力于研究一般短视频中的HAR问题。与静止图像相比,视频片段提供了空间和时间信息,这使得从静止帧和帧之间的运动中提取外观的互补信息具有挑战性。这项研究有双重贡献。首先,我们研究了在两流卷积神经网络架构的变体中使用运动嵌入图像,其中一个流使用组合批次的帧捕获运动,而另一个流使用常规图像分类卷积神经网络对静态外观进行分类。其次,我们创建了一个新的东南亚体育短视频数据集,其中包括带效果和不带效果的视频,这是所有当前可用的用于基准模型的数据集所缺乏的现代因素。提出的模型在两个基准上进行了训练和评估:UCF-101和segs - v1。结果表明,与先前解决相同问题的尝试相比,所提出的模型产生了具有竞争力的性能。
{"title":"Motion Embedded Images: An Approach to Capture Spatial and Temporal Features for Action Recognition","authors":"Tri Le, Nham Huynh-Duc, Chung Thai Nguyen, Minh-Triet Tran","doi":"10.31449/inf.v47i3.4755","DOIUrl":"https://doi.org/10.31449/inf.v47i3.4755","url":null,"abstract":"The demand for human activity recognition (HAR) from videos has witnessed a significant surge in various real-life applications, including video surveillance, healthcare, elderly care, among others. The explotion of short-form videos on social media platforms has further intensified the interest in this domain. This research endeavors to focus on the problem of HAR in general short videos. In contrast to still images, video clips offer both spatial and temporal information, rendering it challenging to extract complementary information on appearance from still frames and motion between frames. This research makes a two-fold contribution. Firstly, we investigate the use of motion-embedded images in a variant of two-stream Convolutional Neural Network architecture, in which one stream captures motion using combined batches of frames, while another stream employs a normal image classification ConvNet to classify static appearance. Secondly, we create a novel dataset of Southeast Asian Sports short videos that encompasses both videos with and without effects, which is a modern factor that is lacking in all currently available datasets used for benchmarking models. The proposed model is trained and evaluated on two benchmarks: UCF-101 and SEAGS-V1. The results reveal that the proposed model yields competitive performance compared to prior attempts to address the same problem.","PeriodicalId":56292,"journal":{"name":"Informatica","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136349752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Hybrid Deep Learning Approach to Keyword Spotting in Vietnamese Stele Images 越南石碑图像关键字识别的混合深度学习方法
4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-29 DOI: 10.31449/inf.v47i3.4785
Anna Scius-Bertrand, Marc Bui, Andreas Fischer
In order to access the rich cultural heritage conveyed in Vietnamese steles, automatic reading of stone engravings would be a great support for historians, who are analyzing tens of thousands of stele images. Approaching the challenging problem with deep learning alone is difficult because the data-driven models require large representative datasets with expert human annotations, which are not available for the steles and costly to obtain. In this article, we present a hybrid approach to spot keywords in stele images that combines data-driven deep learning with knowledge-based structural modeling and matching of Chu Nom characters. The main advantage of the proposed method is that it is annotation-free, i.e. no human data annotation is required. In an experimental evaluation, we demonstrate that keywords can be successfully spotted with a mean average precision of more than 70% when a single engraving style is considered.
为了了解越南石碑所传达的丰富文化遗产,自动读取石刻将成为分析数万个石碑图像的历史学家的巨大支持。仅用深度学习来解决具有挑战性的问题是困难的,因为数据驱动的模型需要具有专家注释的大型代表性数据集,而这些数据集对于石碑来说是不可用的,并且获取成本很高。在本文中,我们提出了一种结合数据驱动的深度学习和基于知识的结构建模和Chu Nom字符匹配的混合方法来识别石碑图像中的关键词。该方法的主要优点是无需注释,即不需要人工数据注释。在实验评估中,我们证明了当考虑单一雕刻风格时,关键词可以成功地识别,平均精度超过70%。
{"title":"A Hybrid Deep Learning Approach to Keyword Spotting in Vietnamese Stele Images","authors":"Anna Scius-Bertrand, Marc Bui, Andreas Fischer","doi":"10.31449/inf.v47i3.4785","DOIUrl":"https://doi.org/10.31449/inf.v47i3.4785","url":null,"abstract":"In order to access the rich cultural heritage conveyed in Vietnamese steles, automatic reading of stone engravings would be a great support for historians, who are analyzing tens of thousands of stele images. Approaching the challenging problem with deep learning alone is difficult because the data-driven models require large representative datasets with expert human annotations, which are not available for the steles and costly to obtain. In this article, we present a hybrid approach to spot keywords in stele images that combines data-driven deep learning with knowledge-based structural modeling and matching of Chu Nom characters. The main advantage of the proposed method is that it is annotation-free, i.e. no human data annotation is required. In an experimental evaluation, we demonstrate that keywords can be successfully spotted with a mean average precision of more than 70% when a single engraving style is considered.","PeriodicalId":56292,"journal":{"name":"Informatica","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136349755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lightweight Multi-Objective and Many-Objective Problem Formulations for Evolutionary Neural Architecture Search with the Training-Free Performance Metric Synaptic Flow 基于无训练性能度量突触流的进化神经结构搜索的轻量级多目标和多目标问题公式
4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-29 DOI: 10.31449/inf.v47i3.4736
An Vo, Tan Ngoc Pham, Van Bich Nguyen, Ngoc Hoang Luong
Neural architecture search (NAS) with naive problem formulations and applications of conventional search algorithms often incur prohibitive search costs due to the evaluations of many candidate architectures. For each architecture, its accuracy performance can be properly evaluated after hundreds (or thousands) of computationally expensive training epochs are performed to achieve proper network weights. A so-called zero-cost metric, Synaptic Flow, computed based on random network weight values at initialization, is found to exhibit certain correlations with the neural network test accuracy and can thus be used as an efficient proxy performance metric during the search. Besides, NAS in practice often involves not only optimizing for network accuracy performance but also optimizing for network complexity, such as model size, number of floating point operations, or latency, as well. In this article, we study various NAS problem formulations in which multiple aspects of deep neural networks are treated as multiple optimization objectives. We employ a widely-used multi-objective evolutionary algorithm, i.e., the non-dominated sorting genetic algorithm II (NSGA-II), to approximate the optimal Pareto-optimal fronts for these NAS problem formulations. Experimental results on the NAS benchmark NATS-Bench show the advantages and disadvantages of each formulation.
具有朴素问题公式的神经结构搜索(NAS)和传统搜索算法的应用通常会由于对许多候选体系结构的评估而导致过高的搜索成本。对于每种体系结构,在执行数百(或数千)个计算代价高昂的训练epoch以获得适当的网络权重后,可以正确评估其准确性性能。所谓的零成本指标Synaptic Flow是在初始化时基于随机网络权重值计算的,发现它与神经网络测试精度表现出一定的相关性,因此可以在搜索过程中用作有效的代理性能指标。此外,NAS在实践中通常不仅涉及对网络精度性能的优化,还涉及对网络复杂性的优化,例如模型大小、浮点操作的数量或延迟。在本文中,我们研究了各种NAS问题的表述,其中深度神经网络的多个方面被视为多个优化目标。我们采用了一种广泛使用的多目标进化算法,即非支配排序遗传算法II (NSGA-II),来近似这些NAS问题公式的最优帕累托最优前沿。在NAS基准NATS-Bench上的实验结果显示了每种配方的优缺点。
{"title":"Lightweight Multi-Objective and Many-Objective Problem Formulations for Evolutionary Neural Architecture Search with the Training-Free Performance Metric Synaptic Flow","authors":"An Vo, Tan Ngoc Pham, Van Bich Nguyen, Ngoc Hoang Luong","doi":"10.31449/inf.v47i3.4736","DOIUrl":"https://doi.org/10.31449/inf.v47i3.4736","url":null,"abstract":"Neural architecture search (NAS) with naive problem formulations and applications of conventional search algorithms often incur prohibitive search costs due to the evaluations of many candidate architectures. For each architecture, its accuracy performance can be properly evaluated after hundreds (or thousands) of computationally expensive training epochs are performed to achieve proper network weights. A so-called zero-cost metric, Synaptic Flow, computed based on random network weight values at initialization, is found to exhibit certain correlations with the neural network test accuracy and can thus be used as an efficient proxy performance metric during the search. Besides, NAS in practice often involves not only optimizing for network accuracy performance but also optimizing for network complexity, such as model size, number of floating point operations, or latency, as well. In this article, we study various NAS problem formulations in which multiple aspects of deep neural networks are treated as multiple optimization objectives. We employ a widely-used multi-objective evolutionary algorithm, i.e., the non-dominated sorting genetic algorithm II (NSGA-II), to approximate the optimal Pareto-optimal fronts for these NAS problem formulations. Experimental results on the NAS benchmark NATS-Bench show the advantages and disadvantages of each formulation.","PeriodicalId":56292,"journal":{"name":"Informatica","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136349976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Complaints with Target Scope Identification on Social Media 社交媒体上有目标范围识别的投诉
4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-29 DOI: 10.31449/inf.v47i3.4758
Kazuhiro Ito, Taichi Murayama, Shuntaro Yada, Shoko Wakamiya, Eiji Aramaki
A complaint is uttered when reality fails to meet one's expectations. Research on complaints, which contributes to our understanding of basic human behavior, has been conducted in the fields of psychology, linguistics, and marketing. Although several approaches have been implemented to the study of complaints, studies have yet focused on a target scope of complaints. Examination of a target scope of complaints is crusial because the functions of complaints, such as evocation of emotion, use of grammar, and intention, are different depending on the target scope. We first tackle the construction and release of a complaint dataset of 6,418 tweets by annotating Japanese texts collected from Twitter with labels of the target scope. Our dataset is available at url{https://github.com/sociocom/JaGUCHI}. We then benchmark the annotated dataset with several machine learning baselines and obtain the best performance of 90.4 F1-score in detecting whether a text was a complaint or not, and a micro-F1 score of 72.2 in identifying the target scope label. Finally, we conducted case studies using our model to demonstrate that identifying a target scope of complaints is useful for sociological analysis.
当现实不能满足一个人的期望时,就会抱怨。心理学、语言学和市场营销领域对抱怨的研究有助于我们对人类基本行为的理解。虽然对投诉的研究采取了几种方法,但研究的重点仍然是投诉的目标范围。检查投诉的目标范围是至关重要的,因为投诉的功能,如情感的唤起、语法的使用和意图,随着目标范围的不同而不同。我们首先通过用目标范围的标签注释从Twitter收集的日语文本,解决了包含6,418条tweet的投诉数据集的构建和发布问题。我们的数据集可以在url{https://github.com/sociocom/JaGUCHI}上找到。然后,我们用几个机器学习基线对带注释的数据集进行基准测试,在检测文本是否为投诉方面获得了90.4 f1分的最佳性能,在识别目标范围标签方面获得了72.2的微f1分。最后,我们使用我们的模型进行了案例研究,以证明确定投诉的目标范围对社会学分析是有用的。
{"title":"Complaints with Target Scope Identification on Social Media","authors":"Kazuhiro Ito, Taichi Murayama, Shuntaro Yada, Shoko Wakamiya, Eiji Aramaki","doi":"10.31449/inf.v47i3.4758","DOIUrl":"https://doi.org/10.31449/inf.v47i3.4758","url":null,"abstract":"A complaint is uttered when reality fails to meet one's expectations. Research on complaints, which contributes to our understanding of basic human behavior, has been conducted in the fields of psychology, linguistics, and marketing. Although several approaches have been implemented to the study of complaints, studies have yet focused on a target scope of complaints. Examination of a target scope of complaints is crusial because the functions of complaints, such as evocation of emotion, use of grammar, and intention, are different depending on the target scope. We first tackle the construction and release of a complaint dataset of 6,418 tweets by annotating Japanese texts collected from Twitter with labels of the target scope. Our dataset is available at url{https://github.com/sociocom/JaGUCHI}. We then benchmark the annotated dataset with several machine learning baselines and obtain the best performance of 90.4 F1-score in detecting whether a text was a complaint or not, and a micro-F1 score of 72.2 in identifying the target scope label. Finally, we conducted case studies using our model to demonstrate that identifying a target scope of complaints is useful for sociological analysis.","PeriodicalId":56292,"journal":{"name":"Informatica","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136349758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Automatic Labeling Method for Subword-Phrase Recognition in Effective Text Classification 有效文本分类中子词-短语识别的自动标注方法
4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-29 DOI: 10.31449/inf.v47i3.4742
Yusuke Kimura, Takahiro Komamizu, Kenji Hatano
Text classification methods using deep learning, which is trained with a tremendous amount of text, have achieved superior performance than traditional methods. In addition to its success, multi-task learning (MTL for short) has become a promising approach for text classification; for instance, a multi-task learning approach employs the named entity recognition as an auxiliary task for text classification, and it showcases that the auxiliary task helps make the text classification model higher classification performance. The existing MTL-based text classification methods depend on auxiliary tasks using supervised labels. Obtaining such supervision signals requires additional human and financial costs in addition to those for the main text classification task. To reduce these costs, this paper proposes a multi-task learning-based text classification framework reducing the additional costs on supervised label creation by automatically labeling phrases in texts for the auxiliary recognition task. A basic idea to realize the proposed framework is to utilize phrasal expressions consisting of subwords (called subword-phrase) and to deal with the recent situation in which the pre-trained neural language models such as BERT are designed upon subword-based tokenization to avoid out-of-vocabulary words being missed. To the best of our knowledge, there has been no text classification approach on top of subword-phrases, because subwords only sometimes express a coherent set of meanings. The proposed framework is novel in adding subword-phrase recognition as an auxiliary task and utilizing subword-phrases for text classification. It extracts subword-phrases in an unsupervised manner, particularly the statistics approach. In order to construct labels for effective subword-phrase recognition tasks, extracted subword-phrases are classified for document classes so that subword-phrases dedicated to some classes can be distinguishable. The experimental evaluation of the five popular datasets for text classification showcases the effectiveness of the involvement of the subword-phrase recognition as an auxiliary task. It also shows comparative results with the state-of-the-art method, and the comparison of various labeling schemes indicates insights for labeling common subword-phrases among several document classes.
基于深度学习的文本分类方法在大量文本的训练下取得了优于传统方法的性能。除了它的成功之外,多任务学习(简称MTL)已经成为一种很有前途的文本分类方法;例如,一种多任务学习方法将命名实体识别作为文本分类的辅助任务,并展示了辅助任务有助于文本分类模型获得更高的分类性能。现有的基于mtl的文本分类方法依赖于使用监督标签的辅助任务。获取这样的监督信号,除了主要的文本分类任务之外,还需要额外的人力和财力成本。为了降低这些成本,本文提出了一种基于多任务学习的文本分类框架,通过为辅助识别任务自动标记文本中的短语来减少监督标签创建的额外成本。实现该框架的一个基本思路是利用由子词组成的短语表达式(称为子词-短语),并处理基于子词的标记化设计的预训练神经语言模型(如BERT)以避免遗漏词汇外词的情况。据我们所知,目前还没有基于子词-短语的文本分类方法,因为子词有时只表达一组连贯的含义。该框架将子词-短语识别作为辅助任务,并利用子词-短语进行文本分类。它以一种无监督的方式提取子词短语,特别是统计方法。为了构造有效的子词-短语识别任务的标签,对提取的子词-短语进行文档类分类,使专用于某些类的子词-短语能够被区分。通过对五种常用文本分类数据集的实验评估,验证了子词-短语识别作为辅助任务的有效性。它还显示了与最先进的方法的比较结果,并且各种标记方案的比较表明了在几个文档类中标记常见子词短语的见解。
{"title":"An Automatic Labeling Method for Subword-Phrase Recognition in Effective Text Classification","authors":"Yusuke Kimura, Takahiro Komamizu, Kenji Hatano","doi":"10.31449/inf.v47i3.4742","DOIUrl":"https://doi.org/10.31449/inf.v47i3.4742","url":null,"abstract":"Text classification methods using deep learning, which is trained with a tremendous amount of text, have achieved superior performance than traditional methods. In addition to its success, multi-task learning (MTL for short) has become a promising approach for text classification; for instance, a multi-task learning approach employs the named entity recognition as an auxiliary task for text classification, and it showcases that the auxiliary task helps make the text classification model higher classification performance. The existing MTL-based text classification methods depend on auxiliary tasks using supervised labels. Obtaining such supervision signals requires additional human and financial costs in addition to those for the main text classification task. To reduce these costs, this paper proposes a multi-task learning-based text classification framework reducing the additional costs on supervised label creation by automatically labeling phrases in texts for the auxiliary recognition task. A basic idea to realize the proposed framework is to utilize phrasal expressions consisting of subwords (called subword-phrase) and to deal with the recent situation in which the pre-trained neural language models such as BERT are designed upon subword-based tokenization to avoid out-of-vocabulary words being missed. To the best of our knowledge, there has been no text classification approach on top of subword-phrases, because subwords only sometimes express a coherent set of meanings. The proposed framework is novel in adding subword-phrase recognition as an auxiliary task and utilizing subword-phrases for text classification. It extracts subword-phrases in an unsupervised manner, particularly the statistics approach. In order to construct labels for effective subword-phrase recognition tasks, extracted subword-phrases are classified for document classes so that subword-phrases dedicated to some classes can be distinguishable. The experimental evaluation of the five popular datasets for text classification showcases the effectiveness of the involvement of the subword-phrase recognition as an auxiliary task. It also shows comparative results with the state-of-the-art method, and the comparison of various labeling schemes indicates insights for labeling common subword-phrases among several document classes.","PeriodicalId":56292,"journal":{"name":"Informatica","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136349753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Introduction to special issue "SOICT 2022" 《SOICT 2022》特刊简介
4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-29 DOI: 10.31449/inf.v47i3.5142
Huynh Thi Thanh Binh, Ichiro Ide
{"title":"Introduction to special issue \"SOICT 2022\"","authors":"Huynh Thi Thanh Binh, Ichiro Ide","doi":"10.31449/inf.v47i3.5142","DOIUrl":"https://doi.org/10.31449/inf.v47i3.5142","url":null,"abstract":"","PeriodicalId":56292,"journal":{"name":"Informatica","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136349977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A New Multimedia Web-Data Mining Approach based on Equivalence Class Evaluation Pipelined to Feature Maps onto Planar Projection 一种基于等价类评价的多媒体web数据挖掘新方法
IF 2.9 4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-04 DOI: 10.31449/inf.v47i7.4583
Ravindar Mogili, M. Naidu, G. Narsimha
{"title":"A New Multimedia Web-Data Mining Approach based on Equivalence Class Evaluation Pipelined to Feature Maps onto Planar Projection","authors":"Ravindar Mogili, M. Naidu, G. Narsimha","doi":"10.31449/inf.v47i7.4583","DOIUrl":"https://doi.org/10.31449/inf.v47i7.4583","url":null,"abstract":"","PeriodicalId":56292,"journal":{"name":"Informatica","volume":" ","pages":""},"PeriodicalIF":2.9,"publicationDate":"2023-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48577968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Robust End-to-End CNN Architecture for Efficient COVID-19 Prediction form X-ray Images with Imbalanced Data 基于数据不平衡的x射线图像高效预测COVID-19的鲁棒端到端CNN架构
IF 2.9 4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-04 DOI: 10.31449/inf.v47i7.4790
Zakariya A. Oraibi, Safaa Albasri
{"title":"A Robust End-to-End CNN Architecture for Efficient COVID-19 Prediction form X-ray Images with Imbalanced Data","authors":"Zakariya A. Oraibi, Safaa Albasri","doi":"10.31449/inf.v47i7.4790","DOIUrl":"https://doi.org/10.31449/inf.v47i7.4790","url":null,"abstract":"","PeriodicalId":56292,"journal":{"name":"Informatica","volume":" ","pages":""},"PeriodicalIF":2.9,"publicationDate":"2023-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43474331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Informatica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1