{"title":"TextConvoNet:一种基于卷积神经网络的文本分类架构。","authors":"Sanskar Soni, Satyendra Singh Chouhan, Santosh Singh Rathore","doi":"10.1007/s10489-022-04221-9","DOIUrl":null,"url":null,"abstract":"<p><p>This paper presents, <i>TextConvoNet</i>, a novel Convolutional Neural Network (CNN) based architecture for binary and multi-class text classification problems. Most of the existing CNN-based models use one-dimensional convolving filters, where each filter specializes in extracting <i>n-grams</i> features of a particular input word embeddings (Sentence Matrix). These features can be termed as intra-sentence <i>n-gram</i> features. To the best of our knowledge, all the existing CNN models for text classification are based on the aforementioned concept. The presented <i>TextConvoNet</i> not only extracts the intra-sentence <i>n-gram</i> features but also captures the inter-sentence <i>n-gram</i> features in input text data. It uses an alternative approach for input matrix representation and applies a two-dimensional multi-scale convolutional operation on the input. We perform an experimental study on five binary and multi-class classification datasets and evaluate the performance of the <i>TextConvoNet</i> for text classification. The results are evaluated using eight performance measures, accuracy, precision, recall, f1-score, specificity, gmean1, gmean2, and Mathews correlation coefficient (MCC). Furthermore, we extensively compared presented <i>TextConvoNet</i> with machine learning, deep learning, and attention-based models. The experimental results evidenced that the presented <i>TextConvoNet</i> outperformed and yielded better performance than the other used models for text classification purposes.</p>","PeriodicalId":72260,"journal":{"name":"Applied intelligence (Dordrecht, Netherlands)","volume":"53 11","pages":"14249-14268"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9589611/pdf/","citationCount":"15","resultStr":"{\"title\":\"<i>TextConvoNet</i>: a convolutional neural network based architecture for text classification.\",\"authors\":\"Sanskar Soni, Satyendra Singh Chouhan, Santosh Singh Rathore\",\"doi\":\"10.1007/s10489-022-04221-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>This paper presents, <i>TextConvoNet</i>, a novel Convolutional Neural Network (CNN) based architecture for binary and multi-class text classification problems. Most of the existing CNN-based models use one-dimensional convolving filters, where each filter specializes in extracting <i>n-grams</i> features of a particular input word embeddings (Sentence Matrix). These features can be termed as intra-sentence <i>n-gram</i> features. To the best of our knowledge, all the existing CNN models for text classification are based on the aforementioned concept. The presented <i>TextConvoNet</i> not only extracts the intra-sentence <i>n-gram</i> features but also captures the inter-sentence <i>n-gram</i> features in input text data. It uses an alternative approach for input matrix representation and applies a two-dimensional multi-scale convolutional operation on the input. We perform an experimental study on five binary and multi-class classification datasets and evaluate the performance of the <i>TextConvoNet</i> for text classification. The results are evaluated using eight performance measures, accuracy, precision, recall, f1-score, specificity, gmean1, gmean2, and Mathews correlation coefficient (MCC). Furthermore, we extensively compared presented <i>TextConvoNet</i> with machine learning, deep learning, and attention-based models. The experimental results evidenced that the presented <i>TextConvoNet</i> outperformed and yielded better performance than the other used models for text classification purposes.</p>\",\"PeriodicalId\":72260,\"journal\":{\"name\":\"Applied intelligence (Dordrecht, Netherlands)\",\"volume\":\"53 11\",\"pages\":\"14249-14268\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9589611/pdf/\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied intelligence (Dordrecht, Netherlands)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s10489-022-04221-9\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2022/10/22 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied intelligence (Dordrecht, Netherlands)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10489-022-04221-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/10/22 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
TextConvoNet: a convolutional neural network based architecture for text classification.
This paper presents, TextConvoNet, a novel Convolutional Neural Network (CNN) based architecture for binary and multi-class text classification problems. Most of the existing CNN-based models use one-dimensional convolving filters, where each filter specializes in extracting n-grams features of a particular input word embeddings (Sentence Matrix). These features can be termed as intra-sentence n-gram features. To the best of our knowledge, all the existing CNN models for text classification are based on the aforementioned concept. The presented TextConvoNet not only extracts the intra-sentence n-gram features but also captures the inter-sentence n-gram features in input text data. It uses an alternative approach for input matrix representation and applies a two-dimensional multi-scale convolutional operation on the input. We perform an experimental study on five binary and multi-class classification datasets and evaluate the performance of the TextConvoNet for text classification. The results are evaluated using eight performance measures, accuracy, precision, recall, f1-score, specificity, gmean1, gmean2, and Mathews correlation coefficient (MCC). Furthermore, we extensively compared presented TextConvoNet with machine learning, deep learning, and attention-based models. The experimental results evidenced that the presented TextConvoNet outperformed and yielded better performance than the other used models for text classification purposes.