首页 > 最新文献

Data Technologies and Applications最新文献

英文 中文
Identifying business information through deep learning: analyzing the tender documents of an Internet-based logistics bidding platform 通过深度学习识别商业信息——分析基于互联网的物流投标平台的投标文件
IF 1.6 4区 计算机科学 Q1 Social Sciences Pub Date : 2023-05-04 DOI: 10.1108/dta-08-2022-0308
Yingwen Yu, Jing Ma
PurposeThe tender documents, an essential data source for internet-based logistics tendering platforms, incorporate massive fine-grained data, ranging from information on tenderee, shipping location and shipping items. Automated information extraction in this area is, however, under-researched, making the extraction process a time- and effort-consuming one. For Chinese logistics tender entities, in particular, existing named entity recognition (NER) solutions are mostly unsuitable as they involve domain-specific terminologies and possess different semantic features.Design/methodology/approachTo tackle this problem, a novel lattice long short-term memory (LSTM) model, combining a variant contextual feature representation and a conditional random field (CRF) layer, is proposed in this paper for identifying valuable entities from logistic tender documents. Instead of traditional word embedding, the proposed model uses the pretrained Bidirectional Encoder Representations from Transformers (BERT) model as input to augment the contextual feature representation. Subsequently, with the Lattice-LSTM model, the information of characters and words is effectively utilized to avoid error segmentation.FindingsThe proposed model is then verified by the Chinese logistic tender named entity corpus. Moreover, the results suggest that the proposed model excels in the logistics tender corpus over other mainstream NER models. The proposed model underpins the automatic extraction of logistics tender information, enabling logistic companies to perceive the ever-changing market trends and make far-sighted logistic decisions.Originality/value(1) A practical model for logistic tender NER is proposed in the manuscript. By employing and fine-tuning BERT into the downstream task with a small amount of data, the experiment results show that the model has a better performance than other existing models. This is the first study, to the best of the authors' knowledge, to extract named entities from Chinese logistic tender documents. (2) A real logistic tender corpus for practical use is constructed and a program of the model for online-processing real logistic tender documents is developed in this work. The authors believe that the model will facilitate logistic companies in converting unstructured documents to structured data and further perceive the ever-changing market trends to make far-sighted logistic decisions.
目的招标文件是互联网物流招标平台的重要数据来源,包含了大量的细粒度数据,包括招标人信息、运输地点信息、运输物品信息等。然而,这一领域的自动化信息提取研究并不充分,使得提取过程既耗时又费力。特别是对于中国的物流招标实体,现有的命名实体识别(NER)解决方案大多不适合,因为它们涉及特定领域的术语,并且具有不同的语义特征。为了解决这个问题,本文提出了一种新的晶格长短期记忆(LSTM)模型,该模型结合了变量上下文特征表示和条件随机场(CRF)层,用于从物流投标文件中识别有价值的实体。与传统的词嵌入不同,该模型使用预训练的双向编码器表示作为输入来增强上下文特征表示。随后,利用Lattice-LSTM模型,有效地利用了字符和单词的信息,避免了错误分割。研究结果:提出的模型随后通过中国物流招标命名实体语料库进行验证。此外,研究结果表明,该模型在物流投标语料库中优于其他主流NER模型。该模型支持物流投标信息的自动提取,使物流公司能够感知不断变化的市场趋势,做出有远见的物流决策。原创性/价值(1)本文提出了一个实用的物流投标NER模型。通过将BERT应用于少量数据的下游任务中并对其进行微调,实验结果表明该模型比现有的其他模型具有更好的性能。据作者所知,这是第一次从中国物流招标文件中提取命名实体的研究。(2)构建了实用的物流实物标书语料库,开发了物流实物标书在线处理模型程序。该模型有助于物流企业将非结构化文档转化为结构化数据,进一步洞察不断变化的市场趋势,做出有远见的物流决策。
{"title":"Identifying business information through deep learning: analyzing the tender documents of an Internet-based logistics bidding platform","authors":"Yingwen Yu, Jing Ma","doi":"10.1108/dta-08-2022-0308","DOIUrl":"https://doi.org/10.1108/dta-08-2022-0308","url":null,"abstract":"PurposeThe tender documents, an essential data source for internet-based logistics tendering platforms, incorporate massive fine-grained data, ranging from information on tenderee, shipping location and shipping items. Automated information extraction in this area is, however, under-researched, making the extraction process a time- and effort-consuming one. For Chinese logistics tender entities, in particular, existing named entity recognition (NER) solutions are mostly unsuitable as they involve domain-specific terminologies and possess different semantic features.Design/methodology/approachTo tackle this problem, a novel lattice long short-term memory (LSTM) model, combining a variant contextual feature representation and a conditional random field (CRF) layer, is proposed in this paper for identifying valuable entities from logistic tender documents. Instead of traditional word embedding, the proposed model uses the pretrained Bidirectional Encoder Representations from Transformers (BERT) model as input to augment the contextual feature representation. Subsequently, with the Lattice-LSTM model, the information of characters and words is effectively utilized to avoid error segmentation.FindingsThe proposed model is then verified by the Chinese logistic tender named entity corpus. Moreover, the results suggest that the proposed model excels in the logistics tender corpus over other mainstream NER models. The proposed model underpins the automatic extraction of logistics tender information, enabling logistic companies to perceive the ever-changing market trends and make far-sighted logistic decisions.Originality/value(1) A practical model for logistic tender NER is proposed in the manuscript. By employing and fine-tuning BERT into the downstream task with a small amount of data, the experiment results show that the model has a better performance than other existing models. This is the first study, to the best of the authors' knowledge, to extract named entities from Chinese logistic tender documents. (2) A real logistic tender corpus for practical use is constructed and a program of the model for online-processing real logistic tender documents is developed in this work. The authors believe that the model will facilitate logistic companies in converting unstructured documents to structured data and further perceive the ever-changing market trends to make far-sighted logistic decisions.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45262725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Risk assessment in machine learning enhanced failure mode and effects analysis 机器学习中的风险评估增强了故障模式和影响分析
IF 1.6 4区 计算机科学 Q1 Social Sciences Pub Date : 2023-05-04 DOI: 10.1108/dta-06-2022-0232
Zeping Wang, Hengte Du, Liangyan Tao, S. Javed
PurposeThe traditional failure mode and effect analysis (FMEA) has some limitations, such as the neglect of relevant historical data, subjective use of rating numbering and the less rationality and accuracy of the Risk Priority Number. The current study proposes a machine learning–enhanced FMEA (ML-FMEA) method based on a popular machine learning tool, Waikato environment for knowledge analysis (WEKA).Design/methodology/approachThis work uses the collected FMEA historical data to predict the probability of component/product failure risk by machine learning based on different commonly used classifiers. To compare the correct classification rate of ML-FMEA based on different classifiers, the 10-fold cross-validation is employed. Moreover, the prediction error is estimated by repeated experiments with different random seeds under varying initialization settings. Finally, the case of the submersible pump in Bhattacharjee et al. (2020) is utilized to test the performance of the proposed method.FindingsThe results show that ML-FMEA, based on most of the commonly used classifiers, outperforms the Bhattacharjee model. For example, the ML-FMEA based on Random Committee improves the correct classification rate from 77.47 to 90.09 per cent and area under the curve of receiver operating characteristic curve (ROC) from 80.9 to 91.8 per cent, respectively.Originality/valueThe proposed method not only enables the decision-maker to use the historical failure data and predict the probability of the risk of failure but also may pave a new way for the application of machine learning techniques in FMEA.
目的传统的失效模式与影响分析(FMEA)存在忽视相关历史数据、主观使用评级编号、风险优先级编号的合理性和准确性不高等局限性。本研究提出了一种机器学习增强的FMEA (ML-FMEA)方法,该方法基于一种流行的机器学习工具,Waikato环境for knowledge analysis (WEKA)。设计/方法/方法本工作使用收集的FMEA历史数据,通过基于不同常用分类器的机器学习来预测组件/产品故障风险的概率。为了比较不同分类器对ML-FMEA的分类正确率,采用10倍交叉验证。通过不同初始化设置下不同随机种子的重复实验估计预测误差。最后,利用Bhattacharjee等人(2020)的潜水泵案例来测试所提出方法的性能。结果表明,基于大多数常用分类器的ML-FMEA优于Bhattacharjee模型。例如,基于Random Committee的ML-FMEA将分类正确率从77.47提高到90.09%,将受试者工作特征曲线(ROC)曲线下面积从80.9%提高到91.8%。提出的方法不仅使决策者能够使用历史故障数据并预测故障风险的概率,而且为机器学习技术在FMEA中的应用铺平了新的道路。
{"title":"Risk assessment in machine learning enhanced failure mode and effects analysis","authors":"Zeping Wang, Hengte Du, Liangyan Tao, S. Javed","doi":"10.1108/dta-06-2022-0232","DOIUrl":"https://doi.org/10.1108/dta-06-2022-0232","url":null,"abstract":"PurposeThe traditional failure mode and effect analysis (FMEA) has some limitations, such as the neglect of relevant historical data, subjective use of rating numbering and the less rationality and accuracy of the Risk Priority Number. The current study proposes a machine learning–enhanced FMEA (ML-FMEA) method based on a popular machine learning tool, Waikato environment for knowledge analysis (WEKA).Design/methodology/approachThis work uses the collected FMEA historical data to predict the probability of component/product failure risk by machine learning based on different commonly used classifiers. To compare the correct classification rate of ML-FMEA based on different classifiers, the 10-fold cross-validation is employed. Moreover, the prediction error is estimated by repeated experiments with different random seeds under varying initialization settings. Finally, the case of the submersible pump in Bhattacharjee et al. (2020) is utilized to test the performance of the proposed method.FindingsThe results show that ML-FMEA, based on most of the commonly used classifiers, outperforms the Bhattacharjee model. For example, the ML-FMEA based on Random Committee improves the correct classification rate from 77.47 to 90.09 per cent and area under the curve of receiver operating characteristic curve (ROC) from 80.9 to 91.8 per cent, respectively.Originality/valueThe proposed method not only enables the decision-maker to use the historical failure data and predict the probability of the risk of failure but also may pave a new way for the application of machine learning techniques in FMEA.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47381854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Channel attention-based spatial-temporal graph neural networks for traffic prediction 基于通道注意力的时空图神经网络交通预测
IF 1.6 4区 计算机科学 Q1 Social Sciences Pub Date : 2023-04-29 DOI: 10.1108/dta-09-2022-0378
Bin Wang, Fan Gao, Le Tong, Qian Zhang, Sulei Zhu
PurposeTraffic flow prediction has always been a top priority of intelligent transportation systems. There are many mature methods for short-term traffic flow prediction. However, the existing methods are often insufficient in capturing long-term spatial-temporal dependencies. To predict long-term dependencies more accurately, in this paper, a new and more effective traffic flow prediction model is proposed.Design/methodology/approachThis paper proposes a new and more effective traffic flow prediction model, named channel attention-based spatial-temporal graph neural networks. A graph convolutional network is used to extract local spatial-temporal correlations, a channel attention mechanism is used to enhance the influence of nearby spatial-temporal dependencies on decision-making and a transformer mechanism is used to capture long-term dependencies.FindingsThe proposed model is applied to two common highway datasets: METR-LA collected in Los Angeles and PEMS-BAY collected in the California Bay Area. This model outperforms the other five in terms of performance on three performance metrics a popular model.Originality/value(1) Based on the spatial-temporal synchronization graph convolution module, a spatial-temporal channel attention module is designed to increase the influence of proximity dependence on decision-making by enhancing or suppressing different channels. (2) To better capture long-term dependencies, the transformer module is introduced.
目的交通流量预测一直是智能交通系统的首要任务。短期交通流量预测有许多成熟的方法。然而,现有的方法往往不足以捕捉长期的时空依赖关系。为了更准确地预测长期相关性,本文提出了一种新的、更有效的交通流预测模型。设计/方法论/方法本文提出了一种新的、更有效的交通流预测模型,称为基于通道注意力的时空图神经网络。图卷积网络用于提取局部时空相关性,通道注意力机制用于增强附近时空相关性对决策的影响,变换器机制用于捕获长期相关性。发现所提出的模型应用于两个常见的公路数据集:洛杉矶收集的METR-LA和加利福尼亚湾区收集的PEMS-BAY。该模型在三个性能指标上的性能优于其他五个,这是一个流行的模型。独创性/价值(1)基于时空同步图卷积模块,设计了一个时空通道注意力模块,通过增强或抑制不同通道来增加邻近度依赖对决策的影响。(2) 为了更好地捕捉长期依赖关系,引入了transformer模块。
{"title":"Channel attention-based spatial-temporal graph neural networks for traffic prediction","authors":"Bin Wang, Fan Gao, Le Tong, Qian Zhang, Sulei Zhu","doi":"10.1108/dta-09-2022-0378","DOIUrl":"https://doi.org/10.1108/dta-09-2022-0378","url":null,"abstract":"PurposeTraffic flow prediction has always been a top priority of intelligent transportation systems. There are many mature methods for short-term traffic flow prediction. However, the existing methods are often insufficient in capturing long-term spatial-temporal dependencies. To predict long-term dependencies more accurately, in this paper, a new and more effective traffic flow prediction model is proposed.Design/methodology/approachThis paper proposes a new and more effective traffic flow prediction model, named channel attention-based spatial-temporal graph neural networks. A graph convolutional network is used to extract local spatial-temporal correlations, a channel attention mechanism is used to enhance the influence of nearby spatial-temporal dependencies on decision-making and a transformer mechanism is used to capture long-term dependencies.FindingsThe proposed model is applied to two common highway datasets: METR-LA collected in Los Angeles and PEMS-BAY collected in the California Bay Area. This model outperforms the other five in terms of performance on three performance metrics a popular model.Originality/value(1) Based on the spatial-temporal synchronization graph convolution module, a spatial-temporal channel attention module is designed to increase the influence of proximity dependence on decision-making by enhancing or suppressing different channels. (2) To better capture long-term dependencies, the transformer module is introduced.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49082434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning approaches for prediction of ovarian cancer driver genes from mutational and network analysis 从突变和网络分析中预测卵巢癌驱动基因的机器学习方法
IF 1.6 4区 计算机科学 Q1 Social Sciences Pub Date : 2023-04-28 DOI: 10.1108/dta-03-2022-0096
Rucha Wadapurkar, S. Bapat, Rupali A. Mahajan, R. Vyas
PurposeOvarian cancer (OC) is the most common type of gynecologic cancer in the world with a high rate of mortality. Due to manifestation of generic symptoms and absence of specific biomarkers, OC is usually diagnosed at a late stage. Machine learning models can be employed to predict driver genes implicated in causative mutations.Design/methodology/approachIn the present study, a comprehensive next generation sequencing (NGS) analysis of whole exome sequences of 47 OC patients was carried out to identify clinically significant mutations. Nine functional features of 708 mutations identified were input into a machine learning classification model by employing the eXtreme Gradient Boosting (XGBoost) classifier method for prediction of OC driver genes.FindingsThe XGBoost classifier model yielded a classification accuracy of 0.946, which was superior to that obtained by other classifiers such as decision tree, Naive Bayes, random forest and support vector machine. Further, an interaction network was generated to identify and establish correlations with cancer-associated pathways and gene ontology data.Originality/valueThe final results revealed 12 putative candidate cancer driver genes, namely LAMA3, LAMC3, COL6A1, COL5A1, COL2A1, UGT1A1, BDNF, ANK1, WNT10A, FZD4, PLEKHG5 and CYP2C9, that may have implications in clinical diagnosis.
目的癌症是世界上最常见的妇科癌症,死亡率高。由于一般症状的表现和缺乏特定的生物标志物,OC通常在晚期被诊断出来。机器学习模型可以用来预测与致病突变有关的驱动基因。设计/方法/方法在本研究中,对47名OC患者的全外显子组序列进行了全面的下一代测序(NGS)分析,以确定具有临床意义的突变。通过使用用于预测OC驱动基因的极限梯度增强(XGBoost)分类器方法,将所识别的708个突变的9个功能特征输入到机器学习分类模型中。结果XGBoost分类器模型的分类精度为0.946,优于决策树、朴素贝叶斯、随机森林和支持向量机等其他分类器。此外,生成了一个交互网络,以识别和建立与癌症相关途径和基因本体数据的相关性。原创性/价值最终结果揭示了12个推定的癌症驱动基因候选,即LAMA3、LAMC3、COL6A1、COL5A1、COL2A1、UGT1A1、BDNF、ANK1、WNT10A、FZD4、PLEKHG5和CYP2C9,这些基因可能对临床诊断有影响。
{"title":"Machine learning approaches for prediction of ovarian cancer driver genes from mutational and network analysis","authors":"Rucha Wadapurkar, S. Bapat, Rupali A. Mahajan, R. Vyas","doi":"10.1108/dta-03-2022-0096","DOIUrl":"https://doi.org/10.1108/dta-03-2022-0096","url":null,"abstract":"PurposeOvarian cancer (OC) is the most common type of gynecologic cancer in the world with a high rate of mortality. Due to manifestation of generic symptoms and absence of specific biomarkers, OC is usually diagnosed at a late stage. Machine learning models can be employed to predict driver genes implicated in causative mutations.Design/methodology/approachIn the present study, a comprehensive next generation sequencing (NGS) analysis of whole exome sequences of 47 OC patients was carried out to identify clinically significant mutations. Nine functional features of 708 mutations identified were input into a machine learning classification model by employing the eXtreme Gradient Boosting (XGBoost) classifier method for prediction of OC driver genes.FindingsThe XGBoost classifier model yielded a classification accuracy of 0.946, which was superior to that obtained by other classifiers such as decision tree, Naive Bayes, random forest and support vector machine. Further, an interaction network was generated to identify and establish correlations with cancer-associated pathways and gene ontology data.Originality/valueThe final results revealed 12 putative candidate cancer driver genes, namely LAMA3, LAMC3, COL6A1, COL5A1, COL2A1, UGT1A1, BDNF, ANK1, WNT10A, FZD4, PLEKHG5 and CYP2C9, that may have implications in clinical diagnosis.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48683941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
News recommendations based on collaborative topic modeling and collaborative filtering with generative adversarial networks 基于协同主题建模和生成对抗网络协同过滤的新闻推荐
IF 1.6 4区 计算机科学 Q1 Social Sciences Pub Date : 2023-03-30 DOI: 10.1108/dta-08-2022-0315
Duen-Ren Liu, Yang Huang, Jhen-Jie Jhao, Shin-Jye Lee
PurposeOnline news websites provide huge amounts of timely news, bringing the challenge of recommending personalized news articles. Generative adversarial networks (GAN) based on collaborative filtering (CFGAN) can achieve effective recommendation quality. However, CFGAN ignores item contents, which contain more latent preference features than just user ratings. It is important to consider both ratings and item contents in making preference predictions. This study aims to improve news recommendation by proposing a GAN-based news recommendation model considering both ratings (implicit feedback) and the latent features of news content.Design/methodology/approachThe collaborative topic modeling (CTM) can improve user preference prediction by combining matrix factorization (MF) with latent topics of item content derived from latent topic modeling. This study proposes a novel hybrid news recommendation model, Hybrid-CFGAN, which modifies the architecture of the CFGAN model with enhanced preference learning from the CTM. The proposed Hybrid-CFGAN model contains parallel neural networks – original rating-based preference learning and CTM-based preference learning, which consider both ratings and news content with user preferences derived from the CTM model. A tunable parameter is used to adjust the weights of the two preference learnings, while concatenating the preference outputs of the two parallel neural networks.FindingsThis study uses the dataset collected from an online news website, NiusNews, to conduct an experimental evaluation. The results show that the proposed Hybrid-CFGAN model can achieve better performance than the state-of-the-art GAN-based recommendation methods. The proposed novel Hybrid-CFGAN model can enhance existing GAN-based recommendation and increase the performance of preference predictions on textual content such as news articles.Originality/valueAs the existing CFGAN model does not consider content information and solely relies on history logs, it may not be effective in recommending news articles. Our proposed Hybrid-CFGAN model modified the architecture of the CFGAN generator by adding a parallel neural network to gain the relevant information from news content and user preferences derived from the CTM model. The novel idea of adjusting the preference learning from two parallel neural networks – original rating-based preference learning and CTM-based preference learning – contributes to improve the recommendation quality of the proposed model by considering both ratings and latent preferences derived from item contents. The proposed novel recommendation model can improve news recommendation, thereby increasing the commercial value of news media platforms.
目的在线新闻网站提供了大量及时的新闻,带来了个性化新闻文章推荐的挑战。基于协同过滤(CFGAN)的生成式对抗网络(GAN)可以达到有效的推荐质量。然而,CFGAN忽略了项目内容,其中包含比用户评分更多的潜在偏好特征。在进行偏好预测时,同时考虑评分和项目内容是很重要的。本研究旨在通过提出一种基于gan的新闻推荐模型,同时考虑新闻内容的评级(隐式反馈)和潜在特征,从而改进新闻推荐。设计/方法/方法协同主题建模(CTM)将矩阵分解(MF)与潜在主题建模衍生的项目内容潜在主题相结合,提高了用户偏好预测的精度。本文提出了一种新的混合新闻推荐模型hybrid -CFGAN,该模型改进了CFGAN模型的结构,增强了对CTM的偏好学习。提出的Hybrid-CFGAN模型包含并行神经网络-基于原始评级的偏好学习和基于CTM的偏好学习,它们同时考虑评级和新闻内容以及从CTM模型派生的用户偏好。在连接两个并行神经网络的偏好输出时,使用一个可调参数来调整两个偏好学习的权重。本研究使用在线新闻网站NiusNews收集的数据集进行实验评估。结果表明,所提出的Hybrid-CFGAN模型比目前基于gan的推荐方法具有更好的性能。提出的新型Hybrid-CFGAN模型可以增强现有的基于gan的推荐,并提高对文本内容(如新闻文章)的偏好预测性能。原创性/价值由于现有的CFGAN模型不考虑内容信息,仅依赖历史日志,因此可能无法有效推荐新闻文章。我们提出的Hybrid-CFGAN模型修改了CFGAN生成器的架构,通过添加一个并行神经网络从CTM模型衍生的新闻内容和用户偏好中获取相关信息。利用基于原始评分的偏好学习和基于ctm的偏好学习这两个并行神经网络调整偏好学习的新思想,通过同时考虑来自项目内容的评分和潜在偏好,有助于提高所提出模型的推荐质量。本文提出的新推荐模型可以改进新闻推荐,从而提高新闻媒体平台的商业价值。
{"title":"News recommendations based on collaborative topic modeling and collaborative filtering with generative adversarial networks","authors":"Duen-Ren Liu, Yang Huang, Jhen-Jie Jhao, Shin-Jye Lee","doi":"10.1108/dta-08-2022-0315","DOIUrl":"https://doi.org/10.1108/dta-08-2022-0315","url":null,"abstract":"PurposeOnline news websites provide huge amounts of timely news, bringing the challenge of recommending personalized news articles. Generative adversarial networks (GAN) based on collaborative filtering (CFGAN) can achieve effective recommendation quality. However, CFGAN ignores item contents, which contain more latent preference features than just user ratings. It is important to consider both ratings and item contents in making preference predictions. This study aims to improve news recommendation by proposing a GAN-based news recommendation model considering both ratings (implicit feedback) and the latent features of news content.Design/methodology/approachThe collaborative topic modeling (CTM) can improve user preference prediction by combining matrix factorization (MF) with latent topics of item content derived from latent topic modeling. This study proposes a novel hybrid news recommendation model, Hybrid-CFGAN, which modifies the architecture of the CFGAN model with enhanced preference learning from the CTM. The proposed Hybrid-CFGAN model contains parallel neural networks – original rating-based preference learning and CTM-based preference learning, which consider both ratings and news content with user preferences derived from the CTM model. A tunable parameter is used to adjust the weights of the two preference learnings, while concatenating the preference outputs of the two parallel neural networks.FindingsThis study uses the dataset collected from an online news website, NiusNews, to conduct an experimental evaluation. The results show that the proposed Hybrid-CFGAN model can achieve better performance than the state-of-the-art GAN-based recommendation methods. The proposed novel Hybrid-CFGAN model can enhance existing GAN-based recommendation and increase the performance of preference predictions on textual content such as news articles.Originality/valueAs the existing CFGAN model does not consider content information and solely relies on history logs, it may not be effective in recommending news articles. Our proposed Hybrid-CFGAN model modified the architecture of the CFGAN generator by adding a parallel neural network to gain the relevant information from news content and user preferences derived from the CTM model. The novel idea of adjusting the preference learning from two parallel neural networks – original rating-based preference learning and CTM-based preference learning – contributes to improve the recommendation quality of the proposed model by considering both ratings and latent preferences derived from item contents. The proposed novel recommendation model can improve news recommendation, thereby increasing the commercial value of news media platforms.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41499535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data mining–based stock price prediction using hybridization of technical and fundamental analysis 基于数据挖掘的技术分析和基本面分析相结合的股价预测
IF 1.6 4区 计算机科学 Q1 Social Sciences Pub Date : 2023-03-20 DOI: 10.1108/dta-04-2022-0142
Jasleen Kaur, Khushdeep Dharni
PurposeThe stock market generates massive databases of various financial companies that are highly volatile and complex. To forecast daily stock values of these companies, investors frequently use technical analysis or fundamental analysis. Data mining techniques coupled with fundamental and technical analysis types have the potential to give satisfactory results for stock market prediction. In the current paper, an effort is made to investigate the accuracy of stock market predictions by using the combined approach of variables from technical and fundamental analysis for the creation of a data mining predictive model.Design/methodology/approachWe chose 381 companies from the National Stock Exchange of India's CNX 500 index and conducted a two-stage data analysis. The first stage is identifying key fundamental variables and constructing a portfolio based on that study. Artificial neural network (ANN), support vector machines (SVM) and decision tree J48 were used to build the models. The second stage entails applying technical analysis to forecast price movements in the companies included in the portfolios. ANN and SVM techniques were used to create predictive models for all companies in the portfolios. We also estimated returns using trading decisions based on the model's output and then compared them to buy-and-hold returns and the return of the NIFTY 50 index, which served as a benchmark.FindingsThe results show that the returns of both the portfolios are higher than the benchmark buy-and-hold strategy return. It can be concluded that data mining techniques give better results, irrespective of the type of stock, and have the ability to make up for poor stocks. The comparison of returns of portfolios with the return of NIFTY as a benchmark also indicates that both the portfolios are generating higher returns as compared to the return generated by NIFTY.Originality/valueAs stock prices are influenced by both technical and fundamental indicators, the current paper explored the combined effect of technical analysis and fundamental analysis variables for Indian stock market prediction. Further, the results obtained by individual analysis have also been compared. The proposed method under study can also be utilized to determine whether to hold stocks for the long or short term using trend-based research.
股票市场产生了大量的各种金融公司数据库,这些数据库高度波动和复杂。为了预测这些公司的每日股价,投资者经常使用技术分析或基本面分析。数据挖掘技术与基础和技术分析相结合,有可能为股票市场预测提供令人满意的结果。在当前的论文中,通过使用技术和基本面分析变量的组合方法来创建数据挖掘预测模型,努力研究股票市场预测的准确性。设计/方法/方法我们从印度国家证券交易所的CNX 500指数中选择了381家公司,并进行了两阶段的数据分析。第一阶段是确定关键的基本变量,并在此基础上构建投资组合。采用人工神经网络(ANN)、支持向量机(SVM)和决策树J48建立模型。第二阶段需要应用技术分析来预测投资组合中公司的价格走势。使用人工神经网络和支持向量机技术为投资组合中的所有公司创建预测模型。我们还根据模型的输出使用交易决策来估计回报,然后将其与买入并持有的回报和作为基准的NIFTY 50指数的回报进行比较。结果表明,两种投资组合的收益均高于基准买入并持有策略的收益。可以得出结论,无论股票类型如何,数据挖掘技术都能提供更好的结果,并且有能力弥补糟糕的股票。将投资组合的回报与NIFTY的回报作为基准进行比较也表明,与NIFTY产生的回报相比,这两种投资组合产生的回报都更高。由于股票价格同时受到技术指标和基本面指标的影响,本文探讨了技术分析和基本面分析变量对印度股市预测的联合作用。此外,还对个别分析所得的结果进行了比较。研究中提出的方法也可以利用基于趋势的研究来确定是长期持有股票还是短期持有股票。
{"title":"Data mining–based stock price prediction using hybridization of technical and fundamental analysis","authors":"Jasleen Kaur, Khushdeep Dharni","doi":"10.1108/dta-04-2022-0142","DOIUrl":"https://doi.org/10.1108/dta-04-2022-0142","url":null,"abstract":"PurposeThe stock market generates massive databases of various financial companies that are highly volatile and complex. To forecast daily stock values of these companies, investors frequently use technical analysis or fundamental analysis. Data mining techniques coupled with fundamental and technical analysis types have the potential to give satisfactory results for stock market prediction. In the current paper, an effort is made to investigate the accuracy of stock market predictions by using the combined approach of variables from technical and fundamental analysis for the creation of a data mining predictive model.Design/methodology/approachWe chose 381 companies from the National Stock Exchange of India's CNX 500 index and conducted a two-stage data analysis. The first stage is identifying key fundamental variables and constructing a portfolio based on that study. Artificial neural network (ANN), support vector machines (SVM) and decision tree J48 were used to build the models. The second stage entails applying technical analysis to forecast price movements in the companies included in the portfolios. ANN and SVM techniques were used to create predictive models for all companies in the portfolios. We also estimated returns using trading decisions based on the model's output and then compared them to buy-and-hold returns and the return of the NIFTY 50 index, which served as a benchmark.FindingsThe results show that the returns of both the portfolios are higher than the benchmark buy-and-hold strategy return. It can be concluded that data mining techniques give better results, irrespective of the type of stock, and have the ability to make up for poor stocks. The comparison of returns of portfolios with the return of NIFTY as a benchmark also indicates that both the portfolios are generating higher returns as compared to the return generated by NIFTY.Originality/valueAs stock prices are influenced by both technical and fundamental indicators, the current paper explored the combined effect of technical analysis and fundamental analysis variables for Indian stock market prediction. Further, the results obtained by individual analysis have also been compared. The proposed method under study can also be utilized to determine whether to hold stocks for the long or short term using trend-based research.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44417754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Music sentiment classification based on an optimized CNN-RF-QPSO model 基于优化CNN-RF-QPSO模型的音乐情感分类
IF 1.6 4区 计算机科学 Q1 Social Sciences Pub Date : 2023-03-17 DOI: 10.1108/dta-07-2022-0267
Rui Tian, Ruheng Yin, Feng Gan
PurposeMusic sentiment analysis helps to promote the diversification of music information retrieval methods. Traditional music emotion classification tasks suffer from high manual workload and low classification accuracy caused by difficulty in feature extraction and inaccurate manual determination of hyperparameter. In this paper, the authors propose an optimized convolution neural network-random forest (CNN-RF) model for music sentiment classification which is capable of optimizing the manually selected hyperparameters to improve the accuracy of music sentiment classification and reduce labor costs and human classification errors.Design/methodology/approachA CNN-RF music sentiment classification model is designed based on quantum particle swarm optimization (QPSO). First, the audio data are transformed into a Mel spectrogram, and feature extraction is conducted by a CNN. Second, the music features extracted are processed by RF algorithm to complete a preliminary emotion classification. Finally, to select the suitable hyperparameters for a CNN, the QPSO algorithm is adopted to extract the best hyperparameters and obtain the final classification results.FindingsThe model has gone through experimental validations and achieved a classification accuracy of 97 per cent for different sentiment categories with shortened training time. The proposed method with QPSO achieved 1.2 and 1.6 per cent higher accuracy than that with particle swarm optimization and genetic algorithm, respectively. The proposed model had great potential for music sentiment classification.Originality/valueThe dual contribution of this work comprises the proposed model which integrated two deep learning models and the introduction of a QPSO into model optimization. With these two innovations, the efficiency and accuracy of music emotion recognition and classification have been significantly improved.
目的音乐情感分析有助于促进音乐信息检索方法的多样化。传统的音乐情感分类任务由于特征提取困难和超参数的手工确定不准确,导致手工工作量大,分类精度低。在本文中,作者提出了一种用于音乐情感分类的优化卷积神经网络随机森林(CNN-RF)模型,该模型能够优化手动选择的超参数,以提高音乐情感分类精度,降低人工成本和人工分类误差。设计/方法论/方法基于量子粒子群优化(QPSO)设计了一个CNN-RF音乐情感分类模型。首先,将音频数据转换为梅尔谱图,并通过CNN进行特征提取。其次,通过RF算法对提取的音乐特征进行处理,完成初步的情绪分类。最后,为了为CNN选择合适的超参数,采用QPSO算法提取最佳超参数并获得最终的分类结果。发现该模型经过了实验验证,在缩短训练时间的情况下,对不同情绪类别的分类准确率达到97%。与粒子群优化和遗传算法相比,QPSO方法的精度分别提高了1.2%和1.6%。所提出的模型在音乐情感分类方面具有很大的潜力。原创性/价值这项工作的双重贡献包括所提出的模型,该模型集成了两个深度学习模型,并在模型优化中引入了QPSO。通过这两项创新,音乐情感识别和分类的效率和准确性得到了显著提高。
{"title":"Music sentiment classification based on an optimized CNN-RF-QPSO model","authors":"Rui Tian, Ruheng Yin, Feng Gan","doi":"10.1108/dta-07-2022-0267","DOIUrl":"https://doi.org/10.1108/dta-07-2022-0267","url":null,"abstract":"PurposeMusic sentiment analysis helps to promote the diversification of music information retrieval methods. Traditional music emotion classification tasks suffer from high manual workload and low classification accuracy caused by difficulty in feature extraction and inaccurate manual determination of hyperparameter. In this paper, the authors propose an optimized convolution neural network-random forest (CNN-RF) model for music sentiment classification which is capable of optimizing the manually selected hyperparameters to improve the accuracy of music sentiment classification and reduce labor costs and human classification errors.Design/methodology/approachA CNN-RF music sentiment classification model is designed based on quantum particle swarm optimization (QPSO). First, the audio data are transformed into a Mel spectrogram, and feature extraction is conducted by a CNN. Second, the music features extracted are processed by RF algorithm to complete a preliminary emotion classification. Finally, to select the suitable hyperparameters for a CNN, the QPSO algorithm is adopted to extract the best hyperparameters and obtain the final classification results.FindingsThe model has gone through experimental validations and achieved a classification accuracy of 97 per cent for different sentiment categories with shortened training time. The proposed method with QPSO achieved 1.2 and 1.6 per cent higher accuracy than that with particle swarm optimization and genetic algorithm, respectively. The proposed model had great potential for music sentiment classification.Originality/valueThe dual contribution of this work comprises the proposed model which integrated two deep learning models and the introduction of a QPSO into model optimization. With these two innovations, the efficiency and accuracy of music emotion recognition and classification have been significantly improved.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47188804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-relation global context learning for session-based recommendation 基于会话推荐的多关系全局上下文学习
IF 1.6 4区 计算机科学 Q1 Social Sciences Pub Date : 2023-03-16 DOI: 10.1108/dta-07-2022-0290
Yishan Liu, Wenming Cao, Guitao Cao
PurposeSession-based recommendation aims to predict the user's next preference based on the user's recent activities. Although most existing studies consider the global characteristics of items, they only learn the global characteristics of items based on a single connection relationship, which cannot fully capture the complex transformation relationship between items. We believe that multiple relationships between items in learning sessions can improve the performance of session recommendation tasks and the scalability of recommendation models. At the same time, high-quality global features of the item help to explore the potential common preferences of users.Design/methodology/approachThis work proposes a session-based recommendation method with a multi-relation global context–enhanced network to capture this global transition relationship. Specifically, we construct a multi-relation global item graph based on a group of sessions, use a graded attention mechanism to learn different types of connection relations independently and obtain the global feature of the item according to the multi-relation weight.FindingsWe did related experiments on three benchmark datasets. The experimental results show that our proposed model is superior to the existing state-of-the-art methods, which verifies the effectiveness of our model.Originality/valueFirst, we construct a multi-relation global item graph to learn the complex transition relations of the global context of the item and effectively mine the potential association of items between different sessions. Second, our model effectively improves the scalability of the model by obtaining high-quality item global features and enables some previously unconsidered items to make it onto the candidate list.
基于会话的推荐旨在根据用户最近的活动来预测用户的下一个偏好。虽然现有的研究大多考虑了项目的全局特征,但它们只是基于单一的连接关系来学习项目的全局特征,无法充分捕捉到项目之间复杂的转换关系。我们认为,学习会话中项目之间的多重关系可以提高会话推荐任务的性能和推荐模型的可扩展性。同时,产品的高质量全球特征有助于探索用户潜在的共同偏好。设计/方法/方法本工作提出了一种基于会话的推荐方法,该方法采用多关系全局上下文增强网络来捕获这种全局转换关系。具体而言,我们基于一组会话构建了一个多关系全局项目图,使用分级注意机制独立学习不同类型的连接关系,并根据多关系权重获得项目的全局特征。我们在三个基准数据集上做了相关的实验。实验结果表明,该模型优于现有的最先进的方法,验证了该模型的有效性。首先,我们构建了一个多关系的全局项目图,学习了项目全局上下文的复杂过渡关系,有效挖掘了不同会话之间项目的潜在关联。其次,我们的模型通过获得高质量的物品全局特征,有效地提高了模型的可扩展性,并使一些之前未考虑的物品进入候选列表。
{"title":"Multi-relation global context learning for session-based recommendation","authors":"Yishan Liu, Wenming Cao, Guitao Cao","doi":"10.1108/dta-07-2022-0290","DOIUrl":"https://doi.org/10.1108/dta-07-2022-0290","url":null,"abstract":"PurposeSession-based recommendation aims to predict the user's next preference based on the user's recent activities. Although most existing studies consider the global characteristics of items, they only learn the global characteristics of items based on a single connection relationship, which cannot fully capture the complex transformation relationship between items. We believe that multiple relationships between items in learning sessions can improve the performance of session recommendation tasks and the scalability of recommendation models. At the same time, high-quality global features of the item help to explore the potential common preferences of users.Design/methodology/approachThis work proposes a session-based recommendation method with a multi-relation global context–enhanced network to capture this global transition relationship. Specifically, we construct a multi-relation global item graph based on a group of sessions, use a graded attention mechanism to learn different types of connection relations independently and obtain the global feature of the item according to the multi-relation weight.FindingsWe did related experiments on three benchmark datasets. The experimental results show that our proposed model is superior to the existing state-of-the-art methods, which verifies the effectiveness of our model.Originality/valueFirst, we construct a multi-relation global item graph to learn the complex transition relations of the global context of the item and effectively mine the potential association of items between different sessions. Second, our model effectively improves the scalability of the model by obtaining high-quality item global features and enables some previously unconsidered items to make it onto the candidate list.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48165487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A new method based on ensemble time series for fast and accurate clustering 一种基于集成时间序列的快速准确聚类方法
IF 1.6 4区 计算机科学 Q1 Social Sciences Pub Date : 2023-03-15 DOI: 10.1108/dta-08-2022-0300
A. Ghorbanian, H. Razavi
PurposeThe common methods for clustering time series are the use of specific distance criteria or the use of standard clustering algorithms. Ensemble clustering is one of the common techniques used in data mining to increase the accuracy of clustering. In this study, based on segmentation, selecting the best segments, and using ensemble clustering for selected segments, a multistep approach has been developed for the whole clustering of time series data.Design/methodology/approachFirst, this approach divides the time series dataset into equal segments. In the next step, using one or more internal clustering criteria, the best segments are selected, and then the selected segments are combined for final clustering. By using a loop and how to select the best segments for the final clustering (using one criterion or several criteria simultaneously), two algorithms have been developed in different settings. A logarithmic relationship limits the number of segments created in the loop.FindingAccording to Rand's external criteria and statistical tests, at first, the best setting of the two developed algorithms has been selected. Then this setting has been compared to different algorithms in the literature on clustering accuracy and execution time. The obtained results indicate more accuracy and less execution time for the proposed approach.Originality/valueThis paper proposed a fast and accurate approach for time series clustering in three main steps. This is the first work that uses a combination of segmentation and ensemble clustering. More accuracy and less execution time are the remarkable achievements of this study.
常用的时间序列聚类方法是使用特定的距离标准或使用标准聚类算法。集成聚类是数据挖掘中用于提高聚类精度的常用技术之一。本研究提出了一种多步骤的时间序列数据整体聚类方法,该方法基于分割、选择最佳片段,并对所选片段使用集成聚类。设计/方法/方法首先,该方法将时间序列数据集分成相等的部分。下一步,使用一个或多个内部聚类标准,选择最佳段,然后将选择的段组合进行最终聚类。通过使用循环以及如何为最终聚类选择最佳片段(同时使用一个标准或几个标准),在不同的设置下开发了两种算法。对数关系限制了在循环中创建的段的数量。根据Rand的外部标准和统计检验,首先选择了两种开发算法的最佳设置。然后将此设置与文献中不同的算法在聚类精度和执行时间上进行了比较。结果表明,该方法具有较高的精度和较短的执行时间。本文提出了一种快速准确的时间序列聚类方法,分为三个主要步骤。这是第一次使用分割和集成聚类相结合的工作。准确性提高,执行时间缩短是本研究的显著成果。
{"title":"A new method based on ensemble time series for fast and accurate clustering","authors":"A. Ghorbanian, H. Razavi","doi":"10.1108/dta-08-2022-0300","DOIUrl":"https://doi.org/10.1108/dta-08-2022-0300","url":null,"abstract":"PurposeThe common methods for clustering time series are the use of specific distance criteria or the use of standard clustering algorithms. Ensemble clustering is one of the common techniques used in data mining to increase the accuracy of clustering. In this study, based on segmentation, selecting the best segments, and using ensemble clustering for selected segments, a multistep approach has been developed for the whole clustering of time series data.Design/methodology/approachFirst, this approach divides the time series dataset into equal segments. In the next step, using one or more internal clustering criteria, the best segments are selected, and then the selected segments are combined for final clustering. By using a loop and how to select the best segments for the final clustering (using one criterion or several criteria simultaneously), two algorithms have been developed in different settings. A logarithmic relationship limits the number of segments created in the loop.FindingAccording to Rand's external criteria and statistical tests, at first, the best setting of the two developed algorithms has been selected. Then this setting has been compared to different algorithms in the literature on clustering accuracy and execution time. The obtained results indicate more accuracy and less execution time for the proposed approach.Originality/valueThis paper proposed a fast and accurate approach for time series clustering in three main steps. This is the first work that uses a combination of segmentation and ensemble clustering. More accuracy and less execution time are the remarkable achievements of this study.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42318784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A novel twin-support vector machine for binary classification to imbalanced data 一种新的双支持向量机对不平衡数据进行二值分类
IF 1.6 4区 计算机科学 Q1 Social Sciences Pub Date : 2023-03-09 DOI: 10.1108/dta-08-2022-0302
Jingyi Li, S. Chao
PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.
目的对不平衡数据进行二值分类是一个挑战;由于阶级的不平衡,少数阶级很容易被多数阶级所掩盖。然而,现有的大多数分类器更擅长识别多数类,从而忽略了少数类,从而导致分类器退化。为了解决这个问题,本文提出了一种双支持向量机对不平衡数据进行二值分类。在提出的方法中,作者构建了两个支持向量机,分别关注多数类和少数类。(1)提出了一种新的双支持向量机对不平衡数据进行二值分类,并推导了新的核。(2)对于不平衡数据,数据分布的复杂性对分类结果有负面影响;然而,通过使用优化核可以获得高级分类结果和期望的边界。(3)对于非平衡数据的二值分类,基于双体系结构的分类器比基于单体系结构的分类器更有优势。独创性/价值对于不平衡数据,数据分布的复杂性对分类结果有负面影响;然而,通过使用优化核可以获得高级分类结果并学习到期望的边界。
{"title":"A novel twin-support vector machine for binary classification to imbalanced data","authors":"Jingyi Li, S. Chao","doi":"10.1108/dta-08-2022-0302","DOIUrl":"https://doi.org/10.1108/dta-08-2022-0302","url":null,"abstract":"PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73540195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Data Technologies and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1