首页 > 最新文献

Information Processing & Management最新文献

英文 中文
Bridging insight gaps in topic dependency discovery with a knowledge-inspired topic model 用知识启发的话题模型弥合话题依赖发现中的洞察力差距
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-09 DOI: 10.1016/j.ipm.2024.103911
Yi-Kun Tang , Heyan Huang , Xuewen Shi , Xian-Ling Mao
Discovering intricate dependencies between topics in topic modeling is challenging due to the noisy and incomplete nature of real-world data and the inherent complexity of topic dependency relationships. In practice, certain basic dependency relationships have been manually annotated and can serve as valuable knowledge resources, enhancing the learning of topic dependencies. To this end, we propose a novel topic model, called Knowledge-Inspired Dependency-Aware Dirichlet Neural Topic Model (KDNTM). Specifically, we first propose Dependency-Aware Dirichlet Neural Topic Model (DepDirNTM), which can discover semantically coherent topics and complex dependencies between these topics from textual data. Then, we propose three methods to leverage accessible external dependency knowledge under the framework of DepDirNTM to enhance the discovery of topic dependencies. Extensive experiments on real-world corpora demonstrate that our models outperform 12 state-of-the-art baselines in terms of topic quality and multi-labeled text classification in most cases, achieving up to a 14% improvement in topic quality over the best baseline. Visualizations of the learned dependency relationships further highlight the benefits of integrating external knowledge, confirming its substantial impact on the effectiveness of topic modeling.
在主题建模中,发现主题之间错综复杂的依赖关系具有挑战性,这是因为现实世界的数据具有噪声和不完整性,而且主题依赖关系本身也很复杂。在实践中,某些基本的依赖关系已经过人工标注,可以作为宝贵的知识资源,加强对主题依赖关系的学习。为此,我们提出了一种新颖的主题模型,即知识启发的依赖关系感知 Dirichlet 神经主题模型(Knowledge-Inspired Dependency-Aware Dirichlet Neural Topic Model,KDNTM)。具体来说,我们首先提出了 "依赖感知 Dirichlet 神经主题模型"(DepDirNTM),它可以从文本数据中发现语义一致的主题以及这些主题之间复杂的依赖关系。然后,我们提出了三种方法,在 DepDirNTM 框架下利用可获取的外部依赖关系知识来增强主题依赖关系的发现。在真实世界语料库上进行的广泛实验表明,我们的模型在大多数情况下在主题质量和多标签文本分类方面都优于 12 个最先进的基线模型,在主题质量方面比最佳基线模型最多提高了 14%。学习到的依赖关系可视化进一步突出了整合外部知识的优势,证实了外部知识对主题建模效果的重大影响。
{"title":"Bridging insight gaps in topic dependency discovery with a knowledge-inspired topic model","authors":"Yi-Kun Tang ,&nbsp;Heyan Huang ,&nbsp;Xuewen Shi ,&nbsp;Xian-Ling Mao","doi":"10.1016/j.ipm.2024.103911","DOIUrl":"10.1016/j.ipm.2024.103911","url":null,"abstract":"<div><div>Discovering intricate dependencies between topics in topic modeling is challenging due to the noisy and incomplete nature of real-world data and the inherent complexity of topic dependency relationships. In practice, certain basic dependency relationships have been manually annotated and can serve as valuable knowledge resources, enhancing the learning of topic dependencies. To this end, we propose a novel topic model, called Knowledge-Inspired Dependency-Aware Dirichlet Neural Topic Model (KDNTM). Specifically, we first propose Dependency-Aware Dirichlet Neural Topic Model (DepDirNTM), which can discover semantically coherent topics and complex dependencies between these topics from textual data. Then, we propose three methods to leverage accessible external dependency knowledge under the framework of DepDirNTM to enhance the discovery of topic dependencies. Extensive experiments on real-world corpora demonstrate that our models outperform 12 state-of-the-art baselines in terms of topic quality and multi-labeled text classification in most cases, achieving up to a 14% improvement in topic quality over the best baseline. Visualizations of the learned dependency relationships further highlight the benefits of integrating external knowledge, confirming its substantial impact on the effectiveness of topic modeling.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103911"},"PeriodicalIF":7.4,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142417982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NNEnsLeG: A novel approach for e-commerce payment fraud detection using ensemble learning and neural networks NNEnsLeG:利用集合学习和神经网络检测电子商务支付欺诈的新方法
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-09 DOI: 10.1016/j.ipm.2024.103916
Qingfeng Zeng , Li Lin , Rui Jiang , Weiyu Huang , Dijia Lin
The proliferation of fraud in online shopping has accompanied the development of e-commerce, leading to substantial economic losses, and affecting consumer trust in online shopping. However, few studies have focused on fraud detection in e-commerce due to its diversity and dynamism. In this work, we conduct a feature set specifically for e-commerce payment fraud, around transactions, user behavior, and account relevance. We propose a novel comprehensive model called Neural Network Based Ensemble Learning with Generation (NNEnsLeG) for fraud detection. In this model, ensemble learning, data generation, and parameter-passing are designed to cope with extreme data imbalance, overfitting, and simulating the dynamics of fraud patterns. We evaluate the model performance in e-commerce payment fraud detection with >310,000 pieces of e-commerce account data. Then we verify the effectiveness of the model design and feature engineering through ablation experiments, and validate the generalization ability of the model in other payment fraud scenarios. The experimental results show that NNEnsLeG outperforms all the benchmarks and proves the effectiveness of generative data and parameter-passing design, presenting the practical application of the NNEnsLeG model in e-commerce payment fraud detection.
伴随着电子商务的发展,网络购物中的欺诈行为层出不穷,造成了巨大的经济损失,也影响了消费者对网络购物的信任。然而,由于电子商务的多样性和动态性,很少有研究关注电子商务中的欺诈检测。在这项工作中,我们围绕交易、用户行为和账户相关性,专门针对电子商务支付欺诈进行了特征集研究。我们提出了一种用于欺诈检测的新型综合模型,名为 "基于神经网络的集合学习与生成(NNEnsLeG)"。在该模型中,集合学习、数据生成和参数传递被设计用来应对极端数据不平衡、过拟合和模拟欺诈模式的动态变化。我们利用 31 万条电子商务账户数据评估了该模型在电子商务支付欺诈检测中的性能。然后,我们通过消融实验验证了模型设计和特征工程的有效性,并验证了模型在其他支付欺诈场景中的泛化能力。实验结果表明,NNEnsLeG优于所有基准,证明了生成数据和参数传递设计的有效性,展示了NNEnsLeG模型在电子商务支付欺诈检测中的实际应用。
{"title":"NNEnsLeG: A novel approach for e-commerce payment fraud detection using ensemble learning and neural networks","authors":"Qingfeng Zeng ,&nbsp;Li Lin ,&nbsp;Rui Jiang ,&nbsp;Weiyu Huang ,&nbsp;Dijia Lin","doi":"10.1016/j.ipm.2024.103916","DOIUrl":"10.1016/j.ipm.2024.103916","url":null,"abstract":"<div><div>The proliferation of fraud in online shopping has accompanied the development of e-commerce, leading to substantial economic losses, and affecting consumer trust in online shopping. However, few studies have focused on fraud detection in e-commerce due to its diversity and dynamism. In this work, we conduct a feature set specifically for e-commerce payment fraud, around transactions, user behavior, and account relevance. We propose a novel comprehensive model called Neural Network Based Ensemble Learning with Generation (NNEnsLeG) for fraud detection. In this model, ensemble learning, data generation, and parameter-passing are designed to cope with extreme data imbalance, overfitting, and simulating the dynamics of fraud patterns. We evaluate the model performance in e-commerce payment fraud detection with &gt;310,000 pieces of e-commerce account data. Then we verify the effectiveness of the model design and feature engineering through ablation experiments, and validate the generalization ability of the model in other payment fraud scenarios. The experimental results show that NNEnsLeG outperforms all the benchmarks and proves the effectiveness of generative data and parameter-passing design, presenting the practical application of the NNEnsLeG model in e-commerce payment fraud detection.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103916"},"PeriodicalIF":7.4,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142417979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discovering technology opportunities of latecomers based on RGNN and patent data: The example of Huawei in self-driving vehicle industry 基于 RGNN 和专利数据发现后来者的技术机会:以自动驾驶汽车行业的华为为例
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-09 DOI: 10.1016/j.ipm.2024.103908
Runzhe Zhang , Xiang Yu , Ben Zhang , Qinglan Ren , Yakun Ji
Emerging technologies provide competitive opportunities for latecomers to catch up with leading giants. As most of the extant literature indicated, types of single-dimensional relations from patent data have been revealed in technology opportunity discovery (TOD) research. Still, few have been aware of the more complex characteristics extracted from higher-dimensional patent information such as the patentee-technology relation. To derive this valuable relation for more robust results, this article introduces a novel TOD method, utilizing a recursive graph neural network (RGNN) to transform this high-dimensional information into measures of heterogeneity as internal capability, and combining it with external challenges evaluated by the competitiveness index, to identify technological opportunities. Taking the self-driving vehicle (SDV) industry with 33,347 patent families from 2010 to 2021 as the initial dataset, it shows significant performance promotions compared to previous analogous TOD models. Meanwhile, tested by recent filing patent data, the predicted opportunities are consistent with Huawei and other enterprises. Upon illuminating the intense technological competition situation among the preeminent SDV firms worldwide as a case exploration, this research contributes theoretical and practical views to the TOD research and network analysis.
新兴技术为后来者提供了赶超领先巨头的竞争机会。正如大多数现有文献所指出的,技术机会发现(TOD)研究已经揭示了专利数据中的单维度关系类型。然而,很少有人注意到从专利信息中提取的更复杂的特征,如专利与技术的关系。为了从这一有价值的关系中得出更稳健的结果,本文介绍了一种新颖的 TOD 方法,利用递归图神经网络(RGNN)将这些高维信息转化为衡量内部能力的异质性指标,并将其与竞争力指数评估的外部挑战相结合,从而发现技术机遇。以 2010 年至 2021 年自动驾驶汽车(SDV)行业的 33347 项专利族为初始数据集,与以往类似的 TOD 模型相比,其性能有显著提升。同时,通过近期申请专利数据的检验,其预测的机会与华为等企业一致。本研究以案例探索的方式揭示了全球知名 SDV 企业之间激烈的技术竞争态势,为 TOD 研究和网络分析贡献了理论和实践观点。
{"title":"Discovering technology opportunities of latecomers based on RGNN and patent data: The example of Huawei in self-driving vehicle industry","authors":"Runzhe Zhang ,&nbsp;Xiang Yu ,&nbsp;Ben Zhang ,&nbsp;Qinglan Ren ,&nbsp;Yakun Ji","doi":"10.1016/j.ipm.2024.103908","DOIUrl":"10.1016/j.ipm.2024.103908","url":null,"abstract":"<div><div>Emerging technologies provide competitive opportunities for latecomers to catch up with leading giants. As most of the extant literature indicated, types of single-dimensional relations from patent data have been revealed in technology opportunity discovery (TOD) research. Still, few have been aware of the more complex characteristics extracted from higher-dimensional patent information such as the patentee-technology relation. To derive this valuable relation for more robust results, this article introduces a novel TOD method, utilizing a recursive graph neural network (RGNN) to transform this high-dimensional information into measures of heterogeneity as internal capability, and combining it with external challenges evaluated by the competitiveness index, to identify technological opportunities. Taking the self-driving vehicle (SDV) industry with 33,347 patent families from 2010 to 2021 as the initial dataset, it shows significant performance promotions compared to previous analogous TOD models. Meanwhile, tested by recent filing patent data, the predicted opportunities are consistent with Huawei and other enterprises. Upon illuminating the intense technological competition situation among the preeminent SDV firms worldwide as a case exploration, this research contributes theoretical and practical views to the TOD research and network analysis.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103908"},"PeriodicalIF":7.4,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142417980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust annotation aggregation in crowdsourcing via enhanced worker ability modeling 通过增强工人能力建模实现众包中的稳健注释聚合
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-01 DOI: 10.1016/j.ipm.2024.103914
Ju Chen , Jun Feng , Shenyu Zhang , Xiaodong Li , Hamza Djigal
Truth inference in crowdsourcing, which studies how to aggregate noisy and biased annotations from workers with varied expertise, is a fundamental technology powering the quality of crowdsourced annotations. Generally, confusion-matrix-based methods are more promising and worker better, as they model each worker’s ability using a confusion matrix rather than a single real value. However, the imbalanced classes and the insufficient training data caused by the K×K pattern (K refers to the number of classes) are still two major issues for the learning of confusion matrices, which call for a robust modeling structure of workers’ confusion matrices. In this article, we propose in response a Fine-Grained Bayesian Classifier Combination model (FGBCC), in which a combination of K univariate Gaussian distributions and the standard softmax function is exploited with an aim to improve the estimation of workers’ abilities. Compared to existing methods, FGBCC is capable of learning extensive worker behaviors and is less susceptible to these issues that previous methods suffer from, owing to its stronger generalization ability. Moreover, Considering the exact solution to the complex posterior is unavailable, we devise a computationally efficient algorithm to approximate the posterior. Extensive experiments on 24 real-world datasets covering a wide range of domains, verify the clear advantages of FGBCC over 11 state-of-the-art benchmark methods.
众包中的真相推理是研究如何从具有不同专业知识的工作者那里汇总有噪声和有偏差的注释,是提高众包注释质量的一项基本技术。一般来说,基于混淆矩阵的方法更有前途,工作效率也更高,因为它们使用混淆矩阵而不是单一的真实值来模拟每个工作人员的能力。然而,K×K 模式(K 指类的数量)导致的类不平衡和训练数据不足仍是混淆矩阵学习的两大问题,这就要求对工作者的混淆矩阵建立健壮的建模结构。在本文中,我们提出了细粒度贝叶斯分类器组合模型(FGBCC),利用 K 个单变量高斯分布和标准 softmax 函数的组合来改进对工人能力的估计。与现有方法相比,FGBCC 能够学习工人的广泛行为,并且由于其较强的泛化能力,不易受到以往方法所存在的这些问题的影响。此外,考虑到复杂后验的精确解不可用,我们设计了一种计算高效的算法来近似后验。在涵盖广泛领域的 24 个实际数据集上进行的大量实验验证了 FGBCC 相对于 11 种最先进基准方法的明显优势。
{"title":"Robust annotation aggregation in crowdsourcing via enhanced worker ability modeling","authors":"Ju Chen ,&nbsp;Jun Feng ,&nbsp;Shenyu Zhang ,&nbsp;Xiaodong Li ,&nbsp;Hamza Djigal","doi":"10.1016/j.ipm.2024.103914","DOIUrl":"10.1016/j.ipm.2024.103914","url":null,"abstract":"<div><div>Truth inference in crowdsourcing, which studies how to aggregate noisy and biased annotations from workers with varied expertise, is a fundamental technology powering the quality of crowdsourced annotations. Generally, confusion-matrix-based methods are more promising and worker better, as they model each worker’s ability using a confusion matrix rather than a single real value. However, the imbalanced classes and the insufficient training data caused by the <span><math><mrow><mi>K</mi><mo>×</mo><mi>K</mi></mrow></math></span> pattern (<span><math><mi>K</mi></math></span> refers to the number of classes) are still two major issues for the learning of confusion matrices, which call for a robust modeling structure of workers’ confusion matrices. In this article, we propose in response a Fine-Grained Bayesian Classifier Combination model (FGBCC), in which a combination of <span><math><mi>K</mi></math></span> univariate Gaussian distributions and the standard softmax function is exploited with an aim to improve the estimation of workers’ abilities. Compared to existing methods, FGBCC is capable of learning extensive worker behaviors and is less susceptible to these issues that previous methods suffer from, owing to its stronger generalization ability. Moreover, Considering the exact solution to the complex posterior is unavailable, we devise a computationally efficient algorithm to approximate the posterior. Extensive experiments on 24 real-world datasets covering a wide range of domains, verify the clear advantages of FGBCC over 11 state-of-the-art benchmark methods.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103914"},"PeriodicalIF":7.4,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142417978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable reasoning over temporal knowledge graphs by pre-trained language model 通过预训练语言模型对时态知识图谱进行可解释推理
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-01 DOI: 10.1016/j.ipm.2024.103903
Qing Li, Guanzhong Wu
Temporal knowledge graph reasoning (TKGR) has been considered as a crucial task for modeling the evolving knowledge, aiming to infer the unknown connections between entities at specific times. Traditional TKGR methods try to aggregate structural information between entities and evolve representations of entities over distinct snapshots, while some other methods attempt to extract temporal logic rules from historical interactions. However, these methods fail to address the continuously emerging unseen entities over time and ignore the historical dependencies between entities and relations. To overcome these limitations, we propose a novel method, termed TPNet, which introduces historical information completion strategy (HICS) and pre-trained language model (PLM) to conduct explainable inductive reasoning over TKGs. Specifically, TPNet extracts reliable temporal logical paths from historical subgraphs using a temporal-correlated search strategy. For unseen entities, we utilize HICS to sample or generate paths to supplement their historical information. Besides, a PLM and a time-aware encoder are introduced to jointly encode the temporal paths, thereby comprehensively capturing dependencies between entities and relations. Moreover, the semantic similarity between the query quadruples and the extracted paths is evaluated to simultaneously optimize the representations of entities and relations. Extensive experiments on entity and relation prediction tasks are conducted to evaluate the performance of TPNet. The experimental results on four benchmark datasets demonstrate the superiority of TPNet over state-of-the-art TKGR methods, achieving improvements of 14.35%, 23.08%, 6.75% and 5.38% on MRR, respectively.
时态知识图谱推理(TKGR)被认为是对不断演化的知识进行建模的一项重要任务,其目的是推断实体之间在特定时间的未知联系。传统的时态知识图推理方法试图聚合实体间的结构信息,并在不同的快照中演化实体的表征,而其他一些方法则试图从历史交互中提取时态逻辑规则。然而,这些方法无法解决随着时间推移不断出现的未知实体问题,也忽略了实体和关系之间的历史依赖关系。为了克服这些局限性,我们提出了一种称为 TPNet 的新方法,它引入了历史信息补全策略(HICS)和预训练语言模型(PLM),对 TKG 进行可解释的归纳推理。具体来说,TPNet 使用时间相关搜索策略从历史子图中提取可靠的时间逻辑路径。对于未见实体,我们利用 HICS 来采样或生成路径,以补充其历史信息。此外,我们还引入了 PLM 和时间感知编码器来共同编码时间路径,从而全面捕捉实体和关系之间的依赖关系。此外,还评估了查询四元组和提取路径之间的语义相似性,从而同时优化实体和关系的表示。为了评估 TPNet 的性能,我们对实体和关系预测任务进行了广泛的实验。在四个基准数据集上的实验结果表明,TPNet 优于最先进的 TKGR 方法,其 MRR 分别提高了 14.35%、23.08%、6.75% 和 5.38%。
{"title":"Explainable reasoning over temporal knowledge graphs by pre-trained language model","authors":"Qing Li,&nbsp;Guanzhong Wu","doi":"10.1016/j.ipm.2024.103903","DOIUrl":"10.1016/j.ipm.2024.103903","url":null,"abstract":"<div><div>Temporal knowledge graph reasoning (TKGR) has been considered as a crucial task for modeling the evolving knowledge, aiming to infer the unknown connections between entities at specific times. Traditional TKGR methods try to aggregate structural information between entities and evolve representations of entities over distinct snapshots, while some other methods attempt to extract temporal logic rules from historical interactions. However, these methods fail to address the continuously emerging unseen entities over time and ignore the historical dependencies between entities and relations. To overcome these limitations, we propose a novel method, termed TPNet, which introduces historical information completion strategy (HICS) and pre-trained language model (PLM) to conduct explainable inductive reasoning over TKGs. Specifically, TPNet extracts reliable temporal logical paths from historical subgraphs using a temporal-correlated search strategy. For unseen entities, we utilize HICS to sample or generate paths to supplement their historical information. Besides, a PLM and a time-aware encoder are introduced to jointly encode the temporal paths, thereby comprehensively capturing dependencies between entities and relations. Moreover, the semantic similarity between the query quadruples and the extracted paths is evaluated to simultaneously optimize the representations of entities and relations. Extensive experiments on entity and relation prediction tasks are conducted to evaluate the performance of TPNet. The experimental results on four benchmark datasets demonstrate the superiority of TPNet over state-of-the-art TKGR methods, achieving improvements of 14.35%, 23.08%, 6.75% and 5.38% on MRR, respectively.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103903"},"PeriodicalIF":7.4,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142418002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The joint extraction of fact-condition statement and super relation in scientific text with table filling method 用表格填充法联合提取科学文本中的事实条件陈述和超级关系
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-01 DOI: 10.1016/j.ipm.2024.103906
Qizhi Chen , Hong Yao , Diange Zhou
The fact-condition statements are of great significance in scientific text, via which the natural phenomenon and its precondition are detailly recorded. In previous study, the extraction of fact-condition statement and their relation (super relation) from scientific text is designed as a pipeline that the fact-condition statement and super relation are extracted successively, which leads to the error propagation and lowers the accuracy. To solve this problem, the table filling method is firstly adopted for joint extraction of fact-condition statement and super relation, and the Biaffine Convolution Neural Network model (BCNN) is proposed to complete the task. In the BCNN, the pretrained language model and Biaffine Neural Network work as the encoder, while the Convolution Neural Network is added into the model as the decoder that enhances the local semantic information. Benefiting from the local semantic enhancement, the BCNN achieves the best F1 score with different pretrained language models in comparison with other baselines. Its F1 scores in GeothCF (geological text) reach 73.17% and 71.04% with BERT and SciBERT as pretrained language model, respectively. Moreover, the local semantic enhancement also increases its training efficiency, via which the tags’ distribution can be more easily learned by the model. Besides, the BCNN trained with GeothCF also exhibits the best performance in BioCF (biomedical text), which indicates that it can be widely applied for the information extraction in all scientific domains. Finally, the geological fact-condition knowledge graph is built with BCNN, showing a new pipeline for construction of scientific fact-condition knowledge graph.
事实条件语句在科学文本中具有重要意义,通过它可以详细记录自然现象及其前提条件。在以往的研究中,从科学文本中提取事实条件语句及其关系(超关系)被设计成一个流水线,即先后提取事实条件语句和超关系,这导致了错误的传播,降低了准确性。为解决这一问题,首先采用表格填充法对事实条件语句和超级关系进行联合提取,并提出了双峰卷积神经网络(Biaffine Convolution Neural Network,BCNN)模型来完成这一任务。在 BCNN 中,预训练的语言模型和 Biaffine 神经网络作为编码器工作,而卷积神经网络则作为解码器加入到模型中,以增强局部语义信息。得益于局部语义增强,与其他基线相比,BCNN 在不同的预训练语言模型中取得了最好的 F1 分数。在使用 BERT 和 SciBERT 作为预训练语言模型时,BCNN 在 GeothCF(地质文本)中的 F1 分数分别达到 73.17% 和 71.04%。此外,局部语义增强也提高了其训练效率,通过这种方法,模型可以更容易地学习标签的分布。此外,用 GeothCF 训练的 BCNN 在 BioCF(生物医学文本)中也表现出了最佳性能,这表明它可以广泛应用于所有科学领域的信息提取。最后,利用 BCNN 构建了地质事实条件知识图谱,为科学事实条件知识图谱的构建提供了新的管道。
{"title":"The joint extraction of fact-condition statement and super relation in scientific text with table filling method","authors":"Qizhi Chen ,&nbsp;Hong Yao ,&nbsp;Diange Zhou","doi":"10.1016/j.ipm.2024.103906","DOIUrl":"10.1016/j.ipm.2024.103906","url":null,"abstract":"<div><div>The fact-condition statements are of great significance in scientific text, via which the natural phenomenon and its precondition are detailly recorded. In previous study, the extraction of fact-condition statement and their relation (super relation) from scientific text is designed as a pipeline that the fact-condition statement and super relation are extracted successively, which leads to the error propagation and lowers the accuracy. To solve this problem, the table filling method is firstly adopted for joint extraction of fact-condition statement and super relation, and the Biaffine Convolution Neural Network model (BCNN) is proposed to complete the task. In the BCNN, the pretrained language model and Biaffine Neural Network work as the encoder, while the Convolution Neural Network is added into the model as the decoder that enhances the local semantic information. Benefiting from the local semantic enhancement, the BCNN achieves the best F1 score with different pretrained language models in comparison with other baselines. Its F1 scores in GeothCF (geological text) reach 73.17% and 71.04% with BERT and SciBERT as pretrained language model, respectively. Moreover, the local semantic enhancement also increases its training efficiency, via which the tags’ distribution can be more easily learned by the model. Besides, the BCNN trained with GeothCF also exhibits the best performance in BioCF (biomedical text), which indicates that it can be widely applied for the information extraction in all scientific domains. Finally, the geological fact-condition knowledge graph is built with BCNN, showing a new pipeline for construction of scientific fact-condition knowledge graph.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103906"},"PeriodicalIF":7.4,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142417752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Crowdsourced auction-based framework for time-critical and budget-constrained last mile delivery 基于众包拍卖的时间紧迫、预算有限的最后一英里交付框架
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-30 DOI: 10.1016/j.ipm.2024.103888
Esraa Odeh , Shakti Singh , Rabeb Mizouni , Hadi Otrok
This work addresses the problem of Last Mile Delivery (LMD) under time-critical and budget-constrained environments. Given the rapid growth of e-commerce worldwide, LMD has become a primary bottleneck to the efficiency of delivery services due to several factors, including travelling distance, service cost, and delivery time. Existing works mainly target optimizing travelled distance and maximizing gained profit; however, they do not consider time-critical and budget-limited tasks. The deployment of UAVs and the development of crowdsourcing platforms have provided a range of solutions to advance performance in LMD frameworks, as they offer many crowdworkers at varying locations ready to perform tasks instead of having a single point of departure. This work proposes a Hybrid, Crowdsourced, Auction-based LMD (HCA-LMD) framework with a dynamic allocation mechanism for optimized delivery of time-sensitive and budget-limited tasks. The proposed framework allocates tasks to workers as soon as they are submitted, given their urgency level and dropoff location, while considering the price, rating, and location of available workers. This work was compared against two benchmarks to assess the framework’s performance in dynamic environments in terms of on-time deliveries, average delay, and profit. Extensive simulation results showed an outstanding performance of the proposed state-of-the-art LMD framework by accomplishing almost 92% on-time deliveries under varying time- and budget-constrained scenarios, outperforming the first benchmark in the on-time allocation rate by fulfiling an additional 24% of the tasks the benchmark failed, with around 50% drop in average delay time and up to x5.8 gained profit when compared against the second benchmark.
这项研究解决的是时间紧迫和预算有限环境下的最后一英里配送(LMD)问题。鉴于全球电子商务的迅猛发展,受旅行距离、服务成本和交付时间等多种因素的影响,最后一英里配送已成为影响配送服务效率的主要瓶颈。现有研究主要以优化运输距离和最大化收益为目标,但没有考虑时间紧迫和预算有限的任务。无人机的部署和众包平台的发展为提高 LMD 框架的性能提供了一系列解决方案,因为它们在不同地点提供了许多准备执行任务的众包者,而不是只有一个出发点。本研究提出了一种基于拍卖的混合众包 LMD(HCA-LMD)框架,该框架具有动态分配机制,可优化时间敏感型和预算有限型任务的交付。根据任务的紧急程度和下达地点,提议的框架会在任务提交后立即将任务分配给工人,同时考虑可用工人的价格、等级和地点。这项工作与两个基准进行了比较,从准时交货、平均延迟和利润方面评估了该框架在动态环境中的性能。广泛的仿真结果表明,所提出的最先进的 LMD 框架表现出色,在时间和预算受限的不同情况下,完成了近 92% 的准时交付,在准时分配率方面优于第一个基准,额外完成了 24% 基准未能完成的任务,与第二个基准相比,平均延迟时间减少了约 50%,利润增加了高达 x5.8。
{"title":"Crowdsourced auction-based framework for time-critical and budget-constrained last mile delivery","authors":"Esraa Odeh ,&nbsp;Shakti Singh ,&nbsp;Rabeb Mizouni ,&nbsp;Hadi Otrok","doi":"10.1016/j.ipm.2024.103888","DOIUrl":"10.1016/j.ipm.2024.103888","url":null,"abstract":"<div><div>This work addresses the problem of Last Mile Delivery (LMD) under time-critical and budget-constrained environments. Given the rapid growth of e-commerce worldwide, LMD has become a primary bottleneck to the efficiency of delivery services due to several factors, including travelling distance, service cost, and delivery time. Existing works mainly target optimizing travelled distance and maximizing gained profit; however, they do not consider time-critical and budget-limited tasks. The deployment of UAVs and the development of crowdsourcing platforms have provided a range of solutions to advance performance in LMD frameworks, as they offer many crowdworkers at varying locations ready to perform tasks instead of having a single point of departure. This work proposes a Hybrid, Crowdsourced, Auction-based LMD (HCA-LMD) framework with a dynamic allocation mechanism for optimized delivery of time-sensitive and budget-limited tasks. The proposed framework allocates tasks to workers as soon as they are submitted, given their urgency level and dropoff location, while considering the price, rating, and location of available workers. This work was compared against two benchmarks to assess the framework’s performance in dynamic environments in terms of on-time deliveries, average delay, and profit. Extensive simulation results showed an outstanding performance of the proposed state-of-the-art LMD framework by accomplishing almost 92% on-time deliveries under varying time- and budget-constrained scenarios, outperforming the first benchmark in the on-time allocation rate by fulfiling an additional 24% of the tasks the benchmark failed, with around 50% drop in average delay time and up to x5.8 gained profit when compared against the second benchmark.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103888"},"PeriodicalIF":7.4,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142356984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An interpretable polytomous cognitive diagnosis framework for predicting examinee performance 用于预测考生成绩的可解释多项式认知诊断框架
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-29 DOI: 10.1016/j.ipm.2024.103913
Xiaoyu Li , Shaoyang Guo , Jin Wu , Chanjin Zheng
As a fundamental task of intelligent education, deep learning-based cognitive diagnostic models (CDMs) have been introduced to effectively model dichotomous testing data. However, it remains a challenge to model the polytomous data within the deep-learning framework. This paper proposed a novel Polytomous Cognitive Diagnosis Framework (PCDF), which employs Cumulative Category Response Function (CCRF) theory to partition and consolidate data, thereby enabling existing cognitive diagnostic models to seamlessly analyze graded response data. By combining the proposed PCDF with IRT, MIRT, NCDM, KaNCD, and ICDM, extensive experiments were complemented by data re-encoding techniques on the four real-world graded scoring datasets, along with baseline methods such as linear-split, one-vs-all, and random. The results suggest that when combined with existing CDMs, PCDF outperforms the baseline models in terms of prediction. Additionally, we showcase the interpretability of examinee ability and item parameters through the utilization of PCDF.
作为智能教育的一项基本任务,基于深度学习的认知诊断模型(CDM)已被引入,以有效地对二分测试数据进行建模。然而,如何在深度学习框架内对多态数据建模仍是一个挑战。本文提出了一种新颖的多态认知诊断框架(PCDF),它采用累积类别响应函数(CCRF)理论来分割和整合数据,从而使现有的认知诊断模型能够无缝地分析分级响应数据。通过将所提出的 PCDF 与 IRT、MIRT、NCDM、KaNCD 和 ICDM 相结合,在四个真实世界的分级评分数据集上进行了广泛的实验,并辅以数据重新编码技术,以及线性拆分、one-vs-all 和随机等基线方法。结果表明,当与现有的 CDM 相结合时,PCDF 在预测方面优于基线模型。此外,我们还展示了利用 PCDF 对考生能力和项目参数的可解释性。
{"title":"An interpretable polytomous cognitive diagnosis framework for predicting examinee performance","authors":"Xiaoyu Li ,&nbsp;Shaoyang Guo ,&nbsp;Jin Wu ,&nbsp;Chanjin Zheng","doi":"10.1016/j.ipm.2024.103913","DOIUrl":"10.1016/j.ipm.2024.103913","url":null,"abstract":"<div><div>As a fundamental task of intelligent education, deep learning-based cognitive diagnostic models (CDMs) have been introduced to effectively model dichotomous testing data. However, it remains a challenge to model the polytomous data within the deep-learning framework. This paper proposed a novel <strong>P</strong>olytomous <strong>C</strong>ognitive <strong>D</strong>iagnosis <strong>F</strong>ramework (PCDF), which employs <strong>C</strong>umulative <strong>C</strong>ategory <strong>R</strong>esponse <strong>F</strong>unction (CCRF) theory to partition and consolidate data, thereby enabling existing cognitive diagnostic models to seamlessly analyze graded response data. By combining the proposed PCDF with IRT, MIRT, NCDM, KaNCD, and ICDM, extensive experiments were complemented by data re-encoding techniques on the four real-world graded scoring datasets, along with baseline methods such as linear-split, one-vs-all, and random. The results suggest that when combined with existing CDMs, PCDF outperforms the baseline models in terms of prediction. Additionally, we showcase the interpretability of examinee ability and item parameters through the utilization of PCDF.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103913"},"PeriodicalIF":7.4,"publicationDate":"2024-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142356982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A theoretical framework for human-centered intelligent information services: A systematic review 以人为本的智能信息服务理论框架:系统回顾
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-29 DOI: 10.1016/j.ipm.2024.103891
Qiao Li, Yuelin Li, Shuhan Zhang, Xin Zhou, Zhengyuan Pan
Intelligent Information Services (IIS) employ Artificial Intelligence (AI)-based systems to provide information that matches the user's needs in diverse and evolving environments. Acknowledging the importance of users in AI-empowered IIS success, a growing number of researchers are investigating AI-empowered IIS from a user-centric perspective, establishing the foundation for a new research domain called “Human-Centered Intelligent Information Services” (HCIIS). Nonetheless, a review of user studies in AI-empowered IIS is still lacking, impeding the development of a clear definition and research framework for the HCIIS field. To fill this gap, this study conducts a systematic review of 116 user studies in AI-empowered IIS. Results reveal two primary research themes in user studies in AI-empowered IIS: human-IIS interaction (including user experience, system quality, user attitude, intention and behavior, information quality, and individual task performance) and IIS ethics (e.g., explainability and interpretability, privacy and safety, and inclusivity). Analyzing research gaps within these topics, this study formulates an HCIIS research framework consisting of three interconnected elements: human values and needs, environment, and service. The interconnections between each pair of elements identify three key research domains in HCIIS: interaction, ethics, and evolution. Interaction pertains to the facilitation of human-IIS interaction to meet human needs, encompassing topics including human-centered theory, evaluation, and the design of AI-empowered IIS interaction. Ethics emphasize ensuring AI-empowered IIS alignment with human values and norms within specific environments, covering topics like general and context-specific AI-empowered IIS ethical principles, risk assessment, and governance strategies. Evolution focuses on addressing the fulfillment of human needs in diverse and dynamic environments by continually evolving intelligence, involving the enhancement of AI-empowered IIS environmental sensitivity and adaptability within an intelligent ecosystem driven by technology integration. Central to HCIIS is co-creation, situated at the intersection of interaction, evolution, and ethics, emphasizing collaborative information creation between IIS and humans through hybrid intelligence. In conclusion, HCIIS is defined as a field centered on information co-creation between IIS and humans, distinguishing it from IIS, which focuses on providing information to humans.
智能信息服务(IIS)采用基于人工智能(AI)的系统,在多样化和不断发展的环境中提供符合用户需求的信息。越来越多的研究人员认识到用户对人工智能智能信息服务成功的重要性,正在从以用户为中心的角度研究人工智能智能信息服务,为 "以人为本的智能信息服务"(HCIIS)这一新的研究领域奠定了基础。然而,目前仍缺乏对人工智能赋能的智能信息服务的用户研究的综述,这阻碍了为人机交互智能信息服务领域制定明确的定义和研究框架。为了填补这一空白,本研究对人工智能赋能的智能信息系统中的 116 项用户研究进行了系统回顾。研究结果揭示了人工智能赋能 IIS 用户研究的两个主要研究主题:人-IIS 交互(包括用户体验、系统质量、用户态度、意图和行为、信息质量和个人任务绩效)和 IIS 伦理(如可解释性和可解释性、隐私和安全性以及包容性)。通过分析这些主题中的研究空白,本研究制定了人机交互信息系统研究框架,该框架由三个相互关联的要素组成:人类价值与需求、环境和服务。每对元素之间的相互联系确定了人机交互信息系统的三个关键研究领域:交互、伦理和进化。交互涉及促进人与信息系统之间的交互,以满足人类的需求,包括以人为本的理论、评估和人工智能赋能的 IIS 交互设计等主题。伦理强调确保人工智能赋能的IIS在特定环境中符合人类的价值观和规范,涵盖的主题包括一般和特定环境下人工智能赋能的IIS伦理原则、风险评估和管理策略。进化侧重于通过不断进化的智能来满足人类在多样化动态环境中的需求,包括在技术集成驱动的智能生态系统中提高人工智能赋能的智能信息系统对环境的敏感性和适应性。HCIIS 的核心是共同创造,它位于交互、进化和伦理的交叉点,强调 IIS 与人类通过混合智能协同创造信息。总之,HCIIS 被定义为一个以 IIS 与人类共同创造信息为中心的领域,有别于侧重于向人类提供信息的 IIS。
{"title":"A theoretical framework for human-centered intelligent information services: A systematic review","authors":"Qiao Li,&nbsp;Yuelin Li,&nbsp;Shuhan Zhang,&nbsp;Xin Zhou,&nbsp;Zhengyuan Pan","doi":"10.1016/j.ipm.2024.103891","DOIUrl":"10.1016/j.ipm.2024.103891","url":null,"abstract":"<div><div>Intelligent Information Services (IIS) employ Artificial Intelligence (AI)-based systems to provide information that matches the user's needs in diverse and evolving environments. Acknowledging the importance of users in AI-empowered IIS success, a growing number of researchers are investigating AI-empowered IIS from a user-centric perspective, establishing the foundation for a new research domain called “Human-Centered Intelligent Information Services” (HCIIS). Nonetheless, a review of user studies in AI-empowered IIS is still lacking, impeding the development of a clear definition and research framework for the HCIIS field. To fill this gap, this study conducts a systematic review of 116 user studies in AI-empowered IIS. Results reveal two primary research themes in user studies in AI-empowered IIS: human-IIS interaction (including user experience, system quality, user attitude, intention and behavior, information quality, and individual task performance) and IIS ethics (e.g., explainability and interpretability, privacy and safety, and inclusivity). Analyzing research gaps within these topics, this study formulates an HCIIS research framework consisting of three interconnected elements: human values and needs, environment, and service. The interconnections between each pair of elements identify three key research domains in HCIIS: interaction, ethics, and evolution. Interaction pertains to the facilitation of human-IIS interaction to meet human needs, encompassing topics including human-centered theory, evaluation, and the design of AI-empowered IIS interaction. Ethics emphasize ensuring AI-empowered IIS alignment with human values and norms within specific environments, covering topics like general and context-specific AI-empowered IIS ethical principles, risk assessment, and governance strategies. Evolution focuses on addressing the fulfillment of human needs in diverse and dynamic environments by continually evolving intelligence, involving the enhancement of AI-empowered IIS environmental sensitivity and adaptability within an intelligent ecosystem driven by technology integration. Central to HCIIS is co-creation, situated at the intersection of interaction, evolution, and ethics, emphasizing collaborative information creation between IIS and humans through hybrid intelligence. In conclusion, HCIIS is defined as a field centered on information co-creation between IIS and humans, distinguishing it from IIS, which focuses on providing information to humans.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103891"},"PeriodicalIF":7.4,"publicationDate":"2024-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142356983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DualFLAT: Dual Flat-Lattice Transformer for domain-specific Chinese named entity recognition DualFLAT:用于特定领域中文命名实体识别的双平面-网格变换器
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-28 DOI: 10.1016/j.ipm.2024.103902
Yinlong Xiao , Zongcheng Ji , Jianqiang Li , Qing Zhu
Recently, lexicon-enhanced methods for Chinese Named Entity Recognition (NER) have achieved great success which requires a high-quality lexicon. However, for the domain-specific Chinese NER, it is challenging to obtain such a high-quality lexicon due to the different distribution between the general lexicon and domain-specific data, and the high construction cost of the domain lexicon. To address these challenges, we introduce dual-source lexicons (i.e., a general lexicon and a domain lexicon) to acquire enriched lexical knowledge. Considering that the general lexicon often contains more noise compared to its domain counterparts, we further propose a dual-stream model, Dual Flat-LAttice Transformer (DualFLAT), designed to mitigate the impact of noise originating from the general lexicon while comprehensively harnessing the knowledge contained within the dual-source lexicons. Experimental results on three public domain-specific Chinese NER datasets (i.e., News, Novel and E-commerce) demonstrate that our method consistently outperforms the single-source lexicon-enhanced approaches, achieving state-of-the-art results. Specifically, our proposed DualFLAT model consistently outperforms the baseline FLAT, with an increase of up to 1.52%, 4.84% and 1.34% in F1 score for the News, Novel and E-commerce datasets, respectively.
最近,用于中文命名实体识别(NER)的词典增强方法取得了巨大成功,这需要高质量的词典。然而,对于特定领域的中文 NER,由于通用词库和特定领域数据的分布不同,以及领域词库的构建成本较高,要获得这样一个高质量的词库具有挑战性。为了应对这些挑战,我们引入了双源词典(即通用词典和领域词典)来获取丰富的词汇知识。考虑到与领域词库相比,通用词库通常包含更多噪声,我们进一步提出了一种双流模型--双扁平阶梯转换器(Dual Flat-LAttice Transformer,DualFLAT),旨在减轻来自通用词库的噪声的影响,同时全面利用双源词库中包含的知识。在三个公共领域特定中文 NER 数据集(即新闻、小说和电子商务)上的实验结果表明,我们的方法始终优于单源词典增强方法,取得了最先进的结果。具体来说,我们提出的 DualFLAT 模型始终优于基线 FLAT,在新闻、小说和电子商务数据集上的 F1 分数分别提高了 1.52%、4.84% 和 1.34%。
{"title":"DualFLAT: Dual Flat-Lattice Transformer for domain-specific Chinese named entity recognition","authors":"Yinlong Xiao ,&nbsp;Zongcheng Ji ,&nbsp;Jianqiang Li ,&nbsp;Qing Zhu","doi":"10.1016/j.ipm.2024.103902","DOIUrl":"10.1016/j.ipm.2024.103902","url":null,"abstract":"<div><div>Recently, lexicon-enhanced methods for Chinese Named Entity Recognition (NER) have achieved great success which requires a high-quality lexicon. However, for the domain-specific Chinese NER, it is challenging to obtain such a high-quality lexicon due to the different distribution between the general lexicon and domain-specific data, and the high construction cost of the domain lexicon. To address these challenges, we introduce dual-source lexicons (<em>i.e.,</em> a general lexicon and a domain lexicon) to acquire enriched lexical knowledge. Considering that the general lexicon often contains more noise compared to its domain counterparts, we further propose a dual-stream model, Dual Flat-LAttice Transformer (DualFLAT), designed to mitigate the impact of noise originating from the general lexicon while comprehensively harnessing the knowledge contained within the dual-source lexicons. Experimental results on three public domain-specific Chinese NER datasets (<em>i.e.,</em> News, Novel and E-commerce) demonstrate that our method consistently outperforms the single-source lexicon-enhanced approaches, achieving state-of-the-art results. Specifically, our proposed DualFLAT model consistently outperforms the baseline FLAT, with an increase of up to 1.52%, 4.84% and 1.34% in F1 score for the News, Novel and E-commerce datasets, respectively.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103902"},"PeriodicalIF":7.4,"publicationDate":"2024-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142356981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Information Processing & Management
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1