Data Technologies and Applications最新文献_第9页

Analyzing the structure of tourism destination network based on digital footprints: taking Guilin, China as a case 基于数字足迹的旅游目的地网络结构分析——以桂林为例

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2022-05-23 DOI: 10.1108/dta-09-2021-0240

Caihua Yu, Tonghui Lian, Hongbao Geng, Sixin Li

PurposeThis paper gathers tourism digital footprint from online travel platforms, choosing social network analysis method to learn the structure of destination networks and to probe into the features of tourist flow network structure and flow characteristics in Guilin of China.Design/methodology/approachThe digital footprint of tourists can be applied to study the behaviors and laws of digital footprint. This research contributes to improving the understanding of demand-driven network relationships among tourist attractions in a destination.Findings(1) Yulong River, Yangshuo West Street, Longji Terraced Fields, Silver Rock and Four Lakes are the divergent and agglomerative centers of tourist flow, which are the top tourist attractions for transiting tourists. (2) The core-periphery structure of the network is clearly stratified. More specifically, the core nodes in the network are prominent and the core area of the network has weak interaction with the peripheral area. (3) There are eight cohesive subgroups in the network structure, which contains certain differences in the radiation effects.Originality/valueThis research aims at exploring the spatial network structure characteristics of tourism flows in Guilin by analyzing the online footprints of tourists. It takes a good try to analyze the application of network footprint with the research of tourism flow characteristics, and also provides a theoretical reference for the design of tourist routes and the cooperative marketing among various attractions.

目的收集在线旅游平台的旅游数字足迹，采用社会网络分析方法了解目的地网络结构，探讨桂林市旅游流网络结构特征和流特征。设计/方法/途径游客的数字足迹可以用来研究数字足迹的行为和规律。研究结果表明:(1)遇龙河、阳朔西街、龙基梯田、银岩和四湖是旅游流的发散和集聚中心，是游客中转的首选旅游景点。(2)网络的核心-外围结构分层明显。更具体地说，网络中的核心节点突出，网络核心区与外围区域的相互作用弱。(3)网络结构中存在8个内聚亚群，其辐射效应存在一定差异。原创性/价值本研究旨在通过对游客在线足迹的分析，探索桂林市旅游流的空间网络结构特征。通过对旅游流特征的研究来分析网络足迹的应用，为旅游线路的设计和各景点之间的合作营销提供理论参考。

{"title":"Analyzing the structure of tourism destination network based on digital footprints: taking Guilin, China as a case","authors":"Caihua Yu, Tonghui Lian, Hongbao Geng, Sixin Li","doi":"10.1108/dta-09-2021-0240","DOIUrl":"https://doi.org/10.1108/dta-09-2021-0240","url":null,"abstract":"PurposeThis paper gathers tourism digital footprint from online travel platforms, choosing social network analysis method to learn the structure of destination networks and to probe into the features of tourist flow network structure and flow characteristics in Guilin of China.Design/methodology/approachThe digital footprint of tourists can be applied to study the behaviors and laws of digital footprint. This research contributes to improving the understanding of demand-driven network relationships among tourist attractions in a destination.Findings(1) Yulong River, Yangshuo West Street, Longji Terraced Fields, Silver Rock and Four Lakes are the divergent and agglomerative centers of tourist flow, which are the top tourist attractions for transiting tourists. (2) The core-periphery structure of the network is clearly stratified. More specifically, the core nodes in the network are prominent and the core area of the network has weak interaction with the peripheral area. (3) There are eight cohesive subgroups in the network structure, which contains certain differences in the radiation effects.Originality/valueThis research aims at exploring the spatial network structure characteristics of tourism flows in Guilin by analyzing the online footprints of tourists. It takes a good try to analyze the application of network footprint with the research of tourism flow characteristics, and also provides a theoretical reference for the design of tourist routes and the cooperative marketing among various attractions.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"1 1","pages":"56-83"},"PeriodicalIF":1.6,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89735602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Privacy-preserving techniques in recommender systems: state-of-the-art review and future research agenda 推荐系统中的隐私保护技术:最新的审查和未来的研究议程

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2022-05-04 DOI: 10.1108/dta-02-2022-0083

Dhanya Pramod

PurposeThis study explores privacy challenges in recommender systems (RSs) and how they have leveraged privacy-preserving technology for risk mitigation. The study also elucidates the extent of adopting privacy-preserving RSs and postulates the future direction of research in RS security.Design/methodology/approachThe study gathered articles from well-known databases such as SCOPUS, Web of Science and Google scholar. A systematic literature review using PRISMA was carried out on the 41 papers that are shortlisted for study. Two research questions were framed to carry out the review.FindingsIt is evident from this study that privacy issues in the RS have been addressed with various techniques. However, many more challenges are expected while leveraging technology advancements for fine-tuning recommenders, and a research agenda has been devised by postulating future directions.Originality/valueThe study unveils a new comprehensive perspective regarding privacy preservation in recommenders. There is no promising study found that gathers techniques used for privacy protection. The study summarizes the research agenda, and it will be a good reference article for those who develop privacy-preserving RSs.

本研究探讨了推荐系统(RSs)中的隐私挑战，以及它们如何利用隐私保护技术来降低风险。该研究还阐明了采用隐私保护RSs的程度，并对RS安全的未来研究方向进行了展望。设计/方法/方法本研究从SCOPUS、Web of Science和Google scholar等知名数据库中收集文章。采用PRISMA对入选的41篇论文进行系统的文献综述。为了进行审查，我们提出了两个研究问题。从这项研究中可以明显看出，RS中的隐私问题已经通过各种技术得到了解决。然而，在利用技术进步进行微调推荐时，预计会遇到更多挑战，并且通过假设未来的方向设计了一个研究议程。独创性/价值该研究揭示了关于推荐人隐私保护的一个新的综合视角。没有一项有希望的研究发现收集了用于隐私保护的技术。该研究总结了研究议程，对于开发保护隐私RSs的人来说是一篇很好的参考文章。

{"title":"Privacy-preserving techniques in recommender systems: state-of-the-art review and future research agenda","authors":"Dhanya Pramod","doi":"10.1108/dta-02-2022-0083","DOIUrl":"https://doi.org/10.1108/dta-02-2022-0083","url":null,"abstract":"PurposeThis study explores privacy challenges in recommender systems (RSs) and how they have leveraged privacy-preserving technology for risk mitigation. The study also elucidates the extent of adopting privacy-preserving RSs and postulates the future direction of research in RS security.Design/methodology/approachThe study gathered articles from well-known databases such as SCOPUS, Web of Science and Google scholar. A systematic literature review using PRISMA was carried out on the 41 papers that are shortlisted for study. Two research questions were framed to carry out the review.FindingsIt is evident from this study that privacy issues in the RS have been addressed with various techniques. However, many more challenges are expected while leveraging technology advancements for fine-tuning recommenders, and a research agenda has been devised by postulating future directions.Originality/valueThe study unveils a new comprehensive perspective regarding privacy preservation in recommenders. There is no promising study found that gathers techniques used for privacy protection. The study summarizes the research agenda, and it will be a good reference article for those who develop privacy-preserving RSs.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"21 1","pages":"32-55"},"PeriodicalIF":1.6,"publicationDate":"2022-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87140677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

A deep neural networks-based fusion model for COVID-19 rumor detection from online social media 基于深度神经网络的新型冠状病毒谣言检测融合模型

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2022-04-22 DOI: 10.1108/dta-06-2021-0160

Heng-yang Lu, Jun Yang, Wei Fang, Xiaoning Song, Chongjun Wang

PurposeThe COVID-19 has become a global pandemic, which has caused large number of deaths and huge economic losses. These losses are not only caused by the virus but also by the related rumors. Nowadays, online social media are quite popular, where billions of people express their opinions and propagate information. Rumors about COVID-19 posted on online social media usually spread rapidly; it is hard to analyze and detect rumors only by artificial processing. The purpose of this paper is to propose a novel model called the Topic-Comment-based Rumor Detection model (TopCom) to detect rumors as soon as possible.Design/methodology/approachThe authors conducted COVID-19 rumor detection from Sina Weibo, one of the most widely used Chinese online social media. The authors constructed a dataset about COVID-19 from January 1 to June 30, 2020 with a web crawler, including both rumor and non-rumors. The rumor detection task is regarded as a binary classification problem. The proposed TopCom model exploits the topical memory networks to fuse latent topic information with original microblogs, which solves the sparsity problems brought by short-text microblogs. In addition, TopCom fuses comments with corresponding microblogs to further improve the performance.FindingsExperimental results on a publicly available dataset and the proposed COVID dataset have shown superiority and efficiency compared with baselines. The authors further randomly selected microblogs posted from July 1–31, 2020 for the case study, which also shows the effectiveness and application prospects for detecting rumors about COVID-19 automatically.Originality/valueThe originality of TopCom lies in the fusion of latent topic information of original microblogs and corresponding comments with DNNs-based models for the COVID-19 rumor detection task, whose value is to help detect rumors automatically in a short time.

新冠肺炎疫情已成为全球性流行病，造成大量人员死亡和巨大经济损失。这些损失不仅是由病毒造成的，而且是由相关谣言造成的。如今，在线社交媒体非常受欢迎，数十亿人在这里表达自己的观点和传播信息。在网络社交媒体上发布的有关新冠肺炎的谣言通常传播迅速;仅靠人工处理很难分析和发现谣言。本文的目的是提出一种新的模型，即基于topic - comment的谣言检测模型(TopCom)，以尽快检测谣言。设计/方法/方法作者在中国最广泛使用的在线社交媒体之一新浪微博上进行了COVID-19谣言检测。作者使用网络爬虫构建了2020年1月1日至6月30日的COVID-19数据集，包括谣言和非谣言。将谣言检测任务视为一个二元分类问题。提出的TopCom模型利用主题记忆网络将潜在话题信息与原始微博融合，解决了短文本微博带来的稀疏性问题。此外，TopCom还将评论与相应的微博进行融合，进一步提升性能。与基线相比，在公开可用数据集和本文提出的COVID数据集上的实验结果显示出优越性和效率。作者进一步随机选取2020年7月1日至31日发布的微博进行案例研究，也显示了自动检测COVID-19谣言的有效性和应用前景。TopCom的独创性/价值TopCom的独创性在于将原创微博的潜在话题信息和相应评论与基于dnns的模型融合在一起进行COVID-19谣言检测任务，其价值在于帮助在短时间内自动检测谣言。

{"title":"A deep neural networks-based fusion model for COVID-19 rumor detection from online social media","authors":"Heng-yang Lu, Jun Yang, Wei Fang, Xiaoning Song, Chongjun Wang","doi":"10.1108/dta-06-2021-0160","DOIUrl":"https://doi.org/10.1108/dta-06-2021-0160","url":null,"abstract":"PurposeThe COVID-19 has become a global pandemic, which has caused large number of deaths and huge economic losses. These losses are not only caused by the virus but also by the related rumors. Nowadays, online social media are quite popular, where billions of people express their opinions and propagate information. Rumors about COVID-19 posted on online social media usually spread rapidly; it is hard to analyze and detect rumors only by artificial processing. The purpose of this paper is to propose a novel model called the Topic-Comment-based Rumor Detection model (TopCom) to detect rumors as soon as possible.Design/methodology/approachThe authors conducted COVID-19 rumor detection from Sina Weibo, one of the most widely used Chinese online social media. The authors constructed a dataset about COVID-19 from January 1 to June 30, 2020 with a web crawler, including both rumor and non-rumors. The rumor detection task is regarded as a binary classification problem. The proposed TopCom model exploits the topical memory networks to fuse latent topic information with original microblogs, which solves the sparsity problems brought by short-text microblogs. In addition, TopCom fuses comments with corresponding microblogs to further improve the performance.FindingsExperimental results on a publicly available dataset and the proposed COVID dataset have shown superiority and efficiency compared with baselines. The authors further randomly selected microblogs posted from July 1–31, 2020 for the case study, which also shows the effectiveness and application prospects for detecting rumors about COVID-19 automatically.Originality/valueThe originality of TopCom lies in the fusion of latent topic information of original microblogs and corresponding comments with DNNs-based models for the COVID-19 rumor detection task, whose value is to help detect rumors automatically in a short time.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"60 1","pages":"806-824"},"PeriodicalIF":1.6,"publicationDate":"2022-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74968915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Hybrid data analytic technique for grading fairness 公平评分的混合数据分析技术

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2022-04-20 DOI: 10.1108/dta-01-2022-0047

T. Banditwattanawong, A. Jankasem, Masawee Masdisornchote

PurposeFair grading produces learning ability levels that are understandable and acceptable to both learners and instructors. Norm-referenced grading can be achieved by several means such as z score, K-means and a heuristic. However, these methods typically deliver the varied degrees of grading fairness depending on input score data.Design/methodology/approachTo attain the fairest grading, this paper proposes a hybrid algorithm that integrates z score, K-means and heuristic methods with a novel fairness objective function as a decision function.FindingsDepending on an experimented data set, each of the algorithm's constituent methods could deliver the fairest grading results with fairness degrees ranging from 0.110 to 0.646. We also pointed out key factors in the fairness improvement of norm-referenced achievement grading.Originality/valueThe main contributions of this paper are four folds: the definition of fair norm-referenced grading requirements, a hybrid algorithm for fair norm-referenced grading, a fairness metric for norm-referenced grading and the fairness performance results of the statistical, heuristic and machine learning methods.

目的:公平的评分产生学习者和教师都能理解和接受的学习能力水平。标准参照评分可以通过z分数、k均值和启发式等几种方法来实现。然而，这些方法通常根据输入的分数数据提供不同程度的评分公平性。设计/方法/方法为了实现最公平的评分，本文提出了一种混合算法，该算法将z分数、k均值和启发式方法相结合，并以一种新的公平目标函数作为决策函数。根据实验数据集的不同，每个算法的组成方法都能给出最公平的评分结果，公平度在0.110到0.646之间。本文还指出了提高标准参照成绩评分公平性的关键因素。本文的主要贡献有四个方面:公平标准参考评分要求的定义、公平标准参考评分的混合算法、公平标准参考评分的度量以及统计、启发式和机器学习方法的公平性能结果。

{"title":"Hybrid data analytic technique for grading fairness","authors":"T. Banditwattanawong, A. Jankasem, Masawee Masdisornchote","doi":"10.1108/dta-01-2022-0047","DOIUrl":"https://doi.org/10.1108/dta-01-2022-0047","url":null,"abstract":"PurposeFair grading produces learning ability levels that are understandable and acceptable to both learners and instructors. Norm-referenced grading can be achieved by several means such as z score, K-means and a heuristic. However, these methods typically deliver the varied degrees of grading fairness depending on input score data.Design/methodology/approachTo attain the fairest grading, this paper proposes a hybrid algorithm that integrates z score, K-means and heuristic methods with a novel fairness objective function as a decision function.FindingsDepending on an experimented data set, each of the algorithm's constituent methods could deliver the fairest grading results with fairness degrees ranging from 0.110 to 0.646. We also pointed out key factors in the fairness improvement of norm-referenced achievement grading.Originality/valueThe main contributions of this paper are four folds: the definition of fair norm-referenced grading requirements, a hybrid algorithm for fair norm-referenced grading, a fairness metric for norm-referenced grading and the fairness performance results of the statistical, heuristic and machine learning methods.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"34 1","pages":"18-31"},"PeriodicalIF":1.6,"publicationDate":"2022-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83654519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Utility optimization-based multi-stakeholder personalized recommendation system 基于效用优化的多利益相关者个性化推荐系统

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2022-04-15 DOI: 10.1108/dta-07-2021-0182

Rahul Shrivastava, Dilip Singh Sisodia, N. K. Nagwani

PurposeIn a multi-stakeholder recommender system (MSRS), stakeholders are the multiple entities (consumer, producer, system, etc.) benefited by the generated recommendations. Traditionally, the exclusive focus on only a single stakeholders' (for example, only consumer or end-user) preferences obscured the welfare of the others. Two major challenges are encountered while incorporating the multiple stakeholders' perspectives in MSRS: designing a dedicated utility function for each stakeholder and optimizing their utility without hurting others. This paper proposes multiple utility functions for different stakeholders and optimizes these functions for generating balanced, personalized recommendations for each stakeholder.Design/methodology/approachThe proposed methodology considers four valid stakeholders user, producer, cast and recommender system from the multi-stakeholder recommender setting and builds dedicated utility functions. The utility function for users incorporates enhanced side-information-based similarity computation for utility count. Similarly, to improve the utility gain, the authors design new utility functions for producer, star-cast and system to incorporate long-tail and diverse items in the recommendation list. Next, to balance the utility gain and generate the trade-off recommendation solution, the authors perform the evolutionary optimization of the conflicting utility functions using NSGA-II. Experimental evaluation and comparison are conducted over three benchmark data sets.FindingsThe authors observed 19.70% of average enhancement in utility gain with improved mean precision, diversity and novelty. Exposure, hit, reach and target reach metrics are substantially improved.Originality/valueA new approach considers four stakeholders simultaneously with their respective utility functions and establishes the trade-off recommendation solution between conflicting utilities of the stakeholders.

在多利益相关者推荐系统(MSRS)中，利益相关者是从生成的推荐中受益的多个实体(消费者、生产者、系统等)。传统上，只关注单个利益相关者(例如，只关注消费者或最终用户)的偏好，掩盖了其他人的福利。在MSRS中纳入多个利益相关者的观点时，遇到了两个主要挑战:为每个利益相关者设计一个专用的效用函数，并在不损害他人的情况下优化他们的效用。本文提出了针对不同利益相关者的多个效用函数，并对这些函数进行了优化，以便为每个利益相关者生成平衡的、个性化的建议。设计/方法/方法提出的方法从多利益相关者推荐设置中考虑了四个有效的利益相关者用户、生产者、演员和推荐系统，并构建了专用的实用函数。用户效用函数包含增强的基于侧信息的相似性计算，用于效用计数。同样，为了提高效用增益，作者为制作人、演员和系统设计了新的效用函数，在推荐列表中加入了长尾和多样化的项目。其次，为了平衡效用增益并生成权衡推荐方案，作者使用NSGA-II对冲突效用函数进行进化优化。在三个基准数据集上进行了实验评估和比较。作者观察到，在平均精度、多样性和新颖性方面，效用增益平均提高了19.70%。曝光率、点击率、覆盖面和目标覆盖率指标都得到了显著改善。原创性/价值一种新的方法同时考虑四个利益相关者及其各自的效用函数，并在利益相关者的冲突效用之间建立权衡推荐解决方案。

{"title":"Utility optimization-based multi-stakeholder personalized recommendation system","authors":"Rahul Shrivastava, Dilip Singh Sisodia, N. K. Nagwani","doi":"10.1108/dta-07-2021-0182","DOIUrl":"https://doi.org/10.1108/dta-07-2021-0182","url":null,"abstract":"PurposeIn a multi-stakeholder recommender system (MSRS), stakeholders are the multiple entities (consumer, producer, system, etc.) benefited by the generated recommendations. Traditionally, the exclusive focus on only a single stakeholders' (for example, only consumer or end-user) preferences obscured the welfare of the others. Two major challenges are encountered while incorporating the multiple stakeholders' perspectives in MSRS: designing a dedicated utility function for each stakeholder and optimizing their utility without hurting others. This paper proposes multiple utility functions for different stakeholders and optimizes these functions for generating balanced, personalized recommendations for each stakeholder.Design/methodology/approachThe proposed methodology considers four valid stakeholders user, producer, cast and recommender system from the multi-stakeholder recommender setting and builds dedicated utility functions. The utility function for users incorporates enhanced side-information-based similarity computation for utility count. Similarly, to improve the utility gain, the authors design new utility functions for producer, star-cast and system to incorporate long-tail and diverse items in the recommendation list. Next, to balance the utility gain and generate the trade-off recommendation solution, the authors perform the evolutionary optimization of the conflicting utility functions using NSGA-II. Experimental evaluation and comparison are conducted over three benchmark data sets.FindingsThe authors observed 19.70% of average enhancement in utility gain with improved mean precision, diversity and novelty. Exposure, hit, reach and target reach metrics are substantially improved.Originality/valueA new approach considers four stakeholders simultaneously with their respective utility functions and establishes the trade-off recommendation solution between conflicting utilities of the stakeholders.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"45 1","pages":"782-805"},"PeriodicalIF":1.6,"publicationDate":"2022-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81241359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Argentine ant system algorithm for partial set covering problem 部分集覆盖问题的阿根廷蚁系统算法

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2022-04-13 DOI: 10.1108/dta-08-2021-0205

Xiaofan Liu, Yupeng Zhou, Minghao Yin, Shuai Lv

PurposeThe paper aims to provide an efficient meta-heuristic algorithm to solve the partial set covering problem (PSCP). With rich application scenarios, the PSCP is a fascinating and well-known non-deterministic polynomial (NP)-hard problem whose goal is to cover at least k elements with as few subsets as possible.Design/methodology/approachIn this work, the authors present a novel variant of the ant colony optimization (ACO) algorithm, called Argentine ant system (AAS), to deal with the PSCP. The developed AAS is an integrated system of different populations that use the same pheromone to communicate. Moreover, an effective local search framework with the relaxed configuration checking (RCC) and the volatilization-fixed weight mechanism is proposed to improve the exploitation of the algorithm.FindingsA detailed experimental evaluation of 75 instances reveals that the proposed algorithm outperforms the competitors in terms of the quality of the optimal solutions. Also, the performance of AAS gradually improves with the growing instance size, which shows the potential in handling complex practical scenarios. Finally, the designed components of AAS are experimentally proved to be beneficial to the whole framework. Finally, the key components in AAS have been demonstrated.Originality/valueAt present, there is no heuristic method to solve this problem. The authors present the first implementation of heuristic algorithm for solving PSCP and provide competitive solutions.

目的提供一种有效的元启发式算法来解决部分集覆盖问题(PSCP)。PSCP具有丰富的应用场景，是一个引人入胜且众所周知的非确定性多项式(NP)难题，其目标是用尽可能少的子集覆盖至少k个元素。在这项工作中，作者提出了蚁群优化(ACO)算法的一种新变体，称为阿根廷蚂蚁系统(AAS)，用于处理PSCP。发达的AAS是不同种群使用同一信息素进行交流的综合系统。在此基础上，提出了一种有效的局部搜索框架，结合松弛配置检查(RCC)和挥发固定权机制，提高了算法的可开发性。对75个实例的详细实验评估表明，所提出的算法在最优解的质量方面优于竞争对手。此外，随着实例大小的增加，AAS的性能逐渐提高，这显示了处理复杂实际场景的潜力。最后，实验证明了所设计的AAS组件对整个框架是有益的。最后，对原子吸收系统的关键部件进行了演示。目前，还没有启发式的方法来解决这个问题。作者提出了求解PSCP的启发式算法的第一个实现，并提供了竞争性的解决方案。

{"title":"An Argentine ant system algorithm for partial set covering problem","authors":"Xiaofan Liu, Yupeng Zhou, Minghao Yin, Shuai Lv","doi":"10.1108/dta-08-2021-0205","DOIUrl":"https://doi.org/10.1108/dta-08-2021-0205","url":null,"abstract":"PurposeThe paper aims to provide an efficient meta-heuristic algorithm to solve the partial set covering problem (PSCP). With rich application scenarios, the PSCP is a fascinating and well-known non-deterministic polynomial (NP)-hard problem whose goal is to cover at least k elements with as few subsets as possible.Design/methodology/approachIn this work, the authors present a novel variant of the ant colony optimization (ACO) algorithm, called Argentine ant system (AAS), to deal with the PSCP. The developed AAS is an integrated system of different populations that use the same pheromone to communicate. Moreover, an effective local search framework with the relaxed configuration checking (RCC) and the volatilization-fixed weight mechanism is proposed to improve the exploitation of the algorithm.FindingsA detailed experimental evaluation of 75 instances reveals that the proposed algorithm outperforms the competitors in terms of the quality of the optimal solutions. Also, the performance of AAS gradually improves with the growing instance size, which shows the potential in handling complex practical scenarios. Finally, the designed components of AAS are experimentally proved to be beneficial to the whole framework. Finally, the key components in AAS have been demonstrated.Originality/valueAt present, there is no heuristic method to solve this problem. The authors present the first implementation of heuristic algorithm for solving PSCP and provide competitive solutions.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"19 1","pages":"762-781"},"PeriodicalIF":1.6,"publicationDate":"2022-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85384157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A collaborative trend prediction method using the crowdsourced wisdom of web search engines 基于网络搜索引擎众包智慧的协同趋势预测方法

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2022-03-28 DOI: 10.1108/dta-08-2021-0209

Ze-Han Fang, C. Chen

PurposeThe purpose of this paper is to propose a novel collaborative trend prediction method to estimate the status of trending topics by crowdsourcing the wisdom in web search engines. Government officials and decision makers can take advantage of the proposed method to effectively analyze various trending topics and make appropriate decisions in response to fast-changing national and international situations or popular opinions.Design/methodology/approachIn this study, a crowdsourced-wisdom-based feature selection method was designed to select representative indicators showing trending topics and concerns of the general public. The authors also designed a novel prediction method to estimate the trending topic statuses by crowdsourcing public opinion in web search engines.FindingsThe authors’ proposed method achieved better results than traditional trend prediction methods and successfully predict trending topic statuses by using the crowdsourced wisdom of web search engines.Originality/valueThis paper proposes a novel collaborative trend prediction method and applied it to various trending topics. The experimental results show that the authors’ method can successfully estimate the trending topic statuses and outperform other baseline methods. To the best of the authors’ knowledge, this is the first such attempt to predict trending topic statuses by using the crowdsourced wisdom of web search engines.

本文的目的是提出一种新的协同趋势预测方法，通过众包网络搜索引擎中的智慧来估计趋势话题的状态。政府官员和决策者可以利用所提出的方法有效地分析各种趋势话题，并根据快速变化的国内和国际形势或民意做出适当的决策。设计/方法/方法本研究设计了一种基于众包智慧的特征选择方法，以选择具有代表性的指标来显示趋势话题和公众关注的问题。作者还设计了一种新的预测方法，通过在网络搜索引擎中众包民意来估计趋势话题的状态。作者提出的方法比传统的趋势预测方法取得了更好的结果，并利用网络搜索引擎的众包智慧成功地预测了趋势话题状态。本文提出了一种新颖的协同趋势预测方法，并将其应用于各种趋势话题。实验结果表明，该方法可以成功地估计趋势话题状态，优于其他基线方法。据作者所知，这是第一次尝试利用网络搜索引擎的众包智慧来预测热门话题的状态。

{"title":"A collaborative trend prediction method using the crowdsourced wisdom of web search engines","authors":"Ze-Han Fang, C. Chen","doi":"10.1108/dta-08-2021-0209","DOIUrl":"https://doi.org/10.1108/dta-08-2021-0209","url":null,"abstract":"PurposeThe purpose of this paper is to propose a novel collaborative trend prediction method to estimate the status of trending topics by crowdsourcing the wisdom in web search engines. Government officials and decision makers can take advantage of the proposed method to effectively analyze various trending topics and make appropriate decisions in response to fast-changing national and international situations or popular opinions.Design/methodology/approachIn this study, a crowdsourced-wisdom-based feature selection method was designed to select representative indicators showing trending topics and concerns of the general public. The authors also designed a novel prediction method to estimate the trending topic statuses by crowdsourcing public opinion in web search engines.FindingsThe authors’ proposed method achieved better results than traditional trend prediction methods and successfully predict trending topic statuses by using the crowdsourced wisdom of web search engines.Originality/valueThis paper proposes a novel collaborative trend prediction method and applied it to various trending topics. The experimental results show that the authors’ method can successfully estimate the trending topic statuses and outperform other baseline methods. To the best of the authors’ knowledge, this is the first such attempt to predict trending topic statuses by using the crowdsourced wisdom of web search engines.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"13 1","pages":"741-761"},"PeriodicalIF":1.6,"publicationDate":"2022-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80768703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Ranking the ontology development methodologies using the weighted decision matrix 使用加权决策矩阵对本体开发方法进行排序

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2022-03-18 DOI: 10.1108/dta-05-2021-0123

P. K. Sinha, Biswanath Dutta, Udaya Varadarajan

PurposeThe current work provides a framework for the ranking of ontology development methodologies (ODMs).Design/methodology/approachThe framework is a step-by-step approach reinforced by an array of ranking features and a quantitative tool, weighted decision matrix. An extensive literature investigation revealed a set of aspects that regulate ODMs. The aspects and existing state-of-the-art estimates facilitated in extracting the features. To determine weight to each of the features, an online survey was implemented to secure evidence from the Semantic Web community. To demonstrate the framework, the authors perform a pilot study, where a collection of domain ODMs, reported in 2000–2019, is used.FindingsState-of-the-art research revealed that ODMs have been accumulated, surveyed and assessed to prescribe the best probable ODM for ontology development. But none of the prevailing studies provide a ranking mechanism for ODMs. The recommended framework overcomes this limitation and gives a systematic and uniform way of ranking the ODMs. The pilot study yielded NeOn as the top-ranked ODM in the recent two decades.Originality/valueThere is no work in the literature that has investigated ranking the ODMs. Hence, this is a first of its kind work in the area of ODM research. The framework supports identifying the topmost ODMs from the literature possessing a substantial amount of features for ontology development. It also enables the selection of the best possible ODM for the ontology development.

目的本研究为本体开发方法(odm)的排序提供了一个框架。设计/方法/方法该框架是一种循序渐进的方法，由一系列排名特征和量化工具加权决策矩阵加强。一项广泛的文献调查揭示了调节odm的一系列方面。方面和现有的最先进的估计有助于提取特征。为了确定每个特征的权重，我们实施了一项在线调查，以确保来自语义Web社区的证据。为了演示该框架，作者进行了一项试点研究，其中使用了2000-2019年报告的一系列领域odm。最新的研究表明，已经积累、调查和评估了ODM，以规定本体开发的最佳ODM。但是，没有一项主流研究提供odm的排名机制。推荐的框架克服了这一限制，并提供了对odm进行排序的系统和统一的方法。初步研究表明，NeOn是近二十年来排名第一的ODM。原创性/价值文献中没有研究odm排名的工作。因此，这是ODM研究领域的首次此类工作。该框架支持从具有大量本体开发功能的文献中识别最顶级的odm。它还支持为本体开发选择最好的ODM。

{"title":"Ranking the ontology development methodologies using the weighted decision matrix","authors":"P. K. Sinha, Biswanath Dutta, Udaya Varadarajan","doi":"10.1108/dta-05-2021-0123","DOIUrl":"https://doi.org/10.1108/dta-05-2021-0123","url":null,"abstract":"PurposeThe current work provides a framework for the ranking of ontology development methodologies (ODMs).Design/methodology/approachThe framework is a step-by-step approach reinforced by an array of ranking features and a quantitative tool, weighted decision matrix. An extensive literature investigation revealed a set of aspects that regulate ODMs. The aspects and existing state-of-the-art estimates facilitated in extracting the features. To determine weight to each of the features, an online survey was implemented to secure evidence from the Semantic Web community. To demonstrate the framework, the authors perform a pilot study, where a collection of domain ODMs, reported in 2000–2019, is used.FindingsState-of-the-art research revealed that ODMs have been accumulated, surveyed and assessed to prescribe the best probable ODM for ontology development. But none of the prevailing studies provide a ranking mechanism for ODMs. The recommended framework overcomes this limitation and gives a systematic and uniform way of ranking the ODMs. The pilot study yielded NeOn as the top-ranked ODM in the recent two decades.Originality/valueThere is no work in the literature that has investigated ranking the ODMs. Hence, this is a first of its kind work in the area of ODM research. The framework supports identifying the topmost ODMs from the literature possessing a substantial amount of features for ontology development. It also enables the selection of the best possible ODM for the ontology development.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"54 1","pages":"686-719"},"PeriodicalIF":1.6,"publicationDate":"2022-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86877468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

3MO-AHP: an inconsistency reduction approach through mono-, multi- or many-objective quality measures 3MO-AHP:通过单目标、多目标或多目标质量度量来减少不一致的方法

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2022-02-18 DOI: 10.1108/dta-11-2021-0315

C. Floriano, Valdecy Pereira, Brunno e Souza Rodrigues

PurposeAlthough the multi-criteria technique analytic hierarchy process (AHP) has successfully been applied in many areas, either selecting or ranking alternatives or to derive priority vector (weights) for a set of criteria, there is a significant drawback in using this technique if the pairwise comparison matrix (PCM) has inconsistent comparisons, in other words, a consistency ratio (CR) above the value of 0.1, the final solution cannot be validated. Many studies have been developed to treat the inconsistency problem, but few of them tried to satisfy different quality measures, which are minimum inconsistency (fMI), the total number of adjusted pairwise comparisons (fNC), original rank preservation (fKT), minimum average weights adjustment (fWA) and finally, minimum L1 matrix norm between the original PCM and the adjusted PCM (fLM).Design/methodology/approachThe approach is defined in four steps: first, the decision-maker should choose which quality measures she/he wishes to use, ranging from one to all quality measures. In the second step, the authors encode the PCM to be used in a many-objective optimization algorithm (MOOA), and each pairwise comparison can be adjusted individually. The authors generate consistent solutions from the obtained Pareto optimal front that carry the desired quality measures in the third step. Lastly, the decision-maker selects the most suitable solution for her/his problem. Remarkably, as the decision-maker can choose one (mono-objective), two (multi-objective), three or more (many-objectives) quality measures, not all MOOAs can handle or perform well in mono- or multi-objective problems. The unified non-sorting algorithm III (U-NSGA III) is the most appropriate MOOA for this type of scenario because it was specially designed to handle mono-, multi- and many-objective problems.FindingsThe use of two quality measures should not guarantee that the adjusted PCM is similar to the original PCM; hence, the decision-maker should consider using more quality measures if the objective is to preserve the original PCM characteristics.Originality/valueFor the first time, a many-objective approach reduces the CR to consistent levels with the ability to consider one or more quality measures and allows the decision-maker to adjust each pairwise comparison individually.

虽然多准则技术层次分析法(AHP)已经成功地应用于许多领域，无论是选择或排序备选方案，还是为一组标准推导优先向量(权重)，但如果配对比较矩阵(PCM)具有不一致的比较，即一致性比(CR)大于0.1，则使用该技术存在一个显著的缺点，即无法验证最终解决方案。针对不一致性问题的研究已经有很多，但很少有研究试图满足不同的质量指标，即最小不一致性(fMI)、调整后的两两比较总数(fNC)、原始秩保持(fKT)、最小平均权值调整(fWA)以及原始PCM与调整后的PCM之间的最小L1矩阵范数(fLM)。设计/方法论/方法方法定义为四个步骤:首先，决策者应该选择她/他希望使用的质量度量，范围从一个到所有的质量度量。第二步，对PCM进行编码，用于多目标优化算法(MOOA)，每对比较都可以单独调整。在第三步中，作者从得到的帕累托最优前沿生成了具有期望质量度量的一致解。最后，决策者选择最适合自己问题的解决方案。值得注意的是，由于决策者可以选择一个(单目标)，两个(多目标)，三个或更多(多目标)质量度量，并非所有mooa都可以处理或在单目标或多目标问题中表现良好。统一非排序算法III (U-NSGA III)是最适合这种场景的MOOA，因为它是专门为处理单目标、多目标和多目标问题而设计的。两种质量措施的使用不能保证调整后的PCM与原PCM相似;因此，如果目标是保持原有的PCM特性，决策者应该考虑使用更多的质量度量。原创性/价值多目标方法第一次将CR降低到具有考虑一个或多个质量度量的能力的一致水平，并允许决策者单独调整每个两两比较。

{"title":"3MO-AHP: an inconsistency reduction approach through mono-, multi- or many-objective quality measures","authors":"C. Floriano, Valdecy Pereira, Brunno e Souza Rodrigues","doi":"10.1108/dta-11-2021-0315","DOIUrl":"https://doi.org/10.1108/dta-11-2021-0315","url":null,"abstract":"PurposeAlthough the multi-criteria technique analytic hierarchy process (AHP) has successfully been applied in many areas, either selecting or ranking alternatives or to derive priority vector (weights) for a set of criteria, there is a significant drawback in using this technique if the pairwise comparison matrix (PCM) has inconsistent comparisons, in other words, a consistency ratio (CR) above the value of 0.1, the final solution cannot be validated. Many studies have been developed to treat the inconsistency problem, but few of them tried to satisfy different quality measures, which are minimum inconsistency (fMI), the total number of adjusted pairwise comparisons (fNC), original rank preservation (fKT), minimum average weights adjustment (fWA) and finally, minimum L1 matrix norm between the original PCM and the adjusted PCM (fLM).Design/methodology/approachThe approach is defined in four steps: first, the decision-maker should choose which quality measures she/he wishes to use, ranging from one to all quality measures. In the second step, the authors encode the PCM to be used in a many-objective optimization algorithm (MOOA), and each pairwise comparison can be adjusted individually. The authors generate consistent solutions from the obtained Pareto optimal front that carry the desired quality measures in the third step. Lastly, the decision-maker selects the most suitable solution for her/his problem. Remarkably, as the decision-maker can choose one (mono-objective), two (multi-objective), three or more (many-objectives) quality measures, not all MOOAs can handle or perform well in mono- or multi-objective problems. The unified non-sorting algorithm III (U-NSGA III) is the most appropriate MOOA for this type of scenario because it was specially designed to handle mono-, multi- and many-objective problems.FindingsThe use of two quality measures should not guarantee that the adjusted PCM is similar to the original PCM; hence, the decision-maker should consider using more quality measures if the objective is to preserve the original PCM characteristics.Originality/valueFor the first time, a many-objective approach reduces the CR to consistent levels with the ability to consider one or more quality measures and allows the decision-maker to adjust each pairwise comparison individually.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"35 1","pages":"645-670"},"PeriodicalIF":1.6,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77746376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Text mining the mission statements of the most ethical companies 文本挖掘最有道德的公司的使命宣言

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2022-02-15 DOI: 10.1108/dta-10-2021-0280

T. Bayrak

PurposeThis paper explores and examines the mission statements of the most ethical companies across the globe in terms of their main purposes, values, goals, and objective, and what they say about their vision and goals.Design/methodology/approachThis study is based on the data published by the Ethisphere Institute, the global leader in defining and advancing the standards of ethical business practices. Having compiled the mission statements into a text file, the authors conducted text mining using a commercially available text mining tool SAS Enterprise Miner to survey if the most ethical companies have valued the same vision and mission such as social responsibility and ethics.FindingsA review of their mission statements indicated that some of the most ethical companies surveyed in this study such as 3M and Voya strive to be “socially responsible and ethical,” support their “societies” and respect and protect the “nature,” “planet” and “environment.” The world's most ethical companies that stress these weighted terms in their mission statements may do so to show their commitment by being socially responsible and ethical, and delivering sustainable business solutions to their customers.Originality/valueThis study provides a systematic and comprehensive exploration of mission statements of the most ethical companies in an attempt to identify patterns of differences and similarities within these statements.

本文探讨并考察了全球最具道德公司的使命宣言，包括其主要目的、价值观、目标和目的，以及他们对自己的愿景和目标的看法。设计/方法/方法本研究基于道德村研究所(Ethisphere Institute)发布的数据，该研究所是定义和推进道德商业实践标准的全球领导者。在将使命声明汇编成文本文件后，作者使用商业上可用的文本挖掘工具SAS Enterprise Miner进行文本挖掘，以调查最具道德的公司是否重视相同的愿景和使命，如社会责任和道德。对3M和Voya等公司的使命声明的回顾表明，在这项研究中，一些最具道德的公司努力做到“对社会负责和道德”，支持他们的“社会”，尊重和保护“自然”、“地球”和“环境”。世界上最具道德的公司在其使命声明中强调这些加权条款，可能会通过对社会负责和道德，并为客户提供可持续的商业解决方案来展示他们的承诺。原创性/价值本研究对最具道德的公司的使命宣言进行了系统和全面的探索，试图找出这些宣言中的差异和相似之处。

{"title":"Text mining the mission statements of the most ethical companies","authors":"T. Bayrak","doi":"10.1108/dta-10-2021-0280","DOIUrl":"https://doi.org/10.1108/dta-10-2021-0280","url":null,"abstract":"PurposeThis paper explores and examines the mission statements of the most ethical companies across the globe in terms of their main purposes, values, goals, and objective, and what they say about their vision and goals.Design/methodology/approachThis study is based on the data published by the Ethisphere Institute, the global leader in defining and advancing the standards of ethical business practices. Having compiled the mission statements into a text file, the authors conducted text mining using a commercially available text mining tool SAS Enterprise Miner to survey if the most ethical companies have valued the same vision and mission such as social responsibility and ethics.FindingsA review of their mission statements indicated that some of the most ethical companies surveyed in this study such as 3M and Voya strive to be “socially responsible and ethical,” support their “societies” and respect and protect the “nature,” “planet” and “environment.” The world's most ethical companies that stress these weighted terms in their mission statements may do so to show their commitment by being socially responsible and ethical, and delivering sustainable business solutions to their customers.Originality/valueThis study provides a systematic and comprehensive exploration of mission statements of the most ethical companies in an attempt to identify patterns of differences and similarities within these statements.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"25 1","pages":"671-685"},"PeriodicalIF":1.6,"publicationDate":"2022-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80594930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0