首页 > 最新文献

International Journal on Semantic Web and Information Systems最新文献

英文 中文
NIR Spectroscopy Oranges Origin Identification Framework Based on Machine Learning 基于机器学习的近红外光谱橙子来源识别框架
IF 3.2 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-01-01 DOI: 10.4018/ijswis.297039
Songjian Dan
Research on the identification model of orange origin based on machine learning in Near infrared (NIR) spectroscopy. According to the characteristics of NIR spectral data, a complete general framework for origin identification is proposed. It includes steps such as data preprocessing, feature selection, model building and cross validation. Compare multiple preprocessing algorithms and multiple machine learning algorithms under the framework. Based on NIR spectroscopy to identify the origin of orange, a good identification result was obtained. Improve the accuracy of orange origin identification and obtained the best origin identification accuracy of 92.8%.
近红外光谱中基于机器学习的橙子产地识别模型研究。根据近红外光谱数据的特点,提出了一个完整的产地识别总体框架。它包括数据预处理、特征选择、模型构建和交叉验证等步骤。比较框架下的多种预处理算法和多种机器学习算法。采用近红外光谱法对橙子进行产地鉴别,取得了较好的鉴别结果。提高了橙源鉴定的准确度,获得了最佳的橙源鉴定准确率为92.8%。
{"title":"NIR Spectroscopy Oranges Origin Identification Framework Based on Machine Learning","authors":"Songjian Dan","doi":"10.4018/ijswis.297039","DOIUrl":"https://doi.org/10.4018/ijswis.297039","url":null,"abstract":"Research on the identification model of orange origin based on machine learning in Near infrared (NIR) spectroscopy. According to the characteristics of NIR spectral data, a complete general framework for origin identification is proposed. It includes steps such as data preprocessing, feature selection, model building and cross validation. Compare multiple preprocessing algorithms and multiple machine learning algorithms under the framework. Based on NIR spectroscopy to identify the origin of orange, a good identification result was obtained. Improve the accuracy of orange origin identification and obtained the best origin identification accuracy of 92.8%.","PeriodicalId":54934,"journal":{"name":"International Journal on Semantic Web and Information Systems","volume":"26 1","pages":"1-16"},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82245416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Chaotic Whale Crow Optimization Algorithm for Secure Routing In Iot Environment 物联网环境下安全路由的混沌鲸鱼乌鸦优化算法
IF 3.2 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-01-01 DOI: 10.4018/ijswis.300824
Meghana G. Raj
This paper solves the Internet of Things (IoT) security issues by introducing a Chaotic Whale Crow (CWC) optimization, which is the integration of Chaotic Whale Optimization Algorithm (CWOA) in Crow Search Algorithm (CSA). The framework operates on two crucial aspects: one is to select the secure nodes, and the other is to implement secure routing using the selected trusted nodes. First, the selection of trusted nodes is performed based on trust factors like direct, indirect, forwarding rate, integrity, and availability factors. Then, the selected trusted nodes are adapted for trust-based secure routing, which is optimally performed using the proposed CWC, based on the fitness parameters trust and energy. Finally, the proposed CWC is evaluated, which revealed high performance with a minimal delay of 191.46ms, which shows 14.87%, 7.35%, 6.82%, 4.19%, and 5.74% improved performance comapred to existing LaSeR, PM Ipv6, secTrust-RPL RISA, and LSDAR techniques. Similarly, the proposed method obtained the maximal energy of 71.25J, and maximal throughput of 129.77kbps.
本文通过引入混沌鲸鱼乌鸦算法(CWC)来解决物联网安全问题,该算法是混沌鲸鱼优化算法(CWOA)与乌鸦搜索算法(CSA)的集成。该框架在两个关键方面进行操作:一个是选择安全节点,另一个是使用所选的可信节点实现安全路由。首先,基于直接、间接、转发率、完整性和可用性等信任因素进行可信节点的选择。然后,将选择的可信节点适配到基于信任的安全路由中,并基于适应度参数信任和能量,使用所提出的CWC进行最优执行。最后,对CWC进行了评估,结果表明,该CWC具有较高的性能,最小延迟为191.46ms,与现有的LaSeR、PM Ipv6、secTrust-RPL RISA和LSDAR技术相比,性能分别提高了14.87%、7.35%、6.82%、4.19%和5.74%。同样,该方法的最大能量为71.25J,最大吞吐量为129.77kbps。
{"title":"Chaotic Whale Crow Optimization Algorithm for Secure Routing In Iot Environment","authors":"Meghana G. Raj","doi":"10.4018/ijswis.300824","DOIUrl":"https://doi.org/10.4018/ijswis.300824","url":null,"abstract":"This paper solves the Internet of Things (IoT) security issues by introducing a Chaotic Whale Crow (CWC) optimization, which is the integration of Chaotic Whale Optimization Algorithm (CWOA) in Crow Search Algorithm (CSA). The framework operates on two crucial aspects: one is to select the secure nodes, and the other is to implement secure routing using the selected trusted nodes. First, the selection of trusted nodes is performed based on trust factors like direct, indirect, forwarding rate, integrity, and availability factors. Then, the selected trusted nodes are adapted for trust-based secure routing, which is optimally performed using the proposed CWC, based on the fitness parameters trust and energy. Finally, the proposed CWC is evaluated, which revealed high performance with a minimal delay of 191.46ms, which shows 14.87%, 7.35%, 6.82%, 4.19%, and 5.74% improved performance comapred to existing LaSeR, PM Ipv6, secTrust-RPL RISA, and LSDAR techniques. Similarly, the proposed method obtained the maximal energy of 71.25J, and maximal throughput of 129.77kbps.","PeriodicalId":54934,"journal":{"name":"International Journal on Semantic Web and Information Systems","volume":"42 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84594625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A Differential Epidemic Model for Information, Misinformation and Disinformation in Online Social Networks 在线社交网络中信息、错误信息和虚假信息的差异流行模型
IF 3.2 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-01-01 DOI: 10.4018/ijswis.300827
N. Narayan, R. Jha, A. Singh
These days the online social network has become a huge source of data. People are actively sharing information on these platforms. The data on online social networks can be misinformation, information, and disinformation. Because online social network has become an important part of our life, so the information on online social networks makes a great impact on us. Here a differential epidemic model for information, misinformation, and disinformation on online social networks is proposed. The expression for basic reproduction number has been developed. Again, the stability condition for the system at both infection-free and endemic equilibriums points has been discussed. The Numerical simulation has been performed to validate our theoretical results. Again, with the help of data available on twitter related to COVID-19 vaccination is used to perform the experiment. Finally, discuss about the control strategy to minimize the misinformation and disinformation related to vaccination.
如今,这个在线社交网络已经成为一个巨大的数据来源。人们在这些平台上积极地分享信息。在线社交网络上的数据可以是错误信息、信息和虚假信息。因为在线社交网络已经成为我们生活的重要组成部分,所以在线社交网络上的信息对我们产生了很大的影响。本文提出了在线社交网络上信息、错误信息和虚假信息的差异流行模型。提出了基本繁殖数的表达式。再次,讨论了系统在无感染平衡点和地方病平衡点的稳定性条件。通过数值模拟验证了理论结果。同样,在推特上与COVID-19疫苗接种相关的数据的帮助下,进行了实验。最后,讨论了控制策略,以尽量减少与疫苗接种有关的错误信息和虚假信息。
{"title":"A Differential Epidemic Model for Information, Misinformation and Disinformation in Online Social Networks","authors":"N. Narayan, R. Jha, A. Singh","doi":"10.4018/ijswis.300827","DOIUrl":"https://doi.org/10.4018/ijswis.300827","url":null,"abstract":"These days the online social network has become a huge source of data. People are actively sharing information on these platforms. The data on online social networks can be misinformation, information, and disinformation. Because online social network has become an important part of our life, so the information on online social networks makes a great impact on us. Here a differential epidemic model for information, misinformation, and disinformation on online social networks is proposed. The expression for basic reproduction number has been developed. Again, the stability condition for the system at both infection-free and endemic equilibriums points has been discussed. The Numerical simulation has been performed to validate our theoretical results. Again, with the help of data available on twitter related to COVID-19 vaccination is used to perform the experiment. Finally, discuss about the control strategy to minimize the misinformation and disinformation related to vaccination.","PeriodicalId":54934,"journal":{"name":"International Journal on Semantic Web and Information Systems","volume":"24 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81392102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Analyzing the Sociodemographic Factors Impacting the Use of Virtual Reality for Controlling Obesity 影响使用虚拟现实控制肥胖的社会人口因素分析
IF 3.2 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-01-01 DOI: 10.4018/ijswis.300819
Mona A. Alduailij, W. Alhalabi, Mai A. Alduailij, Amal Al-Rashee, Eatedal Alabdulkareem, Seham Saad Alharb
Obesity is one of the most pressing issues in society today. Virtual reality has been used in the design of tools that promotes obesity control. However, the design of current VR tools lacks the involvement of prospective users and health practitioners. Such engagement is crucial in gathering semantic information that identifies stakeholders’ needs and ensures that all aspects of health are considered. Therefore, this paper aims to study the sociodemographic factors and individual-level characteristics and preferences that make the design of any obesity-control VR tool effective and satisfactory for a wide range of users. The paper also aims to solicit opinions of health practitioners to identify best health aspects that should be available in the design of any VR tool for obesity control. Organizations, businesses, and people will be able to readily augment such VR technologies on the semantic web, as well as on personal and mobile devices.
肥胖是当今社会最紧迫的问题之一。虚拟现实已被用于设计促进肥胖控制的工具。然而,当前虚拟现实工具的设计缺乏潜在用户和卫生从业人员的参与。这种参与对于收集语义信息至关重要,这些信息可确定利益攸关方的需求,并确保考虑到卫生的所有方面。因此,本文旨在研究社会人口学因素和个人层面的特征和偏好,这些因素使任何肥胖控制VR工具的设计都能有效地满足广大用户的需求。本文还旨在征求健康从业者的意见,以确定在设计任何用于肥胖控制的VR工具时应该提供的最佳健康方面。组织、企业和个人将能够很容易地在语义网以及个人和移动设备上增强这种VR技术。
{"title":"Analyzing the Sociodemographic Factors Impacting the Use of Virtual Reality for Controlling Obesity","authors":"Mona A. Alduailij, W. Alhalabi, Mai A. Alduailij, Amal Al-Rashee, Eatedal Alabdulkareem, Seham Saad Alharb","doi":"10.4018/ijswis.300819","DOIUrl":"https://doi.org/10.4018/ijswis.300819","url":null,"abstract":"Obesity is one of the most pressing issues in society today. Virtual reality has been used in the design of tools that promotes obesity control. However, the design of current VR tools lacks the involvement of prospective users and health practitioners. Such engagement is crucial in gathering semantic information that identifies stakeholders’ needs and ensures that all aspects of health are considered. Therefore, this paper aims to study the sociodemographic factors and individual-level characteristics and preferences that make the design of any obesity-control VR tool effective and satisfactory for a wide range of users. The paper also aims to solicit opinions of health practitioners to identify best health aspects that should be available in the design of any VR tool for obesity control. Organizations, businesses, and people will be able to readily augment such VR technologies on the semantic web, as well as on personal and mobile devices.","PeriodicalId":54934,"journal":{"name":"International Journal on Semantic Web and Information Systems","volume":"86 1","pages":"1-38"},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90940371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Intrusion Detection Using Normalized Mutual Information Feature Selection and Parallel Quantum Genetic Algorithm 基于归一化互信息特征选择和并行量子遗传算法的入侵检测
IF 3.2 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-01-01 DOI: 10.4018/ijswis.307324
Zhang Ling, Zhang Jia Hao
This paper presents a detection algorithm using normalized mutual information feature selection and cooperative evolution of multiple operators based on adaptive parallel quantum genetic algorithm (NMIFS MOP- AQGA). The proposed algorithm is to address the problems that the intrusion detection system (IDS) has lower the detection speed, less adaptability and lower detection accuracy. In order to achieve an effective reduction for high-dimensional feature data, the NMIFS method is used to select the best feature combination. The best features are sent to the MOP- AQGA classifier for learning and training, and the intrusion detectors are obtained. The data are fed into the detection algorithm to ultimately generate accurate detection results. The experimental results on real abnormal data demonstrate that the NMIFS MOP- AQGA method has higher detection accuracy, lower false negative rate and higher adaptive performance than the existing detection methods, especially for small samples sets.
提出了一种基于自适应并行量子遗传算法(NMIFS MOP- AQGA)的基于归一化互信息特征选择和多算子协同进化的检测算法。该算法是针对入侵检测系统检测速度慢、适应性差、检测精度低等问题而提出的。为了实现对高维特征数据的有效约简,采用NMIFS方法选择最佳特征组合。将最佳特征发送给MOP- AQGA分类器进行学习和训练,得到入侵检测器。这些数据被输入到检测算法中,最终产生准确的检测结果。在真实异常数据上的实验结果表明,与现有的检测方法相比,NMIFS MOP- AQGA方法具有更高的检测精度、更低的假阴性率和更高的自适应性能,特别是对于小样本集。
{"title":"Intrusion Detection Using Normalized Mutual Information Feature Selection and Parallel Quantum Genetic Algorithm","authors":"Zhang Ling, Zhang Jia Hao","doi":"10.4018/ijswis.307324","DOIUrl":"https://doi.org/10.4018/ijswis.307324","url":null,"abstract":"This paper presents a detection algorithm using normalized mutual information feature selection and cooperative evolution of multiple operators based on adaptive parallel quantum genetic algorithm (NMIFS MOP- AQGA). The proposed algorithm is to address the problems that the intrusion detection system (IDS) has lower the detection speed, less adaptability and lower detection accuracy. In order to achieve an effective reduction for high-dimensional feature data, the NMIFS method is used to select the best feature combination. The best features are sent to the MOP- AQGA classifier for learning and training, and the intrusion detectors are obtained. The data are fed into the detection algorithm to ultimately generate accurate detection results. The experimental results on real abnormal data demonstrate that the NMIFS MOP- AQGA method has higher detection accuracy, lower false negative rate and higher adaptive performance than the existing detection methods, especially for small samples sets.","PeriodicalId":54934,"journal":{"name":"International Journal on Semantic Web and Information Systems","volume":"96 1","pages":"1-24"},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76864848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Predictive Model Using a Machine Learning Approach for Enhancing the Retention Rate of Students At-Risk 使用机器学习方法提高风险学生保留率的预测模型
IF 3.2 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-01-01 DOI: 10.4018/ijswis.299859
H. Brdesee, W. Alsaggaf, N. Aljohani, Saeed-Ul Hassan
Student retention is a widely recognized challenge in the educational community to assist the institutes in the formation of appropriate and effective pedagogical interventions. This study intends to predict the students at-risk of low performances during an on-going course, those at-risk of graduating late than the tentative timeline and predicting the capacity of students in a campus. The data constitutes of demographics, learning, academic and educational related attributes which are suitable to deploy various machine learning algorithms for the prediction of at-risk students. For class balancing, Synthetic Minority Over Sampling Technique, is also applied to eliminate the imbalance in the academic award-gap performances and late/timely graduates. Results reveal the effectiveness of the deployed techniques with Long short-term Memory (LSTM) outperforming other models for early prediction of at-risk students. The main contribution of this work is a machine learning approach capable of enhancing the academic decision making related to student performance.
在教育界,学生保留是一个广泛认可的挑战,它有助于学院形成适当和有效的教学干预。本研究旨在预测在校期间学业表现不佳的学生、比预期时间晚毕业的学生,并预测校园内学生的能力。这些数据包括人口统计、学习、学术和教育相关属性,适合部署各种机器学习算法来预测有风险的学生。在班级平衡方面,还采用了合成少数派过采样技术,以消除学业奖差表现和晚/及时毕业生之间的不平衡。结果表明,长短期记忆(LSTM)技术的有效性优于其他模型对有风险学生的早期预测。这项工作的主要贡献是一种能够增强与学生表现相关的学术决策的机器学习方法。
{"title":"Predictive Model Using a Machine Learning Approach for Enhancing the Retention Rate of Students At-Risk","authors":"H. Brdesee, W. Alsaggaf, N. Aljohani, Saeed-Ul Hassan","doi":"10.4018/ijswis.299859","DOIUrl":"https://doi.org/10.4018/ijswis.299859","url":null,"abstract":"Student retention is a widely recognized challenge in the educational community to assist the institutes in the formation of appropriate and effective pedagogical interventions. This study intends to predict the students at-risk of low performances during an on-going course, those at-risk of graduating late than the tentative timeline and predicting the capacity of students in a campus. The data constitutes of demographics, learning, academic and educational related attributes which are suitable to deploy various machine learning algorithms for the prediction of at-risk students. For class balancing, Synthetic Minority Over Sampling Technique, is also applied to eliminate the imbalance in the academic award-gap performances and late/timely graduates. Results reveal the effectiveness of the deployed techniques with Long short-term Memory (LSTM) outperforming other models for early prediction of at-risk students. The main contribution of this work is a machine learning approach capable of enhancing the academic decision making related to student performance.","PeriodicalId":54934,"journal":{"name":"International Journal on Semantic Web and Information Systems","volume":"20 1","pages":"1-21"},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74194474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
A Model of Semantic-Based Image Retrieval Using C-Tree and Neighbor Graph 基于c树和邻居图的语义图像检索模型
IF 3.2 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-01-01 DOI: 10.4018/ijswis.295551
Nguyen Vu Uyen Nhi, T. Le, Thanh The Van
The problems of image mining and semantic image retrieval play an important role in many areas of life. In this paper, a semantic-based image retrieval system is proposed that relies on the combination of C-Tree, which was built in our previous work, and a neighbor graph (called Graph-CTree) to improve accuracy. The k-Nearest Neighbor (k-NN) algorithm is used to classify a set of similar images that are retrieved on Graph-CTree to create a set of visual words. An ontology framework for images is created semi-automatically. SPARQL query is automatically generated from visual words and retrieve on ontology for semantics image. The experiment was performed on image datasets, such as COREL, WANG, ImageCLEF, and Stanford Dogs, with precision values of 0.888473, 0.766473, 0.839814, and 0.826416, respectively. These results are compared with related works on the same image dataset, showing the effectiveness of the methods proposed here.
图像挖掘和语义图像检索问题在生活的许多领域发挥着重要作用。本文提出了一种基于语义的图像检索系统,该系统依赖于我们之前工作中构建的C-Tree和邻居图(称为graph - ctree)的组合来提高准确性。k-最近邻(k-NN)算法用于对Graph-CTree上检索的一组相似图像进行分类,以创建一组视觉单词。图像本体框架是半自动创建的。SPARQL查询由视觉词自动生成,并在本体上检索语义图像。实验在COREL、WANG、ImageCLEF、Stanford Dogs等图像数据集上进行,精度值分别为0.888473、0.766473、0.839814、0.826416。这些结果与相同图像数据集上的相关工作进行了比较,表明了本文提出的方法的有效性。
{"title":"A Model of Semantic-Based Image Retrieval Using C-Tree and Neighbor Graph","authors":"Nguyen Vu Uyen Nhi, T. Le, Thanh The Van","doi":"10.4018/ijswis.295551","DOIUrl":"https://doi.org/10.4018/ijswis.295551","url":null,"abstract":"The problems of image mining and semantic image retrieval play an important role in many areas of life. In this paper, a semantic-based image retrieval system is proposed that relies on the combination of C-Tree, which was built in our previous work, and a neighbor graph (called Graph-CTree) to improve accuracy. The k-Nearest Neighbor (k-NN) algorithm is used to classify a set of similar images that are retrieved on Graph-CTree to create a set of visual words. An ontology framework for images is created semi-automatically. SPARQL query is automatically generated from visual words and retrieve on ontology for semantics image. The experiment was performed on image datasets, such as COREL, WANG, ImageCLEF, and Stanford Dogs, with precision values of 0.888473, 0.766473, 0.839814, and 0.826416, respectively. These results are compared with related works on the same image dataset, showing the effectiveness of the methods proposed here.","PeriodicalId":54934,"journal":{"name":"International Journal on Semantic Web and Information Systems","volume":"37 1","pages":"1-23"},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86506923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
A Context-Independent Ontological Linked Data Alignment Approach to Instance Matching 实例匹配的上下文独立本体关联数据对齐方法
IF 3.2 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-01-01 DOI: 10.4018/ijswis.295977
Armando Barbosa, I. Bittencourt, S. Siqueira, Diego Dermeval, Nicholas J. T. Cruz
Linking data by finding matching instances in different datasets requires considering many characteristics, such as structural heterogeneity, implicit knowledge, and URI (Uniform Resource Identifier)-oriented identification. The authors propose a context-independent approach to align Linked data through an alignment process based on the ontological model’s components and considering data’s multidimensionality. The researchers experimented with the proposed approach against two methods for aligning linked data in two datasets and evaluated precision, recall, and f-measure metrics. The authors also conducted a case study in a real scenario considering a Brazilian publication dataset on computers and education. This study’s results indicate that the proposed approach overcomes the other methods (regarding the precision, recall, and f-measure metrics), requiring less work when changing the dataset domain. This work’s main contributions include enabling real datasets to be semi-automatically linked, presenting an approach capable of calculating resource similarity.
通过在不同数据集中查找匹配实例来链接数据需要考虑许多特征,例如结构异构性、隐式知识和面向URI(统一资源标识符)的标识。作者提出了一种上下文无关的方法,通过基于本体论模型的组件并考虑数据的多维度的对齐过程来对齐关联数据。研究人员针对两种方法对两个数据集中的关联数据进行了实验,并评估了精度、召回率和f-measure指标。作者还在一个真实的场景中进行了一个案例研究,考虑了巴西关于计算机和教育的出版物数据集。本研究的结果表明,该方法克服了其他方法(关于精度,召回率和f-measure指标),在改变数据集域时需要更少的工作。这项工作的主要贡献包括实现真实数据集的半自动链接,提出了一种能够计算资源相似度的方法。
{"title":"A Context-Independent Ontological Linked Data Alignment Approach to Instance Matching","authors":"Armando Barbosa, I. Bittencourt, S. Siqueira, Diego Dermeval, Nicholas J. T. Cruz","doi":"10.4018/ijswis.295977","DOIUrl":"https://doi.org/10.4018/ijswis.295977","url":null,"abstract":"Linking data by finding matching instances in different datasets requires considering many characteristics, such as structural heterogeneity, implicit knowledge, and URI (Uniform Resource Identifier)-oriented identification. The authors propose a context-independent approach to align Linked data through an alignment process based on the ontological model’s components and considering data’s multidimensionality. The researchers experimented with the proposed approach against two methods for aligning linked data in two datasets and evaluated precision, recall, and f-measure metrics. The authors also conducted a case study in a real scenario considering a Brazilian publication dataset on computers and education. This study’s results indicate that the proposed approach overcomes the other methods (regarding the precision, recall, and f-measure metrics), requiring less work when changing the dataset domain. This work’s main contributions include enabling real datasets to be semi-automatically linked, presenting an approach capable of calculating resource similarity.","PeriodicalId":54934,"journal":{"name":"International Journal on Semantic Web and Information Systems","volume":"16 1","pages":"1-29"},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87770707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Path-Clustering Driving Travel-Route Excavation 路径聚类驱动的旅行路线挖掘
IF 3.2 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-01-01 DOI: 10.4018/ijswis.306750
Can Yang
The refueling trajectory of self-driving tourists is sparse, and it is difficult to restore the real travel route. A sparse trajectory clustering algorithm is proposed based on semantic representation to mine popular self-driving travel routes. Different from the traditional trajectory clustering algorithm based on trajectory point matching, the semantic relationship between different trajectory points is researched in this algorithm, and the low-dimensional vector representation of the trajectory is learned. First, the neural network language model is used to learn the distributed vector representation of the fueling station; then, the average of all the station vectors in each trajectory is taken as the vector representation of the trajectory. Finally, the classic k-means algorithm is used to cluster the trajectory vectors. The final visualization results show that the proposed algorithm effectively mines two popular self-driving travel routes.
自驾游游客的加油轨迹稀疏,难以还原真实的出行路线。提出了一种基于语义表示的稀疏轨迹聚类算法来挖掘热门自驾游路线。与传统的基于轨迹点匹配的轨迹聚类算法不同,该算法研究了不同轨迹点之间的语义关系,学习了轨迹的低维向量表示。首先,利用神经网络语言模型学习加油站的分布式向量表示;然后,取每条轨迹中所有站向量的平均值作为轨迹的向量表示。最后,采用经典的k-means算法对轨迹向量进行聚类。最终的可视化结果表明,该算法有效地挖掘了两条流行的自驾车路线。
{"title":"A Path-Clustering Driving Travel-Route Excavation","authors":"Can Yang","doi":"10.4018/ijswis.306750","DOIUrl":"https://doi.org/10.4018/ijswis.306750","url":null,"abstract":"The refueling trajectory of self-driving tourists is sparse, and it is difficult to restore the real travel route. A sparse trajectory clustering algorithm is proposed based on semantic representation to mine popular self-driving travel routes. Different from the traditional trajectory clustering algorithm based on trajectory point matching, the semantic relationship between different trajectory points is researched in this algorithm, and the low-dimensional vector representation of the trajectory is learned. First, the neural network language model is used to learn the distributed vector representation of the fueling station; then, the average of all the station vectors in each trajectory is taken as the vector representation of the trajectory. Finally, the classic k-means algorithm is used to cluster the trajectory vectors. The final visualization results show that the proposed algorithm effectively mines two popular self-driving travel routes.","PeriodicalId":54934,"journal":{"name":"International Journal on Semantic Web and Information Systems","volume":"4 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80248168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Semantic Framework Supporting Multilayer Networks Analysis for Rare Diseases 支持罕见病多层网络分析的语义框架
IF 3.2 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-01-01 DOI: 10.4018/ijswis.297141
N. Capuano, P. Foggia, L. Greco, Pierluigi Ritrovato
Understanding the role played by genetic variations in diseases, exploring genomic variants and discovering disease-associated loci are among the most pressing challenges of genomic medicine. A huge and ever-increasing amount of information is available to researchers to address these challenges. Unfortunately, it is stored in fragmented ontologies and databases, which use heterogeneous formats and poorly integrated schemas. To overcome these limitations, we propose a linked data approach, based on the formalism of multilayer networks, able to integrate and harmonize biomedical information from multiple sources into a single dense network covering different aspects on Neuroendocrine Neoplasms (NENs). The proposed integration schema consists of three interconnected layers representing, respectively, information on the disease, on the affected genes, on the related biological processes and molecular functions. An easy-to-use client-server application was also developed to browse and search for information on the model supporting multilayer network analysis.
了解遗传变异在疾病中的作用,探索基因组变异和发现与疾病相关的基因座是基因组医学最紧迫的挑战之一。研究人员可以获得大量且不断增加的信息来应对这些挑战。不幸的是,它存储在碎片化的本体和数据库中,这些本体和数据库使用异构格式和集成不良的模式。为了克服这些限制,我们提出了一种基于多层网络形式的关联数据方法,能够将来自多个来源的生物医学信息整合和协调成一个覆盖神经内分泌肿瘤(NENs)不同方面的单一密集网络。所提出的整合模式由三个相互关联的层组成,分别代表疾病信息、受影响基因信息、相关生物过程信息和分子功能信息。还开发了一个易于使用的客户机-服务器应用程序,用于浏览和搜索有关支持多层网络分析的模型的信息。
{"title":"A Semantic Framework Supporting Multilayer Networks Analysis for Rare Diseases","authors":"N. Capuano, P. Foggia, L. Greco, Pierluigi Ritrovato","doi":"10.4018/ijswis.297141","DOIUrl":"https://doi.org/10.4018/ijswis.297141","url":null,"abstract":"Understanding the role played by genetic variations in diseases, exploring genomic variants and discovering disease-associated loci are among the most pressing challenges of genomic medicine. A huge and ever-increasing amount of information is available to researchers to address these challenges. Unfortunately, it is stored in fragmented ontologies and databases, which use heterogeneous formats and poorly integrated schemas. To overcome these limitations, we propose a linked data approach, based on the formalism of multilayer networks, able to integrate and harmonize biomedical information from multiple sources into a single dense network covering different aspects on Neuroendocrine Neoplasms (NENs). The proposed integration schema consists of three interconnected layers representing, respectively, information on the disease, on the affected genes, on the related biological processes and molecular functions. An easy-to-use client-server application was also developed to browse and search for information on the model supporting multilayer network analysis.","PeriodicalId":54934,"journal":{"name":"International Journal on Semantic Web and Information Systems","volume":"30 1","pages":"1-22"},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78638691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
International Journal on Semantic Web and Information Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1