首页 > 最新文献

Big Data and Cognitive Computing最新文献

英文 中文
Privacy-Enhancing Digital Contact Tracing with Machine Learning for Pandemic Response: A Comprehensive Review 用机器学习增强隐私的数字接触追踪用于流行病应对:全面综述
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-06-01 DOI: 10.3390/bdcc7020108
C. Hang, Yi-Zhen Tsai, Pei-Duo Yu, Jiasi Chen, C. Tan
The rapid global spread of the coronavirus disease (COVID-19) has severely impacted daily life worldwide. As potential solutions, various digital contact tracing (DCT) strategies have emerged to mitigate the virus’s spread while maintaining economic and social activities. The computational epidemiology problems of DCT often involve parameter optimization through learning processes, making it crucial to understand how to apply machine learning techniques for effective DCT optimization. While numerous research studies on DCT have emerged recently, most existing reviews primarily focus on DCT application design and implementation. This paper offers a comprehensive overview of privacy-preserving machine learning-based DCT in preparation for future pandemics. We propose a new taxonomy to classify existing DCT strategies into forward, backward, and proactive contact tracing. We then categorize several DCT apps developed during the COVID-19 pandemic based on their tracing strategies. Furthermore, we derive three research questions related to computational epidemiology for DCT and provide a detailed description of machine learning techniques to address these problems. We discuss the challenges of learning-based DCT and suggest potential solutions. Additionally, we include a case study demonstrating the review’s insights into the pandemic response. Finally, we summarize the study’s limitations and highlight promising future research directions in DCT.
冠状病毒疾病(新冠肺炎)在全球的迅速传播严重影响了全球的日常生活。作为潜在的解决方案,各种数字接触者追踪(DCT)策略已经出现,以缓解病毒的传播,同时保持经济和社会活动。DCT的计算流行病学问题通常涉及通过学习过程进行参数优化,因此了解如何应用机器学习技术进行有效的DCT优化至关重要。虽然最近出现了许多关于DCT的研究,但大多数现有的综述主要集中在DCT应用程序的设计和实现上。本文全面概述了基于隐私保护机器学习的DCT,为未来的流行病做准备。我们提出了一种新的分类法,将现有的DCT策略分为前向、后向和主动联系人追踪。然后,我们根据新冠肺炎大流行期间开发的几个DCT应用程序的追踪策略对其进行分类。此外,我们推导了三个与DCT计算流行病学相关的研究问题,并详细描述了解决这些问题的机器学习技术。我们讨论了基于学习的DCT的挑战,并提出了潜在的解决方案。此外,我们还包括一个案例研究,展示了该综述对疫情应对的见解。最后,我们总结了本研究的局限性,并强调了DCT未来的研究方向。
{"title":"Privacy-Enhancing Digital Contact Tracing with Machine Learning for Pandemic Response: A Comprehensive Review","authors":"C. Hang, Yi-Zhen Tsai, Pei-Duo Yu, Jiasi Chen, C. Tan","doi":"10.3390/bdcc7020108","DOIUrl":"https://doi.org/10.3390/bdcc7020108","url":null,"abstract":"The rapid global spread of the coronavirus disease (COVID-19) has severely impacted daily life worldwide. As potential solutions, various digital contact tracing (DCT) strategies have emerged to mitigate the virus’s spread while maintaining economic and social activities. The computational epidemiology problems of DCT often involve parameter optimization through learning processes, making it crucial to understand how to apply machine learning techniques for effective DCT optimization. While numerous research studies on DCT have emerged recently, most existing reviews primarily focus on DCT application design and implementation. This paper offers a comprehensive overview of privacy-preserving machine learning-based DCT in preparation for future pandemics. We propose a new taxonomy to classify existing DCT strategies into forward, backward, and proactive contact tracing. We then categorize several DCT apps developed during the COVID-19 pandemic based on their tracing strategies. Furthermore, we derive three research questions related to computational epidemiology for DCT and provide a detailed description of machine learning techniques to address these problems. We discuss the challenges of learning-based DCT and suggest potential solutions. Additionally, we include a case study demonstrating the review’s insights into the pandemic response. Finally, we summarize the study’s limitations and highlight promising future research directions in DCT.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":"1 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41383338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Semantic Hierarchical Indexing for Online Video Lessons Using Natural Language Processing 使用自然语言处理的在线视频课程的语义层次索引
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-05-31 DOI: 10.3390/bdcc7020107
Marco Arazzi, M. Ferretti, Antonino Nocera
Huge quantities of audio and video material are available at universities and teaching institutions, but their use can be limited because of the lack of intelligent search tools. This paper describes a possible way to set up an indexing scheme that offers a smart search modality, that combines semantic analysis of video/audio transcripts with the exact time positioning of uttered words. The proposal leverages NLP methods for topic modeling with lexical analysis of lessons’ transcripts and builds a semantic hierarchical index into the corpus of lessons analyzed. Moreover, using abstracting summarization, the system can offer short summaries on the subject semantically implied by the search carried out.
在大学和教学机构中有大量的音频和视频材料,但由于缺乏智能搜索工具,它们的使用可能受到限制。本文描述了一种建立索引方案的可能方法,该方案提供了一种智能搜索模式,将视频/音频文本的语义分析与发出单词的精确时间定位相结合。该建议利用NLP方法对课程文本进行词法分析进行主题建模,并在所分析的课程语料库中构建语义层次索引。此外,使用抽象摘要,系统可以对所进行的搜索所隐含的主题提供简短的摘要。
{"title":"Semantic Hierarchical Indexing for Online Video Lessons Using Natural Language Processing","authors":"Marco Arazzi, M. Ferretti, Antonino Nocera","doi":"10.3390/bdcc7020107","DOIUrl":"https://doi.org/10.3390/bdcc7020107","url":null,"abstract":"Huge quantities of audio and video material are available at universities and teaching institutions, but their use can be limited because of the lack of intelligent search tools. This paper describes a possible way to set up an indexing scheme that offers a smart search modality, that combines semantic analysis of video/audio transcripts with the exact time positioning of uttered words. The proposal leverages NLP methods for topic modeling with lexical analysis of lessons’ transcripts and builds a semantic hierarchical index into the corpus of lessons analyzed. Moreover, using abstracting summarization, the system can offer short summaries on the subject semantically implied by the search carried out.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48274921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive KNN-Based Extended Collaborative Filtering Recommendation Services 基于自适应KNN的扩展协同过滤推荐服务
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-05-31 DOI: 10.3390/bdcc7020106
Luong Vuong Nguyen, Quoc-Trinh Vo, Tri-Hai Nguyen
In the current era of e-commerce, users are overwhelmed with countless products, making it difficult to find relevant items. Recommendation systems generate suggestions based on user preferences, to avoid information overload. Collaborative filtering is a widely used model in modern recommendation systems. Despite its popularity, collaborative filtering has limitations that researchers aim to overcome. In this paper, we enhance the K-nearest neighbor (KNN)-based collaborative filtering algorithm for a recommendation system, by considering the similarity of user cognition. This enhancement aimed to improve the accuracy in grouping users and generating more relevant recommendations for the active user. The experimental results showed that the proposed model outperformed benchmark models, in terms of MAE, RMSE, MAP, and NDCG metrics.
在当前的电子商务时代,用户被无数的产品淹没,很难找到相关的商品。推荐系统根据用户偏好生成建议,以避免信息过载。协同过滤是现代推荐系统中广泛使用的一种模型。尽管协作过滤很受欢迎,但它也有研究人员想要克服的局限性。在本文中,我们通过考虑用户认知的相似性,增强了推荐系统中基于K近邻(KNN)的协同过滤算法。这种增强旨在提高对用户进行分组的准确性,并为活动用户生成更相关的推荐。实验结果表明,在MAE、RMSE、MAP和NDCG指标方面,所提出的模型优于基准模型。
{"title":"Adaptive KNN-Based Extended Collaborative Filtering Recommendation Services","authors":"Luong Vuong Nguyen, Quoc-Trinh Vo, Tri-Hai Nguyen","doi":"10.3390/bdcc7020106","DOIUrl":"https://doi.org/10.3390/bdcc7020106","url":null,"abstract":"In the current era of e-commerce, users are overwhelmed with countless products, making it difficult to find relevant items. Recommendation systems generate suggestions based on user preferences, to avoid information overload. Collaborative filtering is a widely used model in modern recommendation systems. Despite its popularity, collaborative filtering has limitations that researchers aim to overcome. In this paper, we enhance the K-nearest neighbor (KNN)-based collaborative filtering algorithm for a recommendation system, by considering the similarity of user cognition. This enhancement aimed to improve the accuracy in grouping users and generating more relevant recommendations for the active user. The experimental results showed that the proposed model outperformed benchmark models, in terms of MAE, RMSE, MAP, and NDCG metrics.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48269570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Perspectives on Big Data, Cloud-Based Data Analysis and Machine Learning Systems 大数据、基于云的数据分析和机器学习系统展望
Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-05-30 DOI: 10.3390/bdcc7020104
Fabrizio Marozzo, Domenico Talia
Huge amounts of digital data are continuously generated and collected from different sources, such as sensors, cameras, in-vehicle infotainment, smart meters, mobile devices, social media platforms, and web applications and services [...]
从传感器、摄像头、车载信息娱乐、智能电表、移动设备、社交媒体平台、web应用程序和服务等不同来源不断生成和收集大量数字数据。
{"title":"Perspectives on Big Data, Cloud-Based Data Analysis and Machine Learning Systems","authors":"Fabrizio Marozzo, Domenico Talia","doi":"10.3390/bdcc7020104","DOIUrl":"https://doi.org/10.3390/bdcc7020104","url":null,"abstract":"Huge amounts of digital data are continuously generated and collected from different sources, such as sensors, cameras, in-vehicle infotainment, smart meters, mobile devices, social media platforms, and web applications and services [...]","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135693122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Breaking Barriers: Unveiling Factors Influencing the Adoption of Artificial Intelligence by Healthcare Providers 打破障碍:揭示影响医疗保健提供者采用人工智能的因素
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-05-30 DOI: 10.3390/bdcc7020105
B. Hameed, Nithesh Naik, Sufyan Ibrahim, Nisha S. Tatkar, M. Shah, D. Prasad, P. Hegde, P. Chłosta, B. Rai, B. Somani
Artificial intelligence (AI) is an emerging technological system that provides a platform to manage and analyze data by emulating human cognitive functions with greater accuracy, revolutionizing patient care and introducing a paradigm shift to the healthcare industry. The purpose of this study is to identify the underlying factors that affect the adoption of artificial intelligence in healthcare (AIH) by healthcare providers and to understand “What are the factors that influence healthcare providers’ behavioral intentions to adopt AIH in their routine practice?” An integrated survey was conducted among healthcare providers, including consultants, residents/students, and nurses. The survey included items related to performance expectancy, effort expectancy, initial trust, personal innovativeness, task complexity, and technology characteristics. The collected data were analyzed using structural equation modeling. A total of 392 healthcare professionals participated in the survey, with 72.4% being male and 50.7% being 30 years old or younger. The results showed that performance expectancy, effort expectancy, and initial trust have a positive influence on the behavioral intentions of healthcare providers to use AIH. Personal innovativeness was found to have a positive influence on effort expectancy, while task complexity and technology characteristics have a positive influence on effort expectancy for AIH. The study’s empirically validated model sheds light on healthcare providers’ intention to adopt AIH, while the study’s findings can be used to develop strategies to encourage this adoption. However, further investigation is necessary to understand the individual factors affecting the adoption of AIH by healthcare providers.
人工智能(AI)是一种新兴的技术系统,它提供了一个平台,通过更准确地模拟人类的认知功能来管理和分析数据,从而彻底改变了患者护理并为医疗保健行业引入了范式转变。本研究的目的是确定影响医疗保健提供者在医疗保健中采用人工智能(AIH)的潜在因素,并了解“影响医疗保健提供者在日常实践中采用人工智能的行为意图的因素是什么?”在医疗保健提供者中进行了一项综合调查,包括咨询师、住院医师/学生和护士。调查项目包括绩效预期、努力预期、初始信任、个人创新能力、任务复杂性和技术特征。采用结构方程模型对收集到的数据进行分析。共有392名医护专业人员参与调查,其中72.4%为男性,50.7%为30岁或以下。结果表明,绩效期望、努力期望和初始信任对医疗服务提供者使用AIH的行为意向有正向影响。研究发现,个人创新对努力期望有正向影响,而任务复杂性和技术特征对努力期望有正向影响。该研究的经验验证模型揭示了医疗保健提供者采用AIH的意图,而研究结果可用于制定鼓励采用AIH的策略。然而,需要进一步的调查来了解影响医疗保健提供者采用AIH的个体因素。
{"title":"Breaking Barriers: Unveiling Factors Influencing the Adoption of Artificial Intelligence by Healthcare Providers","authors":"B. Hameed, Nithesh Naik, Sufyan Ibrahim, Nisha S. Tatkar, M. Shah, D. Prasad, P. Hegde, P. Chłosta, B. Rai, B. Somani","doi":"10.3390/bdcc7020105","DOIUrl":"https://doi.org/10.3390/bdcc7020105","url":null,"abstract":"Artificial intelligence (AI) is an emerging technological system that provides a platform to manage and analyze data by emulating human cognitive functions with greater accuracy, revolutionizing patient care and introducing a paradigm shift to the healthcare industry. The purpose of this study is to identify the underlying factors that affect the adoption of artificial intelligence in healthcare (AIH) by healthcare providers and to understand “What are the factors that influence healthcare providers’ behavioral intentions to adopt AIH in their routine practice?” An integrated survey was conducted among healthcare providers, including consultants, residents/students, and nurses. The survey included items related to performance expectancy, effort expectancy, initial trust, personal innovativeness, task complexity, and technology characteristics. The collected data were analyzed using structural equation modeling. A total of 392 healthcare professionals participated in the survey, with 72.4% being male and 50.7% being 30 years old or younger. The results showed that performance expectancy, effort expectancy, and initial trust have a positive influence on the behavioral intentions of healthcare providers to use AIH. Personal innovativeness was found to have a positive influence on effort expectancy, while task complexity and technology characteristics have a positive influence on effort expectancy for AIH. The study’s empirically validated model sheds light on healthcare providers’ intention to adopt AIH, while the study’s findings can be used to develop strategies to encourage this adoption. However, further investigation is necessary to understand the individual factors affecting the adoption of AIH by healthcare providers.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44139862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
On-Shore Plastic Waste Detection with YOLOv5 and RGB-Near-Infrared Fusion: A State-of-the-Art Solution for Accurate and Efficient Environmental Monitoring YOLOv5和RGB近红外融合的岸上塑料垃圾检测:实现准确高效环境监测的最新解决方案
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-05-29 DOI: 10.3390/bdcc7020103
Owen Tamin, E. Moung, J. Dargham, Farashazillah Yahya, A. Farzamnia, F. Sia, Nur Faraha Mohd Naim, L. Angeline
Plastic waste is a growing environmental concern that poses a significant threat to onshore ecosystems, human health, and wildlife. The accumulation of plastic waste in oceans has reached a staggering estimate of over eight million tons annually, leading to hazardous outcomes in marine life and the food chain. Plastic waste is prevalent in urban areas, posing risks to animals that may ingest it or become entangled in it, and negatively impacting the economy and tourism industry. Effective plastic waste management requires a comprehensive approach that includes reducing consumption, promoting recycling, and developing innovative technologies such as automated plastic detection systems. The development of accurate and efficient plastic detection methods is therefore essential for effective waste management. To address this challenge, machine learning techniques such as the YOLOv5 model have emerged as promising tools for developing automated plastic detection systems. Furthermore, there is a need to study both visible light (RGB) and near-infrared (RGNIR) as part of plastic waste detection due to the unique properties of plastic waste in different environmental settings. To this end, two plastic waste datasets, comprising RGB and RGNIR images, were utilized to train the proposed model, YOLOv5m. The performance of the model was then evaluated using a 10-fold cross-validation method on both datasets. The experiment was extended by adding background images into the training dataset to reduce false positives. An additional experiment was carried out to fuse both the RGB and RGNIR datasets. A performance-metric score called the Weighted Metric Score (WMS) was proposed, where the WMS equaled the sum of the mean average precision at the intersection over union (IoU) threshold of 0.5 (mAP@0.5) × 0.1 and the mean average precision averaged over different IoU thresholds ranging from 0.5 to 0.95 (mAP@0.5:0.95) × 0.9. In addition, a 10-fold cross-validation procedure was implemented. Based on the results, the proposed model achieved the best performance using the fusion of the RGB and RGNIR datasets when evaluated on the testing dataset with a mean of mAP@0.5, mAP@0.5:0.95, and a WMS of 92.96% ± 2.63%, 69.47% ± 3.11%, and 71.82% ± 3.04%, respectively. These findings indicate that utilizing both normal visible light and the near-infrared spectrum as feature representations in machine learning could lead to improved performance in plastic waste detection. This opens new opportunities in the development of automated plastic detection systems for use in fields such as automation, environmental management, and resource management.
塑料垃圾是一个日益严重的环境问题,对陆上生态系统、人类健康和野生动物构成了重大威胁。据估计,海洋中塑料垃圾的累积量每年超过800万吨,给海洋生物和食物链带来了危险。塑料垃圾在城市地区普遍存在,给可能摄入或卷入其中的动物带来风险,并对经济和旅游业产生负面影响。有效的塑料废物管理需要一种全面的方法,包括减少消费、促进回收和开发创新技术,如自动塑料检测系统。因此,开发准确高效的塑料检测方法对于有效的废物管理至关重要。为了应对这一挑战,YOLOv5模型等机器学习技术已成为开发自动塑料检测系统的有前途的工具。此外,由于塑料垃圾在不同环境中的独特特性,有必要将可见光(RGB)和近红外(RGNIR)作为塑料垃圾检测的一部分进行研究。为此,利用包括RGB和RGNIR图像的两个塑料垃圾数据集来训练所提出的模型YOLOv5m。然后在两个数据集上使用10倍交叉验证方法评估模型的性能。通过在训练数据集中添加背景图像来减少误报,对实验进行了扩展。进行了额外的实验来融合RGB和RGNIR数据集。提出了一种称为加权度量分数(WMS)的性能度量分数,其中WMS等于0.5的交集(IoU)阈值的平均精度之和(mAP@0.5)×0.1,并且在0.5到0.95的不同IoU阈值上平均的平均精度(mAP@0.5:0.95)×0.9。此外,还实施了10倍交叉验证程序。基于这些结果,当在测试数据集上进行评估时,所提出的模型使用RGB和RGNIR数据集的融合实现了最佳性能,平均值为mAP@0.5,mAP@0.5:0.95,WMS分别为92.96%±2.63%、69.47%±3.11%和71.82%±3.04%。这些发现表明,在机器学习中利用正常可见光和近红外光谱作为特征表示,可以提高塑料垃圾检测的性能。这为开发用于自动化、环境管理和资源管理等领域的自动塑料检测系统开辟了新的机会。
{"title":"On-Shore Plastic Waste Detection with YOLOv5 and RGB-Near-Infrared Fusion: A State-of-the-Art Solution for Accurate and Efficient Environmental Monitoring","authors":"Owen Tamin, E. Moung, J. Dargham, Farashazillah Yahya, A. Farzamnia, F. Sia, Nur Faraha Mohd Naim, L. Angeline","doi":"10.3390/bdcc7020103","DOIUrl":"https://doi.org/10.3390/bdcc7020103","url":null,"abstract":"Plastic waste is a growing environmental concern that poses a significant threat to onshore ecosystems, human health, and wildlife. The accumulation of plastic waste in oceans has reached a staggering estimate of over eight million tons annually, leading to hazardous outcomes in marine life and the food chain. Plastic waste is prevalent in urban areas, posing risks to animals that may ingest it or become entangled in it, and negatively impacting the economy and tourism industry. Effective plastic waste management requires a comprehensive approach that includes reducing consumption, promoting recycling, and developing innovative technologies such as automated plastic detection systems. The development of accurate and efficient plastic detection methods is therefore essential for effective waste management. To address this challenge, machine learning techniques such as the YOLOv5 model have emerged as promising tools for developing automated plastic detection systems. Furthermore, there is a need to study both visible light (RGB) and near-infrared (RGNIR) as part of plastic waste detection due to the unique properties of plastic waste in different environmental settings. To this end, two plastic waste datasets, comprising RGB and RGNIR images, were utilized to train the proposed model, YOLOv5m. The performance of the model was then evaluated using a 10-fold cross-validation method on both datasets. The experiment was extended by adding background images into the training dataset to reduce false positives. An additional experiment was carried out to fuse both the RGB and RGNIR datasets. A performance-metric score called the Weighted Metric Score (WMS) was proposed, where the WMS equaled the sum of the mean average precision at the intersection over union (IoU) threshold of 0.5 (mAP@0.5) × 0.1 and the mean average precision averaged over different IoU thresholds ranging from 0.5 to 0.95 (mAP@0.5:0.95) × 0.9. In addition, a 10-fold cross-validation procedure was implemented. Based on the results, the proposed model achieved the best performance using the fusion of the RGB and RGNIR datasets when evaluated on the testing dataset with a mean of mAP@0.5, mAP@0.5:0.95, and a WMS of 92.96% ± 2.63%, 69.47% ± 3.11%, and 71.82% ± 3.04%, respectively. These findings indicate that utilizing both normal visible light and the near-infrared spectrum as feature representations in machine learning could lead to improved performance in plastic waste detection. This opens new opportunities in the development of automated plastic detection systems for use in fields such as automation, environmental management, and resource management.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49503171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hand Gesture Recognition Using Automatic Feature Extraction and Deep Learning Algorithms with Memory 基于自动特征提取和记忆深度学习算法的手势识别
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-05-23 DOI: 10.3390/bdcc7020102
Rubén E. Nogales, Marco E. Benalcázar
Gesture recognition is widely used to express emotions or to communicate with other people or machines. Hand gesture recognition is a problem of great interest to researchers because it is a high-dimensional pattern recognition problem. The high dimensionality of the problem is directly related to the performance of machine learning models. The dimensionality problem can be addressed through feature selection and feature extraction. In this sense, the evaluation of a model with manual feature extraction and automatic feature extraction was proposed. The manual feature extraction was performed using the statistical functions of central tendency, while the automatic extraction was performed by means of a CNN and BiLSTM. These features were also evaluated in classifiers such as Softmax, ANN, and SVM. The best-performing model was the combination of BiLSTM and ANN (BiLSTM-ANN), with an accuracy of 99.9912%.
手势识别被广泛用于表达情感或与他人或机器进行交流。手势识别是一个高维模式识别问题,一直是研究人员非常感兴趣的问题。问题的高维度直接关系到机器学习模型的性能。维数问题可以通过特征选择和特征提取来解决。在此基础上,提出了人工特征提取与自动特征提取相结合的模型评价方法。采用集中趋势统计函数进行人工特征提取,采用CNN和BiLSTM进行自动特征提取。这些特征也在分类器(如Softmax、ANN和SVM)中进行了评估。效果最好的模型是BiLSTM和ANN的组合模型(BiLSTM-ANN),准确率为99.9912%。
{"title":"Hand Gesture Recognition Using Automatic Feature Extraction and Deep Learning Algorithms with Memory","authors":"Rubén E. Nogales, Marco E. Benalcázar","doi":"10.3390/bdcc7020102","DOIUrl":"https://doi.org/10.3390/bdcc7020102","url":null,"abstract":"Gesture recognition is widely used to express emotions or to communicate with other people or machines. Hand gesture recognition is a problem of great interest to researchers because it is a high-dimensional pattern recognition problem. The high dimensionality of the problem is directly related to the performance of machine learning models. The dimensionality problem can be addressed through feature selection and feature extraction. In this sense, the evaluation of a model with manual feature extraction and automatic feature extraction was proposed. The manual feature extraction was performed using the statistical functions of central tendency, while the automatic extraction was performed by means of a CNN and BiLSTM. These features were also evaluated in classifiers such as Softmax, ANN, and SVM. The best-performing model was the combination of BiLSTM and ANN (BiLSTM-ANN), with an accuracy of 99.9912%.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47489227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An Ontology Development Methodology Based on Ontology-Driven Conceptual Modeling and Natural Language Processing: Tourism Case Study 基于本体驱动的概念建模和自然语言处理的本体开发方法——以旅游业为例
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-05-21 DOI: 10.3390/bdcc7020101
S. Haridy, R. Ismail, N. Badr, M. Hashem
Ontologies provide a powerful method for representing, reusing, and sharing domain knowledge. They are extensively used in a wide range of disciplines, including artificial intelligence, knowledge engineering, biomedical informatics, and many more. For several reasons, developing domain ontologies is a challenging task. One of these reasons is that it is a complicated and time-consuming process. Multiple ontology development methodologies have already been proposed. However, there is room for improvement in terms of covering more activities during development (such as enrichment) and enhancing others (such as conceptualization). In this research, an enhanced ontology development methodology (ON-ODM) is proposed. Ontology-driven conceptual modeling (ODCM) and natural language processing (NLP) serve as the foundation of the proposed methodology. ODCM is defined as the utilization of ontological ideas from various areas to build engineering artifacts that improve conceptual modeling. NLP refers to the scientific discipline that employs computer techniques to analyze human language. The proposed ON-ODM is applied to build a tourism ontology that will be beneficial for a variety of applications, including e-tourism. The produced ontology is evaluated based on competency questions (CQs) and quality metrics. It is verified that the ontology answers SPARQL queries covering all CQ groups specified by domain experts. Quality metrics are used to compare the produced ontology with four existing tourism ontologies. For instance, according to the metrics related to conciseness, the produced ontology received a first place ranking when compared to the others, whereas it received a second place ranking regarding understandability. These results show that utilizing ODCM and NLP could facilitate and improve the development process, respectively.
本体为表示、重用和共享领域知识提供了一种强大的方法。它们被广泛应用于各种学科,包括人工智能、知识工程、生物医学信息学等等。由于几个原因,开发领域本体是一项具有挑战性的任务。其中一个原因是,这是一个复杂而耗时的过程。多种本体开发方法已经被提出。但是,在发展过程中包括更多的活动(如浓缩)和加强其他活动(如概念化)方面还有改进的余地。本文提出了一种增强的本体开发方法(ON-ODM)。本体驱动的概念建模(ODCM)和自然语言处理(NLP)是提出的方法的基础。ODCM被定义为利用来自不同领域的本体论思想来构建改进概念建模的工程工件。自然语言处理是指使用计算机技术来分析人类语言的科学学科。将提出的ON-ODM应用于构建旅游本体,该本体将有利于包括电子旅游在内的各种应用。生成的本体基于能力问题(CQs)和质量指标进行评估。验证了本体回答涵盖领域专家指定的所有CQ组的SPARQL查询。使用质量度量将生成的本体与四个现有的旅游本体进行比较。例如,根据与简洁性相关的度量,生成的本体在与其他本体相比获得第一名的排名,而在可理解性方面获得第二名的排名。这些结果表明,利用ODCM和NLP分别可以促进和改进开发过程。
{"title":"An Ontology Development Methodology Based on Ontology-Driven Conceptual Modeling and Natural Language Processing: Tourism Case Study","authors":"S. Haridy, R. Ismail, N. Badr, M. Hashem","doi":"10.3390/bdcc7020101","DOIUrl":"https://doi.org/10.3390/bdcc7020101","url":null,"abstract":"Ontologies provide a powerful method for representing, reusing, and sharing domain knowledge. They are extensively used in a wide range of disciplines, including artificial intelligence, knowledge engineering, biomedical informatics, and many more. For several reasons, developing domain ontologies is a challenging task. One of these reasons is that it is a complicated and time-consuming process. Multiple ontology development methodologies have already been proposed. However, there is room for improvement in terms of covering more activities during development (such as enrichment) and enhancing others (such as conceptualization). In this research, an enhanced ontology development methodology (ON-ODM) is proposed. Ontology-driven conceptual modeling (ODCM) and natural language processing (NLP) serve as the foundation of the proposed methodology. ODCM is defined as the utilization of ontological ideas from various areas to build engineering artifacts that improve conceptual modeling. NLP refers to the scientific discipline that employs computer techniques to analyze human language. The proposed ON-ODM is applied to build a tourism ontology that will be beneficial for a variety of applications, including e-tourism. The produced ontology is evaluated based on competency questions (CQs) and quality metrics. It is verified that the ontology answers SPARQL queries covering all CQ groups specified by domain experts. Quality metrics are used to compare the produced ontology with four existing tourism ontologies. For instance, according to the metrics related to conciseness, the produced ontology received a first place ranking when compared to the others, whereas it received a second place ranking regarding understandability. These results show that utilizing ODCM and NLP could facilitate and improve the development process, respectively.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45384326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Investigating the Accuracy of Autoregressive Recurrent Networks Using Hierarchical Aggregation Structure-Based Data Partitioning 基于分层聚合结构的数据划分研究自回归递归网络的精度
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-05-18 DOI: 10.3390/bdcc7020100
J. Oliveira, Patrícia Ramos
Global models have been developed to tackle the challenge of forecasting sets of series that are related or share similarities, but they have not been developed for heterogeneous datasets. Various methods of partitioning by relatedness have been introduced to enhance the similarities of sets, resulting in improved forecasting accuracy but often at the cost of a reduced sample size, which could be harmful. To shed light on how the relatedness between series impacts the effectiveness of global models in real-world demand-forecasting problems, we perform an extensive empirical study using the M5 competition dataset. We examine cross-learning scenarios driven by the product hierarchy commonly employed in retail planning to allow global models to capture interdependencies across products and regions more effectively. Our findings show that global models outperform state-of-the-art local benchmarks by a considerable margin, indicating that they are not inherently more limited than local models and can handle unrelated time-series data effectively. The accuracy of data-partitioning approaches increases as the sizes of the data pools and the models’ complexity decrease. However, there is a trade-off between data availability and data relatedness. Smaller data pools lead to increased similarity among time series, making it easier to capture cross-product and cross-region dependencies, but this comes at the cost of a reduced sample, which may not be beneficial. Finally, it is worth noting that the successful implementation of global models for heterogeneous datasets can significantly impact forecasting practice.
全球模型已经开发出来,以解决预测相关或具有相似性的系列集的挑战,但它们还没有为异构数据集开发。已经引入了各种通过相关性划分的方法来增强集合的相似性,从而提高预测准确性,但通常以减少样本量为代价,这可能是有害的。为了阐明系列之间的相关性如何影响全球模型在现实世界需求预测问题中的有效性,我们使用M5竞争数据集进行了广泛的实证研究。我们研究了由零售规划中常用的产品层次结构驱动的交叉学习场景,以允许全球模型更有效地捕获产品和地区之间的相互依赖性。我们的研究结果表明,全球模型在相当程度上优于最先进的本地基准,表明它们本身并不比本地模型更受限制,并且可以有效地处理不相关的时间序列数据。随着数据池规模的增大和模型复杂度的降低,数据划分方法的准确性也随之提高。然而,在数据可用性和数据相关性之间存在权衡。较小的数据池导致时间序列之间的相似性增加,从而更容易捕获跨产品和跨区域的依赖关系,但这是以减少样本为代价的,这可能不是有益的。最后,值得注意的是,异构数据集的全局模型的成功实施可以显著影响预测实践。
{"title":"Investigating the Accuracy of Autoregressive Recurrent Networks Using Hierarchical Aggregation Structure-Based Data Partitioning","authors":"J. Oliveira, Patrícia Ramos","doi":"10.3390/bdcc7020100","DOIUrl":"https://doi.org/10.3390/bdcc7020100","url":null,"abstract":"Global models have been developed to tackle the challenge of forecasting sets of series that are related or share similarities, but they have not been developed for heterogeneous datasets. Various methods of partitioning by relatedness have been introduced to enhance the similarities of sets, resulting in improved forecasting accuracy but often at the cost of a reduced sample size, which could be harmful. To shed light on how the relatedness between series impacts the effectiveness of global models in real-world demand-forecasting problems, we perform an extensive empirical study using the M5 competition dataset. We examine cross-learning scenarios driven by the product hierarchy commonly employed in retail planning to allow global models to capture interdependencies across products and regions more effectively. Our findings show that global models outperform state-of-the-art local benchmarks by a considerable margin, indicating that they are not inherently more limited than local models and can handle unrelated time-series data effectively. The accuracy of data-partitioning approaches increases as the sizes of the data pools and the models’ complexity decrease. However, there is a trade-off between data availability and data relatedness. Smaller data pools lead to increased similarity among time series, making it easier to capture cross-product and cross-region dependencies, but this comes at the cost of a reduced sample, which may not be beneficial. Finally, it is worth noting that the successful implementation of global models for heterogeneous datasets can significantly impact forecasting practice.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44709678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised Deep Learning for Structural Health Monitoring 结构健康监测的无监督深度学习
IF 3.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-05-17 DOI: 10.3390/bdcc7020099
R. Boccagna, M. Bottini, M. Petracca, Alessia Amelio, G. Camata
In the last few decades, structural health monitoring has gained relevance in the context of civil engineering, and much effort has been made to automate the process of data acquisition and analysis through the use of data-driven methods. Currently, the main issues arising in automated monitoring processing regard the establishment of a robust approach that covers all intermediate steps from data acquisition to output production and interpretation. To overcome this limitation, we introduce a dedicated artificial-intelligence-based monitoring approach for the assessment of the health conditions of structures in near-real time. The proposed approach is based on the construction of an unsupervised deep learning algorithm, with the aim of establishing a reliable method of anomaly detection for data acquired from sensors positioned on buildings. After preprocessing, the data are fed into various types of artificial neural network autoencoders, which are trained to produce outputs as close as possible to the inputs. We tested the proposed approach on data generated from an OpenSees numerical model of a railway bridge and data acquired from physical sensors positioned on the Historical Tower of Ravenna (Italy). The results show that the approach actually flags the data produced when damage scenarios are activated in the OpenSees model as coming from a damaged structure. The proposed method is also able to reliably detect anomalous structural behaviors of the tower, preventing critical scenarios. Compared to other state-of-the-art methods for anomaly detection, the proposed approach shows very promising results.
在过去的几十年里,结构健康监测在土木工程领域变得越来越重要,并通过使用数据驱动的方法,努力实现数据采集和分析过程的自动化。目前,自动化监测处理中出现的主要问题是建立一种稳健的方法,涵盖从数据采集到产出生产和解释的所有中间步骤。为了克服这一限制,我们引入了一种专门的基于人工智能的监测方法,用于近实时评估结构的健康状况。所提出的方法基于无监督深度学习算法的构建,目的是为从建筑物上的传感器获取的数据建立一种可靠的异常检测方法。预处理后,数据被输入到各种类型的人工神经网络自动编码器中,这些编码器经过训练以产生尽可能接近输入的输出。我们在铁路桥的OpenSees数值模型生成的数据和位于拉文纳历史塔(意大利)的物理传感器获取的数据上测试了所提出的方法。结果表明,该方法实际上将OpenSees模型中激活损坏场景时产生的数据标记为来自损坏的结构。所提出的方法还能够可靠地检测塔架的异常结构行为,防止出现关键情况。与其他最先进的异常检测方法相比,所提出的方法显示出非常有希望的结果。
{"title":"Unsupervised Deep Learning for Structural Health Monitoring","authors":"R. Boccagna, M. Bottini, M. Petracca, Alessia Amelio, G. Camata","doi":"10.3390/bdcc7020099","DOIUrl":"https://doi.org/10.3390/bdcc7020099","url":null,"abstract":"In the last few decades, structural health monitoring has gained relevance in the context of civil engineering, and much effort has been made to automate the process of data acquisition and analysis through the use of data-driven methods. Currently, the main issues arising in automated monitoring processing regard the establishment of a robust approach that covers all intermediate steps from data acquisition to output production and interpretation. To overcome this limitation, we introduce a dedicated artificial-intelligence-based monitoring approach for the assessment of the health conditions of structures in near-real time. The proposed approach is based on the construction of an unsupervised deep learning algorithm, with the aim of establishing a reliable method of anomaly detection for data acquired from sensors positioned on buildings. After preprocessing, the data are fed into various types of artificial neural network autoencoders, which are trained to produce outputs as close as possible to the inputs. We tested the proposed approach on data generated from an OpenSees numerical model of a railway bridge and data acquired from physical sensors positioned on the Historical Tower of Ravenna (Italy). The results show that the approach actually flags the data produced when damage scenarios are activated in the OpenSees model as coming from a damaged structure. The proposed method is also able to reliably detect anomalous structural behaviors of the tower, preventing critical scenarios. Compared to other state-of-the-art methods for anomaly detection, the proposed approach shows very promising results.","PeriodicalId":36397,"journal":{"name":"Big Data and Cognitive Computing","volume":" ","pages":""},"PeriodicalIF":3.7,"publicationDate":"2023-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43515202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Big Data and Cognitive Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1