Intelligent Data Analysis最新文献_第7页

Dementia prediction using novel IOTM (Internet of Things in Medical) architecture framework 使用新型IOTM(医疗物联网)架构框架预测痴呆症

IF 1.7 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Intelligent Data Analysis

Pub Date : 2023-08-24 DOI: 10.3233/ida-237431

B. Pavitra, D. Singh, S. Sharma, Mohammad Farukh Hashmi

In the last decades the health care developments highly rise the level of ages of world population. This improvement was accompanied by increasing the diseases related with elder like Dementia, which Alzheimer’s disease represents the most common form. The present studies aim to design and implementation a medical system for improving the life of Alzheimer’s disease persons and ease the burden of their caregivers. AI is transforming the practice of medicine. It’s helping doctors diagnose patients more accurately, make predictions about patient’s future health, and recommend better treatments. AI goes beyond the foundations of deep learning to give you insight into the nuances of applying AI to medical use cases. Diagnosis is about identifying disease. By building an algorithm we can diagnosis chest X-ray and determine whether it contains disease, another algorithm that will look at brain MRIs and identify the location of tumours in those brain MRIs health of the patients lab values and their demographics and use those to predict the risk of an event. A Smart IOT Interactive Assistance is a technological device that continuously monitors the stability of an Alzheimer’s patient, indicates their position on a map, automatically reminds them to take their prescriptions, and has a call button for any emergencies they could experience during the day. The system has two components, one of which the patient wears and the other of which is an IoT platform application utilized by the caregiver. The motion processing unit sensor, GPS, heart rate sensor with microcontrollers, and LCD display were used to construct the wearable device. An Internet of Things (IoT) platform supports this device, allowing the caregiver to communicate with the patient from any location.

在过去的几十年里，卫生保健的发展大大提高了世界人口的老龄化水平。这种改善伴随着老年痴呆症等相关疾病的增加，其中阿尔茨海默病是最常见的形式。本研究旨在设计并实施一套医疗系统，以改善阿尔茨海默病患者的生活，减轻照顾者的负担。人工智能正在改变医学实践。它帮助医生更准确地诊断病人，预测病人未来的健康状况，并推荐更好的治疗方法。人工智能超越了深度学习的基础，让您深入了解将人工智能应用于医疗用例的细微差别。诊断就是识别疾病。通过建立一个算法，我们可以诊断胸部x光并确定它是否包含疾病，另一个算法将观察大脑核磁共振成像并确定大脑核磁共振成像中肿瘤的位置病人的健康实验室值和他们的人口统计数据并使用这些来预测事件的风险。智能物联网互动辅助是一种技术设备，可以持续监测阿尔茨海默病患者的稳定状况，在地图上显示他们的位置，自动提醒他们服用处方，并有一个呼叫按钮，以应对他们在白天可能遇到的任何紧急情况。该系统有两个组件，一个是患者佩戴的，另一个是护理人员使用的物联网平台应用程序。该装置采用运动处理单元传感器、GPS、带单片机的心率传感器和LCD显示屏构成。物联网(IoT)平台支持该设备，允许护理人员从任何位置与患者通信。

{"title":"Dementia prediction using novel IOTM (Internet of Things in Medical) architecture framework","authors":"B. Pavitra, D. Singh, S. Sharma, Mohammad Farukh Hashmi","doi":"10.3233/ida-237431","DOIUrl":"https://doi.org/10.3233/ida-237431","url":null,"abstract":"In the last decades the health care developments highly rise the level of ages of world population. This improvement was accompanied by increasing the diseases related with elder like Dementia, which Alzheimer’s disease represents the most common form. The present studies aim to design and implementation a medical system for improving the life of Alzheimer’s disease persons and ease the burden of their caregivers. AI is transforming the practice of medicine. It’s helping doctors diagnose patients more accurately, make predictions about patient’s future health, and recommend better treatments. AI goes beyond the foundations of deep learning to give you insight into the nuances of applying AI to medical use cases. Diagnosis is about identifying disease. By building an algorithm we can diagnosis chest X-ray and determine whether it contains disease, another algorithm that will look at brain MRIs and identify the location of tumours in those brain MRIs health of the patients lab values and their demographics and use those to predict the risk of an event. A Smart IOT Interactive Assistance is a technological device that continuously monitors the stability of an Alzheimer’s patient, indicates their position on a map, automatically reminds them to take their prescriptions, and has a call button for any emergencies they could experience during the day. The system has two components, one of which the patient wears and the other of which is an IoT platform application utilized by the caregiver. The motion processing unit sensor, GPS, heart rate sensor with microcontrollers, and LCD display were used to construct the wearable device. An Internet of Things (IoT) platform supports this device, allowing the caregiver to communicate with the patient from any location.","PeriodicalId":50355,"journal":{"name":"Intelligent Data Analysis","volume":" ","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44070221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Digital image processing for evaluating the impact of designated nanoparticles in biomedical applications 用于评估指定纳米颗粒在生物医学应用中的影响的数字图像处理

IF 1.7 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Intelligent Data Analysis

Pub Date : 2023-08-24 DOI: 10.3233/ida-237435

Rami Baazeem, P. Maheshwary, D. Binjawhar, K. Gulati, Shubham Joshi, Stephen Ojo, P. Pareek, Prashant Kumar Shukla

Nanomaterials are finding increasingly diverse medical uses as technology advances. Researchers are constantly being introduced to new and improved methods, and these applications see widespread use for both diagnostic and therapeutic purposes. Early disease detection, efficient drug delivery, cosmetics and health care products, biosensors, miniaturisation techniques, surface improvement in implantable biomaterials, improved nanofibers in medical textiles, etc. are all examples of how biomedical nanotechnology has made a difference in the medical field. The nanoparticles are introduced deliberately for therapeutic purposes or accidentally from the environment; they will eventually reach and penetrate the human body. The exposed nanoparticles interact with human blood, which carries them to various tissues. An essential aspect of blood rheology in the microcirculation is its malleability. As a result, nanomaterial may cause structural abnormalities in erythrocytes. Echinocyte development is a typical example of an induced morphological alteration. The length of time it takes for these side effects to disappear after taking a nano medication also matters. Haemolyses could result from the dangerous concentration. In this experiment, human blood is exposed to varying concentrations of chosen nanomaterial with potential medical applications. The morphological modifications induced were studied by looking at images of erythrocyte cells. That’s a picture of a cell taken using a digital optical microscope, by the way. We used MATLAB, an image-analysis programme, to study the morphometric features. Human lymphocyte cells were used in the cytotoxic analysis.

随着技术的进步，纳米材料的医疗用途越来越多样化。研究人员不断被引入新的和改进的方法，这些应用被广泛用于诊断和治疗目的。早期疾病检测、高效药物输送、化妆品和医疗保健产品、生物传感器、微型化技术、可植入生物材料的表面改进、医用纺织品中的改进纳米纤维等都是生物医学纳米技术如何在医疗领域发挥作用的例子。纳米颗粒是为了治疗目的而故意引入的，或者是偶然从环境中引入的；它们最终会到达并穿透人体。暴露的纳米颗粒与人体血液相互作用，人体血液将它们携带到各种组织中。微循环中血液流变学的一个重要方面是其延展性。因此，纳米材料可能导致红细胞结构异常。棘突细胞的发育是诱导的形态学改变的典型例子。服用纳米药物后，这些副作用消失所需的时间长短也很重要。危险浓度可能导致溶血。在这个实验中，人类血液暴露在不同浓度的选定纳米材料中，具有潜在的医学应用。通过观察红细胞的图像来研究诱导的形态学改变。顺便说一下，这是一张用数字光学显微镜拍摄的细胞照片。我们使用MATLAB，一个图像分析程序，来研究形态计量特征。使用人淋巴细胞进行细胞毒性分析。

{"title":"Digital image processing for evaluating the impact of designated nanoparticles in biomedical applications","authors":"Rami Baazeem, P. Maheshwary, D. Binjawhar, K. Gulati, Shubham Joshi, Stephen Ojo, P. Pareek, Prashant Kumar Shukla","doi":"10.3233/ida-237435","DOIUrl":"https://doi.org/10.3233/ida-237435","url":null,"abstract":"Nanomaterials are finding increasingly diverse medical uses as technology advances. Researchers are constantly being introduced to new and improved methods, and these applications see widespread use for both diagnostic and therapeutic purposes. Early disease detection, efficient drug delivery, cosmetics and health care products, biosensors, miniaturisation techniques, surface improvement in implantable biomaterials, improved nanofibers in medical textiles, etc. are all examples of how biomedical nanotechnology has made a difference in the medical field. The nanoparticles are introduced deliberately for therapeutic purposes or accidentally from the environment; they will eventually reach and penetrate the human body. The exposed nanoparticles interact with human blood, which carries them to various tissues. An essential aspect of blood rheology in the microcirculation is its malleability. As a result, nanomaterial may cause structural abnormalities in erythrocytes. Echinocyte development is a typical example of an induced morphological alteration. The length of time it takes for these side effects to disappear after taking a nano medication also matters. Haemolyses could result from the dangerous concentration. In this experiment, human blood is exposed to varying concentrations of chosen nanomaterial with potential medical applications. The morphological modifications induced were studied by looking at images of erythrocyte cells. That’s a picture of a cell taken using a digital optical microscope, by the way. We used MATLAB, an image-analysis programme, to study the morphometric features. Human lymphocyte cells were used in the cytotoxic analysis.","PeriodicalId":50355,"journal":{"name":"Intelligent Data Analysis","volume":" ","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48956273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Editorial 社论

IF 1.7 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Intelligent Data Analysis

Pub Date : 2023-08-17 DOI: 10.3233/ida-239006

引用次数: 0

Deep learning models for predicting the position of the head on an X-ray image for Cephalometric analysis 用于预测头部在x射线图像上位置的深度学习模型，用于头部测量分析

IF 1.7 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Intelligent Data Analysis

Pub Date : 2023-08-17 DOI: 10.3233/ida-237430

K. Prasanna, Chinna Babu Jyothi, M. S. Kumar, J. Prabhu, A. Saif, Dinesh Jackson Samuel

Cephalometric analysis is used to identify problems in the development of the skull, evaluate their treatment, and plan for possible surgical interventions. The paper aims to develop a Convolutional Neural Network that will analyze the head position on an X-ray image. It takes place in such a way that it recognizes whether the image is suitable and, if not, suggests a change in the position of the head for correction. This paper addresses the exact rotation of the head with a change in the range of a few degrees of rotation. The objective is to predict the correct head position to take an X-ray image for further Cephalometric analysis. The changes in the degree of rotations were categorized into 5 classes. Deep learning models predict the correct head position for Cephalometric analysis. An X-ray image dataset on the head is generated using CT scan images. The generated images are categorized into 5 classes based on a few degrees of rotations. A set of four deep-learning models were then used to generate the generated X-Ray images for analysis. This research work makes use of four CNN-based networks. These networks are trained on a dataset to predict the accurate head position on generated X-Ray images for analysis. Two networks of VGG-Net, one is the U-Net and the last is of the ResNet type. The experimental analysis ascertains that VGG-4 outperformed the VGG-3, U-Net, and ResNet in estimating the head position to take an X-ray on a test dataset with a measured accuracy of 98%. It is due to the incorrectly classified images are classified that are directly adjacent to the correct ones at intervals and the misclassification rate is significantly reduced.

头颅测量分析用于识别颅骨发育中的问题，评估其治疗方法，并计划可能的手术干预。这篇论文的目的是开发一个卷积神经网络来分析x射线图像上的头部位置。它以这样一种方式进行，即识别图像是否合适，如果不合适，则建议改变头部的位置进行校正。本文解决了头部的精确旋转，在几个旋转度的范围内发生了变化。目的是预测正确的头部位置，以便拍摄x线图像进行进一步的头部测量分析。旋转度的变化分为5类。深度学习模型预测头部测量分析的正确位置。利用CT扫描图像生成头部x射线图像数据集。根据旋转的不同程度，生成的图像被分为5类。然后使用一组四个深度学习模型来生成生成的x射线图像进行分析。本研究工作利用了四种基于cnn的网络。这些网络在数据集上进行训练，以预测生成的x射线图像的准确头部位置以供分析。VGG-Net的两种网络，一种是U-Net，另一种是ResNet类型。实验分析表明，VGG-4在测试数据集上估计x射线头部位置方面优于VGG-3、U-Net和ResNet，测量精度为98%。这是由于在一段时间间隔内对与正确图像直接相邻的错误分类图像进行分类，大大降低了误分类率。

{"title":"Deep learning models for predicting the position of the head on an X-ray image for Cephalometric analysis","authors":"K. Prasanna, Chinna Babu Jyothi, M. S. Kumar, J. Prabhu, A. Saif, Dinesh Jackson Samuel","doi":"10.3233/ida-237430","DOIUrl":"https://doi.org/10.3233/ida-237430","url":null,"abstract":"Cephalometric analysis is used to identify problems in the development of the skull, evaluate their treatment, and plan for possible surgical interventions. The paper aims to develop a Convolutional Neural Network that will analyze the head position on an X-ray image. It takes place in such a way that it recognizes whether the image is suitable and, if not, suggests a change in the position of the head for correction. This paper addresses the exact rotation of the head with a change in the range of a few degrees of rotation. The objective is to predict the correct head position to take an X-ray image for further Cephalometric analysis. The changes in the degree of rotations were categorized into 5 classes. Deep learning models predict the correct head position for Cephalometric analysis. An X-ray image dataset on the head is generated using CT scan images. The generated images are categorized into 5 classes based on a few degrees of rotations. A set of four deep-learning models were then used to generate the generated X-Ray images for analysis. This research work makes use of four CNN-based networks. These networks are trained on a dataset to predict the accurate head position on generated X-Ray images for analysis. Two networks of VGG-Net, one is the U-Net and the last is of the ResNet type. The experimental analysis ascertains that VGG-4 outperformed the VGG-3, U-Net, and ResNet in estimating the head position to take an X-ray on a test dataset with a measured accuracy of 98%. It is due to the incorrectly classified images are classified that are directly adjacent to the correct ones at intervals and the misclassification rate is significantly reduced.","PeriodicalId":50355,"journal":{"name":"Intelligent Data Analysis","volume":" ","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48116235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A systematic review on recommendation systems applied to chronic diseases 慢性疾病推荐系统综述

IF 1.7 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Intelligent Data Analysis

Pub Date : 2023-08-10 DOI: 10.3233/ida-220394

Ana Vieira, João Carneiro, Paulo Novais, Juan Corchado, Goreti Marreiros

A large percentage of the worldwide population is affected by chronic diseases, leading to a burden of the patient and the national healthcare systems. Recommendation systems are used for the personalization of healthcare due to their capacity of performing predictive analyses based on the patient’s clinical data. This systematic literature review presents four research questions to provide an overall state of the art of the use of recommendation systems applied to the healthcare of patients with chronic diseases. Disease management was identified as the main purpose of the systems proposed in the literature. However, few solutions provide support to physicians in the clinical decision-making. Ontologies and rule-based systems were the artificial intelligence techniques most used in the systems since they can easily implement clinical guidelines. Current challenges of these systems include the low adherence, data sparsity, heterogeneous data, and explainability, that affect the success of the recommendation system. The results also show that there are few systems that provide support to patients with multiple chronic conditions. The findings of this literature review should be considered in the development of future recommendation systems that aim to support the management of chronic diseases.

世界上很大一部分人口受到慢性病的影响，给患者和国家卫生保健系统带来负担。推荐系统用于个性化医疗保健，因为它们能够基于患者的临床数据执行预测分析。这个系统的文献综述提出了四个研究问题，以提供使用推荐系统应用于慢性疾病患者的医疗保健的整体状态。疾病管理被确定为文献中提出的系统的主要目的。然而，很少有解决方案能在临床决策中为医生提供支持。本体和基于规则的系统是系统中使用最多的人工智能技术，因为它们可以很容易地实现临床指南。这些系统目前面临的挑战包括低依从性、数据稀疏性、异构数据和可解释性，这些都会影响推荐系统的成功。结果还表明，很少有系统为患有多种慢性疾病的患者提供支持。本文献综述的发现应考虑到未来的推荐系统的发展，旨在支持慢性疾病的管理。

{"title":"A systematic review on recommendation systems applied to chronic diseases","authors":"Ana Vieira, João Carneiro, Paulo Novais, Juan Corchado, Goreti Marreiros","doi":"10.3233/ida-220394","DOIUrl":"https://doi.org/10.3233/ida-220394","url":null,"abstract":"A large percentage of the worldwide population is affected by chronic diseases, leading to a burden of the patient and the national healthcare systems. Recommendation systems are used for the personalization of healthcare due to their capacity of performing predictive analyses based on the patient’s clinical data. This systematic literature review presents four research questions to provide an overall state of the art of the use of recommendation systems applied to the healthcare of patients with chronic diseases. Disease management was identified as the main purpose of the systems proposed in the literature. However, few solutions provide support to physicians in the clinical decision-making. Ontologies and rule-based systems were the artificial intelligence techniques most used in the systems since they can easily implement clinical guidelines. Current challenges of these systems include the low adherence, data sparsity, heterogeneous data, and explainability, that affect the success of the recommendation system. The results also show that there are few systems that provide support to patients with multiple chronic conditions. The findings of this literature review should be considered in the development of future recommendation systems that aim to support the management of chronic diseases.","PeriodicalId":50355,"journal":{"name":"Intelligent Data Analysis","volume":"310 2","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138527040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A fast and distributed C4.5 algorithm for urban big data 一种快速分布式的城市大数据C4.5算法

IF 1.7 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Intelligent Data Analysis

Pub Date : 2023-08-10 DOI: 10.3233/ida-220753

Wan-Shu Cheng, Pengchi Huang, Jheng-Yu Huang, Ju-Chin Chen, K. W. Lin

The amount of information nowadays is rapidly growing. Aside from valuable information, information that is unrelated to a target or is meaningless is also growing. Big data and broader digital technologies are considered the primary components of smart city governance and planning. Big data analysis is considered to define a new era in urban planning, research, and policy. Effective data mining and pattern detection techniques are becoming very important these days. Processing such a large amount of data entails the use of data mining, a technique that clarifies the association between valid information and excludes irrelevant data to implement a practical decision tree. A large amount of data affects processing time and I/O costs during data mining. This study proposes to distribute data among multiple clients and distribute a large amount of data computation equally to improve the resource cost problem of exploration. Following that, the main server consolidates the computation results and generates the survey results. Experiment results show that the proposed algorithm is superior, thus allowing a larger amount of data to be processed while producing high-quality results.

当今的信息量正在迅速增长。除了有价值的信息，与目标无关或毫无意义的信息也在增长。大数据和更广泛的数字技术被认为是智慧城市治理和规划的主要组成部分。大数据分析被认为是城市规划、研究和政策的新时代。有效的数据挖掘和模式检测技术现在变得非常重要。处理如此大量的数据需要使用数据挖掘，这是一种澄清有效信息之间的关联并排除不相关数据以实现实用决策树的技术。在数据挖掘过程中，大量数据会影响处理时间和I/O成本。该研究提出在多个客户端之间分配数据，并平均分配大量数据计算，以改善勘探的资源成本问题。然后，主服务器合并计算结果并生成调查结果。实验结果表明，该算法具有优越性，可以在处理大量数据的同时产生高质量的结果。

{"title":"A fast and distributed C4.5 algorithm for urban big data","authors":"Wan-Shu Cheng, Pengchi Huang, Jheng-Yu Huang, Ju-Chin Chen, K. W. Lin","doi":"10.3233/ida-220753","DOIUrl":"https://doi.org/10.3233/ida-220753","url":null,"abstract":"The amount of information nowadays is rapidly growing. Aside from valuable information, information that is unrelated to a target or is meaningless is also growing. Big data and broader digital technologies are considered the primary components of smart city governance and planning. Big data analysis is considered to define a new era in urban planning, research, and policy. Effective data mining and pattern detection techniques are becoming very important these days. Processing such a large amount of data entails the use of data mining, a technique that clarifies the association between valid information and excludes irrelevant data to implement a practical decision tree. A large amount of data affects processing time and I/O costs during data mining. This study proposes to distribute data among multiple clients and distribute a large amount of data computation equally to improve the resource cost problem of exploration. Following that, the main server consolidates the computation results and generates the survey results. Experiment results show that the proposed algorithm is superior, thus allowing a larger amount of data to be processed while producing high-quality results.","PeriodicalId":50355,"journal":{"name":"Intelligent Data Analysis","volume":" ","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44015817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Data management scheme for building internet of things based on blockchain sharding 基于区块链分片构建物联网的数据管理方案

IF 1.7 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Intelligent Data Analysis

Pub Date : 2023-08-10 DOI: 10.3233/ida-220757

Xu Wang, Wenhu Zheng, Jinlong Wang, Xiaoyun Xiong, Yumin Shen, Wei Mu, Zengliang Fan

As an important part of digital building, building internet of things (BIoT) plays a positive role in promoting the construction of smart cities. Existing schemes utilize blockchain to achieve trusted data storage in BIoT. However, the full-copy storage mechanism of blockchain and the management requirements of massive data have brought computing and storage challenges to edge nodes with limited resources. Therefore, a data management scheme for BIoT based on blockchain sharding is proposed. The scheme proposes a hybrid storage mechanism, which uses inter-planetary file system (IPFS) to ensure the integrity and availability of data outside the chain, and reduces the storage overhead of edge nodes. Based on the hybrid storage mechanism, the sharding algorithm is designed to divide the blockchain into multiple shards, and the storage overhead and computing overhead are offloaded to each shard, which effectively balances the computing and storage overhead of edge nodes. Finally, comparative analysis was made with existing schemes, and effectiveness of proposed scheme was verified from the perspectives of storage overhead, computation overhead, access delay and throughput. Results show that proposed scheme can effectively reduce storage overhead and computing overhead of edge nodes in BIoT scenario.

建筑物联网(BIoT)作为数字化建筑的重要组成部分，对智慧城市建设具有积极的推动作用。现有方案利用区块链在BIoT中实现可信数据存储。然而，区块链的全复制存储机制和海量数据的管理需求，给资源有限的边缘节点带来了计算和存储方面的挑战。为此，提出了一种基于区块链分片的BIoT数据管理方案。该方案提出了一种混合存储机制，利用星际文件系统(IPFS)保证链外数据的完整性和可用性，降低了边缘节点的存储开销。基于混合存储机制，分片算法将区块链划分为多个分片，将存储开销和计算开销分散到每个分片上，有效平衡边缘节点的计算开销和存储开销。最后，与现有方案进行对比分析，从存储开销、计算开销、访问延迟和吞吐量等方面验证了所提方案的有效性。结果表明，该方案可以有效降低边缘节点在BIoT场景下的存储开销和计算开销。

{"title":"Data management scheme for building internet of things based on blockchain sharding","authors":"Xu Wang, Wenhu Zheng, Jinlong Wang, Xiaoyun Xiong, Yumin Shen, Wei Mu, Zengliang Fan","doi":"10.3233/ida-220757","DOIUrl":"https://doi.org/10.3233/ida-220757","url":null,"abstract":"As an important part of digital building, building internet of things (BIoT) plays a positive role in promoting the construction of smart cities. Existing schemes utilize blockchain to achieve trusted data storage in BIoT. However, the full-copy storage mechanism of blockchain and the management requirements of massive data have brought computing and storage challenges to edge nodes with limited resources. Therefore, a data management scheme for BIoT based on blockchain sharding is proposed. The scheme proposes a hybrid storage mechanism, which uses inter-planetary file system (IPFS) to ensure the integrity and availability of data outside the chain, and reduces the storage overhead of edge nodes. Based on the hybrid storage mechanism, the sharding algorithm is designed to divide the blockchain into multiple shards, and the storage overhead and computing overhead are offloaded to each shard, which effectively balances the computing and storage overhead of edge nodes. Finally, comparative analysis was made with existing schemes, and effectiveness of proposed scheme was verified from the perspectives of storage overhead, computation overhead, access delay and throughput. Results show that proposed scheme can effectively reduce storage overhead and computing overhead of edge nodes in BIoT scenario.","PeriodicalId":50355,"journal":{"name":"Intelligent Data Analysis","volume":"264 9","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41290409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

I2R: Intra and inter-modal representation learning for code search I2R：用于代码搜索的模态内和模态间表示学习

IF 1.7 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Intelligent Data Analysis

Pub Date : 2023-08-10 DOI: 10.3233/ida-230082

Xu Zhang, Yanzheng Xiang, Zejie Liu, Xiaoyu Hu, Deyu Zhou

Code search, which locates code snippets in large code repositories based on natural language queries entered by developers, has become increasingly popular in the software development process. It has the potential to improve the efficiency of software developers. Recent studies have demonstrated the effectiveness of using deep learning techniques to represent queries and codes accurately for code search. In specific, pre-trained models of programming languages have recently achieved significant progress in code searching. However, we argue that aligning programming and natural languages are crucial as there are two different modalities. Existing pre-train models based approaches for code search do not effectively consider implicit alignments of representations across modalities (inter-modal representation). Moreover, the existing methods do not take into account the consistency constraint of intra-modal representations, making the model ineffective. As a result, we propose a novel code search method that optimizes both intra-modal and inter-modal representation learning. The alignment of the representation between the two modalities is achieved by introducing contrastive learning. Furthermore, the consistency of intra-modal feature representation is constrained by KL-divergence. Our experimental results confirm the model’s effectiveness on seven different test datasets. This paper proposes a code search method that significantly improves existing methods. Our source code is publicly available on GitHub.1

代码搜索在软件开发过程中越来越流行，它根据开发人员输入的自然语言查询来定位大型代码库中的代码片段。它有可能提高软件开发人员的效率。最近的研究已经证明了使用深度学习技术来准确表示代码搜索的查询和代码的有效性。具体来说，编程语言的预训练模型最近在代码搜索方面取得了重大进展。然而，我们认为，调整编程和自然语言是至关重要的，因为有两种不同的模式。现有的基于预训练模型的代码搜索方法没有有效地考虑跨模态表示的隐式对齐（模态间表示）。此外，现有的方法没有考虑模态内表示的一致性约束，使得模型无效。因此，我们提出了一种新的代码搜索方法，该方法优化了模态内和模态间表示学习。通过引入对比学习来实现两种模式之间的表征的一致性。此外，模态内特征表示的一致性受到KL散度的约束。我们的实验结果证实了该模型在七个不同测试数据集上的有效性。本文提出了一种代码搜索方法，大大改进了现有的方法。我们的源代码在GitHub上公开。1

{"title":"I2R: Intra and inter-modal representation learning for code search","authors":"Xu Zhang, Yanzheng Xiang, Zejie Liu, Xiaoyu Hu, Deyu Zhou","doi":"10.3233/ida-230082","DOIUrl":"https://doi.org/10.3233/ida-230082","url":null,"abstract":"Code search, which locates code snippets in large code repositories based on natural language queries entered by developers, has become increasingly popular in the software development process. It has the potential to improve the efficiency of software developers. Recent studies have demonstrated the effectiveness of using deep learning techniques to represent queries and codes accurately for code search. In specific, pre-trained models of programming languages have recently achieved significant progress in code searching. However, we argue that aligning programming and natural languages are crucial as there are two different modalities. Existing pre-train models based approaches for code search do not effectively consider implicit alignments of representations across modalities (inter-modal representation). Moreover, the existing methods do not take into account the consistency constraint of intra-modal representations, making the model ineffective. As a result, we propose a novel code search method that optimizes both intra-modal and inter-modal representation learning. The alignment of the representation between the two modalities is achieved by introducing contrastive learning. Furthermore, the consistency of intra-modal feature representation is constrained by KL-divergence. Our experimental results confirm the model’s effectiveness on seven different test datasets. This paper proposes a code search method that significantly improves existing methods. Our source code is publicly available on GitHub.1","PeriodicalId":50355,"journal":{"name":"Intelligent Data Analysis","volume":" ","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45514729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Oversampling method based on GAN for tabular binary classification problems 基于GAN的过采样方法在表格二分类问题中的应用

IF 1.7 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Intelligent Data Analysis

Pub Date : 2023-08-10 DOI: 10.3233/ida-220383

Jie Yang, Zhenhao Jiang, Tingting Pan, Yueqi Chen, W. Pedrycz

Data-imbalanced problems are present in many applications. A big gap in the number of samples in different classes induces classifiers to skew to the majority class and thus diminish the performance of learning and quality of obtained results. Most data level imbalanced learning approaches generate new samples only using the information associated with the minority samples through linearly generating or data distribution fitting. Different from these algorithms, we propose a novel oversampling method based on generative adversarial networks (GANs), named OS-GAN. In this method, GAN is assigned to learn the distribution characteristics of the minority class from some selected majority samples but not random noise. As a result, samples released by the trained generator carry information of both majority and minority classes. Furthermore, the central regularization makes the distribution of all synthetic samples not restricted to the domain of the minority class, which can improve the generalization of learning models or algorithms. Experimental results reported on 14 datasets and one high-dimensional dataset show that OS-GAN outperforms 14 commonly used resampling techniques in terms of G-mean, accuracy and F1-score.

数据不平衡问题存在于许多应用中。不同类别中样本数量的巨大差距会导致分类器向大多数类别倾斜，从而降低学习性能和获得结果的质量。大多数数据级不平衡学习方法仅通过线性生成或数据分布拟合使用与少数样本相关的信息来生成新样本。与这些算法不同，我们提出了一种新的基于生成对抗网络(gan)的过采样方法，称为OS-GAN。在该方法中，GAN从一些选定的多数样本中学习少数类的分布特征，而不是随机噪声。因此，经过训练的生成器发布的样本同时带有多数类和少数类的信息。此外，中心正则化使得所有合成样本的分布不局限于少数类的领域，这可以提高学习模型或算法的泛化性。在14个数据集和1个高维数据集上的实验结果表明，OS-GAN在g均值、精度和f1得分方面优于14种常用的重采样技术。

{"title":"Oversampling method based on GAN for tabular binary classification problems","authors":"Jie Yang, Zhenhao Jiang, Tingting Pan, Yueqi Chen, W. Pedrycz","doi":"10.3233/ida-220383","DOIUrl":"https://doi.org/10.3233/ida-220383","url":null,"abstract":"Data-imbalanced problems are present in many applications. A big gap in the number of samples in different classes induces classifiers to skew to the majority class and thus diminish the performance of learning and quality of obtained results. Most data level imbalanced learning approaches generate new samples only using the information associated with the minority samples through linearly generating or data distribution fitting. Different from these algorithms, we propose a novel oversampling method based on generative adversarial networks (GANs), named OS-GAN. In this method, GAN is assigned to learn the distribution characteristics of the minority class from some selected majority samples but not random noise. As a result, samples released by the trained generator carry information of both majority and minority classes. Furthermore, the central regularization makes the distribution of all synthetic samples not restricted to the domain of the minority class, which can improve the generalization of learning models or algorithms. Experimental results reported on 14 datasets and one high-dimensional dataset show that OS-GAN outperforms 14 commonly used resampling techniques in terms of G-mean, accuracy and F1-score.","PeriodicalId":50355,"journal":{"name":"Intelligent Data Analysis","volume":" ","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43542303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Design of an energy efficient dynamic virtual machine consolidation model for smart cities in urban areas 面向城市智慧城市的节能动态虚拟机整合模型设计

IF 1.7 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Intelligent Data Analysis

Pub Date : 2023-08-10 DOI: 10.3233/ida-220754

N. Biswas, Sourav Banerjee, Uttam Ghosh, U. Biswas

The growing smart cities in urban areas are becoming more intelligent day by day. Massive storage and high computational resources are required to provide smart services in urban areas. It can be provided through intelligence cloud computing. The establishment of large-scale cloud data centres is rapidly increasing to provide utility-based services in urban areas. Enormous energy consumption of data centres has a destructive effect on the environment. Due to the enormous energy consumption of data centres, a massive amount of greenhouse gases (GHG) are emitted into the environment. Virtual Machine (VM) consolidation can enable energy efficiency to reduce energy consumption of cloud data centres. The reduce energy consumption can increases the Service Level Agreement (SLA) violation. Therefore, in this research, an energy-efficient dynamic VM consolidation model has been proposed to reduce the energy consumption of cloud data centres and curb SLA violations. Novel algorithms have been proposed to accomplished the VM consolidation. A new status of any host called an almost overload host has been introduced, and determined by a novel algorithm based on the Naive Bayes Classifier Machine Learning (ML) model. A new algorithm based on the exponential binary search is proposed to perform the VM selection. Finally, a new Modified Power-Aware Best Fit Decreasing (MPABFD) VM allocation policy is proposed to allocate all VMs. The proposed model has been compared with certain well-known baseline algorithms. The comparison exhibits that the proposed model improves the energy consumption by 25% and SLA violation by 87%.

城市地区不断发展的智慧城市日益智能化。城市智能服务需要海量存储和高计算资源。它可以通过智能云计算提供。大规模云数据中心的建立正在迅速增加，以便在城市地区提供基于公用事业的服务。数据中心的巨大能源消耗对环境造成了破坏性影响。由于数据中心的巨大能源消耗，大量的温室气体(GHG)排放到环境中。虚拟机(VM)整合可以提高能源效率，降低云数据中心的能源消耗。能源消耗的减少会增加违反SLA (Service Level Agreement)的情况。因此，本研究提出了一种节能的动态VM整合模型，以降低云数据中心的能源消耗并抑制SLA违规。提出了新的算法来完成虚拟机整合。引入了一种新的主机状态，称为几乎过载的主机，并由一种基于朴素贝叶斯分类器机器学习(ML)模型的新算法确定。提出了一种基于指数二叉搜索的虚拟机选择算法。最后，提出了一种改进的MPABFD (Power-Aware Best Fit reduction)虚拟机分配策略来分配所有虚拟机。将该模型与一些已知的基线算法进行了比较。比较表明，所提出的模型将能耗提高了25%，SLA违规率降低了87%。

{"title":"Design of an energy efficient dynamic virtual machine consolidation model for smart cities in urban areas","authors":"N. Biswas, Sourav Banerjee, Uttam Ghosh, U. Biswas","doi":"10.3233/ida-220754","DOIUrl":"https://doi.org/10.3233/ida-220754","url":null,"abstract":"The growing smart cities in urban areas are becoming more intelligent day by day. Massive storage and high computational resources are required to provide smart services in urban areas. It can be provided through intelligence cloud computing. The establishment of large-scale cloud data centres is rapidly increasing to provide utility-based services in urban areas. Enormous energy consumption of data centres has a destructive effect on the environment. Due to the enormous energy consumption of data centres, a massive amount of greenhouse gases (GHG) are emitted into the environment. Virtual Machine (VM) consolidation can enable energy efficiency to reduce energy consumption of cloud data centres. The reduce energy consumption can increases the Service Level Agreement (SLA) violation. Therefore, in this research, an energy-efficient dynamic VM consolidation model has been proposed to reduce the energy consumption of cloud data centres and curb SLA violations. Novel algorithms have been proposed to accomplished the VM consolidation. A new status of any host called an almost overload host has been introduced, and determined by a novel algorithm based on the Naive Bayes Classifier Machine Learning (ML) model. A new algorithm based on the exponential binary search is proposed to perform the VM selection. Finally, a new Modified Power-Aware Best Fit Decreasing (MPABFD) VM allocation policy is proposed to allocate all VMs. The proposed model has been compared with certain well-known baseline algorithms. The comparison exhibits that the proposed model improves the energy consumption by 25% and SLA violation by 87%.","PeriodicalId":50355,"journal":{"name":"Intelligent Data Analysis","volume":" ","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46587638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0