Pub Date : 2023-11-06eCollection Date: 2023-12-01DOI: 10.1007/s13755-023-00250-x
Liyu Huang, Qingfeng Chen, Wei Lan
Predicting drug-drug interactions (DDIs) has become a major concern in the drug research field because it helps explore the pharmacological function of drugs and enables the development of new therapeutic drugs. Existing prediction methods simply integrate multiple drug attributes or perform tasks on a biomedical knowledge graph (KG). Though effective, few methods can fully utilize multi-source drug data information. In this paper, a multi-view and multichannel attention deep learning (MMADL) model is proposed, which not only extracts rich drug features containing both drug attributes and drug-related entity information from multi-source databases, but also considers the consistency and complementarity of different drug feature representation learning approaches to improve the effectiveness and accuracy of DDI prediction. A single-layer perceptron encoder is applied to encode multi-source drug information to obtain multi-view drug representation vectors in the same linear space. Then, the multichannel attention mechanism is introduced to obtain the attention weight by adaptively learning the importance of drug features according to their contributions to DDI prediction. Further, the representation vectors of multi-view drug pairs with attention weights are used as inputs of the deep neural network to predict potential DDI. The accuracy and precision-recall curves of MMADL are 93.05 and 95.94, respectively. The results indicate that the proposed method outperforms other state-of-the-art methods.
{"title":"Predicting drug-drug interactions based on multi-view and multichannel attention deep learning.","authors":"Liyu Huang, Qingfeng Chen, Wei Lan","doi":"10.1007/s13755-023-00250-x","DOIUrl":"10.1007/s13755-023-00250-x","url":null,"abstract":"<p><p>Predicting drug-drug interactions (DDIs) has become a major concern in the drug research field because it helps explore the pharmacological function of drugs and enables the development of new therapeutic drugs. Existing prediction methods simply integrate multiple drug attributes or perform tasks on a biomedical knowledge graph (KG). Though effective, few methods can fully utilize multi-source drug data information. In this paper, a multi-view and multichannel attention deep learning (MMADL) model is proposed, which not only extracts rich drug features containing both drug attributes and drug-related entity information from multi-source databases, but also considers the consistency and complementarity of different drug feature representation learning approaches to improve the effectiveness and accuracy of DDI prediction. A single-layer perceptron encoder is applied to encode multi-source drug information to obtain multi-view drug representation vectors in the same linear space. Then, the multichannel attention mechanism is introduced to obtain the attention weight by adaptively learning the importance of drug features according to their contributions to DDI prediction. Further, the representation vectors of multi-view drug pairs with attention weights are used as inputs of the deep neural network to predict potential DDI. The accuracy and precision-recall curves of MMADL are 93.05 and 95.94, respectively. The results indicate that the proposed method outperforms other state-of-the-art methods.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"50"},"PeriodicalIF":6.0,"publicationDate":"2023-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10628064/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71522917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-10-17eCollection Date: 2023-12-01DOI: 10.1007/s13755-023-00251-w
Jing Zhang, Jianhua Li, Yi Zhu, Yu Fu, Lixia Chen
Thyroid diseases, especially thyroid tumors, have a huge population in China. The postoperative patients, under China's incomplete tertiary diagnosis and treatment system, will frequently go to tertiary hospitals for follow-up and medication adjustment, resulting in heavy burdens on both specialists and patients. To help postoperative patients recover better against the above adverse conditions, a novel mobile application ThyroidKeeper is proposed as a collaborative AI-based platform that benefits both patients and doctors. In addition to routine health records and management functions, ThyroidKeeper has achieved several innovative points. First, it can automatically adjust medication dosage for patients during their rehabilitation based on their medical history, laboratory indicators, physical health status, and current medication. Second, it can comprehensively predict the possible complications based on the patient's health status and the health status of similar groups utilizing graph neural networks. Finally, the employing of graph neural network models can improve the efficiency of online communication between doctors and patients, help doctors obtain medical information for patients more quickly and precisely, and make more accurate diagnoses. The preliminary evaluation in both laboratory and real-world environments shows the advantages of the proposed ThyroidKeeper system.
{"title":"Thyroidkeeper: a healthcare management system for patients with thyroid diseases.","authors":"Jing Zhang, Jianhua Li, Yi Zhu, Yu Fu, Lixia Chen","doi":"10.1007/s13755-023-00251-w","DOIUrl":"10.1007/s13755-023-00251-w","url":null,"abstract":"<p><p>Thyroid diseases, especially thyroid tumors, have a huge population in China. The postoperative patients, under China's incomplete tertiary diagnosis and treatment system, will frequently go to tertiary hospitals for follow-up and medication adjustment, resulting in heavy burdens on both specialists and patients. To help postoperative patients recover better against the above adverse conditions, a novel mobile application ThyroidKeeper is proposed as a collaborative AI-based platform that benefits both patients and doctors. In addition to routine health records and management functions, ThyroidKeeper has achieved several innovative points. First, it can automatically adjust medication dosage for patients during their rehabilitation based on their medical history, laboratory indicators, physical health status, and current medication. Second, it can comprehensively predict the possible complications based on the patient's health status and the health status of similar groups utilizing graph neural networks. Finally, the employing of graph neural network models can improve the efficiency of online communication between doctors and patients, help doctors obtain medical information for patients more quickly and precisely, and make more accurate diagnoses. The preliminary evaluation in both laboratory and real-world environments shows the advantages of the proposed ThyroidKeeper system.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"49"},"PeriodicalIF":6.0,"publicationDate":"2023-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10582002/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49683477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Purpose: To address the contentious data sharing across hospitals, this study adopted a novel approach, federated learning (FL), to establish an aggregate model for acute kidney injury (AKI) prediction in critically ill patients in Taiwan.
Methods: This study used data from the Critical Care Database of Taichung Veterans General Hospital (TCVGH) from 2015 to 2020 and electrical medical records of the intensive care units (ICUs) between 2018 and 2020 of four referral centers in different areas across Taiwan. AKI prediction models were trained and validated thereupon. An FL-based prediction model across hospitals was then established.
Results: The study included 16,732 ICU admissions from the TCVGH and 38,424 ICU admissions from the other four hospitals. The complete model with 60 features and the parsimonious model with 21 features demonstrated comparable accuracies using extreme gradient boosting, neural network (NN), and random forest, with an area under the receiver-operating characteristic (AUROC) curve of approximately 0.90. The Shapley Additive Explanations plot demonstrated that the selected features were the key clinical components of AKI for critically ill patients. The AUROC curve of the established parsimonious model for external validation at the four hospitals ranged from 0.760 to 0.865. NN-based FL slightly improved the model performance at the four centers.
Conclusion: A reliable prediction model for AKI in ICU patients was developed with a lead time of 24 h, and it performed better when the novel FL platform across hospitals was implemented.
Supplementary information: The online version contains supplementary material available at 10.1007/s13755-023-00248-5.
{"title":"Federated machine learning for predicting acute kidney injury in critically ill patients: a multicenter study in Taiwan.","authors":"Chun-Te Huang, Tsai-Jung Wang, Li-Kuo Kuo, Ming-Ju Tsai, Cong-Tat Cia, Dung-Hung Chiang, Po-Jen Chang, Inn-Wen Chong, Yi-Shan Tsai, Yuan-Chia Chu, Chia-Jen Liu, Cheng-Hsu Chen, Kai-Chih Pai, Chieh-Liang Wu","doi":"10.1007/s13755-023-00248-5","DOIUrl":"10.1007/s13755-023-00248-5","url":null,"abstract":"<p><strong>Purpose: </strong>To address the contentious data sharing across hospitals, this study adopted a novel approach, federated learning (FL), to establish an aggregate model for acute kidney injury (AKI) prediction in critically ill patients in Taiwan.</p><p><strong>Methods: </strong>This study used data from the Critical Care Database of Taichung Veterans General Hospital (TCVGH) from 2015 to 2020 and electrical medical records of the intensive care units (ICUs) between 2018 and 2020 of four referral centers in different areas across Taiwan. AKI prediction models were trained and validated thereupon. An FL-based prediction model across hospitals was then established.</p><p><strong>Results: </strong>The study included 16,732 ICU admissions from the TCVGH and 38,424 ICU admissions from the other four hospitals. The complete model with 60 features and the parsimonious model with 21 features demonstrated comparable accuracies using extreme gradient boosting, neural network (NN), and random forest, with an area under the receiver-operating characteristic (AUROC) curve of approximately 0.90. The Shapley Additive Explanations plot demonstrated that the selected features were the key clinical components of AKI for critically ill patients. The AUROC curve of the established parsimonious model for external validation at the four hospitals ranged from 0.760 to 0.865. NN-based FL slightly improved the model performance at the four centers.</p><p><strong>Conclusion: </strong>A reliable prediction model for AKI in ICU patients was developed with a lead time of 24 h, and it performed better when the novel FL platform across hospitals was implemented.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s13755-023-00248-5.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"48"},"PeriodicalIF":6.0,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562351/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41215739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Accurate differentiation between pulmonary arteries and veins (A/V) holds pivotal importance in the realm of diagnosing and treating pulmonary ailments. This study presents a new approach that leverages grayscale differences between A/V. Distinctions are measured using median and mean grayscale values within the vessel area. Initially, adherent regions are removed based on vessel structure. The trunk regions are segmented using gray level information near the heart region of the lung boundary. Incorrectly segmented vessels are corrected based on connectivity. For distal lung vessels, a similar distance field is established using a graph-cut method. Experimental results show the algorithm's superior segmentation accuracy, achieving 97.26% compared to the CNN-based average accuracy of 91.67%. Error branches are more concentrated, aiding subsequent manual and automatic correction. This demonstrates the algorithm's effective segmentation of pulmonary A/V.
{"title":"A new segment method for pulmonary artery and vein.","authors":"Qinghua Zhou, Wenjun Tan, Qingya Li, Baoting Li, Luyu Zhou, Xin Liu, Jinzhu Yang, Dazhe Zhao","doi":"10.1007/s13755-023-00245-8","DOIUrl":"10.1007/s13755-023-00245-8","url":null,"abstract":"<p><p>Accurate differentiation between pulmonary arteries and veins (A/V) holds pivotal importance in the realm of diagnosing and treating pulmonary ailments. This study presents a new approach that leverages grayscale differences between A/V. Distinctions are measured using median and mean grayscale values within the vessel area. Initially, adherent regions are removed based on vessel structure. The trunk regions are segmented using gray level information near the heart region of the lung boundary. Incorrectly segmented vessels are corrected based on connectivity. For distal lung vessels, a similar distance field is established using a graph-cut method. Experimental results show the algorithm's superior segmentation accuracy, achieving 97.26% compared to the CNN-based average accuracy of 91.67%. Error branches are more concentrated, aiding subsequent manual and automatic correction. This demonstrates the algorithm's effective segmentation of pulmonary A/V.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"47"},"PeriodicalIF":4.7,"publicationDate":"2023-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558422/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41178706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-28eCollection Date: 2023-12-01DOI: 10.1007/s13755-023-00247-6
Zhicheng Wang, Hongqing Zhu, Bingcang Huang, Ziying Wang, Weiping Lu, Ning Chen, Ying Wang
Due to the unavailability of source domain data encountered in unsupervised domain adaptation, there has been an increasing number of studies on source-free domain adaptation (SFDA) in recent years. To better solve the SFDA problem and effectively leverage the multi-modal information in medical images, this paper presents a novel SFDA method for multi-modal stroke lesion segmentation in which evidential deep learning instead of convolutional neural network. Specifically, for multi-modal stroke images, we design a multi-modal opinion fusion module which uses Dempster-Shafer evidence theory for decision fusion of different modalities. Besides, for the SFDA problem, we use the pseudo label learning method, which obtains pseudo labels from the pre-trained source model to perform the adaptation process. To solve the unreliability of pseudo label caused by domain shift, we propose a pseudo label filtering scheme using shadowed sets theory and a pseudo label refining scheme using evidential uncertainty. These two schemes can automatically extract unreliable parts in pseudo labels and jointly improve the quality of pseudo labels with low computational costs. Experiments on two multi-modal stroke lesion datasets demonstrate the superiority of our method over other state-of-the-art SFDA methods.
{"title":"M-MSSEU: source-free domain adaptation for multi-modal stroke lesion segmentation using shadowed sets and evidential uncertainty.","authors":"Zhicheng Wang, Hongqing Zhu, Bingcang Huang, Ziying Wang, Weiping Lu, Ning Chen, Ying Wang","doi":"10.1007/s13755-023-00247-6","DOIUrl":"10.1007/s13755-023-00247-6","url":null,"abstract":"<p><p>Due to the unavailability of source domain data encountered in unsupervised domain adaptation, there has been an increasing number of studies on source-free domain adaptation (SFDA) in recent years. To better solve the SFDA problem and effectively leverage the multi-modal information in medical images, this paper presents a novel SFDA method for multi-modal stroke lesion segmentation in which evidential deep learning instead of convolutional neural network. Specifically, for multi-modal stroke images, we design a multi-modal opinion fusion module which uses Dempster-Shafer evidence theory for decision fusion of different modalities. Besides, for the SFDA problem, we use the pseudo label learning method, which obtains pseudo labels from the pre-trained source model to perform the adaptation process. To solve the unreliability of pseudo label caused by domain shift, we propose a pseudo label filtering scheme using shadowed sets theory and a pseudo label refining scheme using evidential uncertainty. These two schemes can automatically extract unreliable parts in pseudo labels and jointly improve the quality of pseudo labels with low computational costs. Experiments on two multi-modal stroke lesion datasets demonstrate the superiority of our method over other state-of-the-art SFDA methods.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"46"},"PeriodicalIF":4.7,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10539264/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41162618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-26eCollection Date: 2023-12-01DOI: 10.1007/s13755-023-00246-7
Linchen Liu, Yiyang Zhang, Le Sun
Efficient and accurate medical image classification (MIC) methods face two major challenges: (1) high similarity between images of different disease classes; and (2) generating large medical image datasets for training deep neural networks is challenging due to privacy restrictions and the need for expert ground truth annotations. In this paper, we introduce a novel deep learning method called pre-training grayscale images with supervised learning for MIC (MediMatrix). Instead of pre-training on color ImageNet, our approach uses MediMatrix on grayscale ImageNet. To improve the performance of the network, we introduce ShuffleAttention (SA), a self-attention mechanism. By combining SA with the multiple residual structure (ResSA block) and replacing short-cut connections with dense residual connections between corresponding layers (densepath), our network can dynamically adjust channel attention weights and receive image inputs of different sizes, resulting in improved feature representation and better discrimination of similarities between different categories. MediMatrix effectively classifies X-ray images of rheumatoid arthritis (RA), enabling efficient screening without the need for expert analysis or invasive testing. Through extensive experiments, we demonstrate the superiority of MediMatrix over state-of-the-art methods and that color is not critical for rich natural image classification. Our results highlight the potential of computer-aided diagnosis combined with MediMatrix as a valuable screening tool for early detection and intervention in RA.
{"title":"Medimatrix: innovative pre-training of grayscale images for rheumatoid arthritis diagnosis revolutionises medical image classification.","authors":"Linchen Liu, Yiyang Zhang, Le Sun","doi":"10.1007/s13755-023-00246-7","DOIUrl":"10.1007/s13755-023-00246-7","url":null,"abstract":"<p><p>Efficient and accurate medical image classification (MIC) methods face two major challenges: (1) high similarity between images of different disease classes; and (2) generating large medical image datasets for training deep neural networks is challenging due to privacy restrictions and the need for expert ground truth annotations. In this paper, we introduce a novel deep learning method called pre-training grayscale images with supervised learning for MIC (MediMatrix). Instead of pre-training on color ImageNet, our approach uses MediMatrix on grayscale ImageNet. To improve the performance of the network, we introduce ShuffleAttention (SA), a self-attention mechanism. By combining SA with the multiple residual structure (ResSA block) and replacing short-cut connections with dense residual connections between corresponding layers (densepath), our network can dynamically adjust channel attention weights and receive image inputs of different sizes, resulting in improved feature representation and better discrimination of similarities between different categories. MediMatrix effectively classifies X-ray images of rheumatoid arthritis (RA), enabling efficient screening without the need for expert analysis or invasive testing. Through extensive experiments, we demonstrate the superiority of MediMatrix over state-of-the-art methods and that color is not critical for rich natural image classification. Our results highlight the potential of computer-aided diagnosis combined with MediMatrix as a valuable screening tool for early detection and intervention in RA.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"44"},"PeriodicalIF":6.0,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10522544/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41147866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-26eCollection Date: 2023-12-01DOI: 10.1007/s13755-023-00235-w
Huanyu Zhao, Tongliang Li, Jian Yang, Chaoyi Pang
The baseline wander (BLW) in electrocardiogram (ECG) is a common disturbance that has a significant influence on the ECG wave pattern recognition. Many methods, such as IIR filter, mean filter, etc., can be used to correct BLW; However, most of them work on the original ECG signals. Compressed ECG data are economic for data storage and transmission, and if the baseline correction can be processed on them, it will be more efficient than we decompress them first and then do such correction. In this paper, we propose a new type of median filter CM_Filter, which works on the synopses of straight lines achieved from ECG by piecewise linear approximation (PLA) under maximum error bound. In CM_Filter, a heuristic strategy "Quick-Finding" is deduced by a property of straight lines in order to get the quality-assured median values from the synopses. The extended experimental tests demonstrate that the proposed filter is very efficient in execution time, and effective for correcting both slow and abrupt ECG baseline wander.
{"title":"An error-bounded median filter for correcting ECG baseline wander.","authors":"Huanyu Zhao, Tongliang Li, Jian Yang, Chaoyi Pang","doi":"10.1007/s13755-023-00235-w","DOIUrl":"10.1007/s13755-023-00235-w","url":null,"abstract":"<p><p>The baseline wander (BLW) in electrocardiogram (ECG) is a common disturbance that has a significant influence on the ECG wave pattern recognition. Many methods, such as IIR filter, mean filter, etc., can be used to correct BLW; However, most of them work on the original ECG signals. Compressed ECG data are economic for data storage and transmission, and if the baseline correction can be processed on them, it will be more efficient than we decompress them first and then do such correction. In this paper, we propose a new type of median filter <i>CM_Filter</i>, which works on the synopses of straight lines achieved from ECG by piecewise linear approximation (PLA) under maximum error bound. In <i>CM_Filter</i>, a heuristic strategy \"Quick-Finding\" is deduced by a property of straight lines in order to get the quality-assured median values from the synopses. The extended experimental tests demonstrate that the proposed filter is very efficient in execution time, and effective for correcting both slow and abrupt ECG baseline wander.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"45"},"PeriodicalIF":6.0,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10522562/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41173208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The clinical manifestations of ischemic cardiomyopathy (ICM) bear resemblance to dilated cardiomyopathy (DCM). The definitive diagnosis of DCM necessitates the identification of invasive, costly, and contraindicated coronary angiography. Many diagnostic studies of cardiovascular disease have tried modal decomposition based on electrocardiogram (ECG) signals. However, these studies ignored the connection between modes and other fields, thus limiting the interpretability of modes to ECG signals and the classification performance of models. This study proposes a classification algorithm based on variational mode decomposition (VMD) and high order spectra, which decomposes the preprocessed ECG signal and extracts its first five modes obtained through VMD. After that, these modes are estimated for their corresponding bispectrums, and the feature vector is composed of fifteen features including bispectral, frequency, and nonlinear features based on this. Finally, a dataset containing 75 subjects (38 DCM, 37 ICM) is classified and compared using random forest (RF), decision tree, support vector machine, and K-nearest neighbor. The results show that, in comparison to previous approaches, the technique proposed provides a better categorization for DCM and ICM of ECG signals, which delivers 98.21% classification accuracy, 98.22% sensitivity, and 98.19% specificity. And mode 3 always has the best performance among single mode. The proposed computerized framework significantly improves automatic diagnostic performance, which can help relieve the working pressure on doctors, possible economic burden and health threaten.
{"title":"Differential diagnosis between dilated cardiomyopathy and ischemic cardiomyopathy based on variational mode decomposition and high order spectra analysis.","authors":"Yuduan Han, Yunyue Zhao, Zhuochen Lin, Zichao Liang, Siyang Chen, Jinxin Zhang","doi":"10.1007/s13755-023-00244-9","DOIUrl":"10.1007/s13755-023-00244-9","url":null,"abstract":"<p><p>The clinical manifestations of ischemic cardiomyopathy (ICM) bear resemblance to dilated cardiomyopathy (DCM). The definitive diagnosis of DCM necessitates the identification of invasive, costly, and contraindicated coronary angiography. Many diagnostic studies of cardiovascular disease have tried modal decomposition based on electrocardiogram (ECG) signals. However, these studies ignored the connection between modes and other fields, thus limiting the interpretability of modes to ECG signals and the classification performance of models. This study proposes a classification algorithm based on variational mode decomposition (VMD) and high order spectra, which decomposes the preprocessed ECG signal and extracts its first five modes obtained through VMD. After that, these modes are estimated for their corresponding bispectrums, and the feature vector is composed of fifteen features including bispectral, frequency, and nonlinear features based on this. Finally, a dataset containing 75 subjects (38 DCM, 37 ICM) is classified and compared using random forest (RF), decision tree, support vector machine, and K-nearest neighbor. The results show that, in comparison to previous approaches, the technique proposed provides a better categorization for DCM and ICM of ECG signals, which delivers 98.21% classification accuracy, 98.22% sensitivity, and 98.19% specificity. And mode 3 always has the best performance among single mode. The proposed computerized framework significantly improves automatic diagnostic performance, which can help relieve the working pressure on doctors, possible economic burden and health threaten.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"43"},"PeriodicalIF":6.0,"publicationDate":"2023-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10511396/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41173209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-02eCollection Date: 2023-12-01DOI: 10.1007/s13755-023-00243-w
Yang Wang, Zuxian Zhang, Chenghong Piao, Ying Huang, Yihan Zhang, Chi Zhang, Yu-Jing Lu, Dongning Liu
Background: Drug-target interaction (DTI) is a vital drug design strategy that plays a significant role in many processes of complex diseases and cellular events. In the face of challenges such as extensive protein data and experimental costs, it is suggested to apply bioinformatics approaches to exploit potential interactions to design new targeted medications. Different data and interaction types bring difficulties to study involving incompatible and heterology formats. The analysis of drug-target interactions in a comprehensive and unified model is a significant challenge.
Method: Here, we propose a general method for predicting interactions between small-molecule drugs and protein targets, Large-scale Drug target Screening Convolutional Neural Network (LDS-CNN), which used unified encoding to achieve the calculation of the different data formats in an integrated model to realize feature abstraction and potential object prediction.
Result: On 898,412 interaction data involving 1683 small-molecule compounds and 14,350 human proteins from 8.8 billion records, the proposed method achieved an area under the curve (AUC) of 0.96, an area under the precision-recall curve (AUPRC) of 0.95, and an accuracy of 90.13%. The experimental results illustrated that the proposed method attained high accuracy on the test set, indicating its high predictive ability in drug-target interaction prediction. LDS-CNN is effective for the prediction of large-scale datasets and datasets composed of data with different formats.
Conclusion: In this study, we propose a DTI prediction method to solve the problems of unified encoding of large-scale data in multiple formats. It provides a feasible way to efficiently abstract the features among different types of drug-related data, thus reducing experimental costs and time consumption. The proposed method can be used to identify potential drug targets and candidates for the treatment of complex diseases. This work provides a reference for DTI to process large-scale data and different formats with deep learning methods and provides certain suggestions for future research.
{"title":"LDS-CNN: a deep learning framework for drug-target interactions prediction based on large-scale drug screening.","authors":"Yang Wang, Zuxian Zhang, Chenghong Piao, Ying Huang, Yihan Zhang, Chi Zhang, Yu-Jing Lu, Dongning Liu","doi":"10.1007/s13755-023-00243-w","DOIUrl":"10.1007/s13755-023-00243-w","url":null,"abstract":"<p><strong>Background: </strong>Drug-target interaction (DTI) is a vital drug design strategy that plays a significant role in many processes of complex diseases and cellular events. In the face of challenges such as extensive protein data and experimental costs, it is suggested to apply bioinformatics approaches to exploit potential interactions to design new targeted medications. Different data and interaction types bring difficulties to study involving incompatible and heterology formats. The analysis of drug-target interactions in a comprehensive and unified model is a significant challenge.</p><p><strong>Method: </strong>Here, we propose a general method for predicting interactions between small-molecule drugs and protein targets, Large-scale Drug target Screening Convolutional Neural Network (LDS-CNN), which used unified encoding to achieve the calculation of the different data formats in an integrated model to realize feature abstraction and potential object prediction.</p><p><strong>Result: </strong>On 898,412 interaction data involving 1683 small-molecule compounds and 14,350 human proteins from 8.8 billion records, the proposed method achieved an area under the curve (AUC) of 0.96, an area under the precision-recall curve (AUPRC) of 0.95, and an accuracy of 90.13%. The experimental results illustrated that the proposed method attained high accuracy on the test set, indicating its high predictive ability in drug-target interaction prediction. LDS-CNN is effective for the prediction of large-scale datasets and datasets composed of data with different formats.</p><p><strong>Conclusion: </strong>In this study, we propose a DTI prediction method to solve the problems of unified encoding of large-scale data in multiple formats. It provides a feasible way to efficiently abstract the features among different types of drug-related data, thus reducing experimental costs and time consumption. The proposed method can be used to identify potential drug targets and candidates for the treatment of complex diseases. This work provides a reference for DTI to process large-scale data and different formats with deep learning methods and provides certain suggestions for future research.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"42"},"PeriodicalIF":6.0,"publicationDate":"2023-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10475000/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10533336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-30eCollection Date: 2023-12-01DOI: 10.1007/s13755-023-00241-y
Hakje Yoo, Jose Moon, Jong-Ho Kim, Hyung Joon Joo
Purpose: The purpose of this study is to construct a synthetic dataset of ECG signal that overcomes the sensitivity of personal information and the complexity of disclosure policies.
Methods: The public dataset was constructed by generating synthetic data based on the deep learning model using a convolution neural network (CNN) and bi-directional long short-term memory (Bi-LSTM), and the effectiveness of the dataset was verified by developing classification models for ECG diagnoses.
Results: The synthetic 12-lead ECG dataset generated consists of a total of 6000 ECGs, with normal and 5 abnormal groups. The synthetic ECG signal has a waveform pattern similar to the original ECG signal, the average RMSE between the two signals is 0.042 µV, and the average cosine similarity is 0.993. In addition, five classification models were developed to verify the effect of the synthetic dataset and showed performance similar to that of the model made with the actual dataset. In particular, even when the real dataset was applied as a test set to the classification model trained with the synthetic dataset, the classification performance of all models showed high accuracy (average accuracy 93.41%).
Conclusion: The synthetic 12-lead ECG dataset was confirmed to perform similarly to the real-world 12-lead ECG in the classification model. This implies that a synthetic dataset can perform similarly to a real dataset in clinical research using AI. The synthetic dataset generation process in this study provides a way to overcome the medical data disclosure challenges constrained by privacy rights, a way to encourage open data policies, and contribute significantly to promoting cardiovascular disease research.
{"title":"Design and technical validation to generate a synthetic 12-lead electrocardiogram dataset to promote artificial intelligence research.","authors":"Hakje Yoo, Jose Moon, Jong-Ho Kim, Hyung Joon Joo","doi":"10.1007/s13755-023-00241-y","DOIUrl":"10.1007/s13755-023-00241-y","url":null,"abstract":"<p><strong>Purpose: </strong>The purpose of this study is to construct a synthetic dataset of ECG signal that overcomes the sensitivity of personal information and the complexity of disclosure policies.</p><p><strong>Methods: </strong>The public dataset was constructed by generating synthetic data based on the deep learning model using a convolution neural network (CNN) and bi-directional long short-term memory (Bi-LSTM), and the effectiveness of the dataset was verified by developing classification models for ECG diagnoses.</p><p><strong>Results: </strong>The synthetic 12-lead ECG dataset generated consists of a total of 6000 ECGs, with normal and 5 abnormal groups. The synthetic ECG signal has a waveform pattern similar to the original ECG signal, the average RMSE between the two signals is 0.042 µV, and the average cosine similarity is 0.993. In addition, five classification models were developed to verify the effect of the synthetic dataset and showed performance similar to that of the model made with the actual dataset. In particular, even when the real dataset was applied as a test set to the classification model trained with the synthetic dataset, the classification performance of all models showed high accuracy (average accuracy 93.41%).</p><p><strong>Conclusion: </strong>The synthetic 12-lead ECG dataset was confirmed to perform similarly to the real-world 12-lead ECG in the classification model. This implies that a synthetic dataset can perform similarly to a real dataset in clinical research using AI. The synthetic dataset generation process in this study provides a way to overcome the medical data disclosure challenges constrained by privacy rights, a way to encourage open data policies, and contribute significantly to promoting cardiovascular disease research.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"41"},"PeriodicalIF":6.0,"publicationDate":"2023-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10468461/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10149351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}