Open Bioinformatics Journal最新文献

Decision-making Support System for Predicting and Eliminating Malnutrition and Anemia 预测和消除营养不良和贫血的决策支持系统

Q3 Computer Science

Open Bioinformatics Journal

Pub Date : 2023-10-27 DOI: 10.2174/0118750362246898230921054021

Manasvi Jagadeesh Maasthi, Harinahalli Lokesh Gururaj, Vinayakumar Ravi, Basavesha D, Meshari Almeshari, Yasser Alzamil

Aims: This study investigates predicting and eliminating malnutrition and anemia using ML Algorithms and comparing all the methods with various malnutrition-based parameters. Background: The health of the nation is more important than the wealth of the nation. Malnutrition and anemia are serious health issues but the least importance is given to eradicate them. Objective: Proper nutrition is an essential component for the survival, growth, and development of infants, children, and women who in turn give birth to infants. Methods: In the proposed system, machine learning approaches are utilized to predict the malnutrition status of children under five years of age and anemia in men and women using old datasets and further providing a suitable diet recommendation to overcome the disease. Classification techniques are being used for malnutrition as well as anemia prediction. Results: Algorithms such as Naïve Bayes classifier (NBC), Decision Tree (DT) algorithm, Random Forest (RF) algorithm, and K-Nearest Neighbor (k-NN) algorithm are utilized for prediction. The results obtained by these algorithms are 94.47%, 85%, 95.49%, and 63.15%. When compared, Naïve Bayes and random forest algorithm provided efficient results for malnutrition and anemia, respectively. Conclusion: By testing and validating, preventive actions can be taken with the help of medical experts and dieticians to reduce malnutrition and anemia conditions among children and elders, respectively.

目的:研究利用ML算法预测和消除营养不良和贫血，并将所有方法与各种基于营养不良的参数进行比较。背景:国家的健康比国家的财富更重要。营养不良和贫血是严重的健康问题，但对根除这些问题的重视程度最低。目的:适当的营养是婴儿、儿童和生育婴儿的妇女生存、生长和发育的重要组成部分。方法:在提出的系统中，利用机器学习方法利用旧数据集预测5岁以下儿童的营养不良状况和男性和女性的贫血，并进一步提供合适的饮食建议来克服疾病。分类技术被用于营养不良和贫血预测。结果:利用Naïve贝叶斯分类器(NBC)、决策树(DT)算法、随机森林(RF)算法和k-最近邻(k-NN)算法进行预测。这些算法得到的结果分别为94.47%、85%、95.49%和63.15%。相比之下，Naïve贝叶斯算法和随机森林算法分别对营养不良和贫血提供了有效的结果。结论:通过测试和验证，可以在医学专家和营养学家的帮助下采取预防措施，分别减少儿童和老年人的营养不良和贫血状况。

{"title":"Decision-making Support System for Predicting and Eliminating Malnutrition and Anemia","authors":"Manasvi Jagadeesh Maasthi, Harinahalli Lokesh Gururaj, Vinayakumar Ravi, Basavesha D, Meshari Almeshari, Yasser Alzamil","doi":"10.2174/0118750362246898230921054021","DOIUrl":"https://doi.org/10.2174/0118750362246898230921054021","url":null,"abstract":"Aims: This study investigates predicting and eliminating malnutrition and anemia using ML Algorithms and comparing all the methods with various malnutrition-based parameters. Background: The health of the nation is more important than the wealth of the nation. Malnutrition and anemia are serious health issues but the least importance is given to eradicate them. Objective: Proper nutrition is an essential component for the survival, growth, and development of infants, children, and women who in turn give birth to infants. Methods: In the proposed system, machine learning approaches are utilized to predict the malnutrition status of children under five years of age and anemia in men and women using old datasets and further providing a suitable diet recommendation to overcome the disease. Classification techniques are being used for malnutrition as well as anemia prediction. Results: Algorithms such as Naïve Bayes classifier (NBC), Decision Tree (DT) algorithm, Random Forest (RF) algorithm, and K-Nearest Neighbor (k-NN) algorithm are utilized for prediction. The results obtained by these algorithms are 94.47%, 85%, 95.49%, and 63.15%. When compared, Naïve Bayes and random forest algorithm provided efficient results for malnutrition and anemia, respectively. Conclusion: By testing and validating, preventive actions can be taken with the help of medical experts and dieticians to reduce malnutrition and anemia conditions among children and elders, respectively.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"52 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136318383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Immunoinformatics Approach for the Design of Chimeric Vaccine Against Whitmore Disease 免疫信息学方法设计抗惠特莫尔病嵌合疫苗

Q3 Computer Science

Open Bioinformatics Journal

Pub Date : 2023-10-20 DOI: 10.2174/0118750362253383230922100803

Shalini Maurya, Salman Akhtar, Mohammad Kalim Ahmad Khan

Purpose: Multidrug-resistant Burkholderia pseudomallei is associated with significant morbidity and mortality. Hence, there is a requirement for a vaccine for this pathogen. Using subtractive proteomics and reverse vaccinology approaches, we have designed a chimeric multiepitope vaccine against the pathogen in the present study. Methods: Twenty-one non-redundant pathogen proteomes were mined using a subtractive proteomics strategy. Out of these, by various analyses, we found proteins that were non-homologous to humans, essential, and virulent. BLASTp against the PDB database and Pocket druggability analysis yielded nine proteins whose 3D structure is available and are druggable. Four proteins that could be candidates for vaccines were identified by subcellular localization and antigenicity prediction, and they could be used in reverse vaccinology methods to create a chimeric multiepitope vaccine. Results: Using online resources and servers, MHC class I, II, and B cell epitopes were identified. The predicted epitopes were selected based on analysis of toxicity, solubility, allergenicity, and hydrophilicity. These predicted epitopes, which were immunogenic, were used for the construction of a multivalent chimeric vaccine. The epitopes, adjuvants, linkers, and PADRE amino acid sequences were employed to create the vaccine. Shortlisted vaccine constructs also interact with the HLA allele and TLR4, as evident from docking and molecular dynamics simulation. Thus, vaccine construct V1 can elicit an immune response against Burkholderia pseudomallei . Conclusion: The availability of the proteome of B. pseudomallei has made this study possible through the usage of various in silico approaches. We could shortlist vaccine targets using subtractive proteomics and then construct chimeric vaccines using reverse vaccinology and immunoinformatics approaches.

目的:耐多药假马利氏伯克氏菌具有显著的发病率和死亡率。因此，需要一种针对这种病原体的疫苗。利用减法蛋白质组学和反向疫苗学方法，我们设计了一种针对病原体的嵌合多表位疫苗。方法:采用减法蛋白质组学方法提取21个非冗余病原体蛋白质组。在这些蛋白质中，通过各种分析，我们发现了与人类非同源的、必需的、有毒的蛋白质。针对PDB数据库的BLASTp和Pocket药物分析获得了9个具有3D结构且可药物的蛋白质。通过亚细胞定位和抗原性预测鉴定出4种可能作为疫苗候选蛋白，它们可用于反向疫苗学方法来制造嵌合多表位疫苗。结果:利用在线资源和服务器，确定了MHC I类、II类和B细胞表位。预测的表位是根据毒性、溶解度、过敏原性和亲水性分析选择的。这些预测的表位具有免疫原性，用于构建多价嵌合疫苗。抗原表位、佐剂、连接体和PADRE氨基酸序列被用来制造疫苗。从对接和分子动力学模拟中可以看出，入围的疫苗构建体也与HLA等位基因和TLR4相互作用。因此，疫苗构建体V1可以引发针对假马利氏伯克氏菌的免疫应答。结论:假假芽孢杆菌蛋白质组的可用性使本研究通过各种计算机方法成为可能。我们可以利用减法蛋白质组学筛选疫苗靶点，然后利用反向疫苗学和免疫信息学方法构建嵌合疫苗。

{"title":"Immunoinformatics Approach for the Design of Chimeric Vaccine Against Whitmore Disease","authors":"Shalini Maurya, Salman Akhtar, Mohammad Kalim Ahmad Khan","doi":"10.2174/0118750362253383230922100803","DOIUrl":"https://doi.org/10.2174/0118750362253383230922100803","url":null,"abstract":"Purpose: Multidrug-resistant Burkholderia pseudomallei is associated with significant morbidity and mortality. Hence, there is a requirement for a vaccine for this pathogen. Using subtractive proteomics and reverse vaccinology approaches, we have designed a chimeric multiepitope vaccine against the pathogen in the present study. Methods: Twenty-one non-redundant pathogen proteomes were mined using a subtractive proteomics strategy. Out of these, by various analyses, we found proteins that were non-homologous to humans, essential, and virulent. BLASTp against the PDB database and Pocket druggability analysis yielded nine proteins whose 3D structure is available and are druggable. Four proteins that could be candidates for vaccines were identified by subcellular localization and antigenicity prediction, and they could be used in reverse vaccinology methods to create a chimeric multiepitope vaccine. Results: Using online resources and servers, MHC class I, II, and B cell epitopes were identified. The predicted epitopes were selected based on analysis of toxicity, solubility, allergenicity, and hydrophilicity. These predicted epitopes, which were immunogenic, were used for the construction of a multivalent chimeric vaccine. The epitopes, adjuvants, linkers, and PADRE amino acid sequences were employed to create the vaccine. Shortlisted vaccine constructs also interact with the HLA allele and TLR4, as evident from docking and molecular dynamics simulation. Thus, vaccine construct V1 can elicit an immune response against Burkholderia pseudomallei . Conclusion: The availability of the proteome of B. pseudomallei has made this study possible through the usage of various in silico approaches. We could shortlist vaccine targets using subtractive proteomics and then construct chimeric vaccines using reverse vaccinology and immunoinformatics approaches.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135666306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A New Deep Learning Model based on Neuroimaging for Predicting Alzheimer's Disease 一种新的基于神经影像学的深度学习模型预测阿尔茨海默病

Q3 Computer Science

Open Bioinformatics Journal

Pub Date : 2023-10-16 DOI: 10.2174/0118750362260635230922051326

Kiran P., Sudheesh K. V., Vinayakumar Ravi, Meshari Almeshari, Yasser Alzamil, Sunil Kumar D. S., Harshitha R.

Background: The psychological aspects of the brain in Alzheimer's disease (AD) are significantly affected. These alterations in brain anatomy take place due to a variety of reasons, including the shrinking of grey and white matter in the brain. Magnetic resonance imaging (MRI) scans can be used to measure it, and these scans offer a chance for early identification of AD utilizing classification methods, like convolutional neural network (CNN). The majority of AD-related tests are now constrained by the test measures. It is, thus, crucial to find an affordable method for image categorization using minimal information. Because of developments in machine learning and medical imaging, the field of computerized health care has evolved rapidly. Recent developments in deep learning, in particular, herald a new era of clinical decision-making that is heavily reliant on multimedia systems. Methods: In the proposed work, we have investigated various CNN-based transfer-learning strategies for predicting AD using MRI scans of the brain's structural organization. According to an analysis of the data, the suggested model makes use of a number of sites related to Alzheimer's disease. In order to interpret structural brain pictures in both 2D and 3D, the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset includes straightforward CNN designs based on 2D and 3D convolutions. Results: According to these results, deep neural networks may be able to automatically learn which imaging biomarkers are indicative of Alzheimer's disease and exploit them for precise early disease detection. The proposed techniques have been found to achieve an accuracy of 93.24%. Conclusion: This research aimed to classify Alzheimer's disease (AD) using transfer learning. We have used strict pre-processing steps on raw MRI data from the ADNI dataset and used the AlexNet, i.e ., Alzheimer's disease has been categorized using pre-processed data and the CNN classifier.

背景:阿尔茨海默病(AD)患者的大脑心理方面受到显著影响。大脑解剖结构的这些变化是由多种原因引起的，包括大脑中灰质和白质的萎缩。磁共振成像(MRI)扫描可以用来测量它，这些扫描为利用分类方法(如卷积神经网络(CNN))早期识别AD提供了机会。大多数与ad相关的测试现在受到测试措施的限制。因此，找到一种使用最小信息的可负担的图像分类方法至关重要。由于机器学习和医学成像的发展，计算机化医疗保健领域发展迅速。特别是深度学习的最新发展，预示着一个严重依赖多媒体系统的临床决策的新时代。方法:在提出的工作中，我们研究了各种基于cnn的迁移学习策略，利用大脑结构组织的MRI扫描来预测AD。根据对数据的分析，建议的模型利用了与阿尔茨海默病相关的一些位点。为了解释二维和三维的大脑结构图像，阿尔茨海默病神经成像倡议(ADNI)数据集包括基于二维和三维卷积的简单CNN设计。结果:根据这些结果，深度神经网络可能能够自动学习哪些成像生物标志物是阿尔茨海默病的指示，并利用它们进行精确的早期疾病检测。所提出的方法的准确率达到了93.24%。结论:本研究旨在利用迁移学习对阿尔茨海默病(AD)进行分类。我们对来自ADNI数据集的原始MRI数据进行了严格的预处理步骤，并使用了AlexNet，即使用预处理数据和CNN分类器对阿尔茨海默病进行了分类。

{"title":"A New Deep Learning Model based on Neuroimaging for Predicting Alzheimer's Disease","authors":"Kiran P., Sudheesh K. V., Vinayakumar Ravi, Meshari Almeshari, Yasser Alzamil, Sunil Kumar D. S., Harshitha R.","doi":"10.2174/0118750362260635230922051326","DOIUrl":"https://doi.org/10.2174/0118750362260635230922051326","url":null,"abstract":"Background: The psychological aspects of the brain in Alzheimer's disease (AD) are significantly affected. These alterations in brain anatomy take place due to a variety of reasons, including the shrinking of grey and white matter in the brain. Magnetic resonance imaging (MRI) scans can be used to measure it, and these scans offer a chance for early identification of AD utilizing classification methods, like convolutional neural network (CNN). The majority of AD-related tests are now constrained by the test measures. It is, thus, crucial to find an affordable method for image categorization using minimal information. Because of developments in machine learning and medical imaging, the field of computerized health care has evolved rapidly. Recent developments in deep learning, in particular, herald a new era of clinical decision-making that is heavily reliant on multimedia systems. Methods: In the proposed work, we have investigated various CNN-based transfer-learning strategies for predicting AD using MRI scans of the brain's structural organization. According to an analysis of the data, the suggested model makes use of a number of sites related to Alzheimer's disease. In order to interpret structural brain pictures in both 2D and 3D, the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset includes straightforward CNN designs based on 2D and 3D convolutions. Results: According to these results, deep neural networks may be able to automatically learn which imaging biomarkers are indicative of Alzheimer's disease and exploit them for precise early disease detection. The proposed techniques have been found to achieve an accuracy of 93.24%. Conclusion: This research aimed to classify Alzheimer's disease (AD) using transfer learning. We have used strict pre-processing steps on raw MRI data from the ADNI dataset and used the AlexNet, i.e ., Alzheimer's disease has been categorized using pre-processed data and the CNN classifier.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136183087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Early Prediction of Covid-19 Samples from Chest X-ray Images using Deep Learning Approach 基于深度学习方法的胸部x线图像Covid-19样本早期预测

Q3 Computer Science

Open Bioinformatics Journal

Pub Date : 2023-10-13 DOI: 10.2174/18750362-v16-231005-2023-5

K V Sudheesh, None Kiran, Harinahalli Lokesh Gururaj, Vinayakumar Ravi, Meshari Almeshari, Yasser Alzamil

Aims: In this study, chest X-ray (CXR) and computed tomography (CT) images are used to analyse and detect COVID-19 using an unsupervised deep learning-based feature fusion approach. Background: The reverse transcription-polymerase chain reaction (RT-PCR) test, which has a reduced viral load, sampling error, etc., is used to detect COVID-19, which has sickened millions of people worldwide. It is possible to check chest X-rays and computed tomography scans because the majority of infected persons have lung infections. The COVID-19 diagnosis can be made early using both CT and CXR imaging modalities, which is an alternative to the RT-PCR test. Objective: The manual diagnosis of CXR pictures and CT scans is labor and time-intensive. Many AI-based solutions are being investigated to tackle this problem, including deep learning-based detection models, which can be utilized to assist the radiologist in making a more accurate diagnosis. However, because of the demand for specialized knowledge and high annotation costs, the amount of annotated data available for COVID-19 identification is constrained. Additionally, the majority of current cutting-edge deep learning-based detection models use supervised learning techniques. Because a tagged dataset is not required, we have investigated various unsupervised learning models for COVID-19 identification in this work. Methods: In this study, we suggest a COVID-19 detection method based on unsupervised deep learning that makes use of the feature fusion technique to improve performance. Based on this an automated CNN model is built for the detection of COVID-19 samples from healthy and pneumonic cases using chest X-ray images. Results: This model has scored an accuracy of about 99% for the classification between covid positive and covid negative. Based on this result further classification will be done for pneumonic and non-pneumonic which has scored an accuracy of 94%. Conclusion: On both datasets, the COVID-19 detection method based on feature fusion and deep unsupervised learning showed promising results. Additionally, it outperforms four well-known unsupervised methods already in use.

目的:在本研究中，使用基于无监督深度学习的特征融合方法，使用胸部x射线(CXR)和计算机断层扫描(CT)图像来分析和检测COVID-19。背景:逆转录聚合酶链反应(RT-PCR)检测具有病毒载量低、采样误差小等优点，被用于检测全球数百万人患病的COVID-19。可以检查胸部x光片和计算机断层扫描，因为大多数感染者都有肺部感染。使用CT和CXR成像方式可以早期诊断COVID-19，这是RT-PCR检测的替代方法。目的:人工诊断CXR图像和CT扫描费时费力。许多基于人工智能的解决方案正在被研究来解决这个问题，包括基于深度学习的检测模型，它可以用来帮助放射科医生做出更准确的诊断。然而，由于对专业知识的需求和高昂的标注成本，可用于COVID-19识别的标注数据数量受到限制。此外，目前大多数基于深度学习的前沿检测模型都使用监督学习技术。由于不需要标记数据集，我们在这项工作中研究了用于COVID-19识别的各种无监督学习模型。方法:在本研究中，我们提出了一种基于无监督深度学习的COVID-19检测方法，利用特征融合技术来提高性能。在此基础上，建立了利用胸部x线图像检测健康病例和肺炎病例COVID-19样本的自动化CNN模型。结果:该模型对covid阳性和covid阴性的分类准确率约为99%。基于此结果，将对肺炎和非肺炎进行进一步分类，准确率达到94%。结论:基于特征融合和深度无监督学习的COVID-19检测方法在两个数据集上都取得了很好的效果。此外，它比已经使用的四种众所周知的无监督方法要好。

{"title":"Early Prediction of Covid-19 Samples from Chest X-ray Images using Deep Learning Approach","authors":"K V Sudheesh, None Kiran, Harinahalli Lokesh Gururaj, Vinayakumar Ravi, Meshari Almeshari, Yasser Alzamil","doi":"10.2174/18750362-v16-231005-2023-5","DOIUrl":"https://doi.org/10.2174/18750362-v16-231005-2023-5","url":null,"abstract":"Aims: In this study, chest X-ray (CXR) and computed tomography (CT) images are used to analyse and detect COVID-19 using an unsupervised deep learning-based feature fusion approach. Background: The reverse transcription-polymerase chain reaction (RT-PCR) test, which has a reduced viral load, sampling error, etc., is used to detect COVID-19, which has sickened millions of people worldwide. It is possible to check chest X-rays and computed tomography scans because the majority of infected persons have lung infections. The COVID-19 diagnosis can be made early using both CT and CXR imaging modalities, which is an alternative to the RT-PCR test. Objective: The manual diagnosis of CXR pictures and CT scans is labor and time-intensive. Many AI-based solutions are being investigated to tackle this problem, including deep learning-based detection models, which can be utilized to assist the radiologist in making a more accurate diagnosis. However, because of the demand for specialized knowledge and high annotation costs, the amount of annotated data available for COVID-19 identification is constrained. Additionally, the majority of current cutting-edge deep learning-based detection models use supervised learning techniques. Because a tagged dataset is not required, we have investigated various unsupervised learning models for COVID-19 identification in this work. Methods: In this study, we suggest a COVID-19 detection method based on unsupervised deep learning that makes use of the feature fusion technique to improve performance. Based on this an automated CNN model is built for the detection of COVID-19 samples from healthy and pneumonic cases using chest X-ray images. Results: This model has scored an accuracy of about 99% for the classification between covid positive and covid negative. Based on this result further classification will be done for pneumonic and non-pneumonic which has scored an accuracy of 94%. Conclusion: On both datasets, the COVID-19 detection method based on feature fusion and deep unsupervised learning showed promising results. Additionally, it outperforms four well-known unsupervised methods already in use.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135922821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Electronic Health Record (EHR) System Development for Study on EHR Data-based Early Prediction of Diabetes Using Machine Learning Algorithms 基于EHR数据的糖尿病早期预测机器学习算法的电子健康记录(EHR)系统开发

Q3 Computer Science

Open Bioinformatics Journal

Pub Date : 2023-10-05 DOI: 10.2174/18750362-v16-e230906-2023-15

Jagadamba G, Shashidhar R, Gururaj H L, Vinayakumar Ravi, Meshari Almeshari, Yasser Alzamil

Aims: This research work aims to develop an interoperable electronic health record (EHR) system to aid the early detection of diabetes by the use of Machine Learning (ML) algorithms. A decision support system developed using many ML algorithms results in optimizing the decision in preventive care in the health information system. Methods: The proposed system consisted of two models. The first model included interoperable EHR system development using a precise database structure. The second module comprised of data extraction from the EHR system, data cleaning, and data processing and prediction. For testing and training, about 1080 patients’ health record was considered. Among 1080, 1000 records were from the Kaggle dataset, and 80 records were demographic information from patients who visited our health center of Siddaganga organization for a regular checkup or during emergencies. The demographic information was collected from the proposed EHR system. Results: The proposed system was tested for the interoperability nature of the EHR system and accuracy in diabetic disease prediction using the proposed decision support system. The proposed EHR system development was tested for interoperability by random updations from various systems maintained in the laboratory. Each system acted like the admin system of different hospitals. The EHR system was tested for handling the load and interoperability by considering user view status, system matching with the real world, consistency in data updations, security etc . However, in the prediction phase, diabetes prediction was concentrated. The features considered were not randomly chosen; however, the features were those prescribed by a doctor who insisted that the features were sufficient for initial prediction. The reports collected from the doctors revealed several features they considered before giving the test details. The proposed system dataset was split into test and train datasets with eight proper features taken as input and one set as a target variable where the result was present. After this, the model was imported using standard “sklearn” libraries, and it fit with the required number of estimators, that is, the number of decision trees. The features included pregnancies, glucose level, blood pressure, skin thickness, insulin level, bone marrow index, diabetic pedigree function, age, weight, etc . At the outset, the research work concentrated on developing an interoperable EHR system, identifying the expectation of diabetic and non-diabetic conditions and demonstrating the accuracy of the system. Conclusion: In this study, the first aim was to design an interoperable EHR system that could help in accumulating, storing, and sharing patients' timely health records over a lifetime. The second aim was to use EHR data for early prediction of diabetes in the user. To confirm the accuracy of the system, the system was tested regarding interoperability to support early prediction through a decision supp

目的:本研究工作旨在开发一个可互操作的电子健康记录(EHR)系统，通过使用机器学习(ML)算法来帮助早期发现糖尿病。一个决策支持系统开发使用许多机器学习算法的结果在卫生信息系统中的预防保健决策优化。方法:该系统由两个模型组成。第一个模型包括使用精确的数据库结构开发可互操作的EHR系统。第二个模块包括从EHR系统中提取数据、清理数据、处理和预测数据。为了测试和培训，大约1080名患者的健康记录被考虑在内。在1080条记录中，有1000条记录来自Kaggle数据集，80条记录是访问Siddaganga组织健康中心进行定期检查或紧急情况的患者的人口统计信息。从拟议的电子病历系统中收集人口统计信息。结果:所提出的系统被用于测试EHR系统的互操作性和使用所提出的决策支持系统预测糖尿病疾病的准确性。通过实验室维护的各种系统的随机更新，对拟议的EHR系统开发进行了互操作性测试。每个系统就像不同医院的管理系统。从用户视图状态、系统与现实世界的匹配、数据更新的一致性、安全性等方面对EHR系统的负载处理和互操作性进行了测试。但在预测阶段，糖尿病预测较为集中。所考虑的特征不是随机选择的;然而，这些特征是由医生规定的，医生坚持认为这些特征足以进行初步预测。从医生那里收集的报告揭示了他们在给出测试细节之前考虑的几个特征。提出的系统数据集被分为测试和训练数据集，其中八个适当的特征作为输入，一个集作为目标变量，其中结果存在。在此之后，使用标准的“sklearn”库导入模型，它与所需的估计器数量(即决策树的数量)相匹配。这些特征包括怀孕、血糖水平、血压、皮肤厚度、胰岛素水平、骨髓指数、糖尿病谱系功能、年龄、体重等。一开始，研究工作集中于开发一个可互操作的电子病历系统，确定糖尿病和非糖尿病疾病的预期，并证明该系统的准确性。结论:在本研究中，第一个目标是设计一个可互操作的电子病历系统，以帮助积累、存储和共享患者一生中及时的健康记录。第二个目的是利用电子病历数据对用户的糖尿病进行早期预测。为了确认系统的准确性，对系统进行了互操作性测试，以通过决策支持系统支持早期预测。

{"title":"Electronic Health Record (EHR) System Development for Study on EHR Data-based Early Prediction of Diabetes Using Machine Learning Algorithms","authors":"Jagadamba G, Shashidhar R, Gururaj H L, Vinayakumar Ravi, Meshari Almeshari, Yasser Alzamil","doi":"10.2174/18750362-v16-e230906-2023-15","DOIUrl":"https://doi.org/10.2174/18750362-v16-e230906-2023-15","url":null,"abstract":"Aims: This research work aims to develop an interoperable electronic health record (EHR) system to aid the early detection of diabetes by the use of Machine Learning (ML) algorithms. A decision support system developed using many ML algorithms results in optimizing the decision in preventive care in the health information system. Methods: The proposed system consisted of two models. The first model included interoperable EHR system development using a precise database structure. The second module comprised of data extraction from the EHR system, data cleaning, and data processing and prediction. For testing and training, about 1080 patients’ health record was considered. Among 1080, 1000 records were from the Kaggle dataset, and 80 records were demographic information from patients who visited our health center of Siddaganga organization for a regular checkup or during emergencies. The demographic information was collected from the proposed EHR system. Results: The proposed system was tested for the interoperability nature of the EHR system and accuracy in diabetic disease prediction using the proposed decision support system. The proposed EHR system development was tested for interoperability by random updations from various systems maintained in the laboratory. Each system acted like the admin system of different hospitals. The EHR system was tested for handling the load and interoperability by considering user view status, system matching with the real world, consistency in data updations, security etc . However, in the prediction phase, diabetes prediction was concentrated. The features considered were not randomly chosen; however, the features were those prescribed by a doctor who insisted that the features were sufficient for initial prediction. The reports collected from the doctors revealed several features they considered before giving the test details. The proposed system dataset was split into test and train datasets with eight proper features taken as input and one set as a target variable where the result was present. After this, the model was imported using standard “sklearn” libraries, and it fit with the required number of estimators, that is, the number of decision trees. The features included pregnancies, glucose level, blood pressure, skin thickness, insulin level, bone marrow index, diabetic pedigree function, age, weight, etc . At the outset, the research work concentrated on developing an interoperable EHR system, identifying the expectation of diabetic and non-diabetic conditions and demonstrating the accuracy of the system. Conclusion: In this study, the first aim was to design an interoperable EHR system that could help in accumulating, storing, and sharing patients' timely health records over a lifetime. The second aim was to use EHR data for early prediction of diabetes in the user. To confirm the accuracy of the system, the system was tested regarding interoperability to support early prediction through a decision supp","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135547188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comparative Functional Genomics Studies for Understanding the Hypothetical Proteins in Mycobacterium Tuberculosis Variant Microti 12 比较功能基因组学研究以了解结核分枝杆菌变异株Microti 12中的假设蛋白质

Q3 Computer Science

Open Bioinformatics Journal

Pub Date : 2023-07-26 DOI: 10.2174/18750362-v16-e230711-2023-2

Tejaswini Vijay Shinde, Tejas Gajanan Shinde, V. V. Chougule, Anagha Rajendra Ghorpade, Geeta Vikas Utekar, Amol S Jadhav, Bandu Shamlal Pawar, S. Sanmukh

The Mycobacterium tuberculosis complex (MTBC) bacteria include the slowly growing, host-associated bacteria Mycobacterium tuberculosis, Mycobacterium Bovis, Mycobacterium microti, Mycobacterium africanum, Mycobacterium pinnipedii. Comparative Functional Genomics Studies for understanding the Hypothetical Proteins in Mycobacterium tuberculosis variant microti 12. A computational genomics study was performed to understand the 247 hypothetical protein genes. Functional annotation of virtual proteins was performed on different servers to maximize confidence level. Sequence Retrieval. The whole genome sequences for the Mycobacterium tuberculosis micro variant 12 were retrieved from the KEGG database ( http://www.genome.jp/kegg/) and were used for screening 247 hypothetical proteins (Fig. 1 ). Functional Annotation and Sub-cellular localization. The Mycobacterium tuberculosis micro variant 12 hypothetical proteins were screened and sorted out from the genome and were individually analyzed for the presence of conserved functional domains by using computational biology tools like CDD-BLAST ( https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) ;Pfam ( http://pfam.xfam.org/ncbiseq/398365647); The subcellular localization of hypothetical proteins was determined by CELLO2GO ( http://cello.life.nctu.edu.tw). These web tools can search the defined conserved domains in the sequences available in the online servers or databases and assist in the classification of proteins in the appropriate families. Protein Structure Prediction. The in-silico structure predictions of the hypothetical protein sequences showing functional properties were carried out by using the PS2 Protein Structure Prediction Server ( http://www.ps2.life.nctu.edu.tw/). The online server helps to generate the 3D structures of the hypothetical proteins. The server accepts the sequences in FASTA format as a query to generate resultant proteins 3D structures. The structure determination is completely based on the conserved template regions detected during functional annotations. Protein-protein interaction through String database: The interaction of each hypothetical protein analyzed for functional characteristics was subjected to a protein-protein interaction server for the prediction of a possible functional role in interaction amongst the available known proteins ( https://string-db.org/). This information can help us to further validated the functional role of such hypothetical proteins and their possible role in the Mycobacterium Tuberculosis micro variant. Protein secondary structure prediction through JPred4: The secondary structure prediction of all the hypothetical proteins was determined through JPred4 ( http://www.compbio.dundee.ac.uk/jpred4/index.html) and served to identify the available secondary structures in the unknown hypothetical protein sequences. These further help us to understand the available templates in the uncharacterized protein seq

结核分枝杆菌复合体（MTBC）细菌包括缓慢生长的宿主相关细菌结核分枝杆菌、牛分枝杆菌、微小分枝杆菌、非洲分枝杆菌、鳍足分枝杆菌。比较功能基因组学研究以了解结核分枝杆菌变异株microti中的假设蛋白质12。进行了一项计算基因组学研究，以了解247个假设的蛋白质基因。在不同的服务器上进行虚拟蛋白质的功能注释，以最大限度地提高置信水平。序列检索。结核分枝杆菌微变体12的全基因组序列从KEGG数据库中检索（http://www.genome.jp/kegg/)并用于筛选247种假设蛋白质（图1）。功能注释和亚细胞定位。从基因组中筛选和分选结核分枝杆菌微变体12种假设蛋白质，并通过使用计算生物学工具如CDD-BLAST（https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi)；Pfam（http://pfam.xfam.org/ncbiseq/398365647)；假设蛋白质的亚细胞定位通过CELLO2GO（http://cello.life.nctu.edu.tw)。这些网络工具可以搜索在线服务器或数据库中可用序列中定义的保守结构域，并帮助将蛋白质分类到适当的家族中。蛋白质结构预测。通过使用PS2蛋白质结构预测服务器（http://www.ps2.life.nctu.edu.tw/)。在线服务器帮助生成假设蛋白质的3D结构。服务器接受FASTA格式的序列作为查询，以生成生成的蛋白质3D结构。结构的确定完全基于在功能注释期间检测到的保守模板区域。通过字符串数据库进行的蛋白质-蛋白质相互作用：对功能特征分析的每个假设蛋白质的相互作用进行蛋白质-蛋白质交互服务器，以预测可用已知蛋白质之间的相互作用中可能的功能作用（https://string-db.org/)。这些信息可以帮助我们进一步验证这些假设蛋白质的功能作用及其在结核分枝杆菌微变体中的可能作用。通过JPred4预测蛋白质二级结构：所有假设蛋白质的二级结构预测通过JPred4http://www.compbio.dundee.ac.uk/jpred4/index.html)并用于鉴定未知假设蛋白质序列中的可用二级结构。这些进一步帮助我们了解未表征蛋白质序列中可用于预测与这些蛋白质相关的新功能的模板。通过Phyre2服务器对预测进行进一步表征，用于基于保守结构域的比较分析的模板的结构建模和预测。通过Phyre2。通过Phyre2服务器（http://www.sbg.bio.ic.ac.uk/phyre2)用于基于保守结构域的比较分析的模板结构建模和预测。进行了一项计算基因组学研究，以了解247个假设的蛋白质基因虚拟蛋白质的功能注释，并在不同的服务器上进行，以最大限度地提高置信水平。通过CDD-Blast和Pfam进行功能预测。蛋白质的基因序列可能已经成功地进行了功能注释和表征，并且已经通过计算预测了它们的亚细胞定位和三维结构预测。在线自动化生物信息学工具，如CDD-Blast、Pfam、CELLO2GO和PS2-Server，用于筛选的假设蛋白质的结构和功能表征。已经获得并展示了来自结核分枝杆菌变体microti 12的假设蛋白质的结构、功能和亚细胞定位（图2）。此外，在使用具有最高得分的模板之后生成的三维结构被显示为各个假设蛋白质的结构栏中的模板ID。然而，由于系统生物学否认了假设的蛋白质功能，因此可以通过生物过程和实验来测试这些蛋白质的结构，使其适合理解其在生命周期、发病机制和药物开发中的作用。我们可以在药物和其他临床相关研究中进一步探索这些预测可能性。

{"title":"Comparative Functional Genomics Studies for Understanding the Hypothetical Proteins in\u0000 Mycobacterium Tuberculosis Variant Microti 12","authors":"Tejaswini Vijay Shinde, Tejas Gajanan Shinde, V. V. Chougule, Anagha Rajendra Ghorpade, Geeta Vikas Utekar, Amol S Jadhav, Bandu Shamlal Pawar, S. Sanmukh","doi":"10.2174/18750362-v16-e230711-2023-2","DOIUrl":"https://doi.org/10.2174/18750362-v16-e230711-2023-2","url":null,"abstract":"\u0000 \u0000 The Mycobacterium tuberculosis complex (MTBC) bacteria include the slowly growing, host-associated bacteria Mycobacterium tuberculosis, Mycobacterium Bovis, Mycobacterium microti, Mycobacterium africanum, Mycobacterium pinnipedii.\u0000 \u0000 \u0000 \u0000 Comparative Functional Genomics Studies for understanding the Hypothetical Proteins in Mycobacterium tuberculosis variant microti 12.\u0000 \u0000 \u0000 \u0000 A computational genomics study was performed to understand the 247 hypothetical protein genes. Functional annotation of virtual proteins was performed on different servers to maximize confidence level.\u0000 \u0000 \u0000 \u0000 Sequence Retrieval. The whole genome sequences for the Mycobacterium tuberculosis micro variant 12 were retrieved from the KEGG database (\u0000 http://www.genome.jp/kegg/) and were used for screening 247 hypothetical proteins (Fig.\u0000 \u0000 1\u0000 ). Functional Annotation and Sub-cellular localization. The Mycobacterium tuberculosis micro variant 12 hypothetical proteins were screened and sorted out from the genome and were individually analyzed for the presence of conserved functional domains by using computational biology tools like CDD-BLAST (\u0000 https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) ;Pfam (\u0000 http://pfam.xfam.org/ncbiseq/398365647); The subcellular localization of hypothetical proteins was determined by CELLO2GO (\u0000 http://cello.life.nctu.edu.tw). These web tools can search the defined conserved domains in the sequences available in the online servers or databases and assist in the classification of proteins in the appropriate families. Protein Structure Prediction. The\u0000 in-silico structure predictions of the hypothetical protein sequences showing functional properties were carried out by using the PS2 Protein Structure Prediction Server (\u0000 http://www.ps2.life.nctu.edu.tw/). The online server helps to generate the 3D structures of the hypothetical proteins. The server accepts the sequences in FASTA format as a query to generate resultant proteins 3D structures. The structure determination is completely based on the conserved template regions detected during functional annotations. Protein-protein interaction through String database: The interaction of each hypothetical protein analyzed for functional characteristics was subjected to a protein-protein interaction server for the prediction of a possible functional role in interaction amongst the available known proteins (\u0000 https://string-db.org/). This information can help us to further validated the functional role of such hypothetical proteins and their possible role in the Mycobacterium Tuberculosis micro variant. Protein secondary structure prediction through JPred4: The secondary structure prediction of all the hypothetical proteins was determined through JPred4 (\u0000 http://www.compbio.dundee.ac.uk/jpred4/index.html) and served to identify the available secondary structures in the unknown hypothetical protein sequences. These further help us to understand the available templates in the uncharacterized protein seq","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47867946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Integrated Bioinformatics Approach for Disclosing Autophagy Pathway as a Therapeutic Target in Advanced KRAS Mutated/Positive Lung Adenocarcinoma 综合生物信息学方法揭示自噬途径作为晚期KRAS突变/阳性肺腺癌治疗靶点

Q3 Computer Science

Open Bioinformatics Journal

Pub Date : 2023-05-24 DOI: 10.2174/18750362-v16-2305230-2022-18

Yasmeen I Dodin

Lung cancer is the leading cause of cancer-related deaths, accounting for 1.8 million deaths (18%). Nearly 80%-85% of lung cancer cases are non-small cell lung cancers (NSCLC). One of the most frequent genetic mutations in NSCLC is the Kirsten Rat Sarcoma Oncogene Homolog (KRAS) gene mutation. In recent years, autophagy has drawn substantial attention as a potential pathway that can be targeted in cancer driven by KRAS gene mutation to efficiently improve the therapeutic profile of different treatments. In this study, we have investigated the potential of targeting the autophagy pathway as a treatment approach in advanced KRAS-mutated lung adenocarcinoma using gene expression data from The Cancer Genome Atlas Lung Adenocarcinoma (TCGA-LUAD) project. Compared to KRAS wild-type lung adenocarcinoma, there were found 11 differentially expressed autophagy-related genes (DEARGs), with 5 upregulated and 6 downregulated DEARGs (threshold of adjusted p-value <0.05). These DEARGs can be investigated as potential genes that can be targeted by different autophagy inhibitors.

癌症是癌症相关死亡的主要原因，占180万人死亡（18%）。近80%-85%的癌症病例是非小细胞肺癌（NSCLC）。NSCLC中最常见的基因突变之一是Kirsten大鼠肉瘤癌基因同源性（KRAS）基因突变。近年来，自噬作为一种潜在的途径，在KRAS基因突变的驱动下，可以靶向癌症，以有效改善不同治疗方法的治疗效果，引起了人们的广泛关注。在这项研究中，我们利用癌症基因组图谱肺腺癌（TCGA-LUAD）项目的基因表达数据，研究了靶向自噬途径作为晚期KRAS-突变肺腺癌治疗方法的潜力。与KRAS野生型肺腺癌相比，共发现11个差异表达的自噬相关基因（DEARGs），其中5个上调，6个下调（调整p值阈值<0.05）。这些DEARG可作为不同自噬抑制剂靶向的潜在基因进行研究。

{"title":"Integrated Bioinformatics Approach for Disclosing Autophagy Pathway as a Therapeutic Target in Advanced KRAS Mutated/Positive Lung Adenocarcinoma","authors":"Yasmeen I Dodin","doi":"10.2174/18750362-v16-2305230-2022-18","DOIUrl":"https://doi.org/10.2174/18750362-v16-2305230-2022-18","url":null,"abstract":"\u0000 \u0000 Lung cancer is the leading cause of cancer-related deaths, accounting for 1.8 million deaths (18%). Nearly 80%-85% of lung cancer cases are non-small cell lung cancers (NSCLC). One of the most frequent genetic mutations in NSCLC is the Kirsten Rat Sarcoma Oncogene Homolog (KRAS) gene mutation. In recent years, autophagy has drawn substantial attention as a potential pathway that can be targeted in cancer driven by KRAS gene mutation to efficiently improve the therapeutic profile of different treatments.\u0000 \u0000 \u0000 \u0000 In this study, we have investigated the potential of targeting the autophagy pathway as a treatment approach in advanced KRAS-mutated lung adenocarcinoma using gene expression data from The Cancer Genome Atlas Lung Adenocarcinoma (TCGA-LUAD) project.\u0000 \u0000 \u0000 \u0000 Compared to KRAS wild-type lung adenocarcinoma, there were found 11 differentially expressed autophagy-related genes (DEARGs), with 5 upregulated and 6 downregulated DEARGs (threshold of adjusted p-value <0.05).\u0000 \u0000 \u0000 \u0000 These DEARGs can be investigated as potential genes that can be targeted by different autophagy inhibitors.\u0000","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46535451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Computational Analysis, In silico Functional Annotation, and Expression of Recombinant PE_PGRS Protein Biomarkers Found in Mycobacterium tuberculosis 结核分枝杆菌中发现的重组PE_PGRS蛋白生物标志物的计算分析、计算机功能注释和表达

Q3 Computer Science

Open Bioinformatics Journal

Pub Date : 2023-04-05 DOI: 10.2174/18750362-v16-e230306-2022-6

Avanthi Moodley-Reddy, Thamsanqa E. Chiliza, O. Pooe

Over the years, there have been many advances made within the treatment and diagnosis of Mycobacterium Tuberculosis (Mtb). In recent times, the rise of drug resistance has led to higher mortality rates, specifically in poorer countries. There is an urgent need for novel treatment regimens to work against Mtb. Previous studies have identified a gene family within Mtb, known as PE_PGRS proteins, which has shown potential as a drug target. Functional annotations can assist with identifying the role these proteins may play within Mtb. Previous studies indicated PE_PGRS to have potential for further research. The protein biomarkers that showed the most promise were identified as PE_PGRS17, PE_PGRS31, PE_PGRS50, and PEPGRS54. The sequences of these proteins were searched on the Mycobrowser software. Results were designed by entering these sequences into various computational algorithms. PE_PGRS17 showed characteristics of a potential vaccine candidate. Considering this result, expression profiling and purification were conducted on the recombinant PE_PGRS17 Mtb protein biomarker. The results were calculated using various online software algorithms. Many characteristics were predicted to understand the stability, localization, and function of these proteins. All the proteins have been estimated to produce an immune response or be involved in the process of immunity. The recombinantPE_PGRS17 protein was chosen to be optimally expressed and purified using E.coli as a host cell. These findings specifically on PE_PGRS17, can be expanded in future scientific studies. The predicted structures, protein-protein interaction, and antigenic properties of the proteins estimate whether a protein can be used for further studies, specifically as drug/vaccine targets. Ultimately, PE_PGRS17 is seen as the most stable according to its predicted structure, which holds promise as a key factor in future tuberculosis studies.

多年来，在治疗和诊断结核分枝杆菌(Mtb)方面取得了许多进展。近年来，耐药性的上升导致死亡率上升，特别是在较贫穷的国家。迫切需要新的治疗方案来对抗结核分枝杆菌。先前的研究已经确定了结核分枝杆菌中的一个基因家族，称为PE_PGRS蛋白，它已经显示出作为药物靶点的潜力。功能注释可以帮助鉴定这些蛋白质在结核分枝杆菌中可能发挥的作用。以往的研究表明，PE_PGRS具有进一步研究的潜力。鉴定出最有希望的蛋白生物标志物为PE_PGRS17、PE_PGRS31、PE_PGRS50和PEPGRS54。在Mycobrowser软件上检索这些蛋白的序列。通过将这些序列输入各种计算算法来设计结果。PE_PGRS17具有潜在候选疫苗的特性。考虑到这一结果，我们对重组PE_PGRS17 Mtb蛋白生物标志物进行了表达谱分析和纯化。使用各种在线软件算法计算结果。预测了许多特征，以了解这些蛋白质的稳定性，定位和功能。据估计，所有这些蛋白质都能产生免疫反应或参与免疫过程。选择重组pe_pgrs17蛋白，以大肠杆菌为宿主细胞进行最佳表达和纯化。这些关于PE_PGRS17的发现可以在未来的科学研究中扩展。蛋白质的预测结构、蛋白-蛋白相互作用和抗原特性可评估蛋白质是否可用于进一步研究，特别是作为药物/疫苗靶点。最终，根据其预测的结构，PE_PGRS17被认为是最稳定的，这有望成为未来结核病研究的关键因素。

{"title":"Computational Analysis, In silico Functional Annotation, and Expression of Recombinant PE_PGRS Protein Biomarkers Found in Mycobacterium tuberculosis","authors":"Avanthi Moodley-Reddy, Thamsanqa E. Chiliza, O. Pooe","doi":"10.2174/18750362-v16-e230306-2022-6","DOIUrl":"https://doi.org/10.2174/18750362-v16-e230306-2022-6","url":null,"abstract":"\u0000 \u0000 Over the years, there have been many advances made within the treatment and diagnosis of Mycobacterium Tuberculosis (Mtb). In recent times, the rise of drug resistance has led to higher mortality rates, specifically in poorer countries. There is an urgent need for novel treatment regimens to work against Mtb. Previous studies have identified a gene family within Mtb, known as PE_PGRS proteins, which has shown potential as a drug target. Functional annotations can assist with identifying the role these proteins may play within Mtb.\u0000 \u0000 \u0000 \u0000 Previous studies indicated PE_PGRS to have potential for further research. The protein biomarkers that showed the most promise were identified as PE_PGRS17, PE_PGRS31, PE_PGRS50, and PEPGRS54. The sequences of these proteins were searched on the Mycobrowser software. Results were designed by entering these sequences into various computational algorithms. PE_PGRS17 showed characteristics of a potential vaccine candidate. Considering this result, expression profiling and purification were conducted on the recombinant PE_PGRS17 Mtb protein biomarker.\u0000 \u0000 \u0000 \u0000 The results were calculated using various online software algorithms. Many characteristics were predicted to understand the stability, localization, and function of these proteins. All the proteins have been estimated to produce an immune response or be involved in the process of immunity. The recombinantPE_PGRS17 protein was chosen to be optimally expressed and purified using E.coli as a host cell. These findings specifically on PE_PGRS17, can be expanded in future scientific studies.\u0000 \u0000 \u0000 \u0000 The predicted structures, protein-protein interaction, and antigenic properties of the proteins estimate whether a protein can be used for further studies, specifically as drug/vaccine targets. Ultimately, PE_PGRS17 is seen as the most stable according to its predicted structure, which holds promise as a key factor in future tuberculosis studies.\u0000","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48588996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Development and Progress in Machine Learning for Protein Subcellular Localization Prediction 机器学习在蛋白质亚细胞定位预测中的发展与进展

Q3 Computer Science

Open Bioinformatics Journal

Pub Date : 2022-10-06 DOI: 10.2174/18750362-v15-e2208110

Le He, Xiyu Liu

Protein subcellular localization is a novel and promising area and is defined as searching for the specific location of proteins inside the cell, such as in the nucleus, in the cytoplasm or on the cell membrane. With the rapid development of next-generation sequencing technology, more and more new protein sequences have been continuously discovered. It is no longer sufficient to merely use traditional wet experimental methods to predict the subcellular localization of these new proteins. Therefore, it is urgent to develop high-throughput computational methods to achieve quick and precise protein subcellular localization predictions. This review summarizes the development of prediction methods for protein subcellular localization over the past decades, expounds on the application of various machine learning methods in this field, and compares the properties and performance of various well-known predictors. The narrative of this review mainly revolves around three main types of methods, namely, the sequence-based methods, the knowledge-based methods, and the fusion methods. A special focus is on the gene ontology (GO)-based methods and the PLoc series methods. Finally, this review looks forward to the future development directions of protein subcellular localization prediction.

蛋白质亚细胞定位是一个新的和有前途的领域，它被定义为寻找蛋白质在细胞内的特定位置，如在细胞核、细胞质或细胞膜上。随着新一代测序技术的迅速发展，越来越多新的蛋白质序列被不断发现。仅仅用传统的湿式实验方法来预测这些新蛋白的亚细胞定位已经不够了。因此，迫切需要开发高通量的计算方法来实现快速准确的蛋白质亚细胞定位预测。本文综述了近几十年来蛋白质亚细胞定位预测方法的发展，阐述了各种机器学习方法在该领域的应用，并比较了各种知名预测方法的性质和性能。本文的叙述主要围绕三种主要的方法展开，即基于序列的方法、基于知识的方法和融合方法。特别关注的是基于基因本体(GO)的方法和PLoc系列方法。最后，对蛋白质亚细胞定位预测的未来发展方向进行了展望。

{"title":"The Development and Progress in Machine Learning for Protein Subcellular Localization Prediction","authors":"Le He, Xiyu Liu","doi":"10.2174/18750362-v15-e2208110","DOIUrl":"https://doi.org/10.2174/18750362-v15-e2208110","url":null,"abstract":"Protein subcellular localization is a novel and promising area and is defined as searching for the specific location of proteins inside the cell, such as in the nucleus, in the cytoplasm or on the cell membrane. With the rapid development of next-generation sequencing technology, more and more new protein sequences have been continuously discovered. It is no longer sufficient to merely use traditional wet experimental methods to predict the subcellular localization of these new proteins. Therefore, it is urgent to develop high-throughput computational methods to achieve quick and precise protein subcellular localization predictions. This review summarizes the development of prediction methods for protein subcellular localization over the past decades, expounds on the application of various machine learning methods in this field, and compares the properties and performance of various well-known predictors. The narrative of this review mainly revolves around three main types of methods, namely, the sequence-based methods, the knowledge-based methods, and the fusion methods. A special focus is on the gene ontology (GO)-based methods and the PLoc series methods. Finally, this review looks forward to the future development directions of protein subcellular localization prediction.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41748283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unsupervised Deep learning-based Feature Fusion Approach for Detection and Analysis of COVID-19 using X-ray and CT Images 基于无监督深度学习的新冠肺炎x射线和CT图像检测与分析方法

Q3 Computer Science

Open Bioinformatics Journal

Pub Date : 2022-09-21 DOI: 10.2174/18750362-v15-e2207290

Vinayakumar Ravi, T. Pham

This study investigates an unsupervised deep learning-based feature fusion approach for the detection and analysis of COVID-19 using chest X-ray (CXR) and Computed tomography (CT) images. The outbreak of COVID-19 has affected millions of people all around the world and the disease is diagnosed by the reverse transcription-polymerase chain reaction (RT-PCR) test which suffers from a lower viral load, and sampling error, etc. Computed tomography (CT) and chest X-ray (CXR) scans can be examined as most infected people suffer from lungs infection. Both CT and CXR imaging techniques are useful for the COVID-19 diagnosis at an early stage and it is an alternative to the RT-PCR test. The manual diagnosis of CT scans and CXR images are labour-intensive and consumes a lot of time. To handle this situation, many AI-based solutions are researched including deep learning-based detection models, which can be used to help the radiologist to make a better diagnosis. However, the availability of annotated data for COVID-19 detection is limited due to the need for domain expertise and expensive annotation cost. Also, most existing state-of-the-art deep learning-based detection models follow a supervised learning approach. Therefore, in this work, we have explored various unsupervised learning models for COVID-19 detection which does not need a labelled dataset. In this work, we propose an unsupervised deep learning-based COVID-19 detection approach that incorporates the feature fusion method for performance enhancement. Four different sets of experiments are run on both CT and CXR scan datasets where convolutional autoencoders, pre-trained CNNs, hybrid, and PCA-based models are used for feature extraction and K-means and GMM techniques are used for clustering. The maximum accuracy of 84% is achieved by the model Autoencoder3-ResNet50 (GMM) on the CT dataset and for the CXR dataset, both Autoencoder1-VGG16 (KMeans and GMM) models achieved 70% accuracy. Our proposed deep unsupervised learning, feature fusion-based COVID-19 detection approach achieved promising results on both datasets. It also outperforms four well-known existing unsupervised approaches.

本研究研究了一种基于无监督深度学习的特征融合方法，用于使用胸部x射线(CXR)和计算机断层扫描(CT)图像检测和分析COVID-19。2019冠状病毒病(COVID-19)的爆发影响了全球数百万人，该疾病的诊断是通过逆转录聚合酶链反应(RT-PCR)检测，该检测具有病毒载量较低，采样误差等缺点。计算机断层扫描(CT)和胸部x光扫描(CXR)可以检查，因为大多数感染者患有肺部感染。CT和CXR成像技术对COVID-19的早期诊断都很有用，是RT-PCR检测的替代方法。CT扫描和CXR图像的人工诊断是劳动密集型的，耗费大量时间。为了处理这种情况，研究了许多基于人工智能的解决方案，包括基于深度学习的检测模型，可以用来帮助放射科医生做出更好的诊断。然而，由于需要领域专业知识和昂贵的注释成本，用于COVID-19检测的注释数据的可用性受到限制。此外，大多数现有的最先进的基于深度学习的检测模型都遵循监督学习方法。因此，在这项工作中，我们探索了各种用于COVID-19检测的无监督学习模型，这些模型不需要标记数据集。在这项工作中，我们提出了一种基于无监督深度学习的COVID-19检测方法，该方法结合了特征融合方法来增强性能。在CT和CXR扫描数据集上运行四组不同的实验，其中使用卷积自编码器、预训练cnn、混合模型和基于pca的模型进行特征提取，使用K-means和GMM技术进行聚类。模型Autoencoder3-ResNet50 (GMM)在CT数据集上达到了84%的最大准确率，对于CXR数据集，Autoencoder1-VGG16 (KMeans和GMM)模型都达到了70%的准确率。我们提出的基于深度无监督学习、特征融合的COVID-19检测方法在两个数据集上都取得了很好的结果。它也优于现有的四种众所周知的无监督方法。

{"title":"Unsupervised Deep learning-based Feature Fusion Approach for Detection and Analysis of COVID-19 using X-ray and CT Images","authors":"Vinayakumar Ravi, T. Pham","doi":"10.2174/18750362-v15-e2207290","DOIUrl":"https://doi.org/10.2174/18750362-v15-e2207290","url":null,"abstract":"\u0000 \u0000 This study investigates an unsupervised deep learning-based feature fusion approach for the detection and analysis of COVID-19 using chest X-ray (CXR) and Computed tomography (CT) images.\u0000 \u0000 \u0000 \u0000 The outbreak of COVID-19 has affected millions of people all around the world and the disease is diagnosed by the reverse transcription-polymerase chain reaction (RT-PCR) test which suffers from a lower viral load, and sampling error, etc. Computed tomography (CT) and chest X-ray (CXR) scans can be examined as most infected people suffer from lungs infection. Both CT and CXR imaging techniques are useful for the COVID-19 diagnosis at an early stage and it is an alternative to the RT-PCR test.\u0000 \u0000 \u0000 \u0000 The manual diagnosis of CT scans and CXR images are labour-intensive and consumes a lot of time. To handle this situation, many AI-based solutions are researched including deep learning-based detection models, which can be used to help the radiologist to make a better diagnosis. However, the availability of annotated data for COVID-19 detection is limited due to the need for domain expertise and expensive annotation cost. Also, most existing state-of-the-art deep learning-based detection models follow a supervised learning approach. Therefore, in this work, we have explored various unsupervised learning models for COVID-19 detection which does not need a labelled dataset.\u0000 \u0000 \u0000 \u0000 In this work, we propose an unsupervised deep learning-based COVID-19 detection approach that incorporates the feature fusion method for performance enhancement. Four different sets of experiments are run on both CT and CXR scan datasets where convolutional autoencoders, pre-trained CNNs, hybrid, and PCA-based models are used for feature extraction and K-means and GMM techniques are used for clustering.\u0000 \u0000 \u0000 \u0000 The maximum accuracy of 84% is achieved by the model Autoencoder3-ResNet50 (GMM) on the CT dataset and for the CXR dataset, both Autoencoder1-VGG16 (KMeans and GMM) models achieved 70% accuracy.\u0000 \u0000 \u0000 \u0000 Our proposed deep unsupervised learning, feature fusion-based COVID-19 detection approach achieved promising results on both datasets. It also outperforms four well-known existing unsupervised approaches.\u0000","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41661571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0