Aims: This study investigates predicting and eliminating malnutrition and anemia using ML Algorithms and comparing all the methods with various malnutrition-based parameters. Background: The health of the nation is more important than the wealth of the nation. Malnutrition and anemia are serious health issues but the least importance is given to eradicate them. Objective: Proper nutrition is an essential component for the survival, growth, and development of infants, children, and women who in turn give birth to infants. Methods: In the proposed system, machine learning approaches are utilized to predict the malnutrition status of children under five years of age and anemia in men and women using old datasets and further providing a suitable diet recommendation to overcome the disease. Classification techniques are being used for malnutrition as well as anemia prediction. Results: Algorithms such as Naïve Bayes classifier (NBC), Decision Tree (DT) algorithm, Random Forest (RF) algorithm, and K-Nearest Neighbor (k-NN) algorithm are utilized for prediction. The results obtained by these algorithms are 94.47%, 85%, 95.49%, and 63.15%. When compared, Naïve Bayes and random forest algorithm provided efficient results for malnutrition and anemia, respectively. Conclusion: By testing and validating, preventive actions can be taken with the help of medical experts and dieticians to reduce malnutrition and anemia conditions among children and elders, respectively.
{"title":"Decision-making Support System for Predicting and Eliminating Malnutrition and Anemia","authors":"Manasvi Jagadeesh Maasthi, Harinahalli Lokesh Gururaj, Vinayakumar Ravi, Basavesha D, Meshari Almeshari, Yasser Alzamil","doi":"10.2174/0118750362246898230921054021","DOIUrl":"https://doi.org/10.2174/0118750362246898230921054021","url":null,"abstract":"Aims: This study investigates predicting and eliminating malnutrition and anemia using ML Algorithms and comparing all the methods with various malnutrition-based parameters. Background: The health of the nation is more important than the wealth of the nation. Malnutrition and anemia are serious health issues but the least importance is given to eradicate them. Objective: Proper nutrition is an essential component for the survival, growth, and development of infants, children, and women who in turn give birth to infants. Methods: In the proposed system, machine learning approaches are utilized to predict the malnutrition status of children under five years of age and anemia in men and women using old datasets and further providing a suitable diet recommendation to overcome the disease. Classification techniques are being used for malnutrition as well as anemia prediction. Results: Algorithms such as Naïve Bayes classifier (NBC), Decision Tree (DT) algorithm, Random Forest (RF) algorithm, and K-Nearest Neighbor (k-NN) algorithm are utilized for prediction. The results obtained by these algorithms are 94.47%, 85%, 95.49%, and 63.15%. When compared, Naïve Bayes and random forest algorithm provided efficient results for malnutrition and anemia, respectively. Conclusion: By testing and validating, preventive actions can be taken with the help of medical experts and dieticians to reduce malnutrition and anemia conditions among children and elders, respectively.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"52 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136318383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-10-20DOI: 10.2174/0118750362253383230922100803
Shalini Maurya, Salman Akhtar, Mohammad Kalim Ahmad Khan
Purpose: Multidrug-resistant Burkholderia pseudomallei is associated with significant morbidity and mortality. Hence, there is a requirement for a vaccine for this pathogen. Using subtractive proteomics and reverse vaccinology approaches, we have designed a chimeric multiepitope vaccine against the pathogen in the present study. Methods: Twenty-one non-redundant pathogen proteomes were mined using a subtractive proteomics strategy. Out of these, by various analyses, we found proteins that were non-homologous to humans, essential, and virulent. BLASTp against the PDB database and Pocket druggability analysis yielded nine proteins whose 3D structure is available and are druggable. Four proteins that could be candidates for vaccines were identified by subcellular localization and antigenicity prediction, and they could be used in reverse vaccinology methods to create a chimeric multiepitope vaccine. Results: Using online resources and servers, MHC class I, II, and B cell epitopes were identified. The predicted epitopes were selected based on analysis of toxicity, solubility, allergenicity, and hydrophilicity. These predicted epitopes, which were immunogenic, were used for the construction of a multivalent chimeric vaccine. The epitopes, adjuvants, linkers, and PADRE amino acid sequences were employed to create the vaccine. Shortlisted vaccine constructs also interact with the HLA allele and TLR4, as evident from docking and molecular dynamics simulation. Thus, vaccine construct V1 can elicit an immune response against Burkholderia pseudomallei . Conclusion: The availability of the proteome of B. pseudomallei has made this study possible through the usage of various in silico approaches. We could shortlist vaccine targets using subtractive proteomics and then construct chimeric vaccines using reverse vaccinology and immunoinformatics approaches.
{"title":"Immunoinformatics Approach for the Design of Chimeric Vaccine Against Whitmore Disease","authors":"Shalini Maurya, Salman Akhtar, Mohammad Kalim Ahmad Khan","doi":"10.2174/0118750362253383230922100803","DOIUrl":"https://doi.org/10.2174/0118750362253383230922100803","url":null,"abstract":"Purpose: Multidrug-resistant Burkholderia pseudomallei is associated with significant morbidity and mortality. Hence, there is a requirement for a vaccine for this pathogen. Using subtractive proteomics and reverse vaccinology approaches, we have designed a chimeric multiepitope vaccine against the pathogen in the present study. Methods: Twenty-one non-redundant pathogen proteomes were mined using a subtractive proteomics strategy. Out of these, by various analyses, we found proteins that were non-homologous to humans, essential, and virulent. BLASTp against the PDB database and Pocket druggability analysis yielded nine proteins whose 3D structure is available and are druggable. Four proteins that could be candidates for vaccines were identified by subcellular localization and antigenicity prediction, and they could be used in reverse vaccinology methods to create a chimeric multiepitope vaccine. Results: Using online resources and servers, MHC class I, II, and B cell epitopes were identified. The predicted epitopes were selected based on analysis of toxicity, solubility, allergenicity, and hydrophilicity. These predicted epitopes, which were immunogenic, were used for the construction of a multivalent chimeric vaccine. The epitopes, adjuvants, linkers, and PADRE amino acid sequences were employed to create the vaccine. Shortlisted vaccine constructs also interact with the HLA allele and TLR4, as evident from docking and molecular dynamics simulation. Thus, vaccine construct V1 can elicit an immune response against Burkholderia pseudomallei . Conclusion: The availability of the proteome of B. pseudomallei has made this study possible through the usage of various in silico approaches. We could shortlist vaccine targets using subtractive proteomics and then construct chimeric vaccines using reverse vaccinology and immunoinformatics approaches.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135666306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-10-16DOI: 10.2174/0118750362260635230922051326
Kiran P., Sudheesh K. V., Vinayakumar Ravi, Meshari Almeshari, Yasser Alzamil, Sunil Kumar D. S., Harshitha R.
Background: The psychological aspects of the brain in Alzheimer's disease (AD) are significantly affected. These alterations in brain anatomy take place due to a variety of reasons, including the shrinking of grey and white matter in the brain. Magnetic resonance imaging (MRI) scans can be used to measure it, and these scans offer a chance for early identification of AD utilizing classification methods, like convolutional neural network (CNN). The majority of AD-related tests are now constrained by the test measures. It is, thus, crucial to find an affordable method for image categorization using minimal information. Because of developments in machine learning and medical imaging, the field of computerized health care has evolved rapidly. Recent developments in deep learning, in particular, herald a new era of clinical decision-making that is heavily reliant on multimedia systems. Methods: In the proposed work, we have investigated various CNN-based transfer-learning strategies for predicting AD using MRI scans of the brain's structural organization. According to an analysis of the data, the suggested model makes use of a number of sites related to Alzheimer's disease. In order to interpret structural brain pictures in both 2D and 3D, the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset includes straightforward CNN designs based on 2D and 3D convolutions. Results: According to these results, deep neural networks may be able to automatically learn which imaging biomarkers are indicative of Alzheimer's disease and exploit them for precise early disease detection. The proposed techniques have been found to achieve an accuracy of 93.24%. Conclusion: This research aimed to classify Alzheimer's disease (AD) using transfer learning. We have used strict pre-processing steps on raw MRI data from the ADNI dataset and used the AlexNet, i.e ., Alzheimer's disease has been categorized using pre-processed data and the CNN classifier.
{"title":"A New Deep Learning Model based on Neuroimaging for Predicting Alzheimer's Disease","authors":"Kiran P., Sudheesh K. V., Vinayakumar Ravi, Meshari Almeshari, Yasser Alzamil, Sunil Kumar D. S., Harshitha R.","doi":"10.2174/0118750362260635230922051326","DOIUrl":"https://doi.org/10.2174/0118750362260635230922051326","url":null,"abstract":"Background: The psychological aspects of the brain in Alzheimer's disease (AD) are significantly affected. These alterations in brain anatomy take place due to a variety of reasons, including the shrinking of grey and white matter in the brain. Magnetic resonance imaging (MRI) scans can be used to measure it, and these scans offer a chance for early identification of AD utilizing classification methods, like convolutional neural network (CNN). The majority of AD-related tests are now constrained by the test measures. It is, thus, crucial to find an affordable method for image categorization using minimal information. Because of developments in machine learning and medical imaging, the field of computerized health care has evolved rapidly. Recent developments in deep learning, in particular, herald a new era of clinical decision-making that is heavily reliant on multimedia systems. Methods: In the proposed work, we have investigated various CNN-based transfer-learning strategies for predicting AD using MRI scans of the brain's structural organization. According to an analysis of the data, the suggested model makes use of a number of sites related to Alzheimer's disease. In order to interpret structural brain pictures in both 2D and 3D, the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset includes straightforward CNN designs based on 2D and 3D convolutions. Results: According to these results, deep neural networks may be able to automatically learn which imaging biomarkers are indicative of Alzheimer's disease and exploit them for precise early disease detection. The proposed techniques have been found to achieve an accuracy of 93.24%. Conclusion: This research aimed to classify Alzheimer's disease (AD) using transfer learning. We have used strict pre-processing steps on raw MRI data from the ADNI dataset and used the AlexNet, i.e ., Alzheimer's disease has been categorized using pre-processed data and the CNN classifier.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136183087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-10-13DOI: 10.2174/18750362-v16-231005-2023-5
K V Sudheesh, None Kiran, Harinahalli Lokesh Gururaj, Vinayakumar Ravi, Meshari Almeshari, Yasser Alzamil
Aims: In this study, chest X-ray (CXR) and computed tomography (CT) images are used to analyse and detect COVID-19 using an unsupervised deep learning-based feature fusion approach. Background: The reverse transcription-polymerase chain reaction (RT-PCR) test, which has a reduced viral load, sampling error, etc., is used to detect COVID-19, which has sickened millions of people worldwide. It is possible to check chest X-rays and computed tomography scans because the majority of infected persons have lung infections. The COVID-19 diagnosis can be made early using both CT and CXR imaging modalities, which is an alternative to the RT-PCR test. Objective: The manual diagnosis of CXR pictures and CT scans is labor and time-intensive. Many AI-based solutions are being investigated to tackle this problem, including deep learning-based detection models, which can be utilized to assist the radiologist in making a more accurate diagnosis. However, because of the demand for specialized knowledge and high annotation costs, the amount of annotated data available for COVID-19 identification is constrained. Additionally, the majority of current cutting-edge deep learning-based detection models use supervised learning techniques. Because a tagged dataset is not required, we have investigated various unsupervised learning models for COVID-19 identification in this work. Methods: In this study, we suggest a COVID-19 detection method based on unsupervised deep learning that makes use of the feature fusion technique to improve performance. Based on this an automated CNN model is built for the detection of COVID-19 samples from healthy and pneumonic cases using chest X-ray images. Results: This model has scored an accuracy of about 99% for the classification between covid positive and covid negative. Based on this result further classification will be done for pneumonic and non-pneumonic which has scored an accuracy of 94%. Conclusion: On both datasets, the COVID-19 detection method based on feature fusion and deep unsupervised learning showed promising results. Additionally, it outperforms four well-known unsupervised methods already in use.
{"title":"Early Prediction of Covid-19 Samples from Chest X-ray Images using Deep Learning Approach","authors":"K V Sudheesh, None Kiran, Harinahalli Lokesh Gururaj, Vinayakumar Ravi, Meshari Almeshari, Yasser Alzamil","doi":"10.2174/18750362-v16-231005-2023-5","DOIUrl":"https://doi.org/10.2174/18750362-v16-231005-2023-5","url":null,"abstract":"Aims: In this study, chest X-ray (CXR) and computed tomography (CT) images are used to analyse and detect COVID-19 using an unsupervised deep learning-based feature fusion approach. Background: The reverse transcription-polymerase chain reaction (RT-PCR) test, which has a reduced viral load, sampling error, etc., is used to detect COVID-19, which has sickened millions of people worldwide. It is possible to check chest X-rays and computed tomography scans because the majority of infected persons have lung infections. The COVID-19 diagnosis can be made early using both CT and CXR imaging modalities, which is an alternative to the RT-PCR test. Objective: The manual diagnosis of CXR pictures and CT scans is labor and time-intensive. Many AI-based solutions are being investigated to tackle this problem, including deep learning-based detection models, which can be utilized to assist the radiologist in making a more accurate diagnosis. However, because of the demand for specialized knowledge and high annotation costs, the amount of annotated data available for COVID-19 identification is constrained. Additionally, the majority of current cutting-edge deep learning-based detection models use supervised learning techniques. Because a tagged dataset is not required, we have investigated various unsupervised learning models for COVID-19 identification in this work. Methods: In this study, we suggest a COVID-19 detection method based on unsupervised deep learning that makes use of the feature fusion technique to improve performance. Based on this an automated CNN model is built for the detection of COVID-19 samples from healthy and pneumonic cases using chest X-ray images. Results: This model has scored an accuracy of about 99% for the classification between covid positive and covid negative. Based on this result further classification will be done for pneumonic and non-pneumonic which has scored an accuracy of 94%. Conclusion: On both datasets, the COVID-19 detection method based on feature fusion and deep unsupervised learning showed promising results. Additionally, it outperforms four well-known unsupervised methods already in use.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135922821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Aims: This research work aims to develop an interoperable electronic health record (EHR) system to aid the early detection of diabetes by the use of Machine Learning (ML) algorithms. A decision support system developed using many ML algorithms results in optimizing the decision in preventive care in the health information system. Methods: The proposed system consisted of two models. The first model included interoperable EHR system development using a precise database structure. The second module comprised of data extraction from the EHR system, data cleaning, and data processing and prediction. For testing and training, about 1080 patients’ health record was considered. Among 1080, 1000 records were from the Kaggle dataset, and 80 records were demographic information from patients who visited our health center of Siddaganga organization for a regular checkup or during emergencies. The demographic information was collected from the proposed EHR system. Results: The proposed system was tested for the interoperability nature of the EHR system and accuracy in diabetic disease prediction using the proposed decision support system. The proposed EHR system development was tested for interoperability by random updations from various systems maintained in the laboratory. Each system acted like the admin system of different hospitals. The EHR system was tested for handling the load and interoperability by considering user view status, system matching with the real world, consistency in data updations, security etc . However, in the prediction phase, diabetes prediction was concentrated. The features considered were not randomly chosen; however, the features were those prescribed by a doctor who insisted that the features were sufficient for initial prediction. The reports collected from the doctors revealed several features they considered before giving the test details. The proposed system dataset was split into test and train datasets with eight proper features taken as input and one set as a target variable where the result was present. After this, the model was imported using standard “sklearn” libraries, and it fit with the required number of estimators, that is, the number of decision trees. The features included pregnancies, glucose level, blood pressure, skin thickness, insulin level, bone marrow index, diabetic pedigree function, age, weight, etc . At the outset, the research work concentrated on developing an interoperable EHR system, identifying the expectation of diabetic and non-diabetic conditions and demonstrating the accuracy of the system. Conclusion: In this study, the first aim was to design an interoperable EHR system that could help in accumulating, storing, and sharing patients' timely health records over a lifetime. The second aim was to use EHR data for early prediction of diabetes in the user. To confirm the accuracy of the system, the system was tested regarding interoperability to support early prediction through a decision supp
{"title":"Electronic Health Record (EHR) System Development for Study on EHR Data-based Early Prediction of Diabetes Using Machine Learning Algorithms","authors":"Jagadamba G, Shashidhar R, Gururaj H L, Vinayakumar Ravi, Meshari Almeshari, Yasser Alzamil","doi":"10.2174/18750362-v16-e230906-2023-15","DOIUrl":"https://doi.org/10.2174/18750362-v16-e230906-2023-15","url":null,"abstract":"Aims: This research work aims to develop an interoperable electronic health record (EHR) system to aid the early detection of diabetes by the use of Machine Learning (ML) algorithms. A decision support system developed using many ML algorithms results in optimizing the decision in preventive care in the health information system. Methods: The proposed system consisted of two models. The first model included interoperable EHR system development using a precise database structure. The second module comprised of data extraction from the EHR system, data cleaning, and data processing and prediction. For testing and training, about 1080 patients’ health record was considered. Among 1080, 1000 records were from the Kaggle dataset, and 80 records were demographic information from patients who visited our health center of Siddaganga organization for a regular checkup or during emergencies. The demographic information was collected from the proposed EHR system. Results: The proposed system was tested for the interoperability nature of the EHR system and accuracy in diabetic disease prediction using the proposed decision support system. The proposed EHR system development was tested for interoperability by random updations from various systems maintained in the laboratory. Each system acted like the admin system of different hospitals. The EHR system was tested for handling the load and interoperability by considering user view status, system matching with the real world, consistency in data updations, security etc . However, in the prediction phase, diabetes prediction was concentrated. The features considered were not randomly chosen; however, the features were those prescribed by a doctor who insisted that the features were sufficient for initial prediction. The reports collected from the doctors revealed several features they considered before giving the test details. The proposed system dataset was split into test and train datasets with eight proper features taken as input and one set as a target variable where the result was present. After this, the model was imported using standard “sklearn” libraries, and it fit with the required number of estimators, that is, the number of decision trees. The features included pregnancies, glucose level, blood pressure, skin thickness, insulin level, bone marrow index, diabetic pedigree function, age, weight, etc . At the outset, the research work concentrated on developing an interoperable EHR system, identifying the expectation of diabetic and non-diabetic conditions and demonstrating the accuracy of the system. Conclusion: In this study, the first aim was to design an interoperable EHR system that could help in accumulating, storing, and sharing patients' timely health records over a lifetime. The second aim was to use EHR data for early prediction of diabetes in the user. To confirm the accuracy of the system, the system was tested regarding interoperability to support early prediction through a decision supp","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135547188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-26DOI: 10.2174/18750362-v16-e230711-2023-2
Tejaswini Vijay Shinde, Tejas Gajanan Shinde, V. V. Chougule, Anagha Rajendra Ghorpade, Geeta Vikas Utekar, Amol S Jadhav, Bandu Shamlal Pawar, S. Sanmukh
The Mycobacterium tuberculosis complex (MTBC) bacteria include the slowly growing, host-associated bacteria Mycobacterium tuberculosis, Mycobacterium Bovis, Mycobacterium microti, Mycobacterium africanum, Mycobacterium pinnipedii. Comparative Functional Genomics Studies for understanding the Hypothetical Proteins in Mycobacterium tuberculosis variant microti 12. A computational genomics study was performed to understand the 247 hypothetical protein genes. Functional annotation of virtual proteins was performed on different servers to maximize confidence level. Sequence Retrieval. The whole genome sequences for the Mycobacterium tuberculosis micro variant 12 were retrieved from the KEGG database ( http://www.genome.jp/kegg/) and were used for screening 247 hypothetical proteins (Fig. 1 ). Functional Annotation and Sub-cellular localization. The Mycobacterium tuberculosis micro variant 12 hypothetical proteins were screened and sorted out from the genome and were individually analyzed for the presence of conserved functional domains by using computational biology tools like CDD-BLAST ( https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) ;Pfam ( http://pfam.xfam.org/ncbiseq/398365647); The subcellular localization of hypothetical proteins was determined by CELLO2GO ( http://cello.life.nctu.edu.tw). These web tools can search the defined conserved domains in the sequences available in the online servers or databases and assist in the classification of proteins in the appropriate families. Protein Structure Prediction. The in-silico structure predictions of the hypothetical protein sequences showing functional properties were carried out by using the PS2 Protein Structure Prediction Server ( http://www.ps2.life.nctu.edu.tw/). The online server helps to generate the 3D structures of the hypothetical proteins. The server accepts the sequences in FASTA format as a query to generate resultant proteins 3D structures. The structure determination is completely based on the conserved template regions detected during functional annotations. Protein-protein interaction through String database: The interaction of each hypothetical protein analyzed for functional characteristics was subjected to a protein-protein interaction server for the prediction of a possible functional role in interaction amongst the available known proteins ( https://string-db.org/). This information can help us to further validated the functional role of such hypothetical proteins and their possible role in the Mycobacterium Tuberculosis micro variant. Protein secondary structure prediction through JPred4: The secondary structure prediction of all the hypothetical proteins was determined through JPred4 ( http://www.compbio.dundee.ac.uk/jpred4/index.html) and served to identify the available secondary structures in the unknown hypothetical protein sequences. These further help us to understand the available templates in the uncharacterized protein seq
{"title":"Comparative Functional Genomics Studies for Understanding the Hypothetical Proteins in\u0000 Mycobacterium Tuberculosis Variant Microti 12","authors":"Tejaswini Vijay Shinde, Tejas Gajanan Shinde, V. V. Chougule, Anagha Rajendra Ghorpade, Geeta Vikas Utekar, Amol S Jadhav, Bandu Shamlal Pawar, S. Sanmukh","doi":"10.2174/18750362-v16-e230711-2023-2","DOIUrl":"https://doi.org/10.2174/18750362-v16-e230711-2023-2","url":null,"abstract":"\u0000 \u0000 The Mycobacterium tuberculosis complex (MTBC) bacteria include the slowly growing, host-associated bacteria Mycobacterium tuberculosis, Mycobacterium Bovis, Mycobacterium microti, Mycobacterium africanum, Mycobacterium pinnipedii.\u0000 \u0000 \u0000 \u0000 Comparative Functional Genomics Studies for understanding the Hypothetical Proteins in Mycobacterium tuberculosis variant microti 12.\u0000 \u0000 \u0000 \u0000 A computational genomics study was performed to understand the 247 hypothetical protein genes. Functional annotation of virtual proteins was performed on different servers to maximize confidence level.\u0000 \u0000 \u0000 \u0000 Sequence Retrieval. The whole genome sequences for the Mycobacterium tuberculosis micro variant 12 were retrieved from the KEGG database (\u0000 http://www.genome.jp/kegg/) and were used for screening 247 hypothetical proteins (Fig.\u0000 \u0000 1\u0000 ). Functional Annotation and Sub-cellular localization. The Mycobacterium tuberculosis micro variant 12 hypothetical proteins were screened and sorted out from the genome and were individually analyzed for the presence of conserved functional domains by using computational biology tools like CDD-BLAST (\u0000 https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) ;Pfam (\u0000 http://pfam.xfam.org/ncbiseq/398365647); The subcellular localization of hypothetical proteins was determined by CELLO2GO (\u0000 http://cello.life.nctu.edu.tw). These web tools can search the defined conserved domains in the sequences available in the online servers or databases and assist in the classification of proteins in the appropriate families. Protein Structure Prediction. The\u0000 in-silico structure predictions of the hypothetical protein sequences showing functional properties were carried out by using the PS2 Protein Structure Prediction Server (\u0000 http://www.ps2.life.nctu.edu.tw/). The online server helps to generate the 3D structures of the hypothetical proteins. The server accepts the sequences in FASTA format as a query to generate resultant proteins 3D structures. The structure determination is completely based on the conserved template regions detected during functional annotations. Protein-protein interaction through String database: The interaction of each hypothetical protein analyzed for functional characteristics was subjected to a protein-protein interaction server for the prediction of a possible functional role in interaction amongst the available known proteins (\u0000 https://string-db.org/). This information can help us to further validated the functional role of such hypothetical proteins and their possible role in the Mycobacterium Tuberculosis micro variant. Protein secondary structure prediction through JPred4: The secondary structure prediction of all the hypothetical proteins was determined through JPred4 (\u0000 http://www.compbio.dundee.ac.uk/jpred4/index.html) and served to identify the available secondary structures in the unknown hypothetical protein sequences. These further help us to understand the available templates in the uncharacterized protein seq","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47867946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}