{"title":"Prediction of Crohn's disease based on deep feature recognition","authors":"Hui Tian , Ran Tang","doi":"10.1016/j.compbiolchem.2024.108231","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Crohn's disease is a complex genetic disease that involves chronic gastrointestinal inflammation and results from a complex set of genetic, environmental, and immunological factors. By analyzing data from the human microbiome, genetic information can be used to predict Crohn's disease. Recent advances in deep learning have demonstrated its effectiveness in feature extraction and the use of deep learning to decode genetic information for disease prediction.</div></div><div><h3>Methods</h3><div>In this paper, we present a deep learning-based model that utilizes a sequential convolutional attention network (SCAN) for feature extraction, incorporates adaptive additive interval losses to enhance these features, and employs support vector machines (SVM) for classification. To address the challenge of unbalanced Crohn's disease samples, we propose a random noise one-hot encoding data augmentation method.</div></div><div><h3>Results</h3><div>Data augmentation with random noise accelerates training convergence, while SCAN-SVM effectively extracts features with adaptive additive interval loss enhancing differentiation. Our approach outperforms benchmark methods, achieving an average accuracy of 0.80 and a kappa value of 0.76, and we validate the effectiveness of feature enhancement.</div></div><div><h3>Conclusions</h3><div>In summary, we use deep feature recognition to effectively analyze the potential information in genes, which has a good application potential for gene analysis and prediction of Crohn's disease.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"113 ","pages":"Article 108231"},"PeriodicalIF":2.6000,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Biology and Chemistry","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1476927124002196","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Crohn's disease is a complex genetic disease that involves chronic gastrointestinal inflammation and results from a complex set of genetic, environmental, and immunological factors. By analyzing data from the human microbiome, genetic information can be used to predict Crohn's disease. Recent advances in deep learning have demonstrated its effectiveness in feature extraction and the use of deep learning to decode genetic information for disease prediction.
Methods
In this paper, we present a deep learning-based model that utilizes a sequential convolutional attention network (SCAN) for feature extraction, incorporates adaptive additive interval losses to enhance these features, and employs support vector machines (SVM) for classification. To address the challenge of unbalanced Crohn's disease samples, we propose a random noise one-hot encoding data augmentation method.
Results
Data augmentation with random noise accelerates training convergence, while SCAN-SVM effectively extracts features with adaptive additive interval loss enhancing differentiation. Our approach outperforms benchmark methods, achieving an average accuracy of 0.80 and a kappa value of 0.76, and we validate the effectiveness of feature enhancement.
Conclusions
In summary, we use deep feature recognition to effectively analyze the potential information in genes, which has a good application potential for gene analysis and prediction of Crohn's disease.
期刊介绍:
Computational Biology and Chemistry publishes original research papers and review articles in all areas of computational life sciences. High quality research contributions with a major computational component in the areas of nucleic acid and protein sequence research, molecular evolution, molecular genetics (functional genomics and proteomics), theory and practice of either biology-specific or chemical-biology-specific modeling, and structural biology of nucleic acids and proteins are particularly welcome. Exceptionally high quality research work in bioinformatics, systems biology, ecology, computational pharmacology, metabolism, biomedical engineering, epidemiology, and statistical genetics will also be considered.
Given their inherent uncertainty, protein modeling and molecular docking studies should be thoroughly validated. In the absence of experimental results for validation, the use of molecular dynamics simulations along with detailed free energy calculations, for example, should be used as complementary techniques to support the major conclusions. Submissions of premature modeling exercises without additional biological insights will not be considered.
Review articles will generally be commissioned by the editors and should not be submitted to the journal without explicit invitation. However prospective authors are welcome to send a brief (one to three pages) synopsis, which will be evaluated by the editors.