QingLan Ma, Mei Meng, XianChao Zhou, Wei Guo, KaiYan Feng, Tao Huang, Yu-Dong Cai
{"title":"Identification of Key Genes in Fetal Gut Development at Single-Cell Level by Exploiting Machine Learning Techniques.","authors":"QingLan Ma, Mei Meng, XianChao Zhou, Wei Guo, KaiYan Feng, Tao Huang, Yu-Dong Cai","doi":"10.1002/pmic.202400104","DOIUrl":null,"url":null,"abstract":"<p><p>The study of fetal gut development is critical due to its substantial influence on immediate neonatal and long-term adult health. Current research largely focuses on microbiome colonization, gut immunity, and barrier function, alongside the impact of external factors on these phenomena. Limited research has been dedicated to the categorization of developing fetal gut cells. Our study aimed to enhance our understanding of fetal gut development by employing advanced machine-learning techniques on single-cell sequencing data. This dataset consisted of 62,849 samples, each characterized by 33,694 distinct gene features. Four feature ranking algorithms were utilized to sort features according to their significance, resulting in four feature lists. Then, these lists were fed into an incremental feature selection method to extract essential genes, classification rules, and build efficient classifiers. Several important genes were recognized by multiple feature ranking algorithms, such as FGG, MDK, RBP1, RBP2, IGFBP7, and SPON2. These features were key in differentiating specific developing intestinal cells, including epithelial, immune, mesenchymal, and vasculature cells of the colon, duo jejunum, and ileum cells. The classification rules showed special gene expression patterns on some intestinal cell types and the efficient classifiers can be useful tools for identifying intestinal cells.</p>","PeriodicalId":224,"journal":{"name":"Proteomics","volume":null,"pages":null},"PeriodicalIF":3.4000,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proteomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/pmic.202400104","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
The study of fetal gut development is critical due to its substantial influence on immediate neonatal and long-term adult health. Current research largely focuses on microbiome colonization, gut immunity, and barrier function, alongside the impact of external factors on these phenomena. Limited research has been dedicated to the categorization of developing fetal gut cells. Our study aimed to enhance our understanding of fetal gut development by employing advanced machine-learning techniques on single-cell sequencing data. This dataset consisted of 62,849 samples, each characterized by 33,694 distinct gene features. Four feature ranking algorithms were utilized to sort features according to their significance, resulting in four feature lists. Then, these lists were fed into an incremental feature selection method to extract essential genes, classification rules, and build efficient classifiers. Several important genes were recognized by multiple feature ranking algorithms, such as FGG, MDK, RBP1, RBP2, IGFBP7, and SPON2. These features were key in differentiating specific developing intestinal cells, including epithelial, immune, mesenchymal, and vasculature cells of the colon, duo jejunum, and ileum cells. The classification rules showed special gene expression patterns on some intestinal cell types and the efficient classifiers can be useful tools for identifying intestinal cells.
期刊介绍:
PROTEOMICS is the premier international source for information on all aspects of applications and technologies, including software, in proteomics and other "omics". The journal includes but is not limited to proteomics, genomics, transcriptomics, metabolomics and lipidomics, and systems biology approaches. Papers describing novel applications of proteomics and integration of multi-omics data and approaches are especially welcome.