Identification of Key Genes in Fetal Gut Development at Single-Cell Level by Exploiting Machine Learning Techniques

IF 3.9 4区生物学 Q2 BIOCHEMICAL RESEARCH METHODS Proteomics Pub Date : 2024-09-26 DOI:10.1002/pmic.202400104

QingLan Ma, Mei Meng, XianChao Zhou, Wei Guo, KaiYan Feng, Tao Huang, Yu-Dong Cai

{"title":"Identification of Key Genes in Fetal Gut Development at Single-Cell Level by Exploiting Machine Learning Techniques","authors":"QingLan Ma, Mei Meng, XianChao Zhou, Wei Guo, KaiYan Feng, Tao Huang, Yu-Dong Cai","doi":"10.1002/pmic.202400104","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>The study of fetal gut development is critical due to its substantial influence on immediate neonatal and long-term adult health. Current research largely focuses on microbiome colonization, gut immunity, and barrier function, alongside the impact of external factors on these phenomena. Limited research has been dedicated to the categorization of developing fetal gut cells. Our study aimed to enhance our understanding of fetal gut development by employing advanced machine-learning techniques on single-cell sequencing data. This dataset consisted of 62,849 samples, each characterized by 33,694 distinct gene features. Four feature ranking algorithms were utilized to sort features according to their significance, resulting in four feature lists. Then, these lists were fed into an incremental feature selection method to extract essential genes, classification rules, and build efficient classifiers. Several important genes were recognized by multiple feature ranking algorithms, such as FGG, MDK, RBP1, RBP2, IGFBP7, and SPON2. These features were key in differentiating specific developing intestinal cells, including epithelial, immune, mesenchymal, and vasculature cells of the colon, duo jejunum, and ileum cells. The classification rules showed special gene expression patterns on some intestinal cell types and the efficient classifiers can be useful tools for identifying intestinal cells.</p>\n </div>","PeriodicalId":224,"journal":{"name":"Proteomics","volume":"24 23-24","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proteomics","FirstCategoryId":"99","ListUrlMain":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/10.1002/pmic.202400104","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

The study of fetal gut development is critical due to its substantial influence on immediate neonatal and long-term adult health. Current research largely focuses on microbiome colonization, gut immunity, and barrier function, alongside the impact of external factors on these phenomena. Limited research has been dedicated to the categorization of developing fetal gut cells. Our study aimed to enhance our understanding of fetal gut development by employing advanced machine-learning techniques on single-cell sequencing data. This dataset consisted of 62,849 samples, each characterized by 33,694 distinct gene features. Four feature ranking algorithms were utilized to sort features according to their significance, resulting in four feature lists. Then, these lists were fed into an incremental feature selection method to extract essential genes, classification rules, and build efficient classifiers. Several important genes were recognized by multiple feature ranking algorithms, such as FGG, MDK, RBP1, RBP2, IGFBP7, and SPON2. These features were key in differentiating specific developing intestinal cells, including epithelial, immune, mesenchymal, and vasculature cells of the colon, duo jejunum, and ileum cells. The classification rules showed special gene expression patterns on some intestinal cell types and the efficient classifiers can be useful tools for identifying intestinal cells.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用机器学习技术在单细胞水平鉴定胎儿肠道发育过程中的关键基因

胎儿肠道发育对新生儿的近期健康和成年后的长期健康有着重大影响，因此对胎儿肠道发育的研究至关重要。目前的研究主要集中在微生物组定植、肠道免疫和屏障功能，以及外部因素对这些现象的影响。对发育中的胎儿肠道细胞进行分类的研究十分有限。我们的研究旨在通过对单细胞测序数据采用先进的机器学习技术，加深我们对胎儿肠道发育的了解。该数据集包括 62,849 个样本，每个样本都有 33,694 个不同的基因特征。研究人员利用四种特征排序算法根据特征的重要性对其进行排序，最终得出四个特征列表。然后，将这些列表输入增量特征选择方法，以提取重要基因和分类规则，并建立高效的分类器。多个特征排序算法识别出了几个重要基因，如 FGG、MDK、RBP1、RBP2、IGFBP7 和 SPON2。这些特征是区分特定发育中肠细胞的关键，包括结肠、空肠和回肠细胞的上皮细胞、免疫细胞、间质细胞和血管细胞。分类规则显示了某些肠细胞类型的特殊基因表达模式，高效的分类器可作为识别肠细胞的有用工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proteomics 生物-生化研究方法

CiteScore

6.30

自引率

5.90%

发文量

193

审稿时长

3 months

期刊介绍： PROTEOMICS is the premier international source for information on all aspects of applications and technologies, including software, in proteomics and other "omics". The journal includes but is not limited to proteomics, genomics, transcriptomics, metabolomics and lipidomics, and systems biology approaches. Papers describing novel applications of proteomics and integration of multi-omics data and approaches are especially welcome.