生物信息学工具在单细胞RNA-seq数据细胞类型分类中的应用。

Shah Tania Akter Sujana, Md Shahjaman, Atul Chandra Singha
{"title":"生物信息学工具在单细胞RNA-seq数据细胞类型分类中的应用。","authors":"Shah Tania Akter Sujana, Md Shahjaman, Atul Chandra Singha","doi":"10.1016/j.compbiolchem.2024.108332","DOIUrl":null,"url":null,"abstract":"<p><p>The advancements in single-cell RNA sequencing (scRNAseq) technology have significantly transformed genomics research, enabling the handling of thousands of cells in each experiment. As of now, 32,068 research studies have been cataloged in the Pubmed database. The primary aim of scRNAseq investigations is to identify cell types, understand the antitumor immune response, and identify new and uncommon cell types. Traditional techniques for identifying cell types include microscopy, histology, and pathological characteristics. However, the complexity of instruments and the need for precise experimental design make it difficult to fully capture the overall heterogeneity. Unsupervised clustering and supervised classification methods have been used to solve this task. Supervised cell type classification methods have gained popularity as large-scale, high-quality, well-annotated and more robust results compared to clustering methods. A recent study showed that support vector machine (SVM) gives a high-quality classification performance in different scenarios. In this article, we compare and evaluate the performance of four different kernels (sigmoid, linear, radial, polynomial) of SVM. The results of the experiments on three standard scRNA-seq datasets indicate that SVM with linear and SVM with sigmoid kernel classify the cells more accurately (approx. 99 %) where SVM linear kernel method has remarkably fast computation time and we also evaluate the results using some single cell specific evaluation matrices F-1 score, MCC, AUC value. Additionally, it sheds light on the potential use of kernels of SVM to give underlying information of single-cell RNA-Seq data more effectively.</p>","PeriodicalId":93952,"journal":{"name":"Computational biology and chemistry","volume":"115 ","pages":"108332"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Application of bioinformatic tools in cell type classification for single-cell RNA-seq data.\",\"authors\":\"Shah Tania Akter Sujana, Md Shahjaman, Atul Chandra Singha\",\"doi\":\"10.1016/j.compbiolchem.2024.108332\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The advancements in single-cell RNA sequencing (scRNAseq) technology have significantly transformed genomics research, enabling the handling of thousands of cells in each experiment. As of now, 32,068 research studies have been cataloged in the Pubmed database. The primary aim of scRNAseq investigations is to identify cell types, understand the antitumor immune response, and identify new and uncommon cell types. Traditional techniques for identifying cell types include microscopy, histology, and pathological characteristics. However, the complexity of instruments and the need for precise experimental design make it difficult to fully capture the overall heterogeneity. Unsupervised clustering and supervised classification methods have been used to solve this task. Supervised cell type classification methods have gained popularity as large-scale, high-quality, well-annotated and more robust results compared to clustering methods. A recent study showed that support vector machine (SVM) gives a high-quality classification performance in different scenarios. In this article, we compare and evaluate the performance of four different kernels (sigmoid, linear, radial, polynomial) of SVM. The results of the experiments on three standard scRNA-seq datasets indicate that SVM with linear and SVM with sigmoid kernel classify the cells more accurately (approx. 99 %) where SVM linear kernel method has remarkably fast computation time and we also evaluate the results using some single cell specific evaluation matrices F-1 score, MCC, AUC value. Additionally, it sheds light on the potential use of kernels of SVM to give underlying information of single-cell RNA-Seq data more effectively.</p>\",\"PeriodicalId\":93952,\"journal\":{\"name\":\"Computational biology and chemistry\",\"volume\":\"115 \",\"pages\":\"108332\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational biology and chemistry\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1016/j.compbiolchem.2024.108332\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational biology and chemistry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.compbiolchem.2024.108332","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

单细胞RNA测序(scRNAseq)技术的进步极大地改变了基因组学研究,使每次实验中处理数千个细胞成为可能。截至目前,Pubmed数据库已收录了32,068项研究。scRNAseq研究的主要目的是鉴定细胞类型,了解抗肿瘤免疫反应,并鉴定新的和不常见的细胞类型。鉴定细胞类型的传统技术包括显微镜、组织学和病理特征。然而,由于仪器的复杂性和对精确实验设计的需要,很难完全捕捉到整体的异质性。无监督聚类和监督分类方法被用来解决这个问题。与聚类方法相比,监督细胞类型分类方法因其大规模、高质量、注释良好、鲁棒性更强而广受欢迎。最近的一项研究表明,支持向量机(SVM)在不同的场景下都能给出高质量的分类性能。在本文中,我们比较和评价了支持向量机的四种不同核(s型核、线性核、径向核、多项式核)的性能。在三个标准scRNA-seq数据集上的实验结果表明,线性支持向量机和sigmoid核支持向量机对细胞的分类精度更高。99 %),其中SVM线性核方法具有非常快的计算时间,并且我们还使用一些单细胞特定的评估矩阵F-1分数,MCC, AUC值来评估结果。此外,它还揭示了支持向量机核的潜在用途,以更有效地提供单细胞RNA-Seq数据的底层信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Application of bioinformatic tools in cell type classification for single-cell RNA-seq data.

The advancements in single-cell RNA sequencing (scRNAseq) technology have significantly transformed genomics research, enabling the handling of thousands of cells in each experiment. As of now, 32,068 research studies have been cataloged in the Pubmed database. The primary aim of scRNAseq investigations is to identify cell types, understand the antitumor immune response, and identify new and uncommon cell types. Traditional techniques for identifying cell types include microscopy, histology, and pathological characteristics. However, the complexity of instruments and the need for precise experimental design make it difficult to fully capture the overall heterogeneity. Unsupervised clustering and supervised classification methods have been used to solve this task. Supervised cell type classification methods have gained popularity as large-scale, high-quality, well-annotated and more robust results compared to clustering methods. A recent study showed that support vector machine (SVM) gives a high-quality classification performance in different scenarios. In this article, we compare and evaluate the performance of four different kernels (sigmoid, linear, radial, polynomial) of SVM. The results of the experiments on three standard scRNA-seq datasets indicate that SVM with linear and SVM with sigmoid kernel classify the cells more accurately (approx. 99 %) where SVM linear kernel method has remarkably fast computation time and we also evaluate the results using some single cell specific evaluation matrices F-1 score, MCC, AUC value. Additionally, it sheds light on the potential use of kernels of SVM to give underlying information of single-cell RNA-Seq data more effectively.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Computational investigation of graphyne monolayer as a promising carrier for anticancer drug delivery. Machine learning and molecular subtyping reveal the impact of diverse patterns of cell death on the prognosis and treatment of hepatocellular carcinoma. In silico analysis of novel Triacontafluoropentadec-1-ene as a sustainable replacement for dodecane in fisheries microplastics: Molecular docking, dynamics simulation and pharmacophore studies of acetylcholinesterase activity. Relationship between structural properties and biological activity of (-)-menthol and some menthyl esters. Deciphering chondrocyte diversity in diabetic osteoarthritis through single-cell transcriptomics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1