Systematic evaluation of B-cell clonal family inference approaches.

IF 2.9 4区 医学 Q3 IMMUNOLOGY BMC Immunology Pub Date : 2024-02-08 DOI:10.1186/s12865-024-00600-8
Daria Balashova, Barbera D C van Schaik, Maria Stratigopoulou, Jeroen E J Guikema, Tom G Caniels, Mathieu Claireaux, Marit J van Gils, Anne Musters, Dornatien C Anang, Niek de Vries, Victor Greiff, Antoine H C van Kampen
{"title":"Systematic evaluation of B-cell clonal family inference approaches.","authors":"Daria Balashova, Barbera D C van Schaik, Maria Stratigopoulou, Jeroen E J Guikema, Tom G Caniels, Mathieu Claireaux, Marit J van Gils, Anne Musters, Dornatien C Anang, Niek de Vries, Victor Greiff, Antoine H C van Kampen","doi":"10.1186/s12865-024-00600-8","DOIUrl":null,"url":null,"abstract":"<p><p>The reconstruction of clonal families (CFs) in B-cell receptor (BCR) repertoire analysis is a crucial step to understand the adaptive immune system and how it responds to antigens. The BCR repertoire of an individual is formed throughout life and is diverse due to several factors such as gene recombination and somatic hypermutation. The use of Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) using next generation sequencing enabled the generation of full BCR repertoires that also include rare CFs. The reconstruction of CFs from AIRR-seq data is challenging and several approaches have been developed to solve this problem. Currently, most methods use the heavy chain (HC) only, as it is more variable than the light chain (LC). CF reconstruction options include the definition of appropriate sequence similarity measures, the use of shared mutations among sequences, and the possibility of reconstruction without preliminary clustering based on V- and J-gene annotation. In this study, we aimed to systematically evaluate different approaches for CF reconstruction and to determine their impact on various outcome measures such as the number of CFs derived, the size of the CFs, and the accuracy of the reconstruction. The methods were compared to each other and to a method that groups sequences based on identical junction sequences and another method that only determines subclones. We found that after accounting for data set variability, in particular sequencing depth and mutation load, the reconstruction approach has an impact on part of the outcome measures, including the number of CFs. Simulations indicate that unique junctions and subclones should not be used as substitutes for CF and that more complex methods do not outperform simpler methods. Also, we conclude that different approaches differ in their ability to correctly reconstruct CFs when not considering the LC and to identify shared CFs. The results showed the effect of different approaches on the reconstruction of CFs and highlighted the importance of choosing an appropriate method.</p>","PeriodicalId":9040,"journal":{"name":"BMC Immunology","volume":null,"pages":null},"PeriodicalIF":2.9000,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11370117/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Immunology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12865-024-00600-8","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The reconstruction of clonal families (CFs) in B-cell receptor (BCR) repertoire analysis is a crucial step to understand the adaptive immune system and how it responds to antigens. The BCR repertoire of an individual is formed throughout life and is diverse due to several factors such as gene recombination and somatic hypermutation. The use of Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) using next generation sequencing enabled the generation of full BCR repertoires that also include rare CFs. The reconstruction of CFs from AIRR-seq data is challenging and several approaches have been developed to solve this problem. Currently, most methods use the heavy chain (HC) only, as it is more variable than the light chain (LC). CF reconstruction options include the definition of appropriate sequence similarity measures, the use of shared mutations among sequences, and the possibility of reconstruction without preliminary clustering based on V- and J-gene annotation. In this study, we aimed to systematically evaluate different approaches for CF reconstruction and to determine their impact on various outcome measures such as the number of CFs derived, the size of the CFs, and the accuracy of the reconstruction. The methods were compared to each other and to a method that groups sequences based on identical junction sequences and another method that only determines subclones. We found that after accounting for data set variability, in particular sequencing depth and mutation load, the reconstruction approach has an impact on part of the outcome measures, including the number of CFs. Simulations indicate that unique junctions and subclones should not be used as substitutes for CF and that more complex methods do not outperform simpler methods. Also, we conclude that different approaches differ in their ability to correctly reconstruct CFs when not considering the LC and to identify shared CFs. The results showed the effect of different approaches on the reconstruction of CFs and highlighted the importance of choosing an appropriate method.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
对 B 细胞克隆家族推断方法进行系统评估。
在 B 细胞受体(BCR)谱系分析中重建克隆家族(CFs)是了解适应性免疫系统及其如何对抗原做出反应的关键步骤。个体的 B 细胞受体谱系是在一生中形成的,并且由于基因重组和体细胞超突变等多种因素而具有多样性。利用下一代测序技术进行的适应性免疫受体汇集测序(AIRR-seq)能够生成完整的 BCR 汇集,其中也包括罕见的 CFs。从 AIRR-seq 数据重建 CFs 具有挑战性,目前已开发出多种方法来解决这一问题。目前,大多数方法只使用重链(HC),因为它比轻链(LC)更易变。CF重建选项包括定义适当的序列相似性度量、使用序列间的共享突变,以及在没有基于V基因和J基因注释的初步聚类的情况下进行重建的可能性。在这项研究中,我们旨在系统地评估不同的 CF 重建方法,并确定它们对各种结果指标的影响,如衍生 CF 的数量、CF 的大小和重建的准确性。我们将这些方法相互进行了比较,并与一种根据相同的连接序列对序列进行分组的方法和另一种仅确定亚克隆的方法进行了比较。我们发现,在考虑了数据集的可变性,特别是测序深度和突变负荷后,重建方法对部分结果指标有影响,包括CF的数量。模拟结果表明,独特的连接点和亚克隆不应该被用来替代CF,更复杂的方法也不会优于更简单的方法。此外,我们还得出结论,不同的方法在不考虑 LC 时正确重建 CF 和识别共享 CF 的能力上存在差异。结果显示了不同方法对重建 CF 的影响,并强调了选择适当方法的重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
BMC Immunology
BMC Immunology 医学-免疫学
CiteScore
5.50
自引率
0.00%
发文量
54
审稿时长
1 months
期刊介绍: BMC Immunology is an open access journal publishing original peer-reviewed research articles in molecular, cellular, tissue-level, organismal, functional, and developmental aspects of the immune system as well as clinical studies and animal models of human diseases.
期刊最新文献
From bioinformatics to clinical applications: a novel prognostic model of cuproptosis-related genes based on single-cell RNA sequencing data in hepatocellular carcinoma Revised version with tracked changes oral Magnesium reduces levels of pathogenic autoantibodies and skin disease in murine lupus. Sepsis induced dysfunction of liver type 1 innate lymphoid cells. Antagonist anti-LIF antibody derived from naive human scFv phage library inhibited tumor growth in mice. Pre-treatment plasma retinol binding protein 4 level and its change after treatments predict systemic treatment response in psoriasis patients.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1