首页 > 最新文献

Journal of bioinformatics and systems biology : Open access最新文献

英文 中文
Abstract 243: Identification of novel epitopes of NY-ESO-1 for solid malignancies by Kiromic proprietary search engine Diamond 243:通过Kiromic专有搜索引擎Diamond鉴定NY-ESO-1实体恶性肿瘤的新表位
Pub Date : 2021-07-01 DOI: 10.1158/1538-7445.AM2021-243
L. Piccotti, L. Mirandola, M. Chiriva-Internati
Adoptive cell therapy has been proven a powerful approach for the cure of cancer and other diseases. In particular, the selection of appropriate immunogenic targets has been key to positive outcomes in clinical settings. The availability of RNA-Seq analysis, the accessibility to large data repositories such as TCGA and GTEx, and the creation of new bioinformatic tools have accelerated the process of neoantigen discovery. However, most of the current algorithms are encumbered by the intrinsic complexity of predicting antigen immunogenicity. Diamond™ is a novel artificial intelligence and cognitive machine and deep learning platform to predict peptide processing, HLA binding, and T cell activation. To validate the predictive value of DIAMOND algorithms, the meta-analyses of expression data of cancer-testis antigen New York Esophageal Squamous Cell Carcinoma 1 (NY-ESO-1) and predictions for the immunogenic peptides were compared to experimental data in the literature. In agreement with published clinical observations, DIAMOND metanalysis showed NY-ESO-1 genic overexpression in skin cutaneous melanoma, lung adenocarcinoma, and sarcoma. Moreover, DIAMOND predicted an MHC binding affinity of 0.289 with Supertype A2 for a new NY-ESO-1 peptide, which has been successfully targeted in clinical trials for patients with HLA-A*02:01, as well as it mirrored published data in its prediction of peptide affinity binding in NY-ESO-1–specific MHC II–restricted T cell receptors. Taken together these data support DIAMOND as a reliable platform for the discovery of new immunogenic targets for cancer therapy. Citation Format: Lucia Piccotti, Leonardo Mirandola, Maurizio Chiriva-Internati. Identification of novel epitopes of NY-ESO-1 for solid malignancies by Kiromic proprietary search engine Diamond [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2021; 2021 Apr 10-15 and May 17-21. Philadelphia (PA): AACR; Cancer Res 2021;81(13_Suppl):Abstract nr 243.
过继细胞疗法已被证明是治疗癌症和其他疾病的有力方法。特别是,选择适当的免疫原性靶点是临床环境中取得积极结果的关键。RNA-Seq分析的可用性、TCGA和GTEx等大型数据库的可访问性以及新的生物信息学工具的创建加速了新抗原发现的过程。然而,目前大多数算法都受到预测抗原免疫原性的固有复杂性的阻碍。Diamond™是一种新型的人工智能、认知机器和深度学习平台,用于预测肽加工、HLA结合和T细胞活化。为了验证DIAMOND算法的预测价值,我们将癌症-睾丸抗原纽约食管鳞状细胞癌1 (NY-ESO-1)的表达数据和免疫原性肽的预测数据与文献中的实验数据进行了比较。与已发表的临床观察一致,DIAMOND meta分析显示NY-ESO-1基因在皮肤黑色素瘤、肺腺癌和肉瘤中过表达。此外,DIAMOND预测了一种新的NY-ESO-1肽与Supertype A2的MHC结合亲和力为0.289,该肽已在HLA-A*02:01患者的临床试验中成功靶向,并且在预测NY-ESO-1特异性MHC ii限制性T细胞受体的肽亲和力结合方面反映了已发表的数据。综上所述,这些数据支持DIAMOND作为发现新的癌症治疗免疫原性靶点的可靠平台。引文格式:Lucia Piccotti, Leonardo Mirandola, Maurizio Chiriva-Internati。通过Kiromic专有搜索引擎Diamond鉴定NY-ESO-1实体恶性肿瘤的新表位[摘要]。见:美国癌症研究协会2021年年会论文集;2021年4月10日至15日和5月17日至21日。费城(PA): AACR;癌症杂志,2021;81(13 -增刊):摘要nr 243。
{"title":"Abstract 243: Identification of novel epitopes of NY-ESO-1 for solid malignancies by Kiromic proprietary search engine Diamond","authors":"L. Piccotti, L. Mirandola, M. Chiriva-Internati","doi":"10.1158/1538-7445.AM2021-243","DOIUrl":"https://doi.org/10.1158/1538-7445.AM2021-243","url":null,"abstract":"Adoptive cell therapy has been proven a powerful approach for the cure of cancer and other diseases. In particular, the selection of appropriate immunogenic targets has been key to positive outcomes in clinical settings. The availability of RNA-Seq analysis, the accessibility to large data repositories such as TCGA and GTEx, and the creation of new bioinformatic tools have accelerated the process of neoantigen discovery. However, most of the current algorithms are encumbered by the intrinsic complexity of predicting antigen immunogenicity. Diamond™ is a novel artificial intelligence and cognitive machine and deep learning platform to predict peptide processing, HLA binding, and T cell activation. To validate the predictive value of DIAMOND algorithms, the meta-analyses of expression data of cancer-testis antigen New York Esophageal Squamous Cell Carcinoma 1 (NY-ESO-1) and predictions for the immunogenic peptides were compared to experimental data in the literature. In agreement with published clinical observations, DIAMOND metanalysis showed NY-ESO-1 genic overexpression in skin cutaneous melanoma, lung adenocarcinoma, and sarcoma. Moreover, DIAMOND predicted an MHC binding affinity of 0.289 with Supertype A2 for a new NY-ESO-1 peptide, which has been successfully targeted in clinical trials for patients with HLA-A*02:01, as well as it mirrored published data in its prediction of peptide affinity binding in NY-ESO-1–specific MHC II–restricted T cell receptors. Taken together these data support DIAMOND as a reliable platform for the discovery of new immunogenic targets for cancer therapy. Citation Format: Lucia Piccotti, Leonardo Mirandola, Maurizio Chiriva-Internati. Identification of novel epitopes of NY-ESO-1 for solid malignancies by Kiromic proprietary search engine Diamond [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2021; 2021 Apr 10-15 and May 17-21. Philadelphia (PA): AACR; Cancer Res 2021;81(13_Suppl):Abstract nr 243.","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"174 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88031642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Abstract 165: Enhanced processing of genomic sequencing data for pediatric cancers: GPUs and machine learning techniques for variant detection 165:儿童癌症基因组测序数据的强化处理:gpu和机器学习技术用于变异检测
Pub Date : 2021-07-01 DOI: 10.1158/1538-7445.AM2021-165
E. Crowgey, Pankaj Vats, Karl R. Franke, G. Burnett, Ankit Sethia, T. Harkins, T. Druley
{"title":"Abstract 165: Enhanced processing of genomic sequencing data for pediatric cancers: GPUs and machine learning techniques for variant detection","authors":"E. Crowgey, Pankaj Vats, Karl R. Franke, G. Burnett, Ankit Sethia, T. Harkins, T. Druley","doi":"10.1158/1538-7445.AM2021-165","DOIUrl":"https://doi.org/10.1158/1538-7445.AM2021-165","url":null,"abstract":"","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"17 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90481225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Abstract 240: Gene fusion calling from RNA panel sequencing data: An ensemble learning approach 来自RNA面板测序数据的基因融合调用:一种集成学习方法
Pub Date : 2021-07-01 DOI: 10.1158/1538-7445.AM2021-240
Kenneth B. Thomas, Y. Mou, C. Magnan, T. Gyuris, E. Shinbrot, Fernando Díaz, Steven Lau-Rivera, Segun Jung, V. Funari, L. Weiss
Introduction: Our goal is to improve gene fusion detection via RNA sequencing by combining multiple fusion callers through machine learning techniques. Background: Gene Fusion events are important drivers of malignancy. RNA sequencing (RNAseq) methods for detection of fusions have the advantage that multiple markers can be targeted at one time. Unlike DNA methods, in which it is challenging to capture fusion breakpoints, in RNA methods fusions are readily identified through chimeric transcripts. While many fusion calling algorithms exist for use on RNAseq data, sensitive fusion callers, needed for samples of low tumor content, often present high false positive rates - a result of aligning chimeric transcripts. Further, there currently is no single feature in NGS data that can be used to filter out false positive fusion calls. In order to achieve higher accuracy in fusion calls than can be achieved using individual fusion callers, we have weighted and combined the results of multiple fusion callers by systematic and objective means: an ensemble learning approach based on random forest models. Our method selects from data generated by three independent fusion callers supplemented by metrics obtained from in-house methods. It presents a metric that can be immediately interpreted as the probability that a candidate fusion call is a true fusion call. Methods: Random forest models were generated by use of the randomForest package in R, with tuning by the R caret package. Training data sets consisted of a balanced set of 394 fusion calls from clinical samples of solid tumors. For training, fusion calls with at least 10 supporting reads were deemed true or false based on manual review via IGV, and orthogonal methods including PCR with Sanger sequencing and the commercial Archer™ fusion CTL and Sarcoma panels. We present the results of training on data from the three well-known fusion callers Arriba, STAR-Fusion, and FusionCatcher, together with additional data from an in-house developed junction counting method, and fusion membership in a list of known fusions (a “white list”). Models were validated by 10-fold cross-validation. Results: In performance evaluations, false positive and false negative calls were presumed false based on orthogonal determinations. On that basis, our current best model has an accuracy of 94.9% (sensitivity 93.4%, specificity 96.7%). Currently, High Confidence fusion calls (calls with probability score greater than 70%) are the most common positive calls. These have been confirmed with 100% success. Conclusion: We have successfully integrated multiple fusion callers by means of random forest models. Our current model is validated for use on our solid tumor fusion calling pipeline. Citation Format: Kenneth B. Thomas, Yanglong Mou, Christophe Magnan, Tibor Gyuris, Eve Shinbrot, Fernando Lopez Diaz, Steven Lau-Rivera, Segun Jung, Vincent Funari, Lawrence M. Weiss. Gene fusion calling from RNA panel sequencing data: An ensemble lear
我们的目标是通过机器学习技术结合多个融合调用者,通过RNA测序改进基因融合检测。背景:基因融合事件是恶性肿瘤的重要驱动因素。RNA测序(RNAseq)检测融合物的方法具有一次检测多个标记物的优点。与DNA方法不同,在DNA方法中很难捕获融合断点,而RNA方法通过嵌合转录物很容易识别融合。虽然存在许多用于RNAseq数据的融合调用算法,但对于低肿瘤含量的样本来说,敏感的融合调用器通常会出现高假阳性率——这是嵌合转录物排列的结果。此外,目前在NGS数据中没有单一的特征可以用来过滤掉误报融合呼叫。为了获得比使用单个融合调用器更高的融合调用精度,我们通过系统和客观的方法对多个融合调用器的结果进行加权和组合:基于随机森林模型的集成学习方法。我们的方法从三个独立的融合调用程序生成的数据中进行选择,并辅以从内部方法获得的指标。它提出了一个度量,可以立即解释为候选融合调用是真正融合调用的概率。方法:使用R中的randomForest包生成随机森林模型,并使用R插入符号包进行调优。训练数据集包括来自实体瘤临床样本的394个融合呼叫的平衡集。对于训练,基于IGV和正交方法(包括PCR与Sanger测序和商业Archer™融合CTL和Sarcoma面板)的人工审查,具有至少10个支持读数的融合呼叫被认为是正确或错误的。我们介绍了三个著名的融合调用器Arriba、STAR-Fusion和FusionCatcher的数据训练结果,以及来自内部开发的结计数方法的额外数据,以及已知融合列表(“白名单”)中的融合成员。模型采用10倍交叉验证。结果:在绩效评估中,假阳性和假阴性呼叫被假定为基于正交确定的假。在此基础上,我们目前的最佳模型准确率为94.9%(灵敏度93.4%,特异性96.7%)。目前,高置信度融合呼叫(概率得分大于70%)是最常见的正面呼叫。这些已被证实100%成功。结论:我们利用随机森林模型成功地集成了多个融合调用者。我们目前的模型已被验证用于我们的实体肿瘤融合呼叫管道。引用格式:Kenneth B. Thomas, Yanglong Mou, Christophe Magnan, Tibor Gyuris, Eve Shinbrot, Fernando Lopez Diaz, Steven Lau-Rivera, Segun Jung, Vincent Funari, Lawrence M. Weiss来自RNA面板测序数据的基因融合调用:一种集成学习方法[摘要]。见:美国癌症研究协会2021年年会论文集;2021年4月10日至15日和5月17日至21日。费城(PA): AACR;癌症杂志,2021;81(13 -增刊):摘要第240期。
{"title":"Abstract 240: Gene fusion calling from RNA panel sequencing data: An ensemble learning approach","authors":"Kenneth B. Thomas, Y. Mou, C. Magnan, T. Gyuris, E. Shinbrot, Fernando Díaz, Steven Lau-Rivera, Segun Jung, V. Funari, L. Weiss","doi":"10.1158/1538-7445.AM2021-240","DOIUrl":"https://doi.org/10.1158/1538-7445.AM2021-240","url":null,"abstract":"Introduction: Our goal is to improve gene fusion detection via RNA sequencing by combining multiple fusion callers through machine learning techniques. Background: Gene Fusion events are important drivers of malignancy. RNA sequencing (RNAseq) methods for detection of fusions have the advantage that multiple markers can be targeted at one time. Unlike DNA methods, in which it is challenging to capture fusion breakpoints, in RNA methods fusions are readily identified through chimeric transcripts. While many fusion calling algorithms exist for use on RNAseq data, sensitive fusion callers, needed for samples of low tumor content, often present high false positive rates - a result of aligning chimeric transcripts. Further, there currently is no single feature in NGS data that can be used to filter out false positive fusion calls. In order to achieve higher accuracy in fusion calls than can be achieved using individual fusion callers, we have weighted and combined the results of multiple fusion callers by systematic and objective means: an ensemble learning approach based on random forest models. Our method selects from data generated by three independent fusion callers supplemented by metrics obtained from in-house methods. It presents a metric that can be immediately interpreted as the probability that a candidate fusion call is a true fusion call. Methods: Random forest models were generated by use of the randomForest package in R, with tuning by the R caret package. Training data sets consisted of a balanced set of 394 fusion calls from clinical samples of solid tumors. For training, fusion calls with at least 10 supporting reads were deemed true or false based on manual review via IGV, and orthogonal methods including PCR with Sanger sequencing and the commercial Archer™ fusion CTL and Sarcoma panels. We present the results of training on data from the three well-known fusion callers Arriba, STAR-Fusion, and FusionCatcher, together with additional data from an in-house developed junction counting method, and fusion membership in a list of known fusions (a “white list”). Models were validated by 10-fold cross-validation. Results: In performance evaluations, false positive and false negative calls were presumed false based on orthogonal determinations. On that basis, our current best model has an accuracy of 94.9% (sensitivity 93.4%, specificity 96.7%). Currently, High Confidence fusion calls (calls with probability score greater than 70%) are the most common positive calls. These have been confirmed with 100% success. Conclusion: We have successfully integrated multiple fusion callers by means of random forest models. Our current model is validated for use on our solid tumor fusion calling pipeline. Citation Format: Kenneth B. Thomas, Yanglong Mou, Christophe Magnan, Tibor Gyuris, Eve Shinbrot, Fernando Lopez Diaz, Steven Lau-Rivera, Segun Jung, Vincent Funari, Lawrence M. Weiss. Gene fusion calling from RNA panel sequencing data: An ensemble lear","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"24 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78213570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Abstract 183: End-to-end training of convolutional network for breast cancer detection in two-view mammography 183:卷积网络的端到端训练在双视图乳房x光检查中的乳腺癌检测
Pub Date : 2021-07-01 DOI: 10.1158/1538-7445.AM2021-183
D. Petrini, C. Shimizu, G. Valente, Guilherme Folgueira, Guilherme Apolinario Silva Novaes, M. H. Katayama, P. Serio, R. A. Roela, T. Tucunduva, M. A. K. Folgueira, Hae Yong Kim
Background:Early computer-aided detection systems for mammography have failed to improve the performance of radiologists. With the remarkable success of deep learning, some recent studies have described computer systems with similar or even superior performance to that of human experts. Among them, Shen et al. (Nature Sci. Rep., 2019) present a promising “end-to-end” training approach. Instead of training a convolutional net with whole mammograms, they first train a “patch classifier” that recognizes lesions in small subimages. Then, they generalize the patch classifier to “whole image classifier” using the property of fully convolutional networks and the end-to-end approach. Using this strategy, the authors have obtained a per-image AUC of 0.87 [0.84, 0.90] in the CBIS-DDSM dataset. Standard mammography consists of two views for each breast: bilateral craniocaudal (CC) and mediolateral oblique (MLO). The algorithm proposed by Shen et al. processes only single-view mammography. We extend their work, presenting the end-to-end training of convolutional net for two-view mammography. Methods:First, we reproduced Shen et al.9s work, using the CBIS-DDSM dataset. We trained a ResNet50-based net for classifying patches with 224x224 pixels using segmented lesions. Then, the weights of the patch classifier were transferred to the whole image single-view classifier, obtained by removing the dense layers from the patch classifier and stacking one ResNet block at the top. This single-view classifier was trained using full images from the same dataset. Trying to replicate Shen et al.9s work, we obtained an AUC of 0.8524±0.0560, less than 0.87 reported in the original paper. We attribute this worsening to the fact that we are using only 2260 images with two views, instead of 2478 images from the original work. Finally, we built the two-view classifier that receives CC and MLO views as input. This classifier has inside two copies of the patch classifier, loaded with the weights from the single-view classifier. The features extracted by the two patch classifiers are concatenated and submitted to the ResNet block. The two-view classifier is end-to-end trained using full images, refining all its weights, including those inside the two patch classifiers. Results:The two-view classifier yielded an AUC of 0.9199±0.0623 in 5-fold cross-validation to classify mammographies into malignant/non-malignant, using single-model and without test-time data augmentation. This is better than the Shen et al.9s AUC (0.87), our single-view AUC (0.85). Zhang et al. (Plos One, 2020) present another two-view algorithm (without end-to-end training) with AUC of 0.95. However, this work cannot directly be compared with ours, as it was tested on a different set of images. Conclusions:We presented end-to-end training of convolutional net for two-view mammography. Our system9s AUC was 0.92, better than the 0.87 obtained by the previous single-view system. Citation Format: Daniel G. Petrini, C
背景:早期乳腺x线摄影的计算机辅助检测系统未能提高放射科医生的工作水平。随着深度学习的显著成功,最近的一些研究已经描述了与人类专家相似甚至优于人类专家的计算机系统。其中,Shen等(Nature Sci.;Rep., 2019)提出了一种有前途的“端到端”培训方法。他们不是用整个乳房x光照片训练卷积网络,而是首先训练一个“补丁分类器”,在小的子图像中识别病变。然后,他们利用全卷积网络的特性和端到端方法将patch分类器推广到“整幅图像分类器”。使用这种策略,作者在CBIS-DDSM数据集中获得了0.87[0.84,0.90]的单幅图像AUC。标准乳房x线照相术包括每个乳房的两个视图:双侧颅侧(CC)和中外侧斜位(MLO)。Shen等人提出的算法只处理单视图乳房x线检查。我们扩展了他们的工作,提出了卷积网络对双视图乳房x线检查的端到端训练。方法:首先,我们使用CBIS-DDSM数据集复制了Shen等人的工作。我们训练了一个基于resnet50的网络,用于使用分割的病灶对224x224像素的斑块进行分类。然后,将patch分类器的权重转移到整个图像的单视图分类器中,该分类器通过去除patch分类器中的密集层并在顶部堆叠一个ResNet块来获得。这个单视图分类器使用来自同一数据集的完整图像进行训练。我们试图复制Shen等人的工作,得到的AUC为0.8524±0.0560,小于原论文报道的0.87。我们将这种恶化归因于我们只使用了2260张带有两个视图的图像,而不是原始作品中的2478张图像。最后,我们构建了接收CC和MLO视图作为输入的双视图分类器。这个分类器有两个补丁分类器的副本,加载了来自单视图分类器的权重。两个补丁分类器提取的特征被连接并提交给ResNet块。双视图分类器使用完整图像进行端到端训练,精炼其所有权重,包括两个补丁分类器内的权重。结果:双视图分类器在单模型、无测试时间数据增强的情况下,5次交叉验证的AUC为0.9199±0.0623。这优于Shen等人的AUC(0.87)和我们的单视图AUC(0.85)。Zhang等人(Plos One, 2020)提出了另一种双视图算法(没有端到端训练),AUC为0.95。但是,这项工作不能直接与我们的工作进行比较,因为它是在不同的图像集上进行测试的。结论:我们提出了用于双视图乳房x线摄影的卷积网络的端到端训练。该系统的AUC为0.92,优于以往单视图系统的0.87。引用格式:Daniel G. Petrini, Carlos Shimizu, Gabriel V. Valente, Guilherme Folgueira, Guilherme A. Novaes, Maria L. Katayama, Pedro Serio, Rosimeire A. Roela, Tatiana C. Tucunduva, Maria aprecida A. Folgueira, Hae Y. Kim。基于卷积网络的乳腺癌双视图乳房x光检查端到端训练[摘要]。见:美国癌症研究协会2021年年会论文集;2021年4月10日至15日和5月17日至21日。费城(PA): AACR;癌症杂志,2021;81(13 -增刊):摘要第183期。
{"title":"Abstract 183: End-to-end training of convolutional network for breast cancer detection in two-view mammography","authors":"D. Petrini, C. Shimizu, G. Valente, Guilherme Folgueira, Guilherme Apolinario Silva Novaes, M. H. Katayama, P. Serio, R. A. Roela, T. Tucunduva, M. A. K. Folgueira, Hae Yong Kim","doi":"10.1158/1538-7445.AM2021-183","DOIUrl":"https://doi.org/10.1158/1538-7445.AM2021-183","url":null,"abstract":"Background:Early computer-aided detection systems for mammography have failed to improve the performance of radiologists. With the remarkable success of deep learning, some recent studies have described computer systems with similar or even superior performance to that of human experts. Among them, Shen et al. (Nature Sci. Rep., 2019) present a promising “end-to-end” training approach. Instead of training a convolutional net with whole mammograms, they first train a “patch classifier” that recognizes lesions in small subimages. Then, they generalize the patch classifier to “whole image classifier” using the property of fully convolutional networks and the end-to-end approach. Using this strategy, the authors have obtained a per-image AUC of 0.87 [0.84, 0.90] in the CBIS-DDSM dataset. Standard mammography consists of two views for each breast: bilateral craniocaudal (CC) and mediolateral oblique (MLO). The algorithm proposed by Shen et al. processes only single-view mammography. We extend their work, presenting the end-to-end training of convolutional net for two-view mammography. Methods:First, we reproduced Shen et al.9s work, using the CBIS-DDSM dataset. We trained a ResNet50-based net for classifying patches with 224x224 pixels using segmented lesions. Then, the weights of the patch classifier were transferred to the whole image single-view classifier, obtained by removing the dense layers from the patch classifier and stacking one ResNet block at the top. This single-view classifier was trained using full images from the same dataset. Trying to replicate Shen et al.9s work, we obtained an AUC of 0.8524±0.0560, less than 0.87 reported in the original paper. We attribute this worsening to the fact that we are using only 2260 images with two views, instead of 2478 images from the original work. Finally, we built the two-view classifier that receives CC and MLO views as input. This classifier has inside two copies of the patch classifier, loaded with the weights from the single-view classifier. The features extracted by the two patch classifiers are concatenated and submitted to the ResNet block. The two-view classifier is end-to-end trained using full images, refining all its weights, including those inside the two patch classifiers. Results:The two-view classifier yielded an AUC of 0.9199±0.0623 in 5-fold cross-validation to classify mammographies into malignant/non-malignant, using single-model and without test-time data augmentation. This is better than the Shen et al.9s AUC (0.87), our single-view AUC (0.85). Zhang et al. (Plos One, 2020) present another two-view algorithm (without end-to-end training) with AUC of 0.95. However, this work cannot directly be compared with ours, as it was tested on a different set of images. Conclusions:We presented end-to-end training of convolutional net for two-view mammography. Our system9s AUC was 0.92, better than the 0.87 obtained by the previous single-view system. Citation Format: Daniel G. Petrini, C","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"57 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76684620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Abstract 153: Development of a workflow to handle the quality control and analysis of Olink protein biomarker data in early phase oncology clinical trials 153:在早期肿瘤临床试验中,开发一种处理Olink蛋白生物标志物数据质量控制和分析的工作流程
Pub Date : 2021-07-01 DOI: 10.1158/1538-7445.AM2021-153
Claire J. Guo, Mary Saltarelli, S. Lambert, H. Fang, Chun Zhang
{"title":"Abstract 153: Development of a workflow to handle the quality control and analysis of Olink protein biomarker data in early phase oncology clinical trials","authors":"Claire J. Guo, Mary Saltarelli, S. Lambert, H. Fang, Chun Zhang","doi":"10.1158/1538-7445.AM2021-153","DOIUrl":"https://doi.org/10.1158/1538-7445.AM2021-153","url":null,"abstract":"","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"29 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73607351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Abstract 210: Advancing knowledgebase representation of pediatric cancer variants through ClinGen/CIViC collaboration 210:通过ClinGen/CIViC合作推进儿科癌症变异的知识库表示
Pub Date : 2021-07-01 DOI: 10.1158/1538-7445.AM2021-210
Arpad M. Danos, Wan-Hsin Lin, J. Saliba, Angshumoy Roy, A. Church, Shruti Rao, D. Ritter, Kilannin Krysiak, A. Wagner, Erica K. Barnell, Lana M. Sheta, Adam C. Coffman, S. Kiwala, Joshua F. McMichael, L. Corson, Kevin E. Fisher, H. Williams, Matthew C. Hiemenz, K. Janeway, J. Ji, Kesserwan A. Chimene, L. Fuqua, L. Dyer, Huiling Xu, Jeffrey Jean, L. Satgunaseelan, Liying Zhang, T. Laetsch, D. Parsons, Ryan J. Schmidt, L. Schriml, K. Sund, S. Kulkarni, Subha Madhavan, Xinjie Xu, R. Kanagal-Shamana, M. Harris, Y. Akkari, Nurit Paz Yacov, P. Terraf, M. Griffith, O. Griffith, G. Raca
Childhood cancers are driven by unique profiles of somatic genetic alterations, with a significant contribution from predisposing germline variants. Understanding the genomic landscape of pediatric cancers is complicated by their rarity, the heterogeneity of variation within a given disease, and the complex forms of structural variation they contain. Variants in childhood disease may differ from those in adult versions of the same cancer type, or may have different clinical significance. Currently, pediatric variants are underrepresented in cancer variant databases, and an urgent need exists for their publicly available expert curation. To address this, the Pediatric Cancer Taskforce (PCT) was formed within the Clinical Genome Resource (ClinGen) Somatic Cancer Clinical Domain Working Group (CDWG) (https://www.clinicalgenome.org/working-groups/somatic/). The PCT is a multi-institutional group of 39 members with broad experience in childhood cancer and variant curation, whose work consists of standardization and classification of genetic variants in pediatric cancers. The CIViC knowledgebase (www.civicdb.org) is a freely available resource for Clinical Interpretation of Variants in Cancer, which leverages public curation and expert moderation to address the problem of annotating the large volume of clinically actionable cancer variants. PCT curators work together with PCT expert members and the CIViC team on variant curation, and have submitted over 230 Evidence Items and over 10 Assertions to CIViC. To further address issues specific to pediatric curation, the PCT is working with CIViC to develop new pediatric-specific CIViC features and modifications of the data model that will aid in pediatric curation. A pediatric user interface, as well as representation of large scale structural and copy number variation are being developed for version two of CIViC, expected to be released in 1-2 years, which will enable curation of a new class of structural variants often encountered in pediatric cancer. A novel standard operating procedure for childhood cancer curation in CIViC is being developed by PCT experts, curators and the CIViC team. This SOP will cover topics including curation of structural variants, as well as pediatric-specific variant tiering guidelines which take into account the sparse nature of evidence in pediatric cases. A companion resource, CIViCmine (http://bionlp.bcgsc.ca/civicmine/), will be further developed to incorporate pediatric data. These and other joint efforts of the PCT and CIViC will significantly enhance pediatric variant representation for public use, to support the care of children with cancer. Citation Format: Arpad Danos, Wan-Hsin Lin, Jason Saliba, Angshumoy Roy, Alanna J. Church, Shruti Rao, Deborah Ritter, Kilannin Krysiak, Alex Wagner, Erica Barnell, Lana Sheta, Adam Coffman, Susanna Kiwala, Joshua F. McMichael, Laura Corson, Kevin Fisher, Heather E. Williams, Matthew Hiemenz, Katherine A. Janeway, Jianling Ji, Kess
儿童癌症是由体细胞遗传改变的独特特征驱动的,其中重要的贡献来自易感的种系变异。了解儿童癌症的基因组景观是复杂的,因为它们的罕见性,特定疾病内变异的异质性,以及它们所包含的复杂形式的结构变异。儿童疾病的变异可能与相同癌症类型的成人版本不同,或者可能具有不同的临床意义。目前,儿科癌症变体在癌症变体数据库中的代表性不足,迫切需要对其进行公开的专家管理。为了解决这个问题,临床基因组资源(ClinGen)体细胞癌临床领域工作组(CDWG) (https://www.clinicalgenome.org/working-groups/somatic/)成立了儿科癌症工作组(PCT)。PCT是一个由39名成员组成的多机构小组,在儿童癌症和变异管理方面具有广泛的经验,其工作包括儿童癌症遗传变异的标准化和分类。CIViC知识库(www.civicdb.org)是癌症变异临床解释的免费资源,它利用公共管理和专家审核来解决注释大量临床可操作的癌症变异的问题。PCT策展人与PCT专家成员和思域团队一起进行变体策展,并向思域提交了230多条证据项和10多条断言。为了进一步解决儿科护理的具体问题,PCT正在与CIViC合作开发新的儿科专用CIViC功能,并修改数据模型,以帮助儿科护理。CIViC的第二版正在开发儿科用户界面,以及大规模结构和拷贝数变异的表示,预计将在1-2年内发布,这将使儿科癌症中经常遇到的一类新的结构变异得以管理。PCT专家、策展人和CIViC团队正在为CIViC的儿童癌症策展制定一个新的标准操作程序。本SOP将涵盖的主题包括结构变异的管理,以及考虑到儿科病例证据稀疏性的儿科特异性变异分级指南。将进一步开发配套资源CIViCmine (http://bionlp.bcgsc.ca/civicmine/),以纳入儿科数据。PCT和CIViC的这些和其他共同努力将显著提高儿科变体的公共使用代表性,以支持癌症儿童的护理。引文格式:Arpad Danos, Lin Wan-Hsin, Jason Saliba, Angshumoy Roy, Alanna J. Church, Shruti Rao, Deborah Ritter, Kilannin Krysiak, Alex Wagner, Erica Barnell, Lana Sheta, Adam Coffman, Susanna Kiwala, Joshua F. McMichael, Laura Corson, Kevin Fisher, Heather E. Williams, Matthew Hiemenz, Katherine A. Janeway, Jianling Ji, Kesserwan A. Chimene, Laura Fuqua, Lisa Dyer,许惠玲,Jeffrey Jean, Laveniya Satgunaseelan, Liying Zhang, Ted W. Laetsch, Donald W. Parsons, Ryan Schmidt, Lynn M. Schriml,Kristen L. Sund, Shashikant Kulkarni, Subha Madhavan, Xinjie Xu, Rashmi Kanagal-Shamana, Marian Harris, Yasmine Akkari, Nurit Paz Yacov, Panieh Terraf, Malachi Griffith, Obi L. Griffith, Gordana Raca。通过ClinGen/CIViC合作推进儿科癌症变异的知识库表示[摘要]。见:美国癌症研究协会2021年年会论文集;2021年4月10日至15日和5月17日至21日。费城(PA): AACR;癌症杂志,2021;81(13 -增刊):摘要第210期。
{"title":"Abstract 210: Advancing knowledgebase representation of pediatric cancer variants through ClinGen/CIViC collaboration","authors":"Arpad M. Danos, Wan-Hsin Lin, J. Saliba, Angshumoy Roy, A. Church, Shruti Rao, D. Ritter, Kilannin Krysiak, A. Wagner, Erica K. Barnell, Lana M. Sheta, Adam C. Coffman, S. Kiwala, Joshua F. McMichael, L. Corson, Kevin E. Fisher, H. Williams, Matthew C. Hiemenz, K. Janeway, J. Ji, Kesserwan A. Chimene, L. Fuqua, L. Dyer, Huiling Xu, Jeffrey Jean, L. Satgunaseelan, Liying Zhang, T. Laetsch, D. Parsons, Ryan J. Schmidt, L. Schriml, K. Sund, S. Kulkarni, Subha Madhavan, Xinjie Xu, R. Kanagal-Shamana, M. Harris, Y. Akkari, Nurit Paz Yacov, P. Terraf, M. Griffith, O. Griffith, G. Raca","doi":"10.1158/1538-7445.AM2021-210","DOIUrl":"https://doi.org/10.1158/1538-7445.AM2021-210","url":null,"abstract":"Childhood cancers are driven by unique profiles of somatic genetic alterations, with a significant contribution from predisposing germline variants. Understanding the genomic landscape of pediatric cancers is complicated by their rarity, the heterogeneity of variation within a given disease, and the complex forms of structural variation they contain. Variants in childhood disease may differ from those in adult versions of the same cancer type, or may have different clinical significance. Currently, pediatric variants are underrepresented in cancer variant databases, and an urgent need exists for their publicly available expert curation. To address this, the Pediatric Cancer Taskforce (PCT) was formed within the Clinical Genome Resource (ClinGen) Somatic Cancer Clinical Domain Working Group (CDWG) (https://www.clinicalgenome.org/working-groups/somatic/). The PCT is a multi-institutional group of 39 members with broad experience in childhood cancer and variant curation, whose work consists of standardization and classification of genetic variants in pediatric cancers. The CIViC knowledgebase (www.civicdb.org) is a freely available resource for Clinical Interpretation of Variants in Cancer, which leverages public curation and expert moderation to address the problem of annotating the large volume of clinically actionable cancer variants. PCT curators work together with PCT expert members and the CIViC team on variant curation, and have submitted over 230 Evidence Items and over 10 Assertions to CIViC. To further address issues specific to pediatric curation, the PCT is working with CIViC to develop new pediatric-specific CIViC features and modifications of the data model that will aid in pediatric curation. A pediatric user interface, as well as representation of large scale structural and copy number variation are being developed for version two of CIViC, expected to be released in 1-2 years, which will enable curation of a new class of structural variants often encountered in pediatric cancer. A novel standard operating procedure for childhood cancer curation in CIViC is being developed by PCT experts, curators and the CIViC team. This SOP will cover topics including curation of structural variants, as well as pediatric-specific variant tiering guidelines which take into account the sparse nature of evidence in pediatric cases. A companion resource, CIViCmine (http://bionlp.bcgsc.ca/civicmine/), will be further developed to incorporate pediatric data. These and other joint efforts of the PCT and CIViC will significantly enhance pediatric variant representation for public use, to support the care of children with cancer. Citation Format: Arpad Danos, Wan-Hsin Lin, Jason Saliba, Angshumoy Roy, Alanna J. Church, Shruti Rao, Deborah Ritter, Kilannin Krysiak, Alex Wagner, Erica Barnell, Lana Sheta, Adam Coffman, Susanna Kiwala, Joshua F. McMichael, Laura Corson, Kevin Fisher, Heather E. Williams, Matthew Hiemenz, Katherine A. Janeway, Jianling Ji, Kess","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"23 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84487140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Abstract 225: Computational analysis of 5-fluorouracil antitumor activity in colon cancer using a mechanistic pharmacokinetic/pharmacodynamic model 基于机制药代动力学/药效学模型的5-氟尿嘧啶结肠癌抗肿瘤活性计算分析
Pub Date : 2021-07-01 DOI: 10.1158/1538-7445.AM2021-225
Chenhui Ma, A. Almasan, Evren Gurkan-Cavusoglu
{"title":"Abstract 225: Computational analysis of 5-fluorouracil antitumor activity in colon cancer using a mechanistic pharmacokinetic/pharmacodynamic model","authors":"Chenhui Ma, A. Almasan, Evren Gurkan-Cavusoglu","doi":"10.1158/1538-7445.AM2021-225","DOIUrl":"https://doi.org/10.1158/1538-7445.AM2021-225","url":null,"abstract":"","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"17 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72859961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Abstract 220: Identifying novel oncology targets and positioning existing targets through the prediction of cancer dependencies 摘要220:通过预测肿瘤依赖性来识别新的肿瘤靶点和定位现有靶点
Pub Date : 2021-07-01 DOI: 10.1158/1538-7445.AM2021-220
M. Parikh, O. Elemento, Neel S. Madhukar, Coryandar Gilvary
{"title":"Abstract 220: Identifying novel oncology targets and positioning existing targets through the prediction of cancer dependencies","authors":"M. Parikh, O. Elemento, Neel S. Madhukar, Coryandar Gilvary","doi":"10.1158/1538-7445.AM2021-220","DOIUrl":"https://doi.org/10.1158/1538-7445.AM2021-220","url":null,"abstract":"","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"26 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83278925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Abstract 207: The cBioPortal for Cancer Genomics 摘要207:癌症基因组学的基因门户网站
Pub Date : 2021-07-01 DOI: 10.1158/1538-7445.am2021-207
Jianjiong Gao, T. Mazor, Ino de Bruijn, Adam Abeshouse, Diana Baiceanu, Ziya Erkoç, Benjamin E. Gross, David M Higgins, P. Jagannathan, Karthik Kalletla, P. Kumari, Ritika Kundra, Xiang Li, James Lindsay, Aaron Lisman, Pieter Lukasse, Divya Madala, Ramyasree Madupuri, Angelica Ochoa, Oleguer Plantalech, Joyce Quach, Sander Y. A. Rodenburg, Anusha Satravada, F. Schaeffer, R. Sheridan, Lucas Sikina, S. O. Sumer, Yichao Sun, P. van Dijk, P. van Nierop, Avery Wang, Manda Wilson, Hongxin Zhang, Gaofei Zhao, Sjoerd van Hagen, K. van Bochove, U. Dogrusoz, Allison P. Heath, A. Resnick, Trevor J Pugh, C. Sander, E. Cerami, N. Schultz
207: The cBioPortal for Cancer Genomics   Author & Article Information Cancer Res (2021) 81 (13_Supplement): 207. https://doi.org/10.1158/1538-7445.AM2021-207
2007:癌症基因组学的cbiopportal作者和文章信息癌症杂志(2021)81 (13_Supplement): 207。https://doi.org/10.1158/1538-7445.AM2021-207
{"title":"Abstract 207: The cBioPortal for Cancer Genomics","authors":"Jianjiong Gao, T. Mazor, Ino de Bruijn, Adam Abeshouse, Diana Baiceanu, Ziya Erkoç, Benjamin E. Gross, David M Higgins, P. Jagannathan, Karthik Kalletla, P. Kumari, Ritika Kundra, Xiang Li, James Lindsay, Aaron Lisman, Pieter Lukasse, Divya Madala, Ramyasree Madupuri, Angelica Ochoa, Oleguer Plantalech, Joyce Quach, Sander Y. A. Rodenburg, Anusha Satravada, F. Schaeffer, R. Sheridan, Lucas Sikina, S. O. Sumer, Yichao Sun, P. van Dijk, P. van Nierop, Avery Wang, Manda Wilson, Hongxin Zhang, Gaofei Zhao, Sjoerd van Hagen, K. van Bochove, U. Dogrusoz, Allison P. Heath, A. Resnick, Trevor J Pugh, C. Sander, E. Cerami, N. Schultz","doi":"10.1158/1538-7445.am2021-207","DOIUrl":"https://doi.org/10.1158/1538-7445.am2021-207","url":null,"abstract":"207: The cBioPortal for Cancer Genomics   Author & Article Information Cancer Res (2021) 81 (13_Supplement): 207. https://doi.org/10.1158/1538-7445.AM2021-207","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84333756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Abstract LB019: Trisicell: Scalable Tumor Phylogeny Reconstruction and Validation Reveals Developmental Origin and Therapeutic Impact of Intratumoral Heterogeneity 摘要LB019:三细胞:可扩展肿瘤系统发育重建和验证揭示肿瘤内异质性的发育起源和治疗影响
Pub Date : 2021-07-01 DOI: 10.1158/1538-7445.AM2021-LB019
F. Mehrabadi, S. Malikić, Kerrie L. Marie, Eva Pérez-Guijarro, Erfan Sadeqi Azer, Howard H. Yang, Can Kızılkale, Charli Gruen, Huaitian Liu, C. Marcelus, A. Buluç, Funda Ergün, M. Lee, G. Merlino, Chi-Ping Day, S. C. Sahinalp
Emerging sets of single-cell sequencing data makes it appealing to apply existing tumor phylogeny reconstruction methods to analyze associated intratumor heterogeneity. Unfortunately, tumor phylogeny inference is an NP-hard problem and existing principled methods typically fail to scale up to handle thousands of cells and mutations observed in emerging single-cell data sets. Even though there are greedy heuristics to build hierarchical clustering of cells and mutations, they suffer from well-documented issues in accuracy. Additionally even when “optimal” solutions are feasible, existing approaches only provide a single “most likely” tree to depict the evolutionary processes that may result in an observed collection of cells and mutations. To make matters worse, the vast majority of single-cell sequencing data sets are transcriptomic and as a result, suffer from considerable variation in coverage across mutational loci. In this paper, we introduce Trisicell, a computational toolkit for scalable tumor phylogeny reconstruction and validation from single-cell genomic, exomic or transcriptomic sequencing data. Trisicell has three components: (i) Trisicell-DnC, a new tumor phylogeny reconstruction method from genotype matrices derived from single-cell data, (ii) Trisicell-ConT a new algorithm for constructing the consensus for two or more tumor phylogenies - which may be built through the use of different data types on the same set of cells, or built through the use of different methods on the same data, and (iii) Trisicell-PF, a new partition function method for assessing the likelihood of any user-defined subtree/set of cells to be seeded by a given set of mutations in the phylogeny. Collectively, these tools provide means of identifying and validating robust portions of a tumor phylogeny, offering the ability to focus on the most important (sub)clones and the genomic alterations that seed the associated clonal expansion. We applied Trisicell to a panel of clonal sublines derived from single-cells of a parental mouse melanoma model on which we performed both whole exome and whole transcriptome sequencing. The tumor phylogenies of the clonal sublines built on exomic and transcriptomic mutations by Trisicell-DnC, were shown by Trisicell-ConT to be highly similar and the subtrees comprised of phenotypically similar clonal sublines were shown to be strongly associated by Trisicell-PF to their seeding mutations. In addition, we applied Trisicell to single-cell whole transcriptome sequencing data from a tumor derived from the same parental melanoma cell line, which was subjected to anti-CTLA-4 immunotherapy. The phylogenies generated from both studies featured distinct subtrees, strongly associated with phenotypes including cell differentiation status, tumor growth and therapeutic response. These results suggest that Trisicell can be used for scalable tumor phylogeny reconstruction and validation through both single-cell and clonal-subline sequencing data,
新出现的单细胞测序数据集使得应用现有的肿瘤系统发育重建方法来分析相关的肿瘤内异质性具有吸引力。不幸的是,肿瘤系统发育推断是一个np难题,现有的原则方法通常无法扩展到处理新兴单细胞数据集中观察到的数千个细胞和突变。尽管存在贪婪的启发式方法来构建细胞和突变的分层聚类,但它们在准确性方面存在众所周知的问题。此外,即使“最优”解决方案是可行的,现有的方法也只能提供一个单一的“最可能”树来描述可能导致观察到的细胞集合和突变的进化过程。更糟糕的是,绝大多数单细胞测序数据集都是转录组的,因此,在突变位点的覆盖范围上存在相当大的差异。在本文中,我们介绍了Trisicell,这是一个计算工具包,用于从单细胞基因组,外显子组或转录组测序数据进行可扩展的肿瘤系统发育重建和验证。Trisicell有三个组成部分:(i) Trisicell-DnC,一种新的基于单细胞数据的基因型矩阵的肿瘤系统发育重建方法,(ii) Trisicell-ConT,一种用于构建两个或多个肿瘤系统发育共识的新算法,可以通过在同一组细胞上使用不同的数据类型来构建,或者通过在同一数据上使用不同的方法来构建,以及(iii) Trisicell-PF,一种新的配分函数方法,用于评估任何用户定义的子树/细胞集被系统发育中给定的一组突变所播种的可能性。总的来说,这些工具提供了识别和验证肿瘤系统发育稳健部分的方法,提供了关注最重要(亚)克隆和为相关克隆扩增提供种子的基因组改变的能力。我们将Trisicell应用于来自亲代小鼠黑色素瘤模型单细胞的克隆亚系,并对其进行了全外显子组和全转录组测序。由Trisicell-DnC构建的外显组和转录组突变的克隆亚系的肿瘤系统发育与Trisicell-ConT显示高度相似,由表型相似的克隆亚系组成的亚树与Trisicell-PF显示的种子突变密切相关。此外,我们将Trisicell应用于来自同一亲本黑色素瘤细胞系的肿瘤的单细胞全转录组测序数据,该肿瘤接受抗ctla -4免疫治疗。两项研究产生的系统发育特征不同的亚树,与表型密切相关,包括细胞分化状态、肿瘤生长和治疗反应。这些结果表明,通过单细胞和克隆亚系测序数据,Trisicell可以用于可扩展的肿瘤系统发育重建和验证,这可能揭示出强烈的表型关联。特别是,他们认为黑色素瘤的发育状态和表型瘤内异质性源于可观察到的亚克隆变异。引文格式:Farid Rashidi Mehrabadi, Salem Malikic, Kerrie L. Marie, Eva perez - gujarro, Erfan Sadeqi Azer, Howard H. Yang, Can Kizilkale, Charli Gruen,刘怀天,Christina Marcelus, Aydin Buluc, Funda Ergun, Maxwell P. Lee, Glenn Merlino, Chi-Ping Day, S. Cenk Sahinalp。三细胞:可扩展的肿瘤系统发育重建和验证揭示了肿瘤内异质性的发育起源和治疗影响[摘要]。见:美国癌症研究协会2021年年会论文集;2021年4月10日至15日和5月17日至21日。费城(PA): AACR;癌症杂志,2021;81(13 -增刊):摘要nr LB019。
{"title":"Abstract LB019: Trisicell: Scalable Tumor Phylogeny Reconstruction and Validation Reveals Developmental Origin and Therapeutic Impact of Intratumoral Heterogeneity","authors":"F. Mehrabadi, S. Malikić, Kerrie L. Marie, Eva Pérez-Guijarro, Erfan Sadeqi Azer, Howard H. Yang, Can Kızılkale, Charli Gruen, Huaitian Liu, C. Marcelus, A. Buluç, Funda Ergün, M. Lee, G. Merlino, Chi-Ping Day, S. C. Sahinalp","doi":"10.1158/1538-7445.AM2021-LB019","DOIUrl":"https://doi.org/10.1158/1538-7445.AM2021-LB019","url":null,"abstract":"Emerging sets of single-cell sequencing data makes it appealing to apply existing tumor phylogeny reconstruction methods to analyze associated intratumor heterogeneity. Unfortunately, tumor phylogeny inference is an NP-hard problem and existing principled methods typically fail to scale up to handle thousands of cells and mutations observed in emerging single-cell data sets. Even though there are greedy heuristics to build hierarchical clustering of cells and mutations, they suffer from well-documented issues in accuracy. Additionally even when “optimal” solutions are feasible, existing approaches only provide a single “most likely” tree to depict the evolutionary processes that may result in an observed collection of cells and mutations. To make matters worse, the vast majority of single-cell sequencing data sets are transcriptomic and as a result, suffer from considerable variation in coverage across mutational loci. In this paper, we introduce Trisicell, a computational toolkit for scalable tumor phylogeny reconstruction and validation from single-cell genomic, exomic or transcriptomic sequencing data. Trisicell has three components: (i) Trisicell-DnC, a new tumor phylogeny reconstruction method from genotype matrices derived from single-cell data, (ii) Trisicell-ConT a new algorithm for constructing the consensus for two or more tumor phylogenies - which may be built through the use of different data types on the same set of cells, or built through the use of different methods on the same data, and (iii) Trisicell-PF, a new partition function method for assessing the likelihood of any user-defined subtree/set of cells to be seeded by a given set of mutations in the phylogeny. Collectively, these tools provide means of identifying and validating robust portions of a tumor phylogeny, offering the ability to focus on the most important (sub)clones and the genomic alterations that seed the associated clonal expansion. We applied Trisicell to a panel of clonal sublines derived from single-cells of a parental mouse melanoma model on which we performed both whole exome and whole transcriptome sequencing. The tumor phylogenies of the clonal sublines built on exomic and transcriptomic mutations by Trisicell-DnC, were shown by Trisicell-ConT to be highly similar and the subtrees comprised of phenotypically similar clonal sublines were shown to be strongly associated by Trisicell-PF to their seeding mutations. In addition, we applied Trisicell to single-cell whole transcriptome sequencing data from a tumor derived from the same parental melanoma cell line, which was subjected to anti-CTLA-4 immunotherapy. The phylogenies generated from both studies featured distinct subtrees, strongly associated with phenotypes including cell differentiation status, tumor growth and therapeutic response. These results suggest that Trisicell can be used for scalable tumor phylogeny reconstruction and validation through both single-cell and clonal-subline sequencing data,","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"24 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89145061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of bioinformatics and systems biology : Open access
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1