首页 > 最新文献

Cancer Informatics最新文献

英文 中文
Patient Identification and Tumor Identification Management: Quality Program in a Cancer Multicentric Clinical Data Warehouse. 病人识别和肿瘤识别管理:癌症多中心临床数据仓库的质量项目。
IF 2 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-01-01 DOI: 10.1177/11769351231172609
Karine Pallier, Olivier Prot, Simone Naldi, Francisco Silva, Thierry Denis, Olivier Giry, Sophie Leobon, Elise Deluche, Nicole Tubiana-Mathieu

Background: The Regional Basis of Solid Tumor (RBST), a clinical data warehouse, centralizes information related to cancer patient care in 5 health establishments in 2 French departments.

Purpose: To develop algorithms matching heterogeneous data to "real" patients and "real" tumors with respect to patient identification (PI) and tumor identification (TI).

Methods: A graph database programed in java Neo4j was used to build the RBST with data from ~20 000 patients. The PI algorithm using the Levenshtein distance was based on the regulatory criteria identifying a patient. A TI algorithm was built on 6 characteristics: tumor location and laterality, date of diagnosis, histology, primary and metastatic status. Given the heterogeneous nature and semantics of the collected data, the creation of repositories (organ, synonym, and histology repositories) was required. The TI algorithm used the Dice coefficient to match tumors.

Results: Patients matched if there was complete agreement of the given name, surname, sex, and date/month/year of birth. These parameters were assigned weights of 28%, 28%, 21%, and 23% (with 18% for year, 2.5% for month, and 2.5% for day), respectively. The algorithm had a sensitivity of 99.69% (95% confidence interval [CI] [98.89%, 99.96%]) and a specificity of 100% (95% CI [99.72%, 100%]). The TI algorithm used repositories, weights were assigned to the diagnosis date and associated organ (37.5% and 37.5%, respectively), laterality (16%) histology (5%), and metastatic status (4%). This algorithm had a sensitivity of 71% (95% CI [62.68%, 78.25%]) and a specificity of 100% (95% CI [94.31%, 100%]).

Conclusion: The RBST encompasses 2 quality controls: PI and TI. It facilitates the implementation of transversal structuring and assessments of the performance of the provided care.

背景:区域实体瘤基础(RBST)是一个临床数据仓库,集中了法国2个部门5家卫生机构的癌症患者护理相关信息。目的:开发在患者识别(PI)和肿瘤识别(TI)方面将异构数据与“真实”患者和“真实”肿瘤匹配的算法。方法:采用java Neo4j编程的图形数据库构建约2万例患者的RBST数据。使用Levenshtein距离的PI算法基于识别患者的监管标准。TI算法基于6个特征:肿瘤的位置和侧边性、诊断日期、组织学、原发和转移状态。鉴于所收集数据的异构性质和语义,需要创建存储库(器官、同义词和组织学存储库)。TI算法使用Dice系数来匹配肿瘤。结果:如果患者的名字、姓氏、性别和出生日期/月/年完全一致,则患者匹配。这些参数的权重分别为28%、28%、21%和23%(年为18%,月为2.5%,日为2.5%)。该算法的灵敏度为99.69%(95%置信区间[CI][98.89%, 99.96%]),特异性为100% (95% CI[99.72%, 100%])。TI算法使用存储库,将权重分配给诊断日期和相关器官(分别为37.5%和37.5%)、侧边性(16%)、组织学(5%)和转移状态(4%)。该算法的灵敏度为71% (95% CI[62.68%, 78.25%]),特异性为100% (95% CI[94.31%, 100%])。结论:RBST包括PI和TI两种质量控制。它有助于实施横向结构和评估所提供护理的绩效。
{"title":"Patient Identification and Tumor Identification Management: Quality Program in a Cancer Multicentric Clinical Data Warehouse.","authors":"Karine Pallier,&nbsp;Olivier Prot,&nbsp;Simone Naldi,&nbsp;Francisco Silva,&nbsp;Thierry Denis,&nbsp;Olivier Giry,&nbsp;Sophie Leobon,&nbsp;Elise Deluche,&nbsp;Nicole Tubiana-Mathieu","doi":"10.1177/11769351231172609","DOIUrl":"https://doi.org/10.1177/11769351231172609","url":null,"abstract":"<p><strong>Background: </strong>The Regional Basis of Solid Tumor (RBST), a clinical data warehouse, centralizes information related to cancer patient care in 5 health establishments in 2 French departments.</p><p><strong>Purpose: </strong>To develop algorithms matching heterogeneous data to \"real\" patients and \"real\" tumors with respect to patient identification (PI) and tumor identification (TI).</p><p><strong>Methods: </strong>A graph database programed in java Neo4j was used to build the RBST with data from ~20 000 patients. The PI algorithm using the Levenshtein distance was based on the regulatory criteria identifying a patient. A TI algorithm was built on 6 characteristics: tumor location and laterality, date of diagnosis, histology, primary and metastatic status. Given the heterogeneous nature and semantics of the collected data, the creation of repositories (organ, synonym, and histology repositories) was required. The TI algorithm used the Dice coefficient to match tumors.</p><p><strong>Results: </strong>Patients matched if there was complete agreement of the given name, surname, sex, and date/month/year of birth. These parameters were assigned weights of 28%, 28%, 21%, and 23% (with 18% for year, 2.5% for month, and 2.5% for day), respectively. The algorithm had a sensitivity of 99.69% (95% confidence interval [CI] [98.89%, 99.96%]) and a specificity of 100% (95% CI [99.72%, 100%]). The TI algorithm used repositories, weights were assigned to the diagnosis date and associated organ (37.5% and 37.5%, respectively), laterality (16%) histology (5%), and metastatic status (4%). This algorithm had a sensitivity of 71% (95% CI [62.68%, 78.25%]) and a specificity of 100% (95% CI [94.31%, 100%]).</p><p><strong>Conclusion: </strong>The RBST encompasses 2 quality controls: PI and TI. It facilitates the implementation of transversal structuring and assessments of the performance of the provided care.</p>","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":"22 ","pages":"11769351231172609"},"PeriodicalIF":2.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/f9/25/10.1177_11769351231172609.PMC10201142.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9888090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trends in Subcutaneous Tumour Height and Impact on Measurement Accuracy. 皮下肿瘤高度变化趋势及其对测量精度的影响。
IF 2 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-01-01 DOI: 10.1177/11769351231165181
Daniel Brough, Hope Amos, Karl Turley, Jake Murkin

Tumour volume is typically calculated using only length and width measurements, using width as a proxy for height in a 1:1 ratio. When tracking tumour growth over time, important morphological information and measurement accuracy is lost by ignoring height, which we show is a unique variable. Lengths, widths, and heights of 9522 subcutaneous tumours in mice were measured using 3D and thermal imaging. The average height:width ratio was found to be 1:3 proving that using width as a proxy for height overestimates tumour volume. Comparing volumes calculated with and without tumour height to the true volumes of excised tumours indeed showed that using the volume formula including height produced volumes 36X more accurate (based off of percentage difference). Monitoring the height:width relationship (prominence) across tumour growth curves indicated that prominence varied, and that height could change independent of width. Twelve cell lines were investigated individually; the scale of tumour prominence was cell line-dependent with relatively less prominent tumours (MC38, BL2, LL/2) and more prominent tumours (RENCA, HCT116) detected. Prominence trends across the growth cycle were also dependent on cell line; prominence was correlated with tumour growth in some cell lines (4T1, CT26, LNCaP), but not others (MC38, TC-1, LL/2). When pooled, invasive cell lines produced tumours that were significantly less prominent at volumes >1200 mm3 compared to non-invasive cell lines (P < .001). Modelling was used to show the impact of the increased accuracy gained by including height in volume calculations on several efficacy study outcomes. Variations in measurement accuracy contribute to experimental variation and irreproducibility of data, therefore we strongly advise researchers to measure height to improve accuracy in tumour studies.

肿瘤体积通常仅使用长度和宽度测量来计算,以1:1的比例使用宽度代替高度。当随时间跟踪肿瘤生长时,忽略高度会丢失重要的形态学信息和测量精度,我们发现高度是一个独特的变量。采用三维和热成像技术测量9522个小鼠皮下肿瘤的长度、宽度和高度。平均高宽比为1:3,证明用宽度代替高度高估了肿瘤体积。将考虑和不考虑肿瘤高度的计算体积与切除肿瘤的真实体积进行比较确实表明,使用包括高度在内的体积公式产生的体积比实际精确36倍(基于百分比差异)。监测肿瘤生长曲线的高度:宽度关系(突出)表明突出变化,并且高度可以独立于宽度变化。分别研究了12个细胞系;肿瘤突出程度与细胞系相关,肿瘤突出程度相对较低(MC38、BL2、LL/2),肿瘤突出程度较高(RENCA、HCT116)。整个生长周期的显著趋势也依赖于细胞系;在一些细胞系(4T1、CT26、LNCaP)中,突出与肿瘤生长相关,而在其他细胞系(MC38、TC-1、LL/2)中则无关。当合并时,侵袭性细胞系产生的肿瘤在体积> 1200mm3时明显低于非侵袭性细胞系(P
{"title":"Trends in Subcutaneous Tumour Height and Impact on Measurement Accuracy.","authors":"Daniel Brough,&nbsp;Hope Amos,&nbsp;Karl Turley,&nbsp;Jake Murkin","doi":"10.1177/11769351231165181","DOIUrl":"https://doi.org/10.1177/11769351231165181","url":null,"abstract":"<p><p>Tumour volume is typically calculated using only length and width measurements, using width as a proxy for height in a 1:1 ratio. When tracking tumour growth over time, important morphological information and measurement accuracy is lost by ignoring height, which we show is a unique variable. Lengths, widths, and heights of 9522 subcutaneous tumours in mice were measured using 3D and thermal imaging. The average height:width ratio was found to be 1:3 proving that using width as a proxy for height overestimates tumour volume. Comparing volumes calculated with and without tumour height to the true volumes of excised tumours indeed showed that using the volume formula including height produced volumes 36X more accurate (based off of percentage difference). Monitoring the height:width relationship (prominence) across tumour growth curves indicated that prominence varied, and that height could change independent of width. Twelve cell lines were investigated individually; the scale of tumour prominence was cell line-dependent with relatively less prominent tumours (MC38, BL2, LL/2) and more prominent tumours (RENCA, HCT116) detected. Prominence trends across the growth cycle were also dependent on cell line; prominence was correlated with tumour growth in some cell lines (4T1, CT26, LNCaP), but not others (MC38, TC-1, LL/2). When pooled, invasive cell lines produced tumours that were significantly less prominent at volumes >1200 mm<sup>3</sup> compared to non-invasive cell lines (<i>P</i> < .001). Modelling was used to show the impact of the increased accuracy gained by including height in volume calculations on several efficacy study outcomes. Variations in measurement accuracy contribute to experimental variation and irreproducibility of data, therefore we strongly advise researchers to measure height to improve accuracy in tumour studies.</p>","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":"22 ","pages":"11769351231165181"},"PeriodicalIF":2.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/2a/51/10.1177_11769351231165181.PMC10126793.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9718474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Different Tumor Types Share a Common Nuclear Map of Chromosome Territories. 不同类型的肿瘤共享一个共同的染色体区域核图谱。
IF 2 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-01-01 DOI: 10.1177/11769351221148592
Fritz F Parl

Different tumor types are characterized by unique histopathological patterns including distinctive nuclear architectures. I hypothesized that the difference in nuclear appearance is reflected in different nuclear maps of chromosome territories, the discrete regions occupied by individual chromosomes in the interphase nucleus. To test this hypothesis, I used interchromosomal translocations (ITLs) as an analytical tool to map chromosome territories in 11 different tumor types from the TCGA PanCancer database encompassing 6003 tumors with 5295 ITLs. For each chromosome I determined the number and percentage of all ITLs for any given tumor type. Chromosomes were ranked according to the frequency and percentage of ITLs per chromosome. The ranking showed similar patterns for all tumor types. Chromosomes 1, 8, 11, 17, and 19 were ranked in the top quarter, accounting for 35.2% of 5295 ITLs, whereas chromosomes 13, 15, 18, 21, and X were in the bottom quarter, accounting for only 10.5% ITLs. The correlation between the chromosome ranking in the total group of 6003 tumors and the ranking in individual tumor types was significant, ranging from P < .0001 to .0033. Thus, contrary to my hypothesis, different tumor types share a common nuclear map of chromosome territories. Based on the large number of ITLs in 11 different types of malignancy one can discern a shared pattern of chromosome territories in cancer and propose a probabilistic model of chromosomes 1, 8, 11, 17, 19 in the center of the nucleus and chromosomes 13, 15, 18, 21, X at the periphery.

不同的肿瘤类型具有独特的组织病理学模式,包括独特的核结构。我假设细胞核外观的差异反映在染色体区域的不同核图上,染色体区域是间期细胞核中单个染色体所占据的离散区域。为了验证这一假设,我使用染色体间易位(ITLs)作为分析工具,从TCGA PanCancer数据库中绘制了11种不同肿瘤类型的染色体区域图,该数据库包含6003个具有5295个ITLs的肿瘤。对于每条染色体,我确定了任何给定肿瘤类型的所有itl的数量和百分比。根据每条染色体出现itl的频率和百分比对染色体进行排序。排名显示所有肿瘤类型的模式相似。染色体1、8、11、17和19位于前1 / 4,占5295个itl的35.2%,而染色体13、15、18、21和X位于后1 / 4,仅占10.5%的itl。在6003个肿瘤的总组中,染色体排名与单个肿瘤类型的排名之间存在显著的相关性,从P
{"title":"Different Tumor Types Share a Common Nuclear Map of Chromosome Territories.","authors":"Fritz F Parl","doi":"10.1177/11769351221148592","DOIUrl":"https://doi.org/10.1177/11769351221148592","url":null,"abstract":"<p><p>Different tumor types are characterized by unique histopathological patterns including distinctive nuclear architectures. I hypothesized that the difference in nuclear appearance is reflected in different nuclear maps of chromosome territories, the discrete regions occupied by individual chromosomes in the interphase nucleus. To test this hypothesis, I used interchromosomal translocations (ITLs) as an analytical tool to map chromosome territories in 11 different tumor types from the TCGA PanCancer database encompassing 6003 tumors with 5295 ITLs. For each chromosome I determined the number and percentage of all ITLs for any given tumor type. Chromosomes were ranked according to the frequency and percentage of ITLs per chromosome. The ranking showed similar patterns for all tumor types. Chromosomes 1, 8, 11, 17, and 19 were ranked in the top quarter, accounting for 35.2% of 5295 ITLs, whereas chromosomes 13, 15, 18, 21, and X were in the bottom quarter, accounting for only 10.5% ITLs. The correlation between the chromosome ranking in the total group of 6003 tumors and the ranking in individual tumor types was significant, ranging from <i>P</i> < .0001 to .0033. Thus, contrary to my hypothesis, different tumor types share a common nuclear map of chromosome territories. Based on the large number of ITLs in 11 different types of malignancy one can discern a shared pattern of chromosome territories in cancer and propose a probabilistic model of chromosomes 1, 8, 11, 17, 19 in the center of the nucleus and chromosomes 13, 15, 18, 21, X at the periphery.</p>","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":"22 ","pages":"11769351221148592"},"PeriodicalIF":2.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/cd/06/10.1177_11769351221148592.PMC9903037.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10747546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Computational Approach to Predict the Role of Genetic Alterations in Methyltransferase Histones Genes With Implications in Liver Cancer. 预测甲基转移酶组蛋白基因遗传改变在肝癌中的作用的计算方法。
IF 2 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-01-01 DOI: 10.1177/11769351231161480
Tania Isabella Aravena, Elizabeth Valdés, Nicolás Ayala, Vívian D'Afonseca

Histone methyltransferases (HMTs) comprise a subclass of epigenetic regulators. Dysregulation of these enzymes results in aberrant epigenetic regulation, commonly observed in various tumor types, including hepatocellular adenocarcinoma (HCC). Probably, these epigenetic changes could lead to tumorigenesis processes. To predict how histone methyltransferase genes and their genetic alterations (somatic mutations, somatic copy number alterations, and gene expression changes) are involved in hepatocellular adenocarcinoma processes, we performed an integrated computational analysis of genetic alterations in 50 HMT genes present in hepatocellular adenocarcinoma. Biological data were obtained through the public repository with 360 samples from patients with hepatocellular carcinoma. Through these biological data, we identified 10 HMT genes (SETDB1, ASH1L, SMYD2, SMYD3, EHMT2, SETD3, PRDM14, PRDM16, KMT2C, and NSD3) with a significant genetic alteration rate (14%) within 360 samples. Of these 10 HMT genes, KMT2C and ASH1L have the highest mutation rate in HCC samples, 5.6% and 2.8%, respectively. Regarding somatic copy number alteration, ASH1L and SETDB1 are amplified in several samples, while SETD3, PRDM14, and NSD3 showed a high rate of large deletion. Finally, SETDB1, SETD3, PRDM14, and NSD3 could play an important role in the progression of hepatocellular adenocarcinoma since alterations in these genes lead to a decrease in patient survival, unlike patients who present these genes without genetic alterations. Our computational analysis provides new insights that help to understand how HMTs are associated with hepatocellular carcinoma, as well as provide a basis for future experimental investigations using HMTs as genetic targets against hepatocellular carcinoma.

组蛋白甲基转移酶(hmt)包括一个亚类的表观遗传调控因子。这些酶的失调导致异常的表观遗传调控,通常在各种肿瘤类型中观察到,包括肝细胞腺癌(HCC)。这些表观遗传变化可能导致肿瘤发生过程。为了预测组蛋白甲基转移酶基因及其遗传改变(体细胞突变、体细胞拷贝数改变和基因表达改变)如何参与肝细胞腺癌过程,我们对肝细胞腺癌中存在的50个HMT基因的遗传改变进行了综合计算分析。生物学数据通过公共存储库获得,其中包括360例肝细胞癌患者的样本。通过这些生物学数据,我们在360个样本中鉴定出10个HMT基因(SETDB1, ASH1L, SMYD2, SMYD3, EHMT2, SETD3, PRDM14, PRDM16, KMT2C和NSD3)具有显著的遗传变异率(14%)。在这10个HMT基因中,KMT2C和ASH1L在HCC样本中的突变率最高,分别为5.6%和2.8%。在体细胞拷贝数改变方面,ASH1L和SETDB1在多个样本中被扩增,而SETD3、PRDM14和NSD3则表现出较高的大缺失率。最后,SETDB1、SETD3、PRDM14和NSD3可能在肝细胞腺癌的进展中发挥重要作用,因为这些基因的改变会导致患者生存期降低,而不像没有遗传改变的患者。我们的计算分析提供了新的见解,有助于了解hmt如何与肝细胞癌相关,并为未来使用hmt作为肝细胞癌遗传靶点的实验研究提供了基础。
{"title":"A Computational Approach to Predict the Role of Genetic Alterations in Methyltransferase Histones Genes With Implications in Liver Cancer.","authors":"Tania Isabella Aravena,&nbsp;Elizabeth Valdés,&nbsp;Nicolás Ayala,&nbsp;Vívian D'Afonseca","doi":"10.1177/11769351231161480","DOIUrl":"https://doi.org/10.1177/11769351231161480","url":null,"abstract":"<p><p>Histone methyltransferases (HMTs) comprise a subclass of epigenetic regulators. Dysregulation of these enzymes results in aberrant epigenetic regulation, commonly observed in various tumor types, including hepatocellular adenocarcinoma (HCC). Probably, these epigenetic changes could lead to tumorigenesis processes. To predict how histone methyltransferase genes and their genetic alterations (somatic mutations, somatic copy number alterations, and gene expression changes) are involved in hepatocellular adenocarcinoma processes, we performed an integrated computational analysis of genetic alterations in 50 HMT genes present in hepatocellular adenocarcinoma. Biological data were obtained through the public repository with 360 samples from patients with hepatocellular carcinoma. Through these biological data, we identified 10 HMT genes (<i>SETDB1, ASH1L, SMYD2, SMYD3, EHMT2, SETD3, PRDM14, PRDM16, KMT2C</i>, and <i>NSD3</i>) with a significant genetic alteration rate (14%) within 360 samples. Of these 10 HMT genes, <i>KMT2C</i> and <i>ASH1L</i> have the highest mutation rate in HCC samples, 5.6% and 2.8%, respectively. Regarding somatic copy number alteration, <i>ASH1L</i> and <i>SETDB1</i> are amplified in several samples, while <i>SETD3, PRDM14</i>, and <i>NSD3</i> showed a high rate of large deletion. Finally, <i>SETDB1, SETD3, PRDM14</i>, and <i>NSD3</i> could play an important role in the progression of hepatocellular adenocarcinoma since alterations in these genes lead to a decrease in patient survival, unlike patients who present these genes without genetic alterations. Our computational analysis provides new insights that help to understand how HMTs are associated with hepatocellular carcinoma, as well as provide a basis for future experimental investigations using HMTs as genetic targets against hepatocellular carcinoma.</p>","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":"22 ","pages":"11769351231161480"},"PeriodicalIF":2.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/1e/b4/10.1177_11769351231161480.PMC10064455.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9610566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Simple Method for Robust and Accurate Intrinsic Subtyping of Breast Cancer. 一种简便、可靠、准确的乳腺癌固有亚型分型方法。
IF 2 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-01-01 DOI: 10.1177/11769351231159893
Mehdi Hamaneh, Yi-Kuo Yu

Motivation: The PAM50 signature/method is widely used for intrinsic subtyping of breast cancer samples. However, depending on the number and composition of the samples included in a cohort, the method may assign different subtypes to the same sample. This lack of robustness is mainly due to the fact that PAM50 subtracts a reference profile, which is computed using all samples in the cohort, from each sample before classification. In this paper we propose modifications to PAM50 to develop a simple and robust single-sample classifier, called MPAM50, for intrinsic subtyping of breast cancer. Like PAM50, the modified method uses a nearest centroid approach for classification, but the centroids are computed differently, and the distances to the centroids are determined using an alternative method. Additionally, MPAM50 uses unnormalized expression values for classification and does not subtract a reference profile from the samples. In other words, MPAM50 classifies each sample independently, and so avoids the previously mentioned robustness issue.

Results: A training set was employed to find the new MPAM50 centroids. MPAM50 was then tested on 19 independent datasets (obtained using various expression profiling technologies) containing 9637 samples. Overall good agreement was observed between the PAM50- and MPAM50-assigned subtypes with a median accuracy of 0.792, which (we show) is comparable with the median concordance between various implementations of PAM50. Additionally, MPAM50- and PAM50-assigned intrinsic subtypes were found to agree comparably with the reported clinical subtypes. Also, survival analyses indicated that MPAM50 preserves the prognostic value of the intrinsic subtypes. These observations demonstrate that MPAM50 can replace PAM50 without loss of performance. On the other hand, MPAM50 was compared with 2 previously published single-sample classifiers, and with 3 alternative modified PAM50 approaches. The results indicated a superior performance by MPAM50.

Conclusions: MPAM50 is a robust, simple, and accurate single-sample classifier of intrinsic subtypes of breast cancer.

动机:PAM50特征/方法被广泛用于乳腺癌样本的内在亚型分型。然而,根据队列中样本的数量和组成,该方法可能为同一样本分配不同的亚型。这种鲁棒性的缺乏主要是由于PAM50在分类前从每个样本中减去了参考概况,该参考概况是使用队列中的所有样本计算的。在本文中,我们提出修改PAM50,以开发一个简单而稳健的单样本分类器,称为MPAM50,用于乳腺癌的内在亚型。与PAM50一样,改进的方法使用最近质心方法进行分类,但质心的计算方式不同,并且使用替代方法确定到质心的距离。此外,MPAM50使用非规范化表达式值进行分类,并且不会从样本中减去参考配置文件。换句话说,MPAM50对每个样本进行独立分类,从而避免了前面提到的鲁棒性问题。结果:利用训练集找到新的MPAM50质心。然后在包含9637个样本的19个独立数据集(使用各种表达谱分析技术获得)上测试MPAM50。在PAM50和mpam50分配的亚型之间观察到总体上良好的一致性,中位数准确性为0.792,(我们表明)与PAM50的各种实现之间的中位数一致性相当。此外,发现MPAM50和pam50分配的内在亚型与报道的临床亚型相当一致。此外,生存分析表明MPAM50保留了内在亚型的预后价值。这些观察结果表明,MPAM50可以代替PAM50而不损失性能。另一方面,将MPAM50与先前发表的2个单样本分类器以及3个可选的修改PAM50方法进行比较。结果表明,MPAM50具有优异的性能。结论:MPAM50是一种强大、简单、准确的乳腺癌固有亚型单样本分类器。
{"title":"A Simple Method for Robust and Accurate Intrinsic Subtyping of Breast Cancer.","authors":"Mehdi Hamaneh,&nbsp;Yi-Kuo Yu","doi":"10.1177/11769351231159893","DOIUrl":"https://doi.org/10.1177/11769351231159893","url":null,"abstract":"<p><strong>Motivation: </strong>The PAM50 signature/method is widely used for intrinsic subtyping of breast cancer samples. However, depending on the number and composition of the samples included in a cohort, the method may assign different subtypes to the same sample. This lack of robustness is mainly due to the fact that PAM50 subtracts a reference profile, which is computed using all samples in the cohort, from each sample before classification. In this paper we propose modifications to PAM50 to develop a simple and robust single-sample classifier, called MPAM50, for intrinsic subtyping of breast cancer. Like PAM50, the modified method uses a nearest centroid approach for classification, but the centroids are computed differently, and the distances to the centroids are determined using an alternative method. Additionally, MPAM50 uses unnormalized expression values for classification and does not subtract a reference profile from the samples. In other words, MPAM50 classifies each sample independently, and so avoids the previously mentioned robustness issue.</p><p><strong>Results: </strong>A training set was employed to find the new MPAM50 centroids. MPAM50 was then tested on 19 independent datasets (obtained using various expression profiling technologies) containing 9637 samples. Overall good agreement was observed between the PAM50- and MPAM50-assigned subtypes with a median accuracy of 0.792, which (we show) is comparable with the median concordance between various implementations of PAM50. Additionally, MPAM50- and PAM50-assigned intrinsic subtypes were found to agree comparably with the reported clinical subtypes. Also, survival analyses indicated that MPAM50 preserves the prognostic value of the intrinsic subtypes. These observations demonstrate that MPAM50 can replace PAM50 without loss of performance. On the other hand, MPAM50 was compared with 2 previously published single-sample classifiers, and with 3 alternative modified PAM50 approaches. The results indicated a superior performance by MPAM50.</p><p><strong>Conclusions: </strong>MPAM50 is a robust, simple, and accurate single-sample classifier of intrinsic subtypes of breast cancer.</p>","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":"22 ","pages":"11769351231159893"},"PeriodicalIF":2.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/38/68/10.1177_11769351231159893.PMC10052604.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9234981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prescription Non-Steroidal Anti-Inflammatory Drugs (NSAIDs) and Incidence of Depression Among Older Cancer Survivors With Osteoarthritis: A Machine Learning Analysis. 处方非甾体抗炎药(NSAIDs)和老年骨关节炎癌症幸存者抑郁发生率:机器学习分析。
IF 2 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-01-01 DOI: 10.1177/11769351231165161
Nazneen Fatima Shaikh, Chan Shen, Traci LeMasters, Nilanjana Dwibedi, Amit Ladani, Usha Sambamoorthi

Objectives: This study examined prescription NSAIDs as one of the leading predictors of incident depression and assessed the direction of the association among older cancer survivors with osteoarthritis.

Methods: This study used a retrospective cohort (N = 14, 992) of older adults with incident cancer (breast, prostate, colorectal cancers, or non-Hodgkin's lymphoma) and osteoarthritis. We used the longitudinal data from the linked Surveillance, Epidemiology, and End Results -Medicare data for the study period from 2006 through 2016, with a 12-month baseline and 12-month follow-up period. Cumulative NSAIDs days was assessed during the baseline period and incident depression was assessed during the follow-up period. An eXtreme Gradient Boosting (XGBoost) model was built with 10-fold repeated stratified cross-validation and hyperparameter tuning using the training dataset. The final model selected from the training data demonstrated high performance (Accuracy: 0.82, Recall: 0.75, Precision: 0.75) when applied to the test data. SHapley Additive exPlanations (SHAP) was used to interpret the output from the XGBoost model.

Results: Over 50% of the study cohort had at least one prescption of NSAIDs. Nearly 13% of the cohort were diagnosed with incident depression, with the rates ranging between 7.4% for prostate cancer and 17.0% for colorectal cancer. The highest incident depression rate of 25% was observed at 90 and 120 cumulative NSAIDs days thresholds. Cumulative NSAIDs days was the sixth leading predictor of incident depression among older adults with OA and cancer. Age, education, care fragmentation, polypharmacy, and zip code level poverty were the top 5 predictors of incident depression.

Conclusion: Overall, 1 in 8 older adults with cancer and OA were diagnosed with incident depression. Cumulative NSAIDs days was the sixth leading predictor with an overall positive association with incident depression. However, the association was complex and varied by the cumulative NSAIDs days.

目的:本研究考察了处方非甾体抗炎药作为抑郁症发生的主要预测因素之一,并评估了老年骨关节炎癌症幸存者之间的关联方向。方法:本研究采用回顾性队列研究(N = 14,992),纳入了发生癌症(乳腺癌、前列腺癌、结直肠癌或非霍奇金淋巴瘤)和骨关节炎的老年人。我们使用了2006年至2016年研究期间相关的监测、流行病学和最终结果-医疗保险数据的纵向数据,包括12个月的基线和12个月的随访期。在基线期评估累积的非甾体抗炎药天数,在随访期评估抑郁事件。利用训练数据集,通过10倍重复分层交叉验证和超参数调优,建立了极端梯度增强(XGBoost)模型。从训练数据中选择的最终模型在应用于测试数据时表现出高性能(准确率:0.82,召回率:0.75,精度:0.75)。SHapley加性解释(SHAP)用于解释XGBoost模型的输出。结果:超过50%的研究队列至少有一种非甾体抗炎药处方。近13%的人被诊断为偶发性抑郁症,前列腺癌的发病率为7.4%,结肠直肠癌的发病率为17.0%。在nsaid累计用药90和120天时观察到最高的抑郁发生率为25%。累计服用非甾体抗炎药天数是老年OA和癌症患者发生抑郁的第六大预测因子。年龄、教育程度、护理碎片化、多种药物治疗和邮政编码水平贫困是事件抑郁症的前5个预测因素。结论:总体而言,每8名患有癌症和OA的老年人中就有1人被诊断为偶发性抑郁症。累计服用非甾体抗炎药天数是与抑郁事件总体正相关的第六大预测因子。然而,随着非甾体抗炎药使用日数的增加,这种关联变得复杂和多样。
{"title":"Prescription Non-Steroidal Anti-Inflammatory Drugs (NSAIDs) and Incidence of Depression Among Older Cancer Survivors With Osteoarthritis: A Machine Learning Analysis.","authors":"Nazneen Fatima Shaikh,&nbsp;Chan Shen,&nbsp;Traci LeMasters,&nbsp;Nilanjana Dwibedi,&nbsp;Amit Ladani,&nbsp;Usha Sambamoorthi","doi":"10.1177/11769351231165161","DOIUrl":"https://doi.org/10.1177/11769351231165161","url":null,"abstract":"<p><strong>Objectives: </strong>This study examined prescription NSAIDs as one of the leading predictors of incident depression and assessed the direction of the association among older cancer survivors with osteoarthritis.</p><p><strong>Methods: </strong>This study used a retrospective cohort (N = 14, 992) of older adults with incident cancer (breast, prostate, colorectal cancers, or non-Hodgkin's lymphoma) and osteoarthritis. We used the longitudinal data from the linked Surveillance, Epidemiology, and End Results -Medicare data for the study period from 2006 through 2016, with a 12-month baseline and 12-month follow-up period. Cumulative NSAIDs days was assessed during the baseline period and incident depression was assessed during the follow-up period. An eXtreme Gradient Boosting (XGBoost) model was built with 10-fold repeated stratified cross-validation and hyperparameter tuning using the training dataset. The final model selected from the training data demonstrated high performance (Accuracy: 0.82, Recall: 0.75, Precision: 0.75) when applied to the test data. SHapley Additive exPlanations (SHAP) was used to interpret the output from the XGBoost model.</p><p><strong>Results: </strong>Over 50% of the study cohort had at least one prescption of NSAIDs. Nearly 13% of the cohort were diagnosed with incident depression, with the rates ranging between 7.4% for prostate cancer and 17.0% for colorectal cancer. The highest incident depression rate of 25% was observed at 90 and 120 cumulative NSAIDs days thresholds. Cumulative NSAIDs days was the sixth leading predictor of incident depression among older adults with OA and cancer. Age, education, care fragmentation, polypharmacy, and zip code level poverty were the top 5 predictors of incident depression.</p><p><strong>Conclusion: </strong>Overall, 1 in 8 older adults with cancer and OA were diagnosed with incident depression. Cumulative NSAIDs days was the sixth leading predictor with an overall positive association with incident depression. However, the association was complex and varied by the cumulative NSAIDs days.</p>","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":"22 ","pages":"11769351231165161"},"PeriodicalIF":2.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/25/bc/10.1177_11769351231165161.PMC10123903.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9356662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Five Critical Gene-Based Biomarkers With Optimal Performance for Hepatocellular Carcinoma. 五种关键基因生物标志物在肝细胞癌中的最佳表现。
IF 2 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-01-01 DOI: 10.1177/11769351231190477
Yongjun Liu, Heping Zhang, Yuqing Xu, Yao-Zhong Liu, David P Al-Adra, Matthew M Yeh, Zhengjun Zhang
Hepatocellular carcinoma (HCC) is one of the most fatal cancers in the world. There is an urgent need to understand the molecular background of HCC to facilitate the identification of biomarkers and discover effective therapeutic targets. Published transcriptomic studies have reported a large number of genes that are individually significant for HCC. However, reliable biomarkers remain to be determined. In this study, built on max-linear competing risk factor models, we developed a machine learning analytical framework to analyze transcriptomic data to identify the most miniature set of differentially expressed genes (DEGs). By analyzing 9 public whole-transcriptome datasets (containing 1184 HCC samples and 672 nontumor controls), we identified 5 critical differentially expressed genes (DEGs) (ie, CCDC107, CXCL12, GIGYF1, GMNN, and IFFO1) between HCC and control samples. The classifiers built on these 5 DEGs reached nearly perfect performance in identification of HCC. The performance of the 5 DEGs was further validated in a US Caucasian cohort that we collected (containing 17 HCC with paired nontumor tissue). The conceptual advance of our work lies in modeling gene-gene interactions and correcting batch effect in the analytic framework. The classifiers built on the 5 DEGs demonstrated clear signature patterns for HCC. The results are interpretable, robust, and reproducible across diverse cohorts/populations with various disease etiologies, indicating the 5 DEGs are intrinsic variables that can describe the overall features of HCC at the genomic level. The analytical framework applied in this study may pave a new way for improving transcriptome profiling analysis of human cancers.
肝细胞癌(HCC)是世界上最致命的癌症之一。迫切需要了解HCC的分子背景,以促进生物标志物的鉴定和发现有效的治疗靶点。已发表的转录组学研究报告了大量对HCC具有显著个体意义的基因。然而,可靠的生物标志物仍有待确定。在这项研究中,基于最大线性竞争风险因素模型,我们开发了一个机器学习分析框架来分析转录组学数据,以识别最微小的差异表达基因(deg)集。通过分析9个公开的全转录组数据集(包含1184个HCC样本和672个非肿瘤对照组),我们确定了HCC和对照样本之间的5个关键差异表达基因(deg)(即CCDC107、CXCL12、GIGYF1、GMNN和IFFO1)。基于这5个deg构建的分类器在鉴别HCC方面达到了近乎完美的性能。我们收集的美国白种人队列(包含17个配对的非肿瘤组织的HCC)进一步验证了5个DEGs的性能。我们工作的概念上的进步在于建立基因-基因相互作用的模型,并在分析框架中纠正批效应。基于5个deg构建的分类器显示出HCC的明确特征模式。研究结果在不同疾病病因的不同队列/人群中具有可解释性、稳健性和可重复性,表明5个deg是内在变量,可以在基因组水平上描述HCC的总体特征。本研究应用的分析框架可能为改进人类癌症转录组分析开辟新的途径。
{"title":"Five Critical Gene-Based Biomarkers With Optimal Performance for Hepatocellular Carcinoma.","authors":"Yongjun Liu,&nbsp;Heping Zhang,&nbsp;Yuqing Xu,&nbsp;Yao-Zhong Liu,&nbsp;David P Al-Adra,&nbsp;Matthew M Yeh,&nbsp;Zhengjun Zhang","doi":"10.1177/11769351231190477","DOIUrl":"https://doi.org/10.1177/11769351231190477","url":null,"abstract":"Hepatocellular carcinoma (HCC) is one of the most fatal cancers in the world. There is an urgent need to understand the molecular background of HCC to facilitate the identification of biomarkers and discover effective therapeutic targets. Published transcriptomic studies have reported a large number of genes that are individually significant for HCC. However, reliable biomarkers remain to be determined. In this study, built on max-linear competing risk factor models, we developed a machine learning analytical framework to analyze transcriptomic data to identify the most miniature set of differentially expressed genes (DEGs). By analyzing 9 public whole-transcriptome datasets (containing 1184 HCC samples and 672 nontumor controls), we identified 5 critical differentially expressed genes (DEGs) (ie, CCDC107, CXCL12, GIGYF1, GMNN, and IFFO1) between HCC and control samples. The classifiers built on these 5 DEGs reached nearly perfect performance in identification of HCC. The performance of the 5 DEGs was further validated in a US Caucasian cohort that we collected (containing 17 HCC with paired nontumor tissue). The conceptual advance of our work lies in modeling gene-gene interactions and correcting batch effect in the analytic framework. The classifiers built on the 5 DEGs demonstrated clear signature patterns for HCC. The results are interpretable, robust, and reproducible across diverse cohorts/populations with various disease etiologies, indicating the 5 DEGs are intrinsic variables that can describe the overall features of HCC at the genomic level. The analytical framework applied in this study may pave a new way for improving transcriptome profiling analysis of human cancers.","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":"22 ","pages":"11769351231190477"},"PeriodicalIF":2.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/11/97/10.1177_11769351231190477.PMC10413891.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10305114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications. 基于监督机器学习算法的简易癌症分类决策支持系统和web应用。
IF 2 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-01-01 DOI: 10.1177/11769351221147244
K Chandrashekar, Anagha S Setlur, Adithya Sabhapathi C, Satyam Suresh Raiker, Satyam Singh, Vidya Niranjan

Using a decision support system (DSS) that classifies various cancers provides support to the clinicians/researchers to make better decisions that can aid in early cancer diagnosis, thereby reducing chances of incorrect disease diagnosis. Thus, this work aimed at designing a classification model that can predict accurately for 5 different cancer types comprising of 20 cancer exomes, using the mutations identified from whole exome cancer analysis. Initially, a basic model was designed using supervised machine learning classification algorithms such as K-nearest neighbor (KNN), support vector machine (SVM), decision tree, naïve bayes and random forest (RF), among which decision tree and random forest performed better in terms of preliminary model accuracy. However, output predictions were incorrect due to less training scores. Thus, 16 essential features were then selected for model improvement using 2 approaches. All imbalanced datasets were balanced using SMOTE. In the first approach, all features from 20 cancer exome datasets were trained and models were designed using decision tree and random forest. Balanced datasets for decision tree model showed an accuracy of 77%, while with the RF model, the accuracy improved to 82% where all 5 cancer types were predicted correctly. Area under the curve for RF model was closer to 1, than decision tree model. In the second approach, all 15 datasets were trained, while 5 were tested. However, only 2 cancer types were predicted correctly. To cross validate RF model, Matthew's correlation co-efficient (MCC) test was performed. For method 1, the MCC test and MCC cross validation was found to be 0.7796 and 0.9356 respectively. Likewise, for second approach, MCC was observed to be 0.9365, corroborating the accuracy of the designed model. The model was successfully deployed using Streamlit as a web application for easy use. This study presents insights for allowing easy cancer classifications.

使用对各种癌症进行分类的决策支持系统(DSS)为临床医生/研究人员提供支持,帮助他们做出更好的决策,从而有助于早期癌症诊断,从而减少错误疾病诊断的机会。因此,本工作旨在设计一个分类模型,该模型可以使用从全外显子组癌症分析中鉴定的突变,准确预测由20个癌症外显子组组成的5种不同癌症类型。首先,使用k近邻(KNN)、支持向量机(SVM)、决策树、naïve贝叶斯和随机森林(RF)等监督机器学习分类算法设计基本模型,其中决策树和随机森林在模型初步精度上表现较好。然而,由于训练分数较少,输出预测是不正确的。因此,然后选择16个基本特征,使用2种方法进行模型改进。使用SMOTE对所有不平衡数据集进行平衡。在第一种方法中,对来自20个癌症外显子组数据集的所有特征进行训练,并使用决策树和随机森林设计模型。决策树模型的平衡数据集显示准确率为77%,而RF模型的准确率提高到82%,其中所有5种癌症类型都被正确预测。与决策树模型相比,射频模型的曲线下面积更接近于1。在第二种方法中,所有15个数据集都进行了训练,而5个数据集进行了测试。然而,只有两种癌症类型预测正确。为了交叉验证RF模型,采用Matthew’s相关系数检验(MCC)。方法1的MCC检验和MCC交叉验证分别为0.7796和0.9356。同样,对于第二种方法,观察到MCC为0.9365,证实了设计模型的准确性。该模型已成功部署,使用Streamlit作为web应用程序,方便使用。这项研究为简化癌症分类提供了见解。
{"title":"Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications.","authors":"K Chandrashekar,&nbsp;Anagha S Setlur,&nbsp;Adithya Sabhapathi C,&nbsp;Satyam Suresh Raiker,&nbsp;Satyam Singh,&nbsp;Vidya Niranjan","doi":"10.1177/11769351221147244","DOIUrl":"https://doi.org/10.1177/11769351221147244","url":null,"abstract":"<p><p>Using a decision support system (DSS) that classifies various cancers provides support to the clinicians/researchers to make better decisions that can aid in early cancer diagnosis, thereby reducing chances of incorrect disease diagnosis. Thus, this work aimed at designing a classification model that can predict accurately for 5 different cancer types comprising of 20 cancer exomes, using the mutations identified from whole exome cancer analysis. Initially, a basic model was designed using supervised machine learning classification algorithms such as K-nearest neighbor (KNN), support vector machine (SVM), decision tree, naïve bayes and random forest (RF), among which decision tree and random forest performed better in terms of preliminary model accuracy. However, output predictions were incorrect due to less training scores. Thus, 16 essential features were then selected for model improvement using 2 approaches. All imbalanced datasets were balanced using SMOTE. In the first approach, all features from 20 cancer exome datasets were trained and models were designed using decision tree and random forest. Balanced datasets for decision tree model showed an accuracy of 77%, while with the RF model, the accuracy improved to 82% where all 5 cancer types were predicted correctly. Area under the curve for RF model was closer to 1, than decision tree model. In the second approach, all 15 datasets were trained, while 5 were tested. However, only 2 cancer types were predicted correctly. To cross validate RF model, Matthew's correlation co-efficient (MCC) test was performed. For method 1, the MCC test and MCC cross validation was found to be 0.7796 and 0.9356 respectively. Likewise, for second approach, MCC was observed to be 0.9365, corroborating the accuracy of the designed model. The model was successfully deployed using Streamlit as a web application for easy use. This study presents insights for allowing easy cancer classifications.</p>","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":"22 ","pages":"11769351221147244"},"PeriodicalIF":2.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/c2/da/10.1177_11769351221147244.PMC9880585.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10591008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Erratum to "Chemical Complementarity of Breast Cancer Resident, T-Cell Receptor CDR3 Domains and the Cancer Antigen, ARMC3, is Associated With Higher Levels of Survival and Granzyme Expression". “乳腺癌居民t细胞受体CDR3结构域和癌症抗原ARMC3的化学互补性与更高的生存率和颗粒酶表达水平相关”的勘误。
IF 2 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-01-01 DOI: 10.1177/11769351231189051

[This corrects the article DOI: 10.1177/11769351231177269.].

[这更正了文章DOI: 10.1177/11769351231177269.]。
{"title":"Erratum to \"Chemical Complementarity of Breast Cancer Resident, T-Cell Receptor CDR3 Domains and the Cancer Antigen, ARMC3, is Associated With Higher Levels of Survival and Granzyme Expression\".","authors":"","doi":"10.1177/11769351231189051","DOIUrl":"https://doi.org/10.1177/11769351231189051","url":null,"abstract":"<p><p>[This corrects the article DOI: 10.1177/11769351231177269.].</p>","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":"22 ","pages":"11769351231189051"},"PeriodicalIF":2.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/24/b1/10.1177_11769351231189051.PMC10350781.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9827252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TP53 and its Regulatory Genes as Prognosis of Cutaneous Melanoma. TP53及其调控基因与皮肤黑色素瘤预后的关系。
IF 2 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-01-01 DOI: 10.1177/11769351231177267
Safir Ullah Khan, Zahid Ullah, Hadia Shaukat, Sheeza Unab, Saba Jannat, Waqar Ali, Amir Ali, Muhammad Irfan, Muhammad Fiaz Khan, Rodolfo Daniel Cervantes-Villagrana

The present study was the first comprehensive investigation of genetic mutation and expression levels of the p53 signaling genes in cutaneous melanoma through various genetic databases providing large datasets. The mutational landscape of p53 and its signaling genes was higher than expected, with TP53 followed by CDKN2A being the most mutated gene in cutaneous melanoma. Furthermore, the expression analysis showed that TP53, MDM2, CDKN2A, and TP53BP1 were overexpressed, while MDM4 and CDKN2B were under-expressed in cutaneous melanoma. Overall, TCGA data revealed that among all the other p53 signaling proteins, CDKN2A was significantly higher in both sun and non-sun-exposed healthy tissues than in melanoma. Likewise, MDM4 and TP53BP1 expressions were markedly greater in non-sun-exposed healthy tissues compared to other groups. However, CDKN2B expression was higher in the sun-exposed healthy tissues than in other tissues. In addition, various genes were expressed significantly differently among males and females. In addition, CDKN2A was highly expressed in the SK-MEL-30 skin cancer cell line, whereas, Immune cell type expression analysis revealed that the MDM4 was highly expressed in naïve B-cells. Furthermore, all six genes were significantly overexpressed in extraordinarily overweight or obese tumor tissues compared to healthy tissues. MDM2 expression and tumor stage were closely related. There were differences in gene expression across patient age groups and positive nodal status. TP53 showed a positive correlation with B cells, MDM2 with CD8+T cells, macrophages and neutrophils, and MDM4 with neutrophils. CDKN2A/B had a non-significant correlation with all six types of immune cells. However, TP53BP1 was positively correlated with all five types of immune cells except B cells. Only TP53, MDM2, and CDKN2A had a role in cutaneous melanoma-specific tumor immunity. All TP53 and its regulating genes may be predictive for prognosis. The results of the present study need to be validated through future screening, in vivo, and in vitro studies.

本研究首次通过提供大型数据集的各种遗传数据库对皮肤黑色素瘤中p53信号基因的基因突变和表达水平进行了全面研究。p53及其信号基因的突变情况比预期的要高,TP53其次是CDKN2A是皮肤黑色素瘤中突变最多的基因。此外,表达分析显示TP53、MDM2、CDKN2A和TP53BP1在皮肤黑色素瘤中过表达,而MDM4和CDKN2B在皮肤黑色素瘤中过表达。总体而言,TCGA数据显示,在所有其他p53信号蛋白中,CDKN2A在阳光照射和非阳光照射的健康组织中均显著高于黑色素瘤。同样,与其他组相比,MDM4和TP53BP1在非阳光照射的健康组织中的表达明显更高。然而,CDKN2B在阳光照射的健康组织中的表达高于其他组织。此外,各基因在雄性和雌性之间的表达也存在显著差异。此外,CDKN2A在SK-MEL-30皮肤癌细胞系中高表达,而免疫细胞类型表达分析显示MDM4在naïve b细胞中高表达。此外,与健康组织相比,所有六个基因在异常超重或肥胖的肿瘤组织中都显着过表达。MDM2的表达与肿瘤分期密切相关。不同患者年龄组和阳性淋巴结状态的基因表达存在差异。TP53与B细胞呈正相关,MDM2与CD8+T细胞、巨噬细胞、中性粒细胞呈正相关,MDM4与中性粒细胞呈正相关。CDKN2A/B与所有六种免疫细胞均无显著相关性。而TP53BP1与除B细胞外的5种免疫细胞均呈正相关。只有TP53、MDM2和CDKN2A在皮肤黑色素瘤特异性肿瘤免疫中起作用。所有TP53及其调控基因均可预测预后。目前研究的结果需要通过未来的筛选、体内和体外研究来验证。
{"title":"TP53 and its Regulatory Genes as Prognosis of Cutaneous Melanoma.","authors":"Safir Ullah Khan,&nbsp;Zahid Ullah,&nbsp;Hadia Shaukat,&nbsp;Sheeza Unab,&nbsp;Saba Jannat,&nbsp;Waqar Ali,&nbsp;Amir Ali,&nbsp;Muhammad Irfan,&nbsp;Muhammad Fiaz Khan,&nbsp;Rodolfo Daniel Cervantes-Villagrana","doi":"10.1177/11769351231177267","DOIUrl":"https://doi.org/10.1177/11769351231177267","url":null,"abstract":"<p><p>The present study was the first comprehensive investigation of genetic mutation and expression levels of the p53 signaling genes in cutaneous melanoma through various genetic databases providing large datasets. The mutational landscape of p53 and its signaling genes was higher than expected, with <i>TP53</i> followed by <i>CDKN2A</i> being the most mutated gene in cutaneous melanoma. Furthermore, the expression analysis showed <i>that TP53</i>, <i>MDM2</i>, <i>CDKN2A</i>, and <i>TP53BP1</i> were overexpressed, while <i>MDM4</i> and <i>CDKN2B</i> were under-expressed in cutaneous melanoma. Overall, TCGA data revealed that among all the other p53 signaling proteins, CDKN2A was significantly higher in both sun and non-sun-exposed healthy tissues than in melanoma. Likewise, MDM4 and TP53BP1 expressions were markedly greater in non-sun-exposed healthy tissues compared to other groups. However, CDKN2B expression was higher in the sun-exposed healthy tissues than in other tissues. In addition, various genes were expressed significantly differently among males and females. In addition, <i>CDKN2A</i> was highly expressed in the SK-MEL-30 skin cancer cell line, whereas, Immune cell type expression analysis revealed that the <i>MDM4</i> was highly expressed in naïve B-cells. Furthermore, all six genes were significantly overexpressed in extraordinarily overweight or obese tumor tissues compared to healthy tissues. <i>MDM2</i> expression and tumor stage were closely related. There were differences in gene expression across patient age groups and positive nodal status. <i>TP53</i> showed a positive correlation with B cells, <i>MDM2</i> with CD8+<i>T</i> cells, macrophages and neutrophils, and <i>MDM4</i> with neutrophils. <i>CDKN2A/B</i> had a non-significant correlation with all six types of immune cells. However, <i>TP53BP1</i> was positively correlated with all five types of immune cells except B cells. Only <i>TP53, MDM2</i>, and <i>CDKN2A</i> had a role in cutaneous melanoma-specific tumor immunity. All TP53 and its regulating genes may be predictive for prognosis. The results of the present study need to be validated through future screening, in vivo, and in vitro studies.</p>","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":"22 ","pages":"11769351231177267"},"PeriodicalIF":2.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/c1/e4/10.1177_11769351231177267.PMC10475268.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10283819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Cancer Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1