{"title":"聚焦孟加拉国人群的乳腺癌风险 SNPs 和相关表达 QTLs:硅学分析","authors":"Bristy Rani Podder , Ilora Shabnam Kheya , Sabrina Moriom Elias","doi":"10.1016/j.humgen.2024.201270","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Single Nucleotide Polymorphism (SNP)s in the genome and associated genes cause susceptibility to breast cancer, the most common cancer leading to death in women. Variation in different human races' genomes makes breast cancer prognosis challenging in terms of targeted drugs and therapies. The study aimed to compile the Breast cancer associated SNPs used for screening in the existing publications focusing Bangladeshi population, followed by the identification of Expression quantitative trait loci (eQTLs) associated with those SNPs from the eQTL database. eQTLs identify genes whose expression is regulated by specific SNPs. In silico characterization in terms of variant effect prediction, co-expression, Gene Ontology (GO) enrichment, protein-protein interaction, and sequence motif analysis narrowed down a specific set of candidate genes.</p></div><div><h3>Methods</h3><p>Published reports emphasizing the SNPs screened for Breast cancer in Bangladeshi population were analyzed in PancanQTL for identification of eQTLs which uses genotype and gene expression data from The Cancer Genome Atlas. The gene description and GO associated with identified eQTLs were retrieved from the <em>Ensembl</em> database and characterizations were performed using variant effect predictor, Coexpedia, MEME suite, and STRINGdb.</p></div><div><h3>Results</h3><p>It was found from the published reports that not all variants showed strong association with the disease in Bangladeshi population. The cis-eQTLs associated with reported SNPs tested so far on Bangladeshi population are <em>ZNF575, MRPL42P5, C15orf57, C15orf62, NFATC3, XRCC1, C14orf153, CKB, BAG5, KLC1, MARK3.</em> Among them only <em>ZNF575</em> was enlisted as breast cancer associated eQTL and the rest are linked with other types of cancer. These genes are mostly associated in DNA-binding transcription factor activity, protein binding, Intracellular protein transport II, transferase activity. Protein-protein interaction could predict some functional partners to connect the eQTLs with respective SNP harboring genes. Taking the commonly screened genes for breast cancer as targets breast cancer associated cis and trans eQTLs along with the associated survival eQTLs have been retrieved from the database and a list of specific variants are recommended for future studies to get a more comprehensive scenario about the disease prognosis.</p></div><div><h3>Conclusion</h3><p>Since it was found from the existing literature that the commonly used variants are not always associated with all human races, this simple and precise in silico study was carried out to analyze publicly available data. This helped limit specific candidate genes and variants which will be helpful in future population-based screenings in understanding breast cancer prognosis with a provision to develop population specific personalized drug.</p></div>","PeriodicalId":29686,"journal":{"name":"Human Gene","volume":"39 ","pages":"Article 201270"},"PeriodicalIF":0.5000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2773044124000147/pdfft?md5=bde448fe0eec53f75b55c0f42301a893&pid=1-s2.0-S2773044124000147-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Breast cancer risk SNPs and associated expression QTLs focusing Bangladeshi population: An in silico analysis\",\"authors\":\"Bristy Rani Podder , Ilora Shabnam Kheya , Sabrina Moriom Elias\",\"doi\":\"10.1016/j.humgen.2024.201270\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><p>Single Nucleotide Polymorphism (SNP)s in the genome and associated genes cause susceptibility to breast cancer, the most common cancer leading to death in women. Variation in different human races' genomes makes breast cancer prognosis challenging in terms of targeted drugs and therapies. The study aimed to compile the Breast cancer associated SNPs used for screening in the existing publications focusing Bangladeshi population, followed by the identification of Expression quantitative trait loci (eQTLs) associated with those SNPs from the eQTL database. eQTLs identify genes whose expression is regulated by specific SNPs. In silico characterization in terms of variant effect prediction, co-expression, Gene Ontology (GO) enrichment, protein-protein interaction, and sequence motif analysis narrowed down a specific set of candidate genes.</p></div><div><h3>Methods</h3><p>Published reports emphasizing the SNPs screened for Breast cancer in Bangladeshi population were analyzed in PancanQTL for identification of eQTLs which uses genotype and gene expression data from The Cancer Genome Atlas. The gene description and GO associated with identified eQTLs were retrieved from the <em>Ensembl</em> database and characterizations were performed using variant effect predictor, Coexpedia, MEME suite, and STRINGdb.</p></div><div><h3>Results</h3><p>It was found from the published reports that not all variants showed strong association with the disease in Bangladeshi population. The cis-eQTLs associated with reported SNPs tested so far on Bangladeshi population are <em>ZNF575, MRPL42P5, C15orf57, C15orf62, NFATC3, XRCC1, C14orf153, CKB, BAG5, KLC1, MARK3.</em> Among them only <em>ZNF575</em> was enlisted as breast cancer associated eQTL and the rest are linked with other types of cancer. These genes are mostly associated in DNA-binding transcription factor activity, protein binding, Intracellular protein transport II, transferase activity. Protein-protein interaction could predict some functional partners to connect the eQTLs with respective SNP harboring genes. Taking the commonly screened genes for breast cancer as targets breast cancer associated cis and trans eQTLs along with the associated survival eQTLs have been retrieved from the database and a list of specific variants are recommended for future studies to get a more comprehensive scenario about the disease prognosis.</p></div><div><h3>Conclusion</h3><p>Since it was found from the existing literature that the commonly used variants are not always associated with all human races, this simple and precise in silico study was carried out to analyze publicly available data. This helped limit specific candidate genes and variants which will be helpful in future population-based screenings in understanding breast cancer prognosis with a provision to develop population specific personalized drug.</p></div>\",\"PeriodicalId\":29686,\"journal\":{\"name\":\"Human Gene\",\"volume\":\"39 \",\"pages\":\"Article 201270\"},\"PeriodicalIF\":0.5000,\"publicationDate\":\"2024-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2773044124000147/pdfft?md5=bde448fe0eec53f75b55c0f42301a893&pid=1-s2.0-S2773044124000147-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Human Gene\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2773044124000147\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Gene","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2773044124000147","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
摘要
背景基因组和相关基因中的单核苷酸多态性(SNP)导致了乳腺癌的易感性,而乳腺癌是导致女性死亡的最常见癌症。不同人种基因组的差异使乳腺癌预后的靶向药物和疗法面临挑战。本研究旨在汇编现有出版物中用于筛查孟加拉人群的乳腺癌相关 SNPs,然后从 eQTL 数据库中确定与这些 SNPs 相关的表达量性状位点(eQTLs)。在变异效应预测、共表达、基因本体(GO)富集、蛋白-蛋白相互作用和序列基序分析等方面进行的硅特征描述缩小了候选基因的具体范围。方法在 PancanQTL 中分析了孟加拉人群中筛查出的乳腺癌 SNPs,以确定 eQTLs。从 Ensembl 数据库中检索了与确定的 eQTL 相关的基因描述和 GO,并使用变异效应预测器、Coexpedia、MEME suite 和 STRINGdb 进行了特征分析。迄今为止,在孟加拉人群中与已报道 SNPs 相关的顺式-EQTLs 有 ZNF575、MRPL42P5、C15orf57、C15orf62、NFATC3、XRCC1、C14orf153、CKB、BAG5、KLC1 和 MARK3。其中只有 ZNF575 被列为与乳腺癌相关的 eQTL,其余的都与其他类型的癌症有关。这些基因主要与 DNA 结合转录因子活性、蛋白质结合、细胞内蛋白质转运 II 和转移酶活性有关。蛋白质与蛋白质之间的相互作用可以预测出一些功能性伙伴,从而将 eQTL 与各自的 SNP 携带基因联系起来。以常用的乳腺癌筛查基因为目标,从数据库中检索出了与乳腺癌相关的顺式和反式 eQTLs 以及相关的存活 eQTLs,并为今后的研究推荐了一份特定变异体列表,以便更全面地了解疾病的预后情况。这有助于限制特定的候选基因和变异,这将有助于未来基于人群的筛查,以了解乳腺癌的预后,并开发针对特定人群的个性化药物。
Breast cancer risk SNPs and associated expression QTLs focusing Bangladeshi population: An in silico analysis
Background
Single Nucleotide Polymorphism (SNP)s in the genome and associated genes cause susceptibility to breast cancer, the most common cancer leading to death in women. Variation in different human races' genomes makes breast cancer prognosis challenging in terms of targeted drugs and therapies. The study aimed to compile the Breast cancer associated SNPs used for screening in the existing publications focusing Bangladeshi population, followed by the identification of Expression quantitative trait loci (eQTLs) associated with those SNPs from the eQTL database. eQTLs identify genes whose expression is regulated by specific SNPs. In silico characterization in terms of variant effect prediction, co-expression, Gene Ontology (GO) enrichment, protein-protein interaction, and sequence motif analysis narrowed down a specific set of candidate genes.
Methods
Published reports emphasizing the SNPs screened for Breast cancer in Bangladeshi population were analyzed in PancanQTL for identification of eQTLs which uses genotype and gene expression data from The Cancer Genome Atlas. The gene description and GO associated with identified eQTLs were retrieved from the Ensembl database and characterizations were performed using variant effect predictor, Coexpedia, MEME suite, and STRINGdb.
Results
It was found from the published reports that not all variants showed strong association with the disease in Bangladeshi population. The cis-eQTLs associated with reported SNPs tested so far on Bangladeshi population are ZNF575, MRPL42P5, C15orf57, C15orf62, NFATC3, XRCC1, C14orf153, CKB, BAG5, KLC1, MARK3. Among them only ZNF575 was enlisted as breast cancer associated eQTL and the rest are linked with other types of cancer. These genes are mostly associated in DNA-binding transcription factor activity, protein binding, Intracellular protein transport II, transferase activity. Protein-protein interaction could predict some functional partners to connect the eQTLs with respective SNP harboring genes. Taking the commonly screened genes for breast cancer as targets breast cancer associated cis and trans eQTLs along with the associated survival eQTLs have been retrieved from the database and a list of specific variants are recommended for future studies to get a more comprehensive scenario about the disease prognosis.
Conclusion
Since it was found from the existing literature that the commonly used variants are not always associated with all human races, this simple and precise in silico study was carried out to analyze publicly available data. This helped limit specific candidate genes and variants which will be helpful in future population-based screenings in understanding breast cancer prognosis with a provision to develop population specific personalized drug.