Pub Date : 2021-12-01Epub Date: 2021-12-31DOI: 10.5808/gi.21049
Sol Lee, Miyoung Nam, Ah-Reum Lee, Seung-Tae Baek, Min Jung Kim, Ju Seong Kim, Andrew Hyunsoo Kong, Minho Lee, Sook-Jeong Lee, Seon-Young Kim, Dong-Uk Kim, Kwang-Lae Hoe
Tamoxifen (TAM) is an anticancer drug used to treat estrogen receptor (ER)‒positive breast cancer. However, its ER-independent cytotoxic and antifungal activities have prompted debates on its mechanism of action. To achieve a better understanding of the ER-independent antifungal action mechanisms of TAM, we systematically identified TAM-sensitive genes through microarray screening of the heterozygous gene deletion library in fission yeast (Schizosaccharomyces pombe). Secondary confirmation was followed by a spotting assay, finally yielding 13 TAM-sensitive genes under the drug-induced haploinsufficient condition. For these 13 TAM-sensitive genes, we conducted a comparative analysis of their Gene Ontology (GO) 'biological process' terms identified from other genome-wide screenings of the budding yeast deletion library and the MCF7 breast cancer cell line. Several TAM-sensitive genes overlapped between the yeast strains and MCF7 in GO terms including 'cell cycle' (cdc2, rik1, pas1, and leo1), 'signaling' (sck2, oga1, and cki3), and 'vesicle-mediated transport' (SPCC126.08c, vps54, sec72, and tvp15), suggesting their roles in the ER-independent cytotoxic effects of TAM. We recently reported that the cki3 gene with the 'signaling' GO term was related to the ER-independent antifungal action mechanisms of TAM in yeast. In this study, we report that haploinsufficiency of the essential vps54 gene, which encodes the GARP complex subunit, significantly aggravated TAM sensitivity and led to an enlarged vesicle structure in comparison with the SP286 control strain. These results strongly suggest that the vesicle-mediated transport process might be another action mechanism of the ER-independent antifungal or cytotoxic effects of TAM.
{"title":"Knockdown of vps54 aggravates tamoxifen-induced cytotoxicity in fission yeast.","authors":"Sol Lee, Miyoung Nam, Ah-Reum Lee, Seung-Tae Baek, Min Jung Kim, Ju Seong Kim, Andrew Hyunsoo Kong, Minho Lee, Sook-Jeong Lee, Seon-Young Kim, Dong-Uk Kim, Kwang-Lae Hoe","doi":"10.5808/gi.21049","DOIUrl":"https://doi.org/10.5808/gi.21049","url":null,"abstract":"<p><p>Tamoxifen (TAM) is an anticancer drug used to treat estrogen receptor (ER)‒positive breast cancer. However, its ER-independent cytotoxic and antifungal activities have prompted debates on its mechanism of action. To achieve a better understanding of the ER-independent antifungal action mechanisms of TAM, we systematically identified TAM-sensitive genes through microarray screening of the heterozygous gene deletion library in fission yeast (Schizosaccharomyces pombe). Secondary confirmation was followed by a spotting assay, finally yielding 13 TAM-sensitive genes under the drug-induced haploinsufficient condition. For these 13 TAM-sensitive genes, we conducted a comparative analysis of their Gene Ontology (GO) 'biological process' terms identified from other genome-wide screenings of the budding yeast deletion library and the MCF7 breast cancer cell line. Several TAM-sensitive genes overlapped between the yeast strains and MCF7 in GO terms including 'cell cycle' (cdc2, rik1, pas1, and leo1), 'signaling' (sck2, oga1, and cki3), and 'vesicle-mediated transport' (SPCC126.08c, vps54, sec72, and tvp15), suggesting their roles in the ER-independent cytotoxic effects of TAM. We recently reported that the cki3 gene with the 'signaling' GO term was related to the ER-independent antifungal action mechanisms of TAM in yeast. In this study, we report that haploinsufficiency of the essential vps54 gene, which encodes the GARP complex subunit, significantly aggravated TAM sensitivity and led to an enlarged vesicle structure in comparison with the SP286 control strain. These results strongly suggest that the vesicle-mediated transport process might be another action mechanism of the ER-independent antifungal or cytotoxic effects of TAM.</p>","PeriodicalId":36591,"journal":{"name":"Genomics and Informatics","volume":"19 4","pages":"e39"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8752990/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39929495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-12-01Epub Date: 2021-12-31DOI: 10.5808/gi.21053
Wonil Chung
Predicting individual traits and diseases from genetic variants is critical to fulfilling the promise of personalized medicine. The genetic variants from genome-wide association studies (GWAS), including variants well below GWAS significance, can be aggregated into highly significant predictions across a wide range of complex traits and diseases. The recent arrival of large-sample public biobanks enables highly accurate polygenic predictions based on genetic variants across the whole genome. Various statistical methodologies and diverse computational tools have been introduced and developed to computed the polygenic risk score (PRS) more accurately. However, many researchers utilize PRS tools without a thorough understanding of the underlying model and how to specify the parameters for the best performance. It is advantageous to study the statistical models implemented in computational tools for PRS estimation and the formulas of parameters to be specified. Here, we review a variety of recent statistical methodologies and computational tools for PRS computation.
{"title":"Statistical models and computational tools for predicting complex traits and diseases.","authors":"Wonil Chung","doi":"10.5808/gi.21053","DOIUrl":"https://doi.org/10.5808/gi.21053","url":null,"abstract":"<p><p>Predicting individual traits and diseases from genetic variants is critical to fulfilling the promise of personalized medicine. The genetic variants from genome-wide association studies (GWAS), including variants well below GWAS significance, can be aggregated into highly significant predictions across a wide range of complex traits and diseases. The recent arrival of large-sample public biobanks enables highly accurate polygenic predictions based on genetic variants across the whole genome. Various statistical methodologies and diverse computational tools have been introduced and developed to computed the polygenic risk score (PRS) more accurately. However, many researchers utilize PRS tools without a thorough understanding of the underlying model and how to specify the parameters for the best performance. It is advantageous to study the statistical models implemented in computational tools for PRS estimation and the formulas of parameters to be specified. Here, we review a variety of recent statistical methodologies and computational tools for PRS computation.</p>","PeriodicalId":36591,"journal":{"name":"Genomics and Informatics","volume":"19 4","pages":"e36"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8752975/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39669904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-12-01Epub Date: 2021-12-31DOI: 10.5808/gi.21029
Rajat Hegde, Smita Hegde, Suyamindra S Kulkarni, Aditya Pandurangi, Pramod B Gai, Kusal K Das
Autism is a complex neurodevelopmental disorder, the prevalence of which has increased drastically in India in recent years. Neuroligin is a type I transmembrane protein that plays a crucial role in synaptogenesis. Alterations in synaptic genes are most commonly implicated in autism and other cognitive disorders. The present study investigated the neuroligin 3 gene in the Indian autistic population by sequencing and in silico pathogenicity prediction of molecular changes. In total, 108 clinically described individuals with autism were included from the North Karnataka region of India, along with 150 age-, sex-, and ethnicity-matched healthy controls. Genomic DNA was extracted from peripheral blood, and exonic regions were sequenced. The functional and structural effects of variants of the neuroligin 3 protein were predicted. One coding sequence variant (a missense variant) and four non-coding variants (two 5'-untranslated region [UTR] variants and two 3'-UTR variants) were recorded. The novel missense variant was found in 25% of the autistic population. The C/C genotype of c.551T>C was significantly more common in autistic children than in controls (p = 0.001), and a significantly increased risk of autism (24.7-fold) was associated with this genotype (p = 0.001). The missense variant showed pathogenic effects and high evolutionary conservation over the functions of the neuroligin 3 protein. In the present study, we reported a novel missense variant, V184A, which causes abnormal neuroligin 3 and was found with high frequency in the Indian autistic population. Therefore, neuroligin is a candidate gene for future molecular investigations and functional analysis in the Indian autistic population.
{"title":"Genetic analysis of the postsynaptic transmembrane X-linked neuroligin 3 gene in autism.","authors":"Rajat Hegde, Smita Hegde, Suyamindra S Kulkarni, Aditya Pandurangi, Pramod B Gai, Kusal K Das","doi":"10.5808/gi.21029","DOIUrl":"https://doi.org/10.5808/gi.21029","url":null,"abstract":"<p><p>Autism is a complex neurodevelopmental disorder, the prevalence of which has increased drastically in India in recent years. Neuroligin is a type I transmembrane protein that plays a crucial role in synaptogenesis. Alterations in synaptic genes are most commonly implicated in autism and other cognitive disorders. The present study investigated the neuroligin 3 gene in the Indian autistic population by sequencing and in silico pathogenicity prediction of molecular changes. In total, 108 clinically described individuals with autism were included from the North Karnataka region of India, along with 150 age-, sex-, and ethnicity-matched healthy controls. Genomic DNA was extracted from peripheral blood, and exonic regions were sequenced. The functional and structural effects of variants of the neuroligin 3 protein were predicted. One coding sequence variant (a missense variant) and four non-coding variants (two 5'-untranslated region [UTR] variants and two 3'-UTR variants) were recorded. The novel missense variant was found in 25% of the autistic population. The C/C genotype of c.551T>C was significantly more common in autistic children than in controls (p = 0.001), and a significantly increased risk of autism (24.7-fold) was associated with this genotype (p = 0.001). The missense variant showed pathogenic effects and high evolutionary conservation over the functions of the neuroligin 3 protein. In the present study, we reported a novel missense variant, V184A, which causes abnormal neuroligin 3 and was found with high frequency in the Indian autistic population. Therefore, neuroligin is a candidate gene for future molecular investigations and functional analysis in the Indian autistic population.</p>","PeriodicalId":36591,"journal":{"name":"Genomics and Informatics","volume":"19 4","pages":"e44"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8752989/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39684632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nowadays, Genomic data constitutes one of the fastest growing datasets in the world. As of 2025, it is supposed to become the fourth largest source of Big Data, and thus mandating adequate high-performance computing (HPC) platform for processing. With the latest unprecedented and unpredictable mutations in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the research community is in crucial need for ICT tools to process SARS-CoV-2 RNA data, e.g., by classifying it (i.e., clustering) and thus assisting in tracking virus mutations and predict future ones. In this paper, we are presenting an HPC-based SARS-CoV-2 RNAs clustering tool. We are adopting a data science approach, from data collection, through analysis, to visualization. In the analysis step, we present how our clustering approach leverages on HPC and the longest common subsequence (LCS) algorithm. The approach uses the Hadoop MapReduce programming paradigm and adapts the LCS algorithm in order to efficiently compute the length of the LCS for each pair of SARS-CoV-2 RNA sequences. The latter are extracted from the U.S. National Center for Biotechnology Information (NCBI) Virus repository. The computed LCS lengths are used to measure the dissimilarities between RNA sequences in order to work out existing clusters. In addition to that, we present a comparative study of the LCS algorithm performance based on variable workloads and different numbers of Hadoop worker nodes.
{"title":"High-performance computing for SARS-CoV-2 RNAs clustering: a data science‒based genomics approach.","authors":"Anas Oujja, Mohamed Riduan Abid, Jaouad Boumhidi, Safae Bourhnane, Asmaa Mourhir, Fatima Merchant, Driss Benhaddou","doi":"10.5808/gi.21056","DOIUrl":"https://doi.org/10.5808/gi.21056","url":null,"abstract":"<p><p>Nowadays, Genomic data constitutes one of the fastest growing datasets in the world. As of 2025, it is supposed to become the fourth largest source of Big Data, and thus mandating adequate high-performance computing (HPC) platform for processing. With the latest unprecedented and unpredictable mutations in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the research community is in crucial need for ICT tools to process SARS-CoV-2 RNA data, e.g., by classifying it (i.e., clustering) and thus assisting in tracking virus mutations and predict future ones. In this paper, we are presenting an HPC-based SARS-CoV-2 RNAs clustering tool. We are adopting a data science approach, from data collection, through analysis, to visualization. In the analysis step, we present how our clustering approach leverages on HPC and the longest common subsequence (LCS) algorithm. The approach uses the Hadoop MapReduce programming paradigm and adapts the LCS algorithm in order to efficiently compute the length of the LCS for each pair of SARS-CoV-2 RNA sequences. The latter are extracted from the U.S. National Center for Biotechnology Information (NCBI) Virus repository. The computed LCS lengths are used to measure the dissimilarities between RNA sequences in order to work out existing clusters. In addition to that, we present a comparative study of the LCS algorithm performance based on variable workloads and different numbers of Hadoop worker nodes.</p>","PeriodicalId":36591,"journal":{"name":"Genomics and Informatics","volume":"19 4","pages":"e49"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8752974/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39684634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-12-01Epub Date: 2021-12-31DOI: 10.5808/gi.21074
Ji-Hye Oh, Hee-Jo Nam, Hyun-Seok Park
This study explored the trends of Genomics & Informatics during the period of 2003-2018 in comparison with 11 other scholarly journals: BMC Bioinformatics, Algorithms for Molecular Biology: AMB, BMC Systems Biology, Journal of Computational Biology, Briefings in Bioinformatics, BMC Genomics, Nucleic Acids Research, American Journal of Human Genetics, Oncogenesis, Disease Markers, and Microarrays. In total, 22,423 research articles were reviewed. Content analysis was the main method employed in the current research. The results were interpreted using descriptive analysis, a clustering analysis, word embedding, and deep learning techniques. Trends are discussed for the 12 journals, both individually and collectively. This is an extension of our previous study (PMCID: PMC6808643).
{"title":"Estimation of the journal distance of Genomics & Informatics from other bioinformatics-driven journals, 2003-2018.","authors":"Ji-Hye Oh, Hee-Jo Nam, Hyun-Seok Park","doi":"10.5808/gi.21074","DOIUrl":"https://doi.org/10.5808/gi.21074","url":null,"abstract":"<p><p>This study explored the trends of Genomics & Informatics during the period of 2003-2018 in comparison with 11 other scholarly journals: BMC Bioinformatics, Algorithms for Molecular Biology: AMB, BMC Systems Biology, Journal of Computational Biology, Briefings in Bioinformatics, BMC Genomics, Nucleic Acids Research, American Journal of Human Genetics, Oncogenesis, Disease Markers, and Microarrays. In total, 22,423 research articles were reviewed. Content analysis was the main method employed in the current research. The results were interpreted using descriptive analysis, a clustering analysis, word embedding, and deep learning techniques. Trends are discussed for the 12 journals, both individually and collectively. This is an extension of our previous study (PMCID: PMC6808643).</p>","PeriodicalId":36591,"journal":{"name":"Genomics and Informatics","volume":"19 4","pages":"e51"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8752985/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39684637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-12-01Epub Date: 2021-12-31DOI: 10.5808/gi.21055
Mayur Mukut Murlidhar Sharma, Rahul Vasudeo Ramekar, Nam-Il Park, Ik-Young Choi, Seon-Kang Choi, Kyong-Cheul Park
Brassica napus is the third most important oilseed crop in the world; however, in Korea, it is greatly affected by cold stress, limiting seed growth and production. Plants have developed specific stress responses that are generally divided into three categories: cold-stress signaling, transcriptional/post-transcriptional regulation, and stress-response mechanisms. Large numbers of functional and regulatory proteins are involved in these processes when triggered by cold stress. Here, our objective was to investigate the different genetic factors involved in the cold-stress responses of B. napus. Consequently, we treated the Korean B. napus cultivar Naehan at the 4-week stage in cold chambers under different conditions, and RNA and cDNA were obtained. An in silico analysis included 80 cold-responsive genes downloaded from the National Center for Biotechnology Information (NCBI) database. Expression levels were assessed by reverse transcription polymerase chain reaction, and 14 cold-triggered genes were identified under cold-stress conditions. The most significant genes encoded zinc-finger proteins (33.7%), followed by MYB transcription factors (7.5%). In the future, we will select genes appropriate for improving the cold tolerance of B. napus.
{"title":"Characterization of transcription factor genes related to cold tolerance in Brassica napus.","authors":"Mayur Mukut Murlidhar Sharma, Rahul Vasudeo Ramekar, Nam-Il Park, Ik-Young Choi, Seon-Kang Choi, Kyong-Cheul Park","doi":"10.5808/gi.21055","DOIUrl":"https://doi.org/10.5808/gi.21055","url":null,"abstract":"Brassica napus is the third most important oilseed crop in the world; however, in Korea, it is greatly affected by cold stress, limiting seed growth and production. Plants have developed specific stress responses that are generally divided into three categories: cold-stress signaling, transcriptional/post-transcriptional regulation, and stress-response mechanisms. Large numbers of functional and regulatory proteins are involved in these processes when triggered by cold stress. Here, our objective was to investigate the different genetic factors involved in the cold-stress responses of B. napus. Consequently, we treated the Korean B. napus cultivar Naehan at the 4-week stage in cold chambers under different conditions, and RNA and cDNA were obtained. An in silico analysis included 80 cold-responsive genes downloaded from the National Center for Biotechnology Information (NCBI) database. Expression levels were assessed by reverse transcription polymerase chain reaction, and 14 cold-triggered genes were identified under cold-stress conditions. The most significant genes encoded zinc-finger proteins (33.7%), followed by MYB transcription factors (7.5%). In the future, we will select genes appropriate for improving the cold tolerance of B. napus.","PeriodicalId":36591,"journal":{"name":"Genomics and Informatics","volume":"19 4","pages":"e45"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8752983/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39929498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-12-01Epub Date: 2021-12-31DOI: 10.5808/gi.19.4.e1
Taesung Park
2021 Korea Genome Organization This is an open-access article distributed under the terms of the Creative Commons Attribution license (http://creativecommons. org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This issue contains one review article, 12 original articles, one application note, and two opinions. In this editorial, I would like to focus on three articles about genetic association studies. The first article, by Wonil Chung (Soongsil University, Korea), provides a good review of statistical models and computational tools for predicting complex traits and diseases. The author focused on utilizing large-sample public biobanks to perform accurate polygenic predictions based on genetic variants across the whole genome based on polygenic risk scores (PRS). The authors reviewed various statistical methodologies and diverse computational tools that have been developed to estimate PRS more accurately from genome-wide association studies (GWASs). The author emphasized that the successful utilization of PRS tools requires a thorough understanding of what the underlying model is and how to specify the parameters to achieve the best performance. I think that this review is quite informative for researchers working on PRS computation. Next, an original article by Young Jin Kim’s group at National Institute of Health, Korea also presents a large-scale GWAS. The Division of Genome Science, Department of Precision Medicine, National Institute of Health of Korea, with which the authors are affiliated, has been in charge of the Korean Genome Epidemiology Study (KoGES) since 2001 [1]. KoGES is a cohort study focusing on the prevention of major chronic diseases such as type 2 diabetes (T2D) and hypertension. The Korea Biobank Array (KBA) was recently developed for genomic studies in the Korean population. The optimized KBA comprises approximately 830K variants [2]. Using 125,850 samples from a Korean population genotyped by the KBA, the authors validated known associations with T2D and related metabolic traits. To the best of my knowledge, this is one of the largest GWASs for T2D in Korean population. The authors considered the imputed datasets available in the KBA genomic data, publicly available East-Asian T2D summary statistics, and the linkage disequilibrium among the variants. The authors identified 1,837 statistically significant (p < 0.05) variants. Although the 0.05 threshold is relatively non-stringent, the identified variants can be used for valuable scientific evidence in future study designs, evaluations of the current power of GWASs, and future applications in precision medicine for the Korean population. Next, an original article by Jong-Keuk Lee’s group at the Asan Institute for Life Sciences, Asan Medical Center, Seoul, Korea presents a genetic study identifying rare coding variants associated with Kawasaki disease (KD) by whole-exome sequencin
{"title":"Editor's introduction to this issue (G&I 19:4, 2021).","authors":"Taesung Park","doi":"10.5808/gi.19.4.e1","DOIUrl":"https://doi.org/10.5808/gi.19.4.e1","url":null,"abstract":"2021 Korea Genome Organization This is an open-access article distributed under the terms of the Creative Commons Attribution license (http://creativecommons. org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This issue contains one review article, 12 original articles, one application note, and two opinions. In this editorial, I would like to focus on three articles about genetic association studies. The first article, by Wonil Chung (Soongsil University, Korea), provides a good review of statistical models and computational tools for predicting complex traits and diseases. The author focused on utilizing large-sample public biobanks to perform accurate polygenic predictions based on genetic variants across the whole genome based on polygenic risk scores (PRS). The authors reviewed various statistical methodologies and diverse computational tools that have been developed to estimate PRS more accurately from genome-wide association studies (GWASs). The author emphasized that the successful utilization of PRS tools requires a thorough understanding of what the underlying model is and how to specify the parameters to achieve the best performance. I think that this review is quite informative for researchers working on PRS computation. Next, an original article by Young Jin Kim’s group at National Institute of Health, Korea also presents a large-scale GWAS. The Division of Genome Science, Department of Precision Medicine, National Institute of Health of Korea, with which the authors are affiliated, has been in charge of the Korean Genome Epidemiology Study (KoGES) since 2001 [1]. KoGES is a cohort study focusing on the prevention of major chronic diseases such as type 2 diabetes (T2D) and hypertension. The Korea Biobank Array (KBA) was recently developed for genomic studies in the Korean population. The optimized KBA comprises approximately 830K variants [2]. Using 125,850 samples from a Korean population genotyped by the KBA, the authors validated known associations with T2D and related metabolic traits. To the best of my knowledge, this is one of the largest GWASs for T2D in Korean population. The authors considered the imputed datasets available in the KBA genomic data, publicly available East-Asian T2D summary statistics, and the linkage disequilibrium among the variants. The authors identified 1,837 statistically significant (p < 0.05) variants. Although the 0.05 threshold is relatively non-stringent, the identified variants can be used for valuable scientific evidence in future study designs, evaluations of the current power of GWASs, and future applications in precision medicine for the Korean population. Next, an original article by Jong-Keuk Lee’s group at the Asan Institute for Life Sciences, Asan Medical Center, Seoul, Korea presents a genetic study identifying rare coding variants associated with Kawasaki disease (KD) by whole-exome sequencin","PeriodicalId":36591,"journal":{"name":"Genomics and Informatics","volume":"19 4","pages":"e35"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8752986/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39669903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-12-01Epub Date: 2021-12-31DOI: 10.5808/gi.21047
Hye Young Jeong, Jinseon Yoo, Hyunwoo Kim, Tae-Min Kim
Mutation signatures represent unique sequence footprints of somatic mutations resulting from specific DNA mutagenic and repair processes. However, their causal associations and the potential utility for genome research remain largely unknown. In this study, we performed PanCancer-scale correlative analyses to identify the genomic features associated with tumor mutation burdens (TMB) and individual mutation signatures. We observed that TMB was correlated with tumor purity, ploidy, and the level of aneuploidy, as well as with the expression of cell proliferation-related genes representing genomic covariates in evaluating TMB. Correlative analyses of mutation signature levels with genes belonging to specific DNA damage-repair processes revealed that deficiencies of NHEJ1 and ALKBH3 may contribute to mutations in the settings of APOBEC cytidine deaminase activation and DNA mismatch repair deficiency, respectively. We further employed a strategy to identify feature-driven, de novo mutation signatures and demonstrated that mutation signatures can be reconstructed using known causal features. Using the strategy, we further identified tumor hypoxia-related mutation signatures similar to the APOBEC-related mutation signatures, suggesting that APOBEC activity mediates hypoxia-related mutational consequences in cancer genomes. Our study advances the mechanistic insights into the TMB and signature-based DNA mutagenic and repair processes in cancer genomes. We also propose that feature-driven mutation signature analysis can further extend the categories of cancer-relevant mutation signatures and their causal relationships.
{"title":"Correlation-based and feature-driven mutation signature analyses to identify genetic features associated with DNA mutagenic processes in cancer genomes.","authors":"Hye Young Jeong, Jinseon Yoo, Hyunwoo Kim, Tae-Min Kim","doi":"10.5808/gi.21047","DOIUrl":"https://doi.org/10.5808/gi.21047","url":null,"abstract":"<p><p>Mutation signatures represent unique sequence footprints of somatic mutations resulting from specific DNA mutagenic and repair processes. However, their causal associations and the potential utility for genome research remain largely unknown. In this study, we performed PanCancer-scale correlative analyses to identify the genomic features associated with tumor mutation burdens (TMB) and individual mutation signatures. We observed that TMB was correlated with tumor purity, ploidy, and the level of aneuploidy, as well as with the expression of cell proliferation-related genes representing genomic covariates in evaluating TMB. Correlative analyses of mutation signature levels with genes belonging to specific DNA damage-repair processes revealed that deficiencies of NHEJ1 and ALKBH3 may contribute to mutations in the settings of APOBEC cytidine deaminase activation and DNA mismatch repair deficiency, respectively. We further employed a strategy to identify feature-driven, de novo mutation signatures and demonstrated that mutation signatures can be reconstructed using known causal features. Using the strategy, we further identified tumor hypoxia-related mutation signatures similar to the APOBEC-related mutation signatures, suggesting that APOBEC activity mediates hypoxia-related mutational consequences in cancer genomes. Our study advances the mechanistic insights into the TMB and signature-based DNA mutagenic and repair processes in cancer genomes. We also propose that feature-driven mutation signature analysis can further extend the categories of cancer-relevant mutation signatures and their causal relationships.</p>","PeriodicalId":36591,"journal":{"name":"Genomics and Informatics","volume":"19 4","pages":"e40"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8752981/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39929496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-12-01Epub Date: 2021-12-31DOI: 10.5808/gi.21061
Dongmoung Kim, Seung-Hyun Jung, Yeun-Jun Chung
In addition to mutations and copy number alterations, gene fusions are commonly identified in cancers. In thyroid cancer, fusions of important cancer-related genes have been commonly reported; however, extant panels do not cover all clinically important gene fusions. In this study, we aimed to develop a custom RNA-based sequencing panel to identify the key fusions in thyroid cancer. Our ThyChase panel was designed to detect 87 types of gene fusion. As quality control of RNA sequencing, five housekeeping genes were included in this panel. When we applied this panel for the analysis of fusions containing reference RNA (HD796), three expected fusions (EML4-ALK, CCDC6-RET, and TPM3-NTRK1) were successfully identified. We confirmed the fusion breakpoint sequences of the three fusions from HD796 by Sanger sequencing. Regarding the limit of detection, this panel could detect the target fusions from a tumor sample containing a 1% fusion-positive tumor cellular fraction. Taken together, our ThyChase panel would be useful to identify gene fusions in the clinical field.
{"title":"Development of an RNA sequencing panel to detect gene fusions in thyroid cancer.","authors":"Dongmoung Kim, Seung-Hyun Jung, Yeun-Jun Chung","doi":"10.5808/gi.21061","DOIUrl":"https://doi.org/10.5808/gi.21061","url":null,"abstract":"<p><p>In addition to mutations and copy number alterations, gene fusions are commonly identified in cancers. In thyroid cancer, fusions of important cancer-related genes have been commonly reported; however, extant panels do not cover all clinically important gene fusions. In this study, we aimed to develop a custom RNA-based sequencing panel to identify the key fusions in thyroid cancer. Our ThyChase panel was designed to detect 87 types of gene fusion. As quality control of RNA sequencing, five housekeeping genes were included in this panel. When we applied this panel for the analysis of fusions containing reference RNA (HD796), three expected fusions (EML4-ALK, CCDC6-RET, and TPM3-NTRK1) were successfully identified. We confirmed the fusion breakpoint sequences of the three fusions from HD796 by Sanger sequencing. Regarding the limit of detection, this panel could detect the target fusions from a tumor sample containing a 1% fusion-positive tumor cellular fraction. Taken together, our ThyChase panel would be useful to identify gene fusions in the clinical field.</p>","PeriodicalId":36591,"journal":{"name":"Genomics and Informatics","volume":"19 4","pages":"e41"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8752984/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39929497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-12-01Epub Date: 2021-12-31DOI: 10.5808/gi.21062
Sonali Rath, Manaswini Jagadeb, Ruchi Bhuyan
Moringa oleifera is nowadays raising as the most preferred medicinal plant, as every part of the moringa plant has potential bioactive compounds which can be used as herbal medicines. Some bioactive compounds of M. oleifera possess potential anti-cancer properties which interact with the apoptosis protein p53 in cancer cell lines of oral squamous cell carcinoma. This research work focuses on the interaction among the selected bioactive compounds derived from M. oleifera with targeted apoptosis protein p53 from the apoptosis pathway to check whether the bioactive compound will induce apoptosis after the mutation in p53. To check the toxicity and drug-likeness of the selected bioactive compound derived from M. oleifera based on Lipinski's Rule of Five. Detailed analysis of the 3D structure of apoptosis protein p53. To analyze protein's active site by CASTp 3.0 server. Molecular docking and binding affinity were analyzed between protein p53 with selected bioactive compounds in order to find the most potential inhibitor against the target. This study shows the docking between the potential bioactive compounds with targeted apoptosis protein p53. Quercetin was the most potential bioactive compound whereas kaempferol shows poor affinity towards the targeted p53 protein in the apoptosis pathway. Thus, the objective of this research can provide an insight prediction towards M. oleifera derived bioactive compounds and target apoptosis protein p53 in the structural analysis for compound isolation and in-vivo experiments on the cancer cell line.
{"title":"Molecular docking of bioactive compounds derived from Moringa oleifera with p53 protein in the apoptosis pathway of oral squamous cell carcinoma.","authors":"Sonali Rath, Manaswini Jagadeb, Ruchi Bhuyan","doi":"10.5808/gi.21062","DOIUrl":"10.5808/gi.21062","url":null,"abstract":"<p><p>Moringa oleifera is nowadays raising as the most preferred medicinal plant, as every part of the moringa plant has potential bioactive compounds which can be used as herbal medicines. Some bioactive compounds of M. oleifera possess potential anti-cancer properties which interact with the apoptosis protein p53 in cancer cell lines of oral squamous cell carcinoma. This research work focuses on the interaction among the selected bioactive compounds derived from M. oleifera with targeted apoptosis protein p53 from the apoptosis pathway to check whether the bioactive compound will induce apoptosis after the mutation in p53. To check the toxicity and drug-likeness of the selected bioactive compound derived from M. oleifera based on Lipinski's Rule of Five. Detailed analysis of the 3D structure of apoptosis protein p53. To analyze protein's active site by CASTp 3.0 server. Molecular docking and binding affinity were analyzed between protein p53 with selected bioactive compounds in order to find the most potential inhibitor against the target. This study shows the docking between the potential bioactive compounds with targeted apoptosis protein p53. Quercetin was the most potential bioactive compound whereas kaempferol shows poor affinity towards the targeted p53 protein in the apoptosis pathway. Thus, the objective of this research can provide an insight prediction towards M. oleifera derived bioactive compounds and target apoptosis protein p53 in the structural analysis for compound isolation and in-vivo experiments on the cancer cell line.</p>","PeriodicalId":36591,"journal":{"name":"Genomics and Informatics","volume":"19 4","pages":"e46"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8752987/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39684633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}