Clinicopathological features of high-frequency microsatellite instability (MSI-H) colorectal cancers (CRCs) are different from low-frequency MSI (MSI-L) and microsatellite stable (MSS) CRCs. The clinical features of MSI-L cases are unknown, and although the tumors usually show instability for dinucleotide markers, evaluation based on dinucleotides alone could lead to the misclassification of MSI-L or MSS as MSI-H. In this research, we investigated the usefulness of hypoxia-inducible factor-1α (HIF-1α) expression to discriminate MSI-L from MSS and MSI-H in human CRC. Tumor tissue from 94 CRC patients was used to determine the expression level of HIF-1α mRNA and HIF-1α protein using quantitative real-time PCR and immunohistochemistry analyses, respectively. The results indicated that HIF-1α mRNA and HIF-1α protein levels were upregulated in CRC patients compared with controls (P < 0.0001). Average HIF-1α expression in tissues with advanced stages and grades was also higher than that in earlier stages and grades. Expression of HIF-1α mRNA varied between CRC patients with different types of microsatellite instability (MSS, MSI-L and MSI-H). Taken together, our findings provide preliminary evidence that HIF-1α expression level in CRC tumors correlates with different MSI categories. HIF-1α expression may therefore represent a novel marker to separate the MSI-L group from the MSS and MSI-H groups.
Colorectal cancer (CRC), which includes the development of cancer from the colon or rectum, is one of the highly prevalent cancers in the populations of Jammu and Kashmir (J&K) in India. However, case-control genetic association studies on CRC are lacking in this population. Various genome-wide association studies have previously shown that single-nucleotide polymorphisms (SNPs) of the AT-rich interaction domain 5B (ARID5B) gene located on chromosome 10q21.2 contribute substantially to the development of colorectal cancer. The association between ARID5B and CRC risk in north Indian population groups is still unknown. To understand the role of ARID5B SNPs in CRC in the population of J&K, we designed a case-control study to investigate the association of the cancer susceptibility variant rs10740055 of ARID5B with CRC in the population of J&K. The study included 180 cases and 390 healthy controls. Genotyping of the rs10740055 variant was performed by RT-PCR using the TaqMan assay technique. Hardy-Weinberg equilibrium of the variant was assessed using the chi-squared test. The allele- and genotype-specific risks were estimated by odds ratios (ORs) with 95% confidence intervals (CIs). The rs10740055 variant showed a higher risk for colorectal cancer with an OR of 3.35 (1.99-5.65 at 95% CI) and P = 0.000005 corrected for age, gender, ethnicity, BMI, alcohol intake and smoking. Our results indicate that the A allele of rs10740055 imparts risk to the population and also that a larger sample size is needed for further statistical validation. The association of other variants in other ARID family genes should also be tested as their role cannot be ruled out.
Many plant and animal species exhibit geographic parthenogenesis, wherein unisexual (= parthenogenetic) lineages are more common in their marginal habitats such as high latitude or altitudes than their closely related bisexual counterparts. The Japanese stick insect, Phraortes elongatus (Thunberg) (Insecta: Phasmatodea), is known as a geographically parthenogenetic species due to the existence of both bisexual and unisexual populations. Here, we developed microsatellite markers to infer the genetic variation among populations of P. elongatus. Totally, 13 primer pairs were developed for the species, and they were tested on 47 samples collected from both a bisexual population and a unisexual population. All 13 loci were polymorphic in the bisexual population, whereas no loci were polymorphic in the unisexual population. The loss of variation in the unisexual population implies automixis with terminal fusion or gamete duplication as the mode of parthenogenesis. The markers developed in this study will be helpful for further comprehensive analysis of the genetic diversity and gene flow between bisexual and parthenogenetic lineages of P. elongatus.
In genetics and related fields, huge amounts of data, such as genome sequences, are accumulating, and the use of artificial intelligence (AI) suitable for big data analysis has become increasingly important. Unsupervised AI that can reveal novel knowledge from big data without prior knowledge or particular models is highly desirable for analyses of genome sequences, particularly for obtaining unexpected insights. We have developed a batch-learning self-organizing map (BLSOM) for oligonucleotide compositions that can reveal various novel genome characteristics. Here, we explain the data mining by the BLSOM: an unsupervised AI. As a specific target, we first selected SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) because a large number of viral genome sequences have been accumulated via worldwide efforts. We analyzed more than 0.6 million sequences collected primarily in the first year of the pandemic. BLSOMs for short oligonucleotides (e.g., 4-6-mers) allowed separation into known clades, but longer oligonucleotides further increased the separation ability and revealed subgrouping within known clades. In the case of 15-mers, there is mostly one copy in the genome; thus, 15-mers that appeared after the epidemic started could be connected to mutations, and the BLSOM for 15-mers revealed the mutations that contributed to separation into known clades and their subgroups. After introducing the detailed methodological strategies, we explain BLSOMs for various topics, such as the tetranucleotide BLSOM for over 5 million 5-kb fragment sequences derived from almost all microorganisms currently available and its use in metagenome studies. We also explain BLSOMs for various eukaryotes, including fishes, frogs and Drosophila species, and found a high separation ability among closely related species. When analyzing the human genome, we found enrichments in transcription factor-binding sequences in centromeric and pericentromeric heterochromatin regions. The tDNAs (tRNA genes) could be separated according to their corresponding amino acid.
Sperm are modified substantially in passing through both the male and the female reproductive tracts, only thereafter becoming functionally competent to fertilize eggs. Drosophila sperm become motile in the seminal vesicle; after ejaculation, they interact with seminal fluid proteins and undergo biochemical changes on their surface while they are stored in the female sperm storage organs. However, the molecular mechanisms underlying these maturation processes remain largely unknown. Here, we focused on Drosophila Neprilysin genes, which are the fly orthologs of the mouse Membrane metallo-endopeptidase-like 1 (Mmel1) gene. While Mmel1 knockout male mice have reduced fertility without abnormality in either testis morphology or sperm motility, there are inconsistent results regarding the association of any Neprilysin gene with male fertility in Drosophila. We examined the association of the Nep1-5 genes with male fertility by RNAi and found that Nep4 gene function is specifically required in germline cells. To investigate this in more detail, we induced mutations in the Nep4 gene by the CRISPR/Cas9 system and isolated two mutants, both of which were viable and female fertile, but male sterile. The mutant males had normal-looking testes and sperm; during copulation, sperm were transferred to females and stored in the seminal receptacle and paired spermathecae. However, following sperm transfer and storage, three defects were observed for Nep4 mutant sperm. First, sperm were quickly discarded by the females; second, the proportion of eggs fertilized was significantly lower for mutant sperm than for control sperm; and third, most eggs laid did not initiate development after sperm entry. Taking these observations together, we conclude that the Nep4 gene is essential for sperm function following sperm transfer to females.