The Culicidae family has two of the most important disease vector genus: Aedes spp. and Culex spp. Both of these are involved in the transmission of arboviruses. Here, we provide novel data for the geographical distribution of 2,383 specimens in the Culicidae family. We also report the percentage of houses infested with these vectors, and Shannon and Simpson diversity indices in three municipalities located in Cauca, Colombia. This dataset is relevant for research on vector-borne diseases because Aedes spp. can transmit arboviruses such as dengue, Zika and chikungunya, and Culex spp. is a well-known vector of West Nile virus and Venezuelan equine encephalitis.
We present LT1, the first high-quality human reference genome from the Baltic States. LT1 is a female de novo human reference genome assembly, constructed using 57× nanopore long reads and polished using 47× short paired-end reads. We utilized 72 GB of Hi-C chromosomal mapping data for scaffolding, to maximize assembly contiguity and accuracy. The contig assembly of LT1 was 2.73 Gbp in length, comprising 4490 contigs with an NG50 value of 12.0 Mbp. After scaffolding with Hi-C data and manual curation, the final assembly has an NG50 value of 137 Mbp and 4699 scaffolds. Assessment of gene prediction quality using Benchmarking Universal Single-Copy Orthologs (BUSCO) identified 89.3% of the single-copy orthologous genes included in the benchmark. Detailed characterization of LT1 suggests it has 73,744 predicted transcripts, 4.2 million autosomal SNPs, 974,616 short indels, and 12,079 large structural variants. These data may be used as a benchmark for further in-depth genomic analyses of Baltic populations.
Between 2012 and 2014, 715 green coffee samples were gathered by Almacafé S.A. (Bogotá, Colombia) from 27 countries. These were analysed at the nuclear magnetic resonance (NMR) laboratory at Universidad del Valle (Cali, Colombia). Over 1000 methanolic coffee extracts were prepared and 4563 spectra were acquired in a fully automatic manner using a 400 MHz NMR spectrometer (Bruker Biospin, Germany). The dataset spans the variance that could be expected for an industrial application of origin monitoring, including samples from different harvest times, collected over several years, and processed by at least two distinct operators. The resulting 1D and 2D spectra can be used to develop and evaluate feature extraction methods, multivariate algorithms, and automation monitoring techniques. They can also be used as datasets for teaching, or as a reference for new studies of similar samples and approaches.
Hox genes and their cofactors are essential developmental genes specifying regional identity in animals. Hox genes have a conserved arrangement in clusters in the same order in which they specify identity along the anterior-posterior axis. A few insect species have breaks in the cluster, but these are exceptions. We annotated the 10 Hox genes of the Asian citrus psyllid Diaphorina citri, and found a split in its Hox cluster between the Deformed and Sex combs reduced genes - the first time a break at this position has been observed in an insect Hox cluster. We also annotated D. citri orthologs of the Hox cofactor genes homothorax, PKNOX and extradenticle and found an additional copy of extradenticle in D. citri that appears to be a retrogene. Expression data and sequence conservation suggest that the extradenticle retrogene may have retained the original extradenticle function and allowed divergence of the parental extradenticle gene.
The circadian rhythm involves multiple genes that generate an internal molecular clock, allowing organisms to anticipate environmental conditions produced by the Earth's rotation on its axis. Here, we present the results of the manual curation of 27 genes that are associated with circadian rhythm in the genome of Diaphorina citri, the Asian citrus psyllid. This insect is the vector for the bacterial pathogen Candidatus Liberibacter asiaticus (CLas), the causal agent of citrus greening disease (Huanglongbing). This disease severely affects citrus industries and has drastically decreased crop yields worldwide. Based on cry1 and cry2 identified in the psyllid genome, D. citri likely possesses a circadian model similar to the lepidopteran butterfly, Danaus plexippus. Manual annotation will improve the quality of circadian rhythm gene models, allowing the future development of molecular therapeutics, such as RNA interference or antisense technologies, to target these genes to disrupt the psyllid biology.
Chitinases are enzymes that digest the polysaccharide polymer chitin. During insect development, breakdown of chitin is an essential step in molting of the exoskeleton. Knockdown of chitinases required for molting is lethal to insects, making chitinase genes an interesting target for RNAi-based pest control methods. The Asian citrus psyllid, Diaphorina citri, carries the bacterium causing Huanglongbing, or citrus greening disease, a devastating citrus disease. We identified and annotated 12 chitinase family genes from D. citri as part of a community effort to create high-quality gene models to aid the design of interdictory molecules for pest control. We categorized the D. citri chitinases according to an established classification scheme and re-evaluated the classification of chitinases in other hemipterans. In addition to chitinases from known groups, we identified a novel class of chitinases present in D. citri and several related hemipterans that appears to be the result of horizontal gene transfer.
The numbat (Myrmecobius fasciatus) is an endangered Australian marsupial, and the last surviving member of the Myrmecobiidae family. The numbat regularly undergoes torpor and is unique amongst marsupials as it is the only diurnal and termitivorous species. Here we sequenced the first draft genome of the numbat using 10× Genomics Chromium linked-read technology, resulting in a 3.42 Gbp genome with a scaffold N50 of 223 kbp. A global transcriptome from liver, lung and tongue was also generated to aid genome annotation, identifying 21,465 protein-coding genes. To investigate adaptation to the numbat's termitivorous diet and arid/semi-arid range, we interrogated the most highly expressed transcripts within the tongue and manually annotated taste, vomeronasal and aquaporin gene families. Antimicrobial proteins and proteins involved in digestion were highly expressed in the tongue, alongside umami taste receptors. However, sweet taste receptors were not expressed in this tissue, which combined with the putative contraction of the bitter taste receptor gene repertoire in the numbat genome, may indicate a potential evolutionary adaptation to their specialised termitivorous diet. Vomeronasal and aquaporin gene repertoires were similar to other marsupials. The draft numbat genome is a valuable tool for conservation and can be applied to population genetics/genomics studies and to investigate the unique biology of this interesting species.
Sepsis is a major healthcare problem with substantial mortality and a common reason for admission to the intensive care unit (ICU). For this reason, the management of sepsis is an important area of ICU research. A number of large-scale, freely-accessible ICU databases are available for observational research and the robust identification of septic patients in such data sets is crucial for research purposes, particularly for comparative studies between critical care sub-populations which may vary around the world. However, data structures are poorly standardised due to inevitable variances in clinical electronic health record system vendor and implementation as well as research database design choices. Robust and well-documented cohort selection (such as patients with sepsis) is crucial for reproducible research. In this work, we operationalise the Sepsis-3 definition on the AmsterdamUMCdb, a recently published large European ICU database, publishing open-access code for wider use by critical care researchers.
Ubiquitination is an ATP-dependent process that targets proteins for degradation by the proteasome. Here, we annotated 15 genes from the ubiquitin-proteasome pathway in the Asian citrus psyllid, Diaphorina citri. This psyllid vector has come to prominence in the last decade owing to its role in the transmission of the devastating bacterial pathogen, Candidatus Liberibacter asiaticus (CLas). Infection of citrus crops by this pathogen causes Huanglongbing (HLB), or citrus greening disease, and results in the eventual death of citrus trees. The identification and correct annotation of these genes in D. citri will be useful for functional genomic studies to aid the development of RNAi-based management strategies aimed at reducing the spread of HLB. Investigating the effects of CLas infection on the expression of ubiquitin-proteasome pathway genes may provide new information about the role these genes play in the acquisition and transmission of CLas by D. citri.