首页 > 最新文献

GigaByte (Hong Kong, China)最新文献

英文 中文
High-speed whole-genome sequencing of a Whippet: Rapid chromosome-level assembly and annotation of an extremely fast dog's genome. 对一只惠比特犬进行高速全基因组测序:对速度极快的狗的基因组进行染色体级快速组装和注释。
Pub Date : 2024-09-13 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.134
Marcel Nebenführ, David Prochotta, Alexander Ben Hamadou, Axel Janke, Charlotte Gerheim, Christian Betz, Carola Greve, Hanno Jörn Bolz

The time required for genome sequencing and de novo assembly depends on the interaction between laboratory work, sequencing capacity, and the bioinformatics workflow, often constrained by external sequencing services. Bringing together academic biodiversity institutes and a medical diagnostics company with extensive sequencing capabilities, we aimed at generating a high-quality mammalian de novo genome in minimal time. We present the first chromosome-level genome assembly of the Whippet, using PacBio long-read high-fidelity sequencing and reference-guided scaffolding. The final assembly has a contig N50 of 55 Mbp and a scaffold N50 of 65.7 Mbp. The total assembly length is 2.47 Gbp, of which 2.43 Gpb were scaffolded into 39 chromosome-length scaffolds. Annotation using mammalian genomes and transcriptome data yielded 28,383 transcripts, 90.9% complete BUSCO genes, and identified 36.5% repeat content. Sequencing, assembling, and scaffolding the chromosome-level genome of the Whippet took less than a week, adding another high-quality reference genome to the available sequences of domestic dog breeds.

基因组测序和从头组装所需的时间取决于实验室工作、测序能力和生物信息学工作流程之间的相互作用,通常受到外部测序服务的限制。我们联合了生物多样性学术机构和一家具有广泛测序能力的医疗诊断公司,旨在用最短的时间生成高质量的哺乳动物从头基因组。我们利用 PacBio 长线程高保真测序技术和参考文献指导的支架技术,首次完成了惠比特犬染色体组水平的基因组组装。最终组装的等位基因 N50 为 55 Mbp,支架 N50 为 65.7 Mbp。装配总长度为 2.47 Gbp,其中 2.43 Gpb 分成 39 个染色体长度的支架。利用哺乳动物基因组和转录组数据进行了注释,得到了 28,383 个转录本、90.9% 完整的 BUSCO 基因,并确定了 36.5% 的重复内容。惠比特犬染色体级基因组的测序、组装和支架制作耗时不到一周,为现有的家犬品种序列又增加了一个高质量的参考基因组。
{"title":"High-speed whole-genome sequencing of a Whippet: Rapid chromosome-level assembly and annotation of an extremely fast dog's genome.","authors":"Marcel Nebenführ, David Prochotta, Alexander Ben Hamadou, Axel Janke, Charlotte Gerheim, Christian Betz, Carola Greve, Hanno Jörn Bolz","doi":"10.46471/gigabyte.134","DOIUrl":"10.46471/gigabyte.134","url":null,"abstract":"<p><p>The time required for genome sequencing and <i>de novo</i> assembly depends on the interaction between laboratory work, sequencing capacity, and the bioinformatics workflow, often constrained by external sequencing services. Bringing together academic biodiversity institutes and a medical diagnostics company with extensive sequencing capabilities, we aimed at generating a high-quality mammalian <i>de novo</i> genome in minimal time. We present the first chromosome-level genome assembly of the Whippet, using PacBio long-read high-fidelity sequencing and reference-guided scaffolding. The final assembly has a contig N50 of 55 Mbp and a scaffold N50 of 65.7 Mbp. The total assembly length is 2.47 Gbp, of which 2.43 Gpb were scaffolded into 39 chromosome-length scaffolds. Annotation using mammalian genomes and transcriptome data yielded 28,383 transcripts, 90.9% complete BUSCO genes, and identified 36.5% repeat content. Sequencing, assembling, and scaffolding the chromosome-level genome of the Whippet took less than a week, adding another high-quality reference genome to the available sequences of domestic dog breeds.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"gigabyte134"},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11418881/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142309262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RiboSnake - a user-friendly, robust, reproducible, multipurpose and documentation-extensive pipeline for 16S rRNA gene microbiome analysis. RiboSnake - 用于 16S rRNA 基因微生物组分析的用户友好型、稳健型、可重现型、多用途和文档丰富型管道。
Pub Date : 2024-08-31 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.132
Ann-Kathrin Dörr, Josefa Welling, Adrian Dörr, Jule Gosch, Hannah Möhlen, Ricarda Schmithausen, Jan Kehrmann, Folker Meyer, Ivana Kraiselburd

Background: Next-generation sequencing for microbial communities has become a standard technique. However, the computational analysis remains resource-intensive. With declining costs and growing adoption of sequencing-based methods in many fields, validated, fully automated, reproducible and flexible pipelines are increasingly essential in various scientific fields.

Results: We present RiboSnake, a validated, automated, reproducible QIIME2-based pipeline implemented in Snakemake for analysing 16S rRNA gene amplicon sequencing data. RiboSnake includes pre-packaged validated parameter sets optimized for different sample types, from environmental samples to patient data. The configuration packages can be easily adapted and shared, requiring minimal user input.

Conclusion: RiboSnake is a new alternative for researchers employing 16S rRNA gene amplicon sequencing and looking for a customizable and user-friendly pipeline for microbiome analyses with in vitro validated settings. By automating the analysis with validated parameters for diverse sample types, RiboSnake enhances existing methods significantly. The workflow repository can be found on GitHub (https://github.com/IKIM-Essen/RiboSnake).

背景:微生物群落的新一代测序已成为一项标准技术。然而,计算分析仍然是资源密集型的。随着成本的下降和许多领域越来越多地采用基于测序的方法,经过验证的、全自动的、可重现的和灵活的管道在各个科学领域越来越重要:我们介绍了 RiboSnake,这是一种基于 QIIME2 的经过验证、自动化、可重现的管道,在 Snakemake 中实现,用于分析 16S rRNA 基因扩增片段测序数据。RiboSnake 包括针对不同样本类型(从环境样本到患者数据)优化的预打包验证参数集。这些配置包可以很容易地调整和共享,只需极少的用户输入:RiboSnake 是采用 16S rRNA 基因扩增片段测序的研究人员的新选择,他们正在寻找一种可定制且用户友好的管道,利用体外验证设置进行微生物组分析。RiboSnake 可针对不同类型的样本使用经过验证的参数进行自动化分析,从而大大增强了现有方法的功能。可在 GitHub(https://github.com/IKIM-Essen/RiboSnake)上找到该工作流程的资源库。
{"title":"RiboSnake - a user-friendly, robust, reproducible, multipurpose and documentation-extensive pipeline for 16S rRNA gene microbiome analysis.","authors":"Ann-Kathrin Dörr, Josefa Welling, Adrian Dörr, Jule Gosch, Hannah Möhlen, Ricarda Schmithausen, Jan Kehrmann, Folker Meyer, Ivana Kraiselburd","doi":"10.46471/gigabyte.132","DOIUrl":"10.46471/gigabyte.132","url":null,"abstract":"<p><strong>Background: </strong>Next-generation sequencing for microbial communities has become a standard technique. However, the computational analysis remains resource-intensive. With declining costs and growing adoption of sequencing-based methods in many fields, validated, fully automated, reproducible and flexible pipelines are increasingly essential in various scientific fields.</p><p><strong>Results: </strong>We present RiboSnake, a validated, automated, reproducible QIIME2-based pipeline implemented in Snakemake for analysing <i>16S rRNA</i> gene amplicon sequencing data. RiboSnake includes pre-packaged validated parameter sets optimized for different sample types, from environmental samples to patient data. The configuration packages can be easily adapted and shared, requiring minimal user input.</p><p><strong>Conclusion: </strong>RiboSnake is a new alternative for researchers employing <i>16S rRNA</i> gene amplicon sequencing and looking for a customizable and user-friendly pipeline for microbiome analyses with <i>in vitro</i> validated settings. By automating the analysis with validated parameters for diverse sample types, RiboSnake enhances existing methods significantly. The workflow repository can be found on GitHub (https://github.com/IKIM-Essen/RiboSnake).</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"gigabyte132"},"PeriodicalIF":0.0,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11448241/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142373717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated management of AWS instances for training. 自动管理用于培训的 AWS 实例。
Pub Date : 2024-08-29 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.133
Jorge Buenabad-Chavez, Evelyn Greeves, James P J Chong, Emma Rand

Amazon Web Services (AWS) instances provide a convenient way to run training on complex 'omics data analysis workflows without requiring participants to install software packages or store large data volumes locally. However, efficiently managing dozens of instances is challenging for training providers. We present a set of Bash scripts that make it quick and easy to manage Linux AWS instances pre-configured with all the software analysis tools and data needed for a course, and accessible using encrypted login keys and optional domain names. Creating over 30 instances takes 10-15 minutes. A comprehensive online tutorial describes how to set up and use an AWS account and the scripts, and how to customise AWS instance templates with other software tools and data. We anticipate that others offering similar training may benefit from using the scripts regardless of the analyses being taught.

亚马逊网络服务(AWS)实例为复杂的 "omics "数据分析工作流程培训提供了一种便捷的方式,学员无需安装软件包或在本地存储大量数据。然而,对于培训提供商来说,高效管理数十个实例极具挑战性。我们提供了一套 Bash 脚本,可快速、轻松地管理预配置了课程所需的所有软件分析工具和数据的 Linux AWS 实例,并可使用加密登录密钥和可选域名进行访问。创建 30 多个实例只需 10-15 分钟。全面的在线教程介绍了如何设置和使用 AWS 账户和脚本,以及如何使用其他软件工具和数据自定义 AWS 实例模板。我们预计,无论教授何种分析,其他提供类似培训的人员都可以从使用脚本中获益。
{"title":"Automated management of AWS instances for training.","authors":"Jorge Buenabad-Chavez, Evelyn Greeves, James P J Chong, Emma Rand","doi":"10.46471/gigabyte.133","DOIUrl":"https://doi.org/10.46471/gigabyte.133","url":null,"abstract":"<p><p>Amazon Web Services (AWS) instances provide a convenient way to run training on complex 'omics data analysis workflows without requiring participants to install software packages or store large data volumes locally. However, efficiently managing dozens of instances is challenging for training providers. We present a set of Bash scripts that make it quick and easy to manage Linux AWS instances pre-configured with all the software analysis tools and data needed for a course, and accessible using encrypted login keys and optional domain names. Creating over 30 instances takes 10-15 minutes. A comprehensive online tutorial describes how to set up and use an AWS account and the scripts, and how to customise AWS instance templates with other software tools and data. We anticipate that others offering similar training may benefit from using the scripts regardless of the analyses being taught.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"gigabyte133"},"PeriodicalIF":0.0,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11382607/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142302548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chromosomal-level genome assembly and single-nucleotide polymorphism sites of black-faced spoonbill Platalea minor. 黑脸琵鹭(Platalea minor)的染色体级基因组组装和单核苷酸多态性位点。
Pub Date : 2024-07-18 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.130

Platalea minor, or black-faced spoonbill (Threskiornithidae), is a wading bird confined to coastal areas in East Asia. Due to habitat destruction, it was classified as globally endangered by the International Union for Conservation of Nature. However, the lack of genomic resources for this species hinders the understanding of its biology and diversity, and the development of conservation measures. Here, we report the first chromosomal-level genome assembly of P. minor using a combination of PacBio SMRT and Omni-C scaffolding technologies. The assembled genome (1.24 Gb) contains 95.33% of the sequences anchored to 31 pseudomolecules. The genome assembly has high sequence continuity with scaffold length N50 = 53 Mb. We predicted 18,780 protein-coding genes and measured high BUSCO score completeness (97.3%). Finally, we revealed 6,155,417 bi-allelic single nucleotide polymorphisms, accounting for ∼5% of the genome. This resource offers new opportunities for studying the black-faced spoonbill and developing conservation measures for this species.

小琵鹭(Platalea minor),又称黑脸琵鹭(Threskiornithidae),是一种仅分布于东亚沿海地区的涉禽。由于栖息地遭到破坏,它被国际自然保护联盟列为全球濒危物种。然而,该物种基因组资源的缺乏阻碍了对其生物学和多样性的了解以及保护措施的制定。在此,我们结合 PacBio SMRT 和 Omni-C 支架技术,首次报道了 P. minor 的染色体级基因组组装。组装完成的基因组(1.24 Gb)包含了锚定在 31 个假分子上的 95.33% 的序列。基因组组装具有较高的序列连续性,支架长度 N50 = 53 Mb。我们预测了 18,780 个编码蛋白质的基因,并测出了较高的 BUSCO 评分完整性(97.3%)。最后,我们发现了 6,155,417 个双等位单核苷酸多态性,占基因组的 5%。这一资源为研究黑脸琵鹭和制定该物种的保护措施提供了新的机遇。
{"title":"Chromosomal-level genome assembly and single-nucleotide polymorphism sites of black-faced spoonbill <i>Platalea minor</i>.","authors":"","doi":"10.46471/gigabyte.130","DOIUrl":"10.46471/gigabyte.130","url":null,"abstract":"<p><p><i>Platalea minor</i>, or black-faced spoonbill (Threskiornithidae), is a wading bird confined to coastal areas in East Asia. Due to habitat destruction, it was classified as globally endangered by the International Union for Conservation of Nature. However, the lack of genomic resources for this species hinders the understanding of its biology and diversity, and the development of conservation measures. Here, we report the first chromosomal-level genome assembly of <i>P. minor</i> using a combination of PacBio SMRT and Omni-C scaffolding technologies. The assembled genome (1.24 Gb) contains 95.33% of the sequences anchored to 31 pseudomolecules. The genome assembly has high sequence continuity with scaffold length N50 = 53 Mb. We predicted 18,780 protein-coding genes and measured high BUSCO score completeness (97.3%). Finally, we revealed 6,155,417 bi-allelic single nucleotide polymorphisms, accounting for ∼5% of the genome. This resource offers new opportunities for studying the black-faced spoonbill and developing conservation measures for this species.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"1-13"},"PeriodicalIF":0.0,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11273517/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141790211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Kinship analysis and pedigree reconstruction by RAD sequencing in cattle. 通过 RAD 测序对牛进行亲缘关系分析和血统重建。
Pub Date : 2024-07-18 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.131
Yiming Xu, Wanqiu Wang, Jiefeng Huang, Minjie Xu, Binhu Wang, Yingsong Wu, Yongzhong Xie, Jianbo Jian

Kinship and pedigree, used for estimating inbreeding, heritability, selection, and gene flow, are useful for breeding and animal conservation. However, as the size of crossbred populations increases, inaccurate generation and parentage assignment in livestock farms increase. Restriction-site-associated DNA sequencing is a cost-effective platform for single nucleotide polymorphism (SNP) discovery and genotyping. Here, we performed a kinship analysis and pedigree reconstruction for Angus and Xiangxi yellow cattle. A total of 975 cattle, including 923 offspring with 24 known sires and 28 known dams, were sampled and subjected to SNP discovery and genotyping. The identified SNP panel included 7,305 SNPs capturing the maximum difference between paternal and maternal genome information, allowing us to distinguish F1 from F2 generations with 90% accuracy. In conclusion, we provided a low-cost and efficient SNP panel for kinship analyses and the improvement of local genetic resources, which are valuable for breed improvement, local resource utilization, and conservation.

亲缘关系和血统用于估计近亲繁殖、遗传率、选择和基因流,对育种和动物保护非常有用。然而,随着杂交种群规模的扩大,畜牧场中不准确的世代和亲子鉴定也在增加。限制性位点相关 DNA 测序是发现单核苷酸多态性(SNP)和进行基因分型的一种经济有效的平台。在此,我们对安格斯牛和湘西黄牛进行了亲缘关系分析和血统重建。我们对总共 975 头牛(包括 923 头后代,其中有 24 头已知的父牛和 28 头已知的母牛)进行了采样,并进行了 SNP 发现和基因分型。确定的 SNP 面板包括 7,305 个 SNP,捕获了父系和母系基因组信息的最大差异,使我们能够以 90% 的准确率区分 F1 和 F2 代。总之,我们为亲缘关系分析和地方遗传资源改良提供了一个低成本、高效率的 SNP 面板,这对品种改良、地方资源利用和保护都很有价值。
{"title":"Kinship analysis and pedigree reconstruction by RAD sequencing in cattle.","authors":"Yiming Xu, Wanqiu Wang, Jiefeng Huang, Minjie Xu, Binhu Wang, Yingsong Wu, Yongzhong Xie, Jianbo Jian","doi":"10.46471/gigabyte.131","DOIUrl":"10.46471/gigabyte.131","url":null,"abstract":"<p><p>Kinship and pedigree, used for estimating inbreeding, heritability, selection, and gene flow, are useful for breeding and animal conservation. However, as the size of crossbred populations increases, inaccurate generation and parentage assignment in livestock farms increase. Restriction-site-associated DNA sequencing is a cost-effective platform for single nucleotide polymorphism (SNP) discovery and genotyping. Here, we performed a kinship analysis and pedigree reconstruction for Angus and Xiangxi yellow cattle. A total of 975 cattle, including 923 offspring with 24 known sires and 28 known dams, were sampled and subjected to SNP discovery and genotyping. The identified SNP panel included 7,305 SNPs capturing the maximum difference between paternal and maternal genome information, allowing us to distinguish F1 from F2 generations with 90% accuracy. In conclusion, we provided a low-cost and efficient SNP panel for kinship analyses and the improvement of local genetic resources, which are valuable for breed improvement, local resource utilization, and conservation.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"1-15"},"PeriodicalIF":0.0,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11273509/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141790212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-coverage whole genome sequencing for a highly selective cohort of severe COVID-19 patients. 为高度选择性的严重 COVID-19 患者队列进行低覆盖率全基因组测序。
Pub Date : 2024-06-20 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.127
Renato Santos, Víctor Moreno-Torres, Ilduara Pintos, Octavio Corral, Carmen de Mendoza, Vicente Soriano, Manuel Corpas

Despite the advances in genetic marker identification associated with severe COVID-19, the full genetic characterisation of the disease remains elusive. This study explores imputation in low-coverage whole genome sequencing for a severe COVID-19 patient cohort. We generated a dataset of 79 imputed variant call format files using the GLIMPSE1 tool, each containing an average of 9.5 million single nucleotide variants. Validation revealed a high imputation accuracy (squared Pearson correlation ≍0.97) across sequencing platforms, showcasing GLIMPSE1's ability to confidently impute variants with minor allele frequencies as low as 2% in individuals with Spanish ancestry. We carried out a comprehensive analysis of the patient cohort, examining hospitalisation and intensive care utilisation, sex and age-based differences, and clinical phenotypes using a standardised set of medical terms developed to characterise severe COVID-19 symptoms. The methods and findings presented here can be leveraged for future genomic projects to gain vital insights into health challenges like COVID-19.

尽管在与重度 COVID-19 相关的遗传标记鉴定方面取得了进展,但该疾病的全部遗传特征仍然难以确定。本研究探讨了在低覆盖率全基因组测序中对重症 COVID-19 患者队列的估算。我们使用 GLIMPSE1 工具生成了一个包含 79 个估算变异调用格式文件的数据集,每个文件平均包含 950 万个单核苷酸变异。验证结果显示,GLIMPSE1 在各种测序平台上都具有很高的估算准确性(平方皮尔逊相关性 ≍0.97),展示了 GLIMPSE1 在西班牙血统个体中对小等位基因频率低至 2% 的变异进行可靠估算的能力。我们对患者队列进行了全面分析,使用一套为描述严重 COVID-19 症状而开发的标准化医学术语,检查了住院和重症监护使用情况、性别和年龄差异以及临床表型。本文介绍的方法和研究结果可用于未来的基因组项目,以深入了解 COVID-19 等健康挑战。
{"title":"Low-coverage whole genome sequencing for a highly selective cohort of severe COVID-19 patients.","authors":"Renato Santos, Víctor Moreno-Torres, Ilduara Pintos, Octavio Corral, Carmen de Mendoza, Vicente Soriano, Manuel Corpas","doi":"10.46471/gigabyte.127","DOIUrl":"10.46471/gigabyte.127","url":null,"abstract":"<p><p>Despite the advances in genetic marker identification associated with severe COVID-19, the full genetic characterisation of the disease remains elusive. This study explores imputation in low-coverage whole genome sequencing for a severe COVID-19 patient cohort. We generated a dataset of 79 imputed variant call format files using the GLIMPSE1 tool, each containing an average of 9.5 million single nucleotide variants. Validation revealed a high imputation accuracy (squared Pearson correlation ≍0.97) across sequencing platforms, showcasing GLIMPSE1's ability to confidently impute variants with minor allele frequencies as low as 2% in individuals with Spanish ancestry. We carried out a comprehensive analysis of the patient cohort, examining hospitalisation and intensive care utilisation, sex and age-based differences, and clinical phenotypes using a standardised set of medical terms developed to characterise severe COVID-19 symptoms. The methods and findings presented here can be leveraged for future genomic projects to gain vital insights into health challenges like COVID-19.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"gigabyte127"},"PeriodicalIF":0.0,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11211761/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141473253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PhysiCell Studio: a graphical tool to make agent-based modeling more accessible. PhysiCell Studio:一种使基于代理的建模更易于使用的图形工具。
Pub Date : 2024-06-19 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.128
Randy Heiland, Daniel Bergman, Blair Lyons, Grant Waldow, Julie Cass, Heber Lima da Rocha, Marco Ruscone, Vincent Noël, Paul Macklin

Defining a multicellular model can be challenging. There may be hundreds of parameters that specify the attributes and behaviors of objects. In the best case, the model will be defined using some format specification - a markup language - that will provide easy model sharing (and a minimal step toward reproducibility). PhysiCell is an open-source, physics-based multicellular simulation framework with an active and growing user community. It uses XML to define a model and, traditionally, users needed to manually edit the XML to modify the model. PhysiCell Studio is a tool to make this task easier. It provides a GUI that allows editing the XML model definition, including the creation and deletion of fundamental objects: cell types and substrates in the microenvironment. It also lets users build their model by defining initial conditions and biological rules, run simulations, and view results interactively. PhysiCell Studio has evolved over multiple workshops and academic courses in recent years, which has led to many improvements. There is both a desktop and cloud version. Its design and development has benefited from an active undergraduate and graduate research program. Like PhysiCell, the Studio is open-source software and contributions from the community are encouraged.

定义多细胞模型是一项挑战。可能有数以百计的参数指定对象的属性和行为。在最好的情况下,模型将使用某种格式规范--标记语言--来定义,这将为模型共享提供方便(也是实现可重复性的最基本步骤)。PhysiCell 是一个开源的、基于物理学的多细胞仿真框架,拥有一个活跃的、不断增长的用户社区。它使用 XML 来定义模型,传统上,用户需要手动编辑 XML 来修改模型。PhysiCell Studio 是一个让这项工作变得更简单的工具。它提供了一个图形用户界面,允许编辑 XML 模型定义,包括创建和删除基本对象:微环境中的细胞类型和基质。用户还可以通过定义初始条件和生物规则来构建模型,运行模拟并交互式查看结果。近年来,PhysiCell Studio 在多个研讨会和学术课程中不断发展,取得了许多改进。该软件有桌面版和云计算版。它的设计和开发得益于活跃的本科生和研究生研究项目。与PhysiCell一样,Studio也是开源软件,鼓励社区贡献。
{"title":"PhysiCell Studio: a graphical tool to make agent-based modeling more accessible.","authors":"Randy Heiland, Daniel Bergman, Blair Lyons, Grant Waldow, Julie Cass, Heber Lima da Rocha, Marco Ruscone, Vincent Noël, Paul Macklin","doi":"10.46471/gigabyte.128","DOIUrl":"10.46471/gigabyte.128","url":null,"abstract":"<p><p>Defining a multicellular model can be challenging. There may be hundreds of parameters that specify the attributes and behaviors of objects. In the best case, the model will be defined using some format specification - a markup language - that will provide easy model sharing (and a minimal step toward reproducibility). PhysiCell is an open-source, physics-based multicellular simulation framework with an active and growing user community. It uses XML to define a model and, traditionally, users needed to manually edit the XML to modify the model. PhysiCell Studio is a tool to make this task easier. It provides a GUI that allows editing the XML model definition, including the creation and deletion of fundamental objects: cell types and substrates in the microenvironment. It also lets users build their model by defining initial conditions and biological rules, run simulations, and view results interactively. PhysiCell Studio has evolved over multiple workshops and academic courses in recent years, which has led to many improvements. There is both a desktop and cloud version. Its design and development has benefited from an active undergraduate and graduate research program. Like PhysiCell, the Studio is open-source software and contributions from the community are encouraged.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"gigabyte128"},"PeriodicalIF":0.0,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11211762/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141473254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multicellular, IVT-derived, unmodified human transcriptome for nanopore-direct RNA analysis. 用于纳米孔直接 RNA 分析的多细胞、IVT 衍生、未修饰的人类转录组。
Pub Date : 2024-06-17 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.129
Caroline A McCormick, Stuart Akeson, Sepideh Tavakoli, Dylan Bloch, Isabel N Klink, Miten Jain, Sara H Rouhanifard

Nanopore direct RNA sequencing (DRS) enables measurements of RNA modifications. Modification-free transcripts are a practical and targeted control for DRS, providing a baseline measurement for canonical nucleotides within a matched and biologically-derived sequence context. However, these controls can be challenging to generate and carry nanopore-specific nuances that can impact analyses. We produced DRS datasets using modification-free transcripts from in vitro transcription of cDNA from six immortalized human cell lines. We characterized variation across cell lines and demonstrated how these may be interpreted. These data will serve as a versatile control and resource to the community for RNA modification analyses of human transcripts.

纳米孔直接 RNA 测序(DRS)可以测量 RNA 的修饰。不含修饰的转录本是 DRS 的一种实用且有针对性的对照,可在匹配的生物序列上下文中提供标准核苷酸的基线测量。然而,这些对照组的生成可能具有挑战性,而且纳米孔特有的细微差别会影响分析结果。我们使用从六个永生化人类细胞系体外转录 cDNA 的无修饰转录本制作了 DRS 数据集。我们描述了不同细胞系之间的差异,并演示了如何解释这些差异。这些数据将作为通用对照和资源提供给社区,用于人类转录本的 RNA 修饰分析。
{"title":"Multicellular, IVT-derived, unmodified human transcriptome for nanopore-direct RNA analysis.","authors":"Caroline A McCormick, Stuart Akeson, Sepideh Tavakoli, Dylan Bloch, Isabel N Klink, Miten Jain, Sara H Rouhanifard","doi":"10.46471/gigabyte.129","DOIUrl":"10.46471/gigabyte.129","url":null,"abstract":"<p><p>Nanopore direct RNA sequencing (DRS) enables measurements of RNA modifications. Modification-free transcripts are a practical and targeted control for DRS, providing a baseline measurement for canonical nucleotides within a matched and biologically-derived sequence context. However, these controls can be challenging to generate and carry nanopore-specific nuances that can impact analyses. We produced DRS datasets using modification-free transcripts from <i>in vitro</i> transcription of cDNA from six immortalized human cell lines. We characterized variation across cell lines and demonstrated how these may be interpreted. These data will serve as a versatile control and resource to the community for RNA modification analyses of human transcripts.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"gigabyte129"},"PeriodicalIF":0.0,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11221353/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141499780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Get Free Copy: a multi-repository search platform for biomedical publications. 免费获取副本:生物医学出版物的多库检索平台。
Pub Date : 2024-05-29 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.126
Nodir Kosimkhujaev, Kuan-Lin Huang

We introduce Get Free Copy (https://getfreecopy.com), a web-based platform designed to streamline the search for biomedical literature across major repositories like arXiv, bioRxiv, medRxiv, and PubMed Central (PMC). Addressing challenges posed by paywalls and fragmented databases, it offers a unified interface for efficient retrieval of free, legitimate copies of biomedical literature. The platform's implementation involves a Node.js backend and dynamic front-end display, enhancing accessibility and research efficiency. As an open-source project, Get Free Copy represents a significant contribution to the open-access movement, inviting global researcher collaboration for further development.

我们介绍 Get Free Copy (https://getfreecopy.com),这是一个基于网络的平台,旨在简化在 arXiv、bioRxiv、medRxiv 和 PubMed Central (PMC) 等主要文献库中搜索生物医学文献的过程。为了应对付费墙和分散数据库带来的挑战,该平台提供了一个统一的界面,用于高效检索生物医学文献的免费合法副本。该平台的实施包括 Node.js 后端和动态前端显示,从而提高了可访问性和研究效率。作为一个开源项目,Get Free Copy 对开放获取运动做出了重大贡献,并邀请全球研究人员合作进一步开发。
{"title":"Get Free Copy: a multi-repository search platform for biomedical publications.","authors":"Nodir Kosimkhujaev, Kuan-Lin Huang","doi":"10.46471/gigabyte.126","DOIUrl":"10.46471/gigabyte.126","url":null,"abstract":"<p><p>We introduce Get Free Copy (https://getfreecopy.com), a web-based platform designed to streamline the search for biomedical literature across major repositories like arXiv, bioRxiv, medRxiv, and PubMed Central (PMC). Addressing challenges posed by paywalls and fragmented databases, it offers a unified interface for efficient retrieval of free, legitimate copies of biomedical literature. The platform's implementation involves a Node.js backend and dynamic front-end display, enhancing accessibility and research efficiency. As an open-source project, Get Free Copy represents a significant contribution to the open-access movement, inviting global researcher collaboration for further development.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"gigabyte126"},"PeriodicalIF":0.0,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11154096/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141285565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genome assembly of the rare and endangered Grantham's camellia, Camellia granthamiana. 稀有濒危的格兰瑟姆山茶花(Camellia granthamiana)的基因组组装。
Pub Date : 2024-05-20 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.124

Grantham's camellia (Camellia granthamiana Sealy) is a rare and endangered tea species discovered in Hong Kong in 1955 and endemic to southern China. Despite its high conservation value, the genomic resources of C. granthamiana are limited. Here, we present a chromosome-scale draft genome of the tetraploid C. granthamiana (2n = 4x = 60), combining PacBio long-read sequencing and Omni-C data. The assembled genome size is ∼2.4 Gb, with most sequences anchored to 15 pseudochromosomes resembling a monoploid genome. The genome has high contiguity, with a scaffold N50 of 139.7 Mb, and high completeness (97.8% BUSCO score). Our gene model prediction resulted in 68,032 protein-coding genes (BUSCO score of 90.9%). We annotated 1.65 Gb of repeat content (68.48% of the genome). Our Grantham's camellia genome assembly is a valuable resource for investigating Grantham's camellia's biology, ecology, and phylogenomic relationships with other Camellia species, and provides a foundation for further conservation measures.

葛氏山茶(Camellia granthamiana Sealy)是1955年在香港发现的稀有濒危茶树品种,为中国南方特有物种。尽管它具有很高的保护价值,但C. granthamiana的基因组资源却很有限。在此,我们结合 PacBio 长线程测序和 Omni-C 数据,提出了四倍体 C. granthamiana(2n = 4x = 60)的染色体级基因组草案。组装的基因组大小为 2.4 Gb,大部分序列锚定在 15 个假染色体上,类似于单倍体基因组。基因组的连续性很高,支架 N50 为 139.7 Mb,完整性也很高(97.8% BUSCO 得分)。我们的基因模型预测得出了 68,032 个蛋白质编码基因(BUSCO 得分为 90.9%)。我们注释了 1.65 Gb 的重复内容(占基因组的 68.48%)。我们的格兰山茶花基因组组装是研究格兰山茶花生物学、生态学以及与其他山茶花物种系统发育关系的宝贵资源,并为进一步的保护措施奠定了基础。
{"title":"Genome assembly of the rare and endangered Grantham's camellia, <i>Camellia granthamiana</i>.","authors":"","doi":"10.46471/gigabyte.124","DOIUrl":"10.46471/gigabyte.124","url":null,"abstract":"<p><p>Grantham's camellia (<i>Camellia granthamiana</i> Sealy) is a rare and endangered tea species discovered in Hong Kong in 1955 and endemic to southern China. Despite its high conservation value, the genomic resources of <i>C. granthamiana</i> are limited. Here, we present a chromosome-scale draft genome of the tetraploid <i>C. granthamiana</i> (2<i>n</i> = 4<i>x</i> = 60), combining PacBio long-read sequencing and Omni-C data. The assembled genome size is ∼2.4 Gb, with most sequences anchored to 15 pseudochromosomes resembling a monoploid genome. The genome has high contiguity, with a scaffold N50 of 139.7 Mb, and high completeness (97.8% BUSCO score). Our gene model prediction resulted in 68,032 protein-coding genes (BUSCO score of 90.9%). We annotated 1.65 Gb of repeat content (68.48% of the genome). Our Grantham's camellia genome assembly is a valuable resource for investigating Grantham's camellia's biology, ecology, and phylogenomic relationships with other <i>Camellia</i> species, and provides a foundation for further conservation measures.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"gigabyte124"},"PeriodicalIF":0.0,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11131091/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141163069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
GigaByte (Hong Kong, China)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1