首页 > 最新文献

JMIR bioinformatics and biotechnology最新文献

英文 中文
The Easy-to-Use SARS-CoV-2 Assembler for Genome Sequencing: Development Study. 易于使用的SARS-CoV-2基因组测序组装器:开发研究
Pub Date : 2022-03-14 eCollection Date: 2022-01-01 DOI: 10.2196/31536
Martina Rueca, Emanuela Giombini, Francesco Messina, Barbara Bartolini, Antonino Di Caro, Maria Rosaria Capobianchi, Cesare Em Gruber

Background: Early sequencing and quick analysis of the SARS-CoV-2 genome have contributed to the understanding of the dynamics of COVID-19 epidemics and in designing countermeasures at a global level.

Objective: Amplicon-based next-generation sequencing (NGS) methods are widely used to sequence the SARS-CoV-2 genome and to identify novel variants that are emerging in rapid succession as well as harboring multiple deletions and amino acid-changing mutations.

Methods: To facilitate the analysis of NGS sequencing data obtained from amplicon-based sequencing methods, here, we propose an easy-to-use SARS-CoV-2 genome assembler: the Easy-to-use SARS-CoV-2 Assembler (ESCA) pipeline.

Results: Our results have shown that ESCA could perform high-quality genome assembly from Ion Torrent and Illumina raw data and help the user in easily correct low-coverage regions. Moreover, ESCA includes the possibility of comparing assembled genomes of multisample runs through an easy table format.

Conclusions: In conclusion, ESCA automatically furnished a variant table output file, fundamental to rapidly recognizing variants of interest. Our pipeline could be a useful method for obtaining a complete, rapid, and accurate analysis even with minimal knowledge in bioinformatics.

背景:SARS-CoV-2基因组的早期测序和快速分析有助于了解COVID-19流行动态和在全球层面制定对策。目的:基于扩增子的新一代测序(NGS)方法被广泛用于对SARS-CoV-2基因组进行测序,并鉴定快速连续出现的新变体,以及包含多个缺失和氨基酸改变突变的新变体。方法:为了便于分析基于扩增子测序方法获得的NGS测序数据,我们提出了一种易于使用的SARS-CoV-2基因组组装器:easy- use SARS-CoV-2 assembler (ESCA)管道。结果:我们的研究结果表明,ESCA可以从Ion Torrent和Illumina原始数据中进行高质量的基因组组装,并帮助用户轻松纠正低覆盖区域。此外,ESCA包括通过一个简单的表格格式比较多样本运行的组装基因组的可能性。结论:总之,ESCA自动提供了一个变体表输出文件,这是快速识别感兴趣的变体的基础。我们的管道可以成为一种有用的方法,即使在生物信息学方面的知识很少,也可以获得完整、快速和准确的分析。
{"title":"The Easy-to-Use SARS-CoV-2 Assembler for Genome Sequencing: Development Study.","authors":"Martina Rueca,&nbsp;Emanuela Giombini,&nbsp;Francesco Messina,&nbsp;Barbara Bartolini,&nbsp;Antonino Di Caro,&nbsp;Maria Rosaria Capobianchi,&nbsp;Cesare Em Gruber","doi":"10.2196/31536","DOIUrl":"https://doi.org/10.2196/31536","url":null,"abstract":"<p><strong>Background: </strong>Early sequencing and quick analysis of the SARS-CoV-2 genome have contributed to the understanding of the dynamics of COVID-19 epidemics and in designing countermeasures at a global level.</p><p><strong>Objective: </strong>Amplicon-based next-generation sequencing (NGS) methods are widely used to sequence the SARS-CoV-2 genome and to identify novel variants that are emerging in rapid succession as well as harboring multiple deletions and amino acid-changing mutations.</p><p><strong>Methods: </strong>To facilitate the analysis of NGS sequencing data obtained from amplicon-based sequencing methods, here, we propose an easy-to-use SARS-CoV-2 genome assembler: the Easy-to-use SARS-CoV-2 Assembler (ESCA) pipeline.</p><p><strong>Results: </strong>Our results have shown that ESCA could perform high-quality genome assembly from Ion Torrent and Illumina raw data and help the user in easily correct low-coverage regions. Moreover, ESCA includes the possibility of comparing assembled genomes of multisample runs through an easy table format.</p><p><strong>Conclusions: </strong>In conclusion, ESCA automatically furnished a variant table output file, fundamental to rapidly recognizing variants of interest. Our pipeline could be a useful method for obtaining a complete, rapid, and accurate analysis even with minimal knowledge in bioinformatics.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":"e31536"},"PeriodicalIF":0.0,"publicationDate":"2022-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8924907/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40307905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Novel Molecular Networks and Regulatory MicroRNAs in Type 2 Diabetes Mellitus: Multiomics Integration and Interactomics Study. 2型糖尿病的新型分子网络和调控性微小RNA:多组学整合和相互作用组学研究
Pub Date : 2022-02-23 DOI: 10.2196/32437
Manoj Khokhar, Dipayan Roy, Sojit Tomo, Ashita Gadwal, Praveen Sharma, Purvi Purohit

Background: Type 2 diabetes mellitus (T2DM) is a metabolic disorder with severe comorbidities. A multiomics approach can facilitate the identification of novel therapeutic targets and biomarkers with proper validation of potential microRNA (miRNA) interactions.

Objective: The aim of this study was to identify significant differentially expressed common target genes in various tissues and their regulating miRNAs from publicly available Gene Expression Omnibus (GEO) data sets of patients with T2DM using in silico analysis.

Methods: Using differentially expressed genes (DEGs) identified from 5 publicly available T2DM data sets, we performed functional enrichment, coexpression, and network analyses to identify pathways, protein-protein interactions, and miRNA-mRNA interactions involved in T2DM.

Results: We extracted 2852, 8631, 5501, 3662, and 3753 DEGs from the expression profiles of GEO data sets GSE38642, GSE25724, GSE20966, GSE26887, and GSE23343, respectively. DEG analysis showed that 16 common genes were enriched in insulin secretion, endocrine resistance, and other T2DM-related pathways. Four DEGs, MAML3, EEF1D, NRG1, and CDK5RAP2, were important in the cluster network regulated by commonly targeted miRNAs (hsa-let-7b-5p, hsa-mir-155-5p, hsa-mir-124-3p, hsa-mir-1-3p), which are involved in the advanced glycation end products (AGE)-receptor for advanced glycation end products (RAGE) signaling pathway, culminating in diabetic complications and endocrine resistance.

Conclusions: This study identified tissue-specific DEGs in T2DM, especially pertaining to the heart, liver, and pancreas. We identified a total of 16 common DEGs and the top four common targeting miRNAs (hsa-let-7b-5p, hsa-miR-124-3p, hsa-miR-1-3p, and has-miR-155-5p). The miRNAs identified are involved in regulating various pathways, including the phosphatidylinositol-3-kinase-protein kinase B, endocrine resistance, and AGE-RAGE signaling pathways.

2型糖尿病(T2DM)是一种伴有严重合并症的代谢紊乱。多组学方法可以通过适当验证潜在的microRNA (miRNA)相互作用,促进新的治疗靶点和生物标志物的鉴定。本研究的目的是利用计算机分析,从公开可获得的基因表达综合(GEO)数据集中的T2DM患者中,确定不同组织中显著差异表达的共同靶基因及其调节mirna。利用从5个公开可用的T2DM数据集中鉴定的差异表达基因(DEGs),我们进行了功能富集、共表达和网络分析,以确定T2DM相关的通路、蛋白-蛋白相互作用和miRNA-mRNA相互作用。我们分别从GEO数据集GSE38642、GSE25724、GSE20966、GSE26887和GSE23343中提取2852、8631、5501、3662和3753个deg。DEG分析显示,在胰岛素分泌、内分泌抵抗等t2dm相关通路中,有16个常见基因富集。四个DEGs, MAML3, EEF1D, NRG1和CDK5RAP2,在由常见靶向mirna (hsa-let-7b-5p, hsa-mir-155-5p, hsa-mir-124-3p, hsa-mir-1-3p)调节的集群网络中是重要的,这些mirna参与晚期糖化终产物(AGE)-晚期糖化终产物受体(RAGE)信号通路,最终导致糖尿病并发症和内分泌抵抗。本研究确定了T2DM的组织特异性deg,特别是与心脏、肝脏和胰腺有关的组织特异性deg。我们共鉴定出16种常见的deg和4种常见的靶向mirna (hsa-let-7b-5p、hsa-miR-124-3p、hsa-miR-1-3p和has-miR-155-5p)。所鉴定的mirna参与调节多种途径,包括磷脂酰肌醇-3-激酶-蛋白激酶B、内分泌抵抗和AGE-RAGE信号通路。
{"title":"Novel Molecular Networks and Regulatory MicroRNAs in Type 2 Diabetes Mellitus: Multiomics Integration and Interactomics Study.","authors":"Manoj Khokhar, Dipayan Roy, Sojit Tomo, Ashita Gadwal, Praveen Sharma, Purvi Purohit","doi":"10.2196/32437","DOIUrl":"10.2196/32437","url":null,"abstract":"<p><strong>Background: </strong>Type 2 diabetes mellitus (T2DM) is a metabolic disorder with severe comorbidities. A multiomics approach can facilitate the identification of novel therapeutic targets and biomarkers with proper validation of potential microRNA (miRNA) interactions.</p><p><strong>Objective: </strong>The aim of this study was to identify significant differentially expressed common target genes in various tissues and their regulating miRNAs from publicly available Gene Expression Omnibus (GEO) data sets of patients with T2DM using in silico analysis.</p><p><strong>Methods: </strong>Using differentially expressed genes (DEGs) identified from 5 publicly available T2DM data sets, we performed functional enrichment, coexpression, and network analyses to identify pathways, protein-protein interactions, and miRNA-mRNA interactions involved in T2DM.</p><p><strong>Results: </strong>We extracted 2852, 8631, 5501, 3662, and 3753 DEGs from the expression profiles of GEO data sets GSE38642, GSE25724, GSE20966, GSE26887, and GSE23343, respectively. DEG analysis showed that 16 common genes were enriched in insulin secretion, endocrine resistance, and other T2DM-related pathways. Four DEGs, MAML3, EEF1D, NRG1, and CDK5RAP2, were important in the cluster network regulated by commonly targeted miRNAs (hsa-let-7b-5p, hsa-mir-155-5p, hsa-mir-124-3p, hsa-mir-1-3p), which are involved in the advanced glycation end products (AGE)-receptor for advanced glycation end products (RAGE) signaling pathway, culminating in diabetic complications and endocrine resistance.</p><p><strong>Conclusions: </strong>This study identified tissue-specific DEGs in T2DM, especially pertaining to the heart, liver, and pancreas. We identified a total of 16 common DEGs and the top four common targeting miRNAs (hsa-let-7b-5p, hsa-miR-124-3p, hsa-miR-1-3p, and has-miR-155-5p). The miRNAs identified are involved in regulating various pathways, including the phosphatidylinositol-3-kinase-protein kinase B, endocrine resistance, and AGE-RAGE signaling pathways.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":"e32437"},"PeriodicalIF":0.0,"publicationDate":"2022-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11135235/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48397752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convolutional Neural Network-Based Automatic Classification of Colorectal and Prostate Tumor Biopsies Using Multispectral Imagery: System Development Study. 基于卷积神经网络的多光谱图像结直肠癌和前列腺肿瘤活检自动分类:系统开发研究。
Pub Date : 2022-02-09 DOI: 10.2196/27394
Remy Peyret, Duaa alSaeed, Fouad Khelifi, Nadia Al-Ghreimil, Heyam Al-Baity, Ahmed Bouridane

Background: Colorectal and prostate cancers are the most common types of cancer in men worldwide. To diagnose colorectal and prostate cancer, a pathologist performs a histological analysis on needle biopsy samples. This manual process is time-consuming and error-prone, resulting in high intra- and interobserver variability, which affects diagnosis reliability.

Objective: This study aims to develop an automatic computerized system for diagnosing colorectal and prostate tumors by using images of biopsy samples to reduce time and diagnosis error rates associated with human analysis.

Methods: In this study, we proposed a convolutional neural network (CNN) model for classifying colorectal and prostate tumors from multispectral images of biopsy samples. The key idea was to remove the last block of the convolutional layers and halve the number of filters per layer.

Results: Our results showed excellent performance, with an average test accuracy of 99.8% and 99.5% for the prostate and colorectal data sets, respectively. The system showed excellent performance when compared with pretrained CNNs and other classification methods, as it avoids the preprocessing phase while using a single CNN model for the whole classification task. Overall, the proposed CNN architecture was globally the best-performing system for classifying colorectal and prostate tumor images.

Conclusions: The proposed CNN architecture was detailed and compared with previously trained network models used as feature extractors. These CNNs were also compared with other classification techniques. As opposed to pretrained CNNs and other classification approaches, the proposed CNN yielded excellent results. The computational complexity of the CNNs was also investigated, and it was shown that the proposed CNN is better at classifying images than pretrained networks because it does not require preprocessing. Thus, the overall analysis was that the proposed CNN architecture was globally the best-performing system for classifying colorectal and prostate tumor images.

背景:结直肠癌和前列腺癌是全球男性最常见的癌症类型。要诊断结直肠癌和前列腺癌,病理学家需要对针刺活检样本进行组织学分析。这种手工操作既费时又容易出错,导致观察者内部和观察者之间的差异很大,影响了诊断的可靠性:本研究旨在开发一种利用活检样本图像诊断结直肠肿瘤和前列腺肿瘤的计算机化自动系统,以减少人工分析所需的时间和诊断错误率:在这项研究中,我们提出了一种卷积神经网络(CNN)模型,用于根据活检样本的多光谱图像对结直肠肿瘤和前列腺肿瘤进行分类。其主要思路是移除卷积层的最后一个区块,并将每层的滤波器数量减半:结果:我们的研究结果表明系统性能卓越,前列腺和结直肠数据集的平均测试准确率分别为 99.8% 和 99.5%。与预训练的 CNN 和其他分类方法相比,该系统表现出色,因为它避免了预处理阶段,同时使用单一 CNN 模型完成整个分类任务。总体而言,所提出的 CNN 架构是全球范围内对结直肠癌和前列腺肿瘤图像进行分类的最佳系统:我们详细介绍了所提出的 CNN 架构,并将其与之前训练的用作特征提取器的网络模型进行了比较。这些 CNN 还与其他分类技术进行了比较。与预训练的 CNN 和其他分类方法相比,所提出的 CNN 取得了优异的结果。此外,还对 CNN 的计算复杂性进行了研究,结果表明,与预训练网络相比,拟议的 CNN 在图像分类方面更胜一筹,因为它不需要进行预处理。因此,总体分析结果表明,在对结直肠和前列腺肿瘤图像进行分类方面,所提出的 CNN 架构是全球表现最佳的系统。
{"title":"Convolutional Neural Network-Based Automatic Classification of Colorectal and Prostate Tumor Biopsies Using Multispectral Imagery: System Development Study.","authors":"Remy Peyret, Duaa alSaeed, Fouad Khelifi, Nadia Al-Ghreimil, Heyam Al-Baity, Ahmed Bouridane","doi":"10.2196/27394","DOIUrl":"10.2196/27394","url":null,"abstract":"<p><strong>Background: </strong>Colorectal and prostate cancers are the most common types of cancer in men worldwide. To diagnose colorectal and prostate cancer, a pathologist performs a histological analysis on needle biopsy samples. This manual process is time-consuming and error-prone, resulting in high intra- and interobserver variability, which affects diagnosis reliability.</p><p><strong>Objective: </strong>This study aims to develop an automatic computerized system for diagnosing colorectal and prostate tumors by using images of biopsy samples to reduce time and diagnosis error rates associated with human analysis.</p><p><strong>Methods: </strong>In this study, we proposed a convolutional neural network (CNN) model for classifying colorectal and prostate tumors from multispectral images of biopsy samples. The key idea was to remove the last block of the convolutional layers and halve the number of filters per layer.</p><p><strong>Results: </strong>Our results showed excellent performance, with an average test accuracy of 99.8% and 99.5% for the prostate and colorectal data sets, respectively. The system showed excellent performance when compared with pretrained CNNs and other classification methods, as it avoids the preprocessing phase while using a single CNN model for the whole classification task. Overall, the proposed CNN architecture was globally the best-performing system for classifying colorectal and prostate tumor images.</p><p><strong>Conclusions: </strong>The proposed CNN architecture was detailed and compared with previously trained network models used as feature extractors. These CNNs were also compared with other classification techniques. As opposed to pretrained CNNs and other classification approaches, the proposed CNN yielded excellent results. The computational complexity of the CNNs was also investigated, and it was shown that the proposed CNN is better at classifying images than pretrained networks because it does not require preprocessing. Thus, the overall analysis was that the proposed CNN architecture was globally the best-performing system for classifying colorectal and prostate tumor images.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":"abs/2301.13151 1","pages":"e27394"},"PeriodicalIF":0.0,"publicationDate":"2022-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11135179/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68432734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Use of Artificial Intelligence in the Search for New Information Through Routine Laboratory Tests: Systematic Review. 人工智能在通过常规实验室测试搜索新信息中的应用:系统综述。
Pub Date : 2022-01-01 DOI: 10.2196/40473
Glauco Cardozo, Salvador Francisco Tirloni, Antônio Renato Pereira Moro, Jefferson Luiz Brum Marques

Background: In recent decades, the use of artificial intelligence has been widely explored in health care. Similarly, the amount of data generated in the most varied medical processes has practically doubled every year, requiring new methods of analysis and treatment of these data. Mainly aimed at aiding in the diagnosis and prevention of diseases, this precision medicine has shown great potential in different medical disciplines. Laboratory tests, for example, almost always present their results separately as individual values. However, physicians need to analyze a set of results to propose a supposed diagnosis, which leads us to think that sets of laboratory tests may contain more information than those presented separately for each result. In this way, the processes of medical laboratories can be strongly affected by these techniques.

Objective: In this sense, we sought to identify scientific research that used laboratory tests and machine learning techniques to predict hidden information and diagnose diseases.

Methods: The methodology adopted used the population, intervention, comparison, and outcomes principle, searching the main engineering and health sciences databases. The search terms were defined based on the list of terms used in the Medical Subject Heading database. Data from this study were presented descriptively and followed the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses; 2020) statement flow diagram and the National Institutes of Health tool for quality assessment of articles. During the analysis, the inclusion and exclusion criteria were independently applied by 2 authors, with a third author being consulted in cases of disagreement.

Results: Following the defined requirements, 40 studies presenting good quality in the analysis process were selected and evaluated. We found that, in recent years, there has been a significant increase in the number of works that have used this methodology, mainly because of COVID-19. In general, the studies used machine learning classification models to predict new information, and the most used parameters were data from routine laboratory tests such as the complete blood count.

Conclusions: Finally, we conclude that laboratory tests, together with machine learning techniques, can predict new tests, thus helping the search for new diagnoses. This process has proved to be advantageous and innovative for medical laboratories. It is making it possible to discover hidden information and propose additional tests, reducing the number of false negatives and helping in the early discovery of unknown diseases.

背景:近几十年来,人工智能在医疗保健领域的应用得到了广泛的探索。同样,在最不同的医疗过程中产生的数据量几乎每年翻一番,需要新的分析和处理这些数据的方法。这种以帮助诊断和预防疾病为主要目的的精准医学在不同的医学学科中显示出巨大的潜力。例如,实验室测试几乎总是将其结果作为单独的值单独呈现。然而,医生需要分析一组结果来提出假定的诊断,这使我们认为,一组实验室测试可能包含比每个结果单独提供的信息更多的信息。这样,医学实验室的流程就会受到这些技术的强烈影响。目的:从这个意义上说,我们试图确定使用实验室测试和机器学习技术来预测隐藏信息和诊断疾病的科学研究。方法:采用人口、干预、比较和结果原则,检索主要的工程和健康科学数据库。搜索术语是根据医学主题标题数据库中使用的术语列表定义的。本研究的数据以描述性方式呈现,并遵循PRISMA(用于系统评价和meta分析的首选报告项目;2020)声明流程图和美国国立卫生研究院文章质量评估工具。在分析过程中,纳入和排除标准由2位作者独立应用,如有不同意见,请咨询第三位作者。结果:根据定义的要求,在分析过程中选择并评估了40项质量较好的研究。我们发现,近年来,主要由于COVID-19,使用这种方法的作品数量显著增加。总的来说,这些研究使用机器学习分类模型来预测新的信息,最常用的参数是来自常规实验室测试的数据,如全血细胞计数。结论:最后,我们得出结论,实验室测试与机器学习技术一起可以预测新的测试,从而有助于寻找新的诊断。这一过程已被证明是有利的和创新的医学实验室。它使人们能够发现隐藏的信息并提出额外的检测方法,减少假阴性的数量,并有助于及早发现未知疾病。
{"title":"Use of Artificial Intelligence in the Search for New Information Through Routine Laboratory Tests: Systematic Review.","authors":"Glauco Cardozo,&nbsp;Salvador Francisco Tirloni,&nbsp;Antônio Renato Pereira Moro,&nbsp;Jefferson Luiz Brum Marques","doi":"10.2196/40473","DOIUrl":"https://doi.org/10.2196/40473","url":null,"abstract":"<p><strong>Background: </strong>In recent decades, the use of artificial intelligence has been widely explored in health care. Similarly, the amount of data generated in the most varied medical processes has practically doubled every year, requiring new methods of analysis and treatment of these data. Mainly aimed at aiding in the diagnosis and prevention of diseases, this precision medicine has shown great potential in different medical disciplines. Laboratory tests, for example, almost always present their results separately as individual values. However, physicians need to analyze a set of results to propose a supposed diagnosis, which leads us to think that sets of laboratory tests may contain more information than those presented separately for each result. In this way, the processes of medical laboratories can be strongly affected by these techniques.</p><p><strong>Objective: </strong>In this sense, we sought to identify scientific research that used laboratory tests and machine learning techniques to predict hidden information and diagnose diseases.</p><p><strong>Methods: </strong>The methodology adopted used the population, intervention, comparison, and outcomes principle, searching the main engineering and health sciences databases. The search terms were defined based on the list of terms used in the Medical Subject Heading database. Data from this study were presented descriptively and followed the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses; 2020) statement flow diagram and the National Institutes of Health tool for quality assessment of articles. During the analysis, the inclusion and exclusion criteria were independently applied by 2 authors, with a third author being consulted in cases of disagreement.</p><p><strong>Results: </strong>Following the defined requirements, 40 studies presenting good quality in the analysis process were selected and evaluated. We found that, in recent years, there has been a significant increase in the number of works that have used this methodology, mainly because of COVID-19. In general, the studies used machine learning classification models to predict new information, and the most used parameters were data from routine laboratory tests such as the complete blood count.</p><p><strong>Conclusions: </strong>Finally, we conclude that laboratory tests, together with machine learning techniques, can predict new tests, thus helping the search for new diagnoses. This process has proved to be advantageous and innovative for medical laboratories. It is making it possible to discover hidden information and propose additional tests, reducing the number of false negatives and helping in the early discovery of unknown diseases.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":"3 1","pages":"e40473"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9828303/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10540975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Nonfungible Tokens as a Blockchain Solution to Ethical Challenges for the Secondary Use of Biospecimens: Viewpoint. 不可替代代币:区块链解决生物标本二次使用的伦理挑战(预印本)
Pub Date : 2021-10-22 DOI: 10.2196/29905
Marielle S Gross, Amelia J Hood, Robert C Miller

Henrietta Lacks' deidentified tissue became HeLa cells (the paradigmatic learning health platform). In this article, we discuss separating research on Ms Lacks' tissue from obligations to promote respect, beneficence, and justice for her as a patient. This case illuminates ethical challenges for the secondary use of biospecimens, which persist in contemporary learning health systems. Deidentification and broad consent seek to maximize the benefits of learning from care by minimizing burdens on patients, but these strategies are insufficient for privacy, transparency, and engagement. The resulting supply chain for human cellular and tissue-based products may therefore recapitulate the harms experienced by the Lacks family. We introduce the potential for blockchain technology to build unprecedented transparency, engagement, and accountability into learning health system architecture without requiring deidentification. The ability of nonfungible tokens to maintain the provenance of inherently unique digital assets may optimize utility, value, and respect for patients who contribute tissue and other clinical data for research. We consider the potential benefits and survey major technical, ethical, socioeconomic, and legal challenges for the successful implementation of the proposed solutions. The potential for nonfungible tokens to promote efficiency, effectiveness, and justice in learning health systems demands further exploration.

亨丽埃塔-拉克斯(Henrietta Lacks)的身份已被确认的组织变成了 HeLa 细胞(学习健康平台的典范)。在这篇文章中,我们讨论了将对拉克斯女士组织的研究与促进对她作为病人的尊重、惠益和公正的义务分离开来的问题。这一案例揭示了生物样本二次利用所面临的伦理挑战,这些挑战在当代学习型医疗体系中依然存在。去标识化和广泛同意旨在通过最大限度地减少患者负担来最大限度地提高护理学习的效益,但这些策略对于隐私、透明度和参与度来说是不够的。因此,由此产生的人体细胞和组织产品供应链可能会重现拉克斯一家所经历的伤害。我们介绍了区块链技术在学习型医疗系统架构中建立前所未有的透明度、参与度和问责制的潜力,而不需要去身份化。不可篡改的代币能够保持固有的独特数字资产的出处,这可能会优化为研究提供组织和其他临床数据的患者的效用、价值和尊重。我们考虑了潜在的益处,并调查了成功实施拟议解决方案所面临的主要技术、伦理、社会经济和法律挑战。我们需要进一步探索不可兑换代币在促进学习型医疗系统的效率、有效性和公正性方面的潜力。
{"title":"Nonfungible Tokens as a Blockchain Solution to Ethical Challenges for the Secondary Use of Biospecimens: Viewpoint.","authors":"Marielle S Gross, Amelia J Hood, Robert C Miller","doi":"10.2196/29905","DOIUrl":"10.2196/29905","url":null,"abstract":"<p><p>Henrietta Lacks' deidentified tissue became HeLa cells (the paradigmatic learning health platform). In this article, we discuss separating research on Ms Lacks' tissue from obligations to promote respect, beneficence, and justice for her as a patient. This case illuminates ethical challenges for the secondary use of biospecimens, which persist in contemporary learning health systems. Deidentification and broad consent seek to maximize the benefits of learning from care by minimizing burdens on patients, but these strategies are insufficient for privacy, transparency, and engagement. The resulting supply chain for human cellular and tissue-based products may therefore recapitulate the harms experienced by the Lacks family. We introduce the potential for blockchain technology to build unprecedented transparency, engagement, and accountability into learning health system architecture without requiring deidentification. The ability of nonfungible tokens to maintain the provenance of inherently unique digital assets may optimize utility, value, and respect for patients who contribute tissue and other clinical data for research. We consider the potential benefits and survey major technical, ethical, socioeconomic, and legal challenges for the successful implementation of the proposed solutions. The potential for nonfungible tokens to promote efficiency, effectiveness, and justice in learning health systems demands further exploration.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":"e29905"},"PeriodicalIF":0.0,"publicationDate":"2021-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11168237/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42689875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of a novel c.3080delC JAG1 gene mutation associated with Alagille syndrome by whole-exome sequencing (Preprint) 用全外显子组测序鉴定与Alagille综合征相关的c.3080delC JAG1基因突变(预印本)
Pub Date : 2021-09-30 DOI: 10.2196/preprints.33946
D. Panwar, Dr. Kumar Gautam Singh, Ms. Shruti Mathur, Mr. Bhagwati Prasad, Ms. Anita Joshi, Dr. Vandana Lal, A. Thatai
BACKGROUND Alagille syndrome is an autosomal dominant disorder associated with variable clinical phenotypic features including cholestasis, congenital heart defects, vertebral defects, and dysmorphic facies. OBJECTIVE Whole-exome sequencing (WES) has become technically feasible due to the recent advances in next-generation sequencing technologies, therefore offering new opportunities for mutations/genes identification. METHODS Next-generation sequencing (NGS) - Whole-exome sequencing was used to identify pathogenic variants of the proband. In this paper, we have uncovered a novel JAG1 mutation associated with Alagille syndrome in a 5 years old girl presented with conjugated hyperbilirubinemia and infantile cholestasis. RESULTS The exome sequencing analysis revealed the presence of a novel JAG1 heterozygous c.3080delC variant in exon 25. The detected mutation determines a stop codon (p.P1027RfsTer9) in the gene sequence, encoding a truncated protein. Our exome observations were confirmed through Sanger sequencing as well. CONCLUSIONS Here, we report a case of a patient diagnosed with Alagille syndrome, and our finding emphasis the detection of novel JAG1 mutation associated with Alagille syndrome variants thereby, establishing the genetic diagnosis of the disease. CLINICALTRIAL N/A
Alagille综合征是一种常染色体显性遗传病,具有多种临床表型特征,包括胆汁淤积、先天性心脏缺陷、椎体缺损和畸形相。由于新一代测序技术的进步,全外显子组测序(WES)在技术上已经变得可行,因此为突变/基因鉴定提供了新的机会。方法采用新一代测序(NGS) -全外显子组测序技术鉴定先证者的致病变异。在本文中,我们发现了一种与Alagille综合征相关的新型JAG1突变,该突变发生在一名患有共轭高胆红素血症和婴儿胆汁淤积症的5岁女孩身上。结果外显子测序结果显示,在25号外显子中存在一个新的JAG1杂合c.3080delC变异。检测到的突变决定了基因序列中的一个终止密码子(p.P1027RfsTer9),编码一个截断的蛋白质。我们的外显子组观察结果也通过桑格测序得到了证实。在此,我们报告了一例被诊断为Alagille综合征的患者,我们的发现强调了检测与Alagille综合征变异相关的新型JAG1突变,从而建立了该疾病的遗传诊断。临床试验N /一个
{"title":"Identification of a novel c.3080delC JAG1 gene mutation associated with Alagille syndrome by whole-exome sequencing (Preprint)","authors":"D. Panwar, Dr. Kumar Gautam Singh, Ms. Shruti Mathur, Mr. Bhagwati Prasad, Ms. Anita Joshi, Dr. Vandana Lal, A. Thatai","doi":"10.2196/preprints.33946","DOIUrl":"https://doi.org/10.2196/preprints.33946","url":null,"abstract":"\u0000 BACKGROUND\u0000 Alagille syndrome is an autosomal dominant disorder associated with variable clinical phenotypic features including cholestasis, congenital heart defects, vertebral defects, and dysmorphic facies.\u0000 \u0000 \u0000 OBJECTIVE\u0000 Whole-exome sequencing (WES) has become technically feasible due to the recent advances in next-generation sequencing technologies, therefore offering new opportunities for mutations/genes identification.\u0000 \u0000 \u0000 METHODS\u0000 Next-generation sequencing (NGS) - Whole-exome sequencing was used to identify pathogenic variants of the proband. In this paper, we have uncovered a novel JAG1 mutation associated with Alagille syndrome in a 5 years old girl presented with conjugated hyperbilirubinemia and infantile cholestasis.\u0000 \u0000 \u0000 RESULTS\u0000 The exome sequencing analysis revealed the presence of a novel JAG1 heterozygous c.3080delC variant in exon 25. The detected mutation determines a stop codon (p.P1027RfsTer9) in the gene sequence, encoding a truncated protein. Our exome observations were confirmed through Sanger sequencing as well.\u0000 \u0000 \u0000 CONCLUSIONS\u0000 Here, we report a case of a patient diagnosed with Alagille syndrome, and our finding emphasis the detection of novel JAG1 mutation associated with Alagille syndrome variants thereby, establishing the genetic diagnosis of the disease.\u0000 \u0000 \u0000 CLINICALTRIAL\u0000 N/A\u0000","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49021753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of Potential Vaccine Candidates Against SARS-CoV-2 to Fight COVID-19: Reverse Vaccinology Approach 识别对抗新冠肺炎的潜在SARS-CoV-2候选疫苗:反向疫苗学方法
Pub Date : 2021-07-26 DOI: 10.2196/32401
E. Gupta, R. Mishra, Ravi Ranjan Kumar Niraj
Background The recent emergence of COVID-19 has caused an immense global public health crisis. The etiological agent of COVID-19 is the novel coronavirus SARS-CoV-2. More research in the field of developing effective vaccines against this emergent viral disease is indeed a need of the hour. Objective The aim of this study was to identify effective vaccine candidates that can offer a new milestone in the battle against COVID-19. Methods We used a reverse vaccinology approach to explore the SARS-CoV-2 genome among strains prominent in India. Epitopes were predicted and then molecular docking and simulation were used to verify the molecular interaction of the candidate antigenic peptide with corresponding amino acid residues of the host protein. Results A promising antigenic peptide, GVYFASTEK, from the surface glycoprotein of SARS-CoV-2 (protein accession number QIA98583.1) was predicted to interact with the human major histocompatibility complex (MHC) class I human leukocyte antigen (HLA)-A*11-01 allele, showing up to 90% conservancy and a high antigenicity value. After vigorous analysis, this peptide was predicted to be a suitable epitope capable of inducing a strong cell-mediated immune response against SARS-CoV-2. Conclusions These results could facilitate selecting SARS-CoV-2 epitopes for vaccine production pipelines in the immediate future. This novel research will certainly pave the way for a fast, reliable, and effective platform to provide a timely countermeasure against this dangerous virus responsible for the COVID-19 pandemic.
近期新冠肺炎疫情的出现,引发了一场巨大的全球公共卫生危机。COVID-19的病原是新型冠状病毒SARS-CoV-2。在开发针对这一突发病毒性疾病的有效疫苗方面进行更多的研究确实是当务之急。目的寻找有效的候选疫苗,为抗击COVID-19提供新的里程碑。方法采用反向疫苗学方法对印度突出菌株的SARS-CoV-2基因组进行研究。预测抗原表位,然后通过分子对接和模拟验证候选抗原肽与宿主蛋白相应氨基酸残基的分子相互作用。结果从SARS-CoV-2表面糖蛋白中分离出一种极具潜力的抗原肽gvyfastk(蛋白加入号QIA98583.1),可与人主要组织相容性复合体(MHC) I类人白细胞抗原(HLA)-A*11-01等位基因相互作用,保护率高达90%,具有较高的抗原性。经过严格的分析,预测该肽是一个合适的表位,能够诱导针对SARS-CoV-2的强细胞介导免疫应答。结论这些结果可为今后疫苗生产管道中SARS-CoV-2抗原表位的选择提供参考。这项新研究必将为建立一个快速、可靠、有效的平台铺平道路,为应对这种导致COVID-19大流行的危险病毒提供及时的对策。
{"title":"Identification of Potential Vaccine Candidates Against SARS-CoV-2 to Fight COVID-19: Reverse Vaccinology Approach","authors":"E. Gupta, R. Mishra, Ravi Ranjan Kumar Niraj","doi":"10.2196/32401","DOIUrl":"https://doi.org/10.2196/32401","url":null,"abstract":"Background The recent emergence of COVID-19 has caused an immense global public health crisis. The etiological agent of COVID-19 is the novel coronavirus SARS-CoV-2. More research in the field of developing effective vaccines against this emergent viral disease is indeed a need of the hour. Objective The aim of this study was to identify effective vaccine candidates that can offer a new milestone in the battle against COVID-19. Methods We used a reverse vaccinology approach to explore the SARS-CoV-2 genome among strains prominent in India. Epitopes were predicted and then molecular docking and simulation were used to verify the molecular interaction of the candidate antigenic peptide with corresponding amino acid residues of the host protein. Results A promising antigenic peptide, GVYFASTEK, from the surface glycoprotein of SARS-CoV-2 (protein accession number QIA98583.1) was predicted to interact with the human major histocompatibility complex (MHC) class I human leukocyte antigen (HLA)-A*11-01 allele, showing up to 90% conservancy and a high antigenicity value. After vigorous analysis, this peptide was predicted to be a suitable epitope capable of inducing a strong cell-mediated immune response against SARS-CoV-2. Conclusions These results could facilitate selecting SARS-CoV-2 epitopes for vaccine production pipelines in the immediate future. This novel research will certainly pave the way for a fast, reliable, and effective platform to provide a timely countermeasure against this dangerous virus responsible for the COVID-19 pandemic.","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41873622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Isolating SARS-CoV-2 Strains From Countries in the Same Meridian: Genome Evolutionary Analysis. 从同一子午线国家分离 SARS-CoV-2 株系:基因组进化分析。
Pub Date : 2021-01-22 eCollection Date: 2021-01-01 DOI: 10.2196/25995
Emilio Mastriani, Alexey V Rakov, Shu-Lin Liu

Background: COVID-19, caused by the novel SARS-CoV-2, is considered the most threatening respiratory infection in the world, with over 40 million people infected and over 0.934 million related deaths reported worldwide. It is speculated that epidemiological and clinical features of COVID-19 may differ across countries or continents. Genomic comparison of 48,635 SARS-CoV-2 genomes has shown that the average number of mutations per sample was 7.23, and most SARS-CoV-2 strains belong to one of 3 clades characterized by geographic and genomic specificity: Europe, Asia, and North America.

Objective: The aim of this study was to compare the genomes of SARS-CoV-2 strains isolated from Italy, Sweden, and Congo, that is, 3 different countries in the same meridian (longitude) but with different climate conditions, and from Brazil (as an outgroup country), to analyze similarities or differences in patterns of possible evolutionary pressure signatures in their genomes.

Methods: We obtained data from the Global Initiative on Sharing All Influenza Data repository by sampling all genomes available on that date. Using HyPhy, we achieved the recombination analysis by genetic algorithm recombination detection method, trimming, removal of the stop codons, and phylogenetic tree and mixed effects model of evolution analyses. We also performed secondary structure prediction analysis for both sequences (mutated and wild-type) and "disorder" and "transmembrane" analyses of the protein. We analyzed both protein structures with an ab initio approach to predict their ontologies and 3D structures.

Results: Evolutionary analysis revealed that codon 9628 is under episodic selective pressure for all SARS-CoV-2 strains isolated from the 4 countries, suggesting it is a key site for virus evolution. Codon 9628 encodes the P0DTD3 (Y14_SARS2) uncharacterized protein 14. Further investigation showed that the codon mutation was responsible for helical modification in the secondary structure. The codon was positioned in the more ordered region of the gene (41-59) and near to the area acting as the transmembrane (54-67), suggesting its involvement in the attachment phase of the virus. The predicted protein structures of both wild-type and mutated P0DTD3 confirmed the importance of the codon to define the protein structure. Moreover, ontological analysis of the protein emphasized that the mutation enhances the binding probability.

Conclusions: Our results suggest that RNA secondary structure may be affected and, consequently, the protein product changes T (threonine) to G (glycine) in position 50 of the protein. This position is located close to the predicted transmembrane region. Mutation analysis revealed that the change from G (glycine) to D (aspartic acid) may confer a new function to the protein-binding activity, which in turn may be responsible for attaching the virus to

背景:由新型 SARS-CoV-2 引起的 COVID-19 被认为是世界上最具威胁性的呼吸道传染病,全球有超过 4000 万人感染,超过 93.4 万人因此死亡。据推测,COVID-19 的流行病学和临床特征在不同国家或大陆可能有所不同。对 48,635 个 SARS-CoV-2 基因组进行的基因组比较显示,每个样本的平均变异数为 7.23 个,大多数 SARS-CoV-2 株系属于 3 个具有地理和基因组特异性的支系之一:大多数 SARS-CoV-2 株系属于 3 个具有地理和基因组特异性的支系之一:欧洲、亚洲和北美洲:本研究的目的是比较从意大利、瑞典和刚果(即位于同一子午线(经度)但气候条件不同的 3 个不同国家)分离的 SARS-CoV-2 株系以及从巴西(作为外群国家)分离的 SARS-CoV-2 株系的基因组,分析其基因组中可能存在的进化压力特征模式的异同:我们从全球流感数据共享计划(Global Initiative on Sharing All Influenza Data Repository)中获取数据,对当日可用的所有基因组进行采样。使用 HyPhy,我们通过遗传算法重组检测方法、修剪、移除终止密码子、系统发生树和进化混合效应模型分析实现了重组分析。我们还对两个序列(突变型和野生型)进行了二级结构预测分析,并对蛋白质进行了 "紊乱 "和 "跨膜 "分析。我们用ab initio方法分析了这两种蛋白质的结构,以预测它们的本体和三维结构:进化分析表明,在这 4 个国家分离出的所有 SARS-CoV-2 株系中,密码子 9628 都受到偶发性选择压力,这表明它是病毒进化的一个关键位点。密码子 9628 编码 P0DTD3(Y14_SARS2)未定性蛋白 14。进一步研究表明,该密码子突变导致二级结构发生螺旋状改变。该密码子位于基因较有序的区域(41-59),靠近作为跨膜的区域(54-67),表明它参与了病毒的附着阶段。野生型和突变型 P0DTD3 的预测蛋白质结构证实了密码子对确定蛋白质结构的重要性。此外,对蛋白质的本体分析强调,突变增强了结合概率:我们的研究结果表明,RNA 二级结构可能受到了影响,因此蛋白质产物中第 50 位的 T(苏氨酸)变为了 G(甘氨酸)。该位置靠近预测的跨膜区。突变分析表明,从 G(甘氨酸)到 D(天冬氨酸)的变化可能赋予蛋白质结合活性一种新的功能,而这又可能是病毒附着在人类真核细胞上的原因。这些发现有助于设计体外实验,并有可能促进疫苗设计和成功的抗病毒策略。
{"title":"Isolating SARS-CoV-2 Strains From Countries in the Same Meridian: Genome Evolutionary Analysis.","authors":"Emilio Mastriani, Alexey V Rakov, Shu-Lin Liu","doi":"10.2196/25995","DOIUrl":"10.2196/25995","url":null,"abstract":"<p><strong>Background: </strong>COVID-19, caused by the novel SARS-CoV-2, is considered the most threatening respiratory infection in the world, with over 40 million people infected and over 0.934 million related deaths reported worldwide. It is speculated that epidemiological and clinical features of COVID-19 may differ across countries or continents. Genomic comparison of 48,635 SARS-CoV-2 genomes has shown that the average number of mutations per sample was 7.23, and most SARS-CoV-2 strains belong to one of 3 clades characterized by geographic and genomic specificity: Europe, Asia, and North America.</p><p><strong>Objective: </strong>The aim of this study was to compare the genomes of SARS-CoV-2 strains isolated from Italy, Sweden, and Congo, that is, 3 different countries in the same meridian (longitude) but with different climate conditions, and from Brazil (as an outgroup country), to analyze similarities or differences in patterns of possible evolutionary pressure signatures in their genomes.</p><p><strong>Methods: </strong>We obtained data from the Global Initiative on Sharing All Influenza Data repository by sampling all genomes available on that date. Using HyPhy, we achieved the recombination analysis by genetic algorithm recombination detection method, trimming, removal of the stop codons, and phylogenetic tree and mixed effects model of evolution analyses. We also performed secondary structure prediction analysis for both sequences (mutated and wild-type) and \"disorder\" and \"transmembrane\" analyses of the protein. We analyzed both protein structures with an ab initio approach to predict their ontologies and 3D structures.</p><p><strong>Results: </strong>Evolutionary analysis revealed that codon 9628 is under episodic selective pressure for all SARS-CoV-2 strains isolated from the 4 countries, suggesting it is a key site for virus evolution. Codon 9628 encodes the P0DTD3 (Y14_SARS2) uncharacterized protein 14. Further investigation showed that the codon mutation was responsible for helical modification in the secondary structure. The codon was positioned in the more ordered region of the gene (41-59) and near to the area acting as the transmembrane (54-67), suggesting its involvement in the attachment phase of the virus. The predicted protein structures of both wild-type and mutated P0DTD3 confirmed the importance of the codon to define the protein structure. Moreover, ontological analysis of the protein emphasized that the mutation enhances the binding probability.</p><p><strong>Conclusions: </strong>Our results suggest that RNA secondary structure may be affected and, consequently, the protein product changes T (threonine) to G (glycine) in position 50 of the protein. This position is located close to the predicted transmembrane region. Mutation analysis revealed that the change from G (glycine) to D (aspartic acid) may confer a new function to the protein-binding activity, which in turn may be responsible for attaching the virus to","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":"2 1","pages":"e25995"},"PeriodicalIF":0.0,"publicationDate":"2021-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7837406/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38781957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Novel Coronavirus Enigma: Phylogeny and Analyses of Coevolving Mutations Among the SARS-CoV-2 Viruses Circulating in India. 新型冠状病毒之谜:印度流行的 SARS-CoV-2 病毒的系统发育和共同演化突变分析。
Pub Date : 2020-09-07 eCollection Date: 2020-01-01 DOI: 10.2196/20735
Anindita Banerjee, Rakesh Sarkar, Suvrotoa Mitra, Mahadeb Lo, Shanta Dutta, Mamta Chawla-Sarkar

Background: The RNA genome of the emerging novel coronavirus is rapidly mutating, and its human-to-human transmission rate is increasing. Hence, temporal dissection of their evolutionary dynamics, the nature of variations among different strains, and understanding the single nucleotide polymorphisms in the endemic settings are crucial. Delineating the heterogeneous genomic constellations of this novel virus will help us understand its complex behavior in a particular geographical region.

Objective: This is a comprehensive analysis of 95 Indian SARS-CoV-2 genome sequences available from the Global Initiative on Sharing All Influenza Data (GISAID) repository during the first 6 months of 2020 (January through June). Evolutionary dynamics, gene-specific phylogeny, and the emergence of the novel coevolving mutations in 9 structural and nonstructural genes among circulating SARS-CoV-2 strains across 12 different Indian states were analyzed.

Methods: A total of 95 SARS-CoV-2 nucleotide sequences submitted from India were downloaded from the GISAID database. Molecular Evolutionary Genetics Analysis, version X software was used to construct the 9 phylogenetic dendrograms based on nucleotide sequences of the SARS-CoV-2 genes. Analyses of the coevolving mutations were done in comparison to the prototype SARS-CoV-2 from Wuhan, China. The secondary structure of the RNA-dependent RNA polymerase/nonstructural protein NSP12 was predicted with respect to the novel A97V mutation.

Results: Phylogenetic analyses revealed the evolution of "genome-type clusters" and adaptive selection of "L"-type SARS-CoV-2 strains with genetic closeness to the bat severe acute respiratory syndrome-like coronaviruses. These strains were distant to pangolin or Middle East respiratory syndrome-related coronavirus strains. With regard to the novel coevolving mutations, 2 groups have been seen circulating in India at present, the "major group" (66/95, 69.4%) and the "minor group" (21/95, 22.1%) , harboring 4 and 5 coexisting mutations, respectively. The "major group" mutations fall in the A2a clade. All the minor group mutations, except 11083G>T (L37F, NSP6 gene), were unique to the Indian isolates.

Conclusions: This study highlights the rapidly evolving SARS-CoV-2 virus and the cocirculation of multiple clades and subclades. This comprehensive study is a potential resource for monitoring the novel mutations in the viral genome, interpreting changes in viral pathogenesis, and designing vaccines or other therapeutics.

背景:新出现的新型冠状病毒的 RNA 基因组正在迅速变异,其在人与人之间的传播率也在不断上升。因此,对其进化动态、不同毒株间变异的性质进行时空剖析,以及了解流行环境中的单核苷酸多态性至关重要。描述这种新型病毒的异质性基因组排列将有助于我们了解其在特定地理区域的复杂行为:本文全面分析了 2020 年前 6 个月(1 月至 6 月)全球流感数据共享计划(GISAID)资源库中的 95 个印度 SARS-CoV-2 基因组序列。分析了印度 12 个不同邦的循环 SARS-CoV-2 株系中 9 个结构和非结构基因的进化动态、基因特异性系统发育和新型共变突变的出现:从 GISAID 数据库中下载了印度提交的 95 个 SARS-CoV-2 核苷酸序列。使用分子进化遗传学分析 X 版软件根据 SARS-CoV-2 基因的核苷酸序列构建了 9 个系统发生树枝图。与中国武汉的 SARS-CoV-2 原型相比,对共同演化的突变进行了分析。预测了与新型 A97V 突变相关的 RNA 依赖性 RNA 聚合酶/非结构蛋白 NSP12 的二级结构:结果:系统进化分析表明,"基因组类型群 "的进化和适应性选择产生了 "L "型SARS-CoV-2株,它们与蝙蝠严重急性呼吸综合征类冠状病毒的基因十分接近。这些毒株与穿山甲或中东呼吸综合征相关冠状病毒毒株的基因十分接近。在新的共变突变方面,目前在印度流行的有两类:"大类"(66/95,69.4%)和 "小类"(21/95,22.1%),分别携带 4 种和 5 种共存突变。主要群体 "突变属于 A2a 支系。除 11083G>T(L37F,NSP6 基因)外,所有小群体突变都是印度分离物所特有的:本研究强调了 SARS-CoV-2 病毒的快速演变以及多个支系和亚支系的共同循环。这项全面的研究为监测病毒基因组的新突变、解释病毒发病机制的变化以及设计疫苗或其他疗法提供了潜在的资源。
{"title":"The Novel Coronavirus Enigma: Phylogeny and Analyses of Coevolving Mutations Among the SARS-CoV-2 Viruses Circulating in India.","authors":"Anindita Banerjee, Rakesh Sarkar, Suvrotoa Mitra, Mahadeb Lo, Shanta Dutta, Mamta Chawla-Sarkar","doi":"10.2196/20735","DOIUrl":"10.2196/20735","url":null,"abstract":"<p><strong>Background: </strong>The RNA genome of the emerging novel coronavirus is rapidly mutating, and its human-to-human transmission rate is increasing. Hence, temporal dissection of their evolutionary dynamics, the nature of variations among different strains, and understanding the single nucleotide polymorphisms in the endemic settings are crucial. Delineating the heterogeneous genomic constellations of this novel virus will help us understand its complex behavior in a particular geographical region.</p><p><strong>Objective: </strong>This is a comprehensive analysis of 95 Indian SARS-CoV-2 genome sequences available from the Global Initiative on Sharing All Influenza Data (GISAID) repository during the first 6 months of 2020 (January through June). Evolutionary dynamics, gene-specific phylogeny, and the emergence of the novel coevolving mutations in 9 structural and nonstructural genes among circulating SARS-CoV-2 strains across 12 different Indian states were analyzed.</p><p><strong>Methods: </strong>A total of 95 SARS-CoV-2 nucleotide sequences submitted from India were downloaded from the GISAID database. Molecular Evolutionary Genetics Analysis, version X software was used to construct the 9 phylogenetic dendrograms based on nucleotide sequences of the SARS-CoV-2 genes. Analyses of the coevolving mutations were done in comparison to the prototype SARS-CoV-2 from Wuhan, China. The secondary structure of the RNA-dependent RNA polymerase/nonstructural protein NSP12 was predicted with respect to the novel A97V mutation.</p><p><strong>Results: </strong>Phylogenetic analyses revealed the evolution of \"genome-type clusters\" and adaptive selection of \"L\"-type SARS-CoV-2 strains with genetic closeness to the bat severe acute respiratory syndrome-like coronaviruses. These strains were distant to pangolin or Middle East respiratory syndrome-related coronavirus strains. With regard to the novel coevolving mutations, 2 groups have been seen circulating in India at present, the \"major group\" (66/95, 69.4%) and the \"minor group\" (21/95, 22.1%) , harboring 4 and 5 coexisting mutations, respectively. The \"major group\" mutations fall in the A2a clade. All the minor group mutations, except 11083G>T (L37F, NSP6 gene), were unique to the Indian isolates.</p><p><strong>Conclusions: </strong>This study highlights the rapidly evolving SARS-CoV-2 virus and the cocirculation of multiple clades and subclades. This comprehensive study is a potential resource for monitoring the novel mutations in the viral genome, interpreting changes in viral pathogenesis, and designing vaccines or other therapeutics.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":"1 1","pages":"e20735"},"PeriodicalIF":0.0,"publicationDate":"2020-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7720937/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38866237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Molecular Docking Using Chimera and Autodock Vina Software for Nonbioinformaticians. 非生物信息学家使用Chimera和Autodock Vina软件进行分子对接
Pub Date : 2020-06-19 DOI: 10.2196/14232
Sania Safdar Butt, Yasmin Badshah, Maria Shabbir, Mehak Rafiq

In the field of drug discovery, many methods of molecular modeling have been employed to study complex biological and chemical systems. Experimental strategies are integrated with computational approaches for the identification, characterization, and development of novel drugs and compounds. In modern drug designing, molecular docking is an approach that explores the confirmation of a ligand within the binding site of a macromolecule. To date, many software and tools for docking have been employed. AutoDock Vina (in UCSF [University of California, San Francisco] Chimera) is one of the computationally fastest and most accurate software employed in docking. In this paper, a sequential demonstration of molecular docking of the ligand fisetin with the target protein Akt has been provided, using AutoDock Vina in UCSF Chimera 1.12. The first step involves target protein ID retrieval from the protein database, the second step involves visualization of the protein structure in UCSF Chimera, the third step involves preparation of the target protein for docking, the fourth step involves preparation of the ligand for docking, the fifth step involves docking of the ligand and the target protein as Mol.2 files in Chimera by using AutoDock Vina, and the final step involves interpretation and analysis of the docking results. By following the guidelines and steps outlined in this paper, researchers with no previous background in bioinformatics research can perform computational docking in an easier and more user-friendly manner.

在药物发现领域,许多分子建模的方法被用于研究复杂的生物和化学系统。实验策略与计算方法相结合,用于鉴定、表征和开发新型药物和化合物。在现代药物设计中,分子对接是一种探索在大分子结合位点内确定配体的方法。迄今为止,已经使用了许多用于对接的软件和工具。AutoDock Vina(位于加州大学旧金山分校)是用于对接的计算速度最快、最准确的软件之一。本文利用UCSF Chimera 1.12软件中的AutoDock Vina,提供了配体非瑟酮与靶蛋白Akt分子对接的序列演示。第一步从蛋白质数据库中检索目标蛋白ID,第二步可视化UCSF嵌合体中蛋白质结构,第三步制备对接目标蛋白,第四步制备对接配体,第五步利用AutoDock Vina将配体与目标蛋白作为mol2文件在嵌合体中对接,最后一步对对接结果进行解读和分析。通过遵循本文概述的指导方针和步骤,没有生物信息学研究背景的研究人员可以以更容易和更友好的方式进行计算对接。
{"title":"Molecular Docking Using Chimera and Autodock Vina Software for Nonbioinformaticians.","authors":"Sania Safdar Butt, Yasmin Badshah, Maria Shabbir, Mehak Rafiq","doi":"10.2196/14232","DOIUrl":"10.2196/14232","url":null,"abstract":"<p><p>In the field of drug discovery, many methods of molecular modeling have been employed to study complex biological and chemical systems. Experimental strategies are integrated with computational approaches for the identification, characterization, and development of novel drugs and compounds. In modern drug designing, molecular docking is an approach that explores the confirmation of a ligand within the binding site of a macromolecule. To date, many software and tools for docking have been employed. AutoDock Vina (in UCSF [University of California, San Francisco] Chimera) is one of the computationally fastest and most accurate software employed in docking. In this paper, a sequential demonstration of molecular docking of the ligand fisetin with the target protein Akt has been provided, using AutoDock Vina in UCSF Chimera 1.12. The first step involves target protein ID retrieval from the protein database, the second step involves visualization of the protein structure in UCSF Chimera, the third step involves preparation of the target protein for docking, the fourth step involves preparation of the ligand for docking, the fifth step involves docking of the ligand and the target protein as Mol.2 files in Chimera by using AutoDock Vina, and the final step involves interpretation and analysis of the docking results. By following the guidelines and steps outlined in this paper, researchers with no previous background in bioinformatics research can perform computational docking in an easier and more user-friendly manner.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":" ","pages":"e14232"},"PeriodicalIF":0.0,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11168238/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45566561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR bioinformatics and biotechnology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1