首页 > 最新文献

Open Bioinformatics Journal最新文献

英文 中文
QSAR and its Role in Target-Ligand Interaction QSAR及其在靶配体相互作用中的作用
Q3 Computer Science Pub Date : 2013-12-27 DOI: 10.2174/1875036201307010063
Anamika Singh, Rajeev Singh
Each molecule has its own specialty, structure and function and when these molecules are combined together they form a compound. Structure and function of a molecule are related to each other and QSARs (Quantitative Structure- Activity relationships) are based on the criteria that the structure of a molecule must contain the features responsible for its physical, chemical, and biological properties, and on the ability to represent the chemical by one, or more, numerical descriptor(s). By QSAR models, the biological activity of a new or untested chemical can be inferred from the molecular structure of similar compounds whose activities have already been assessed. QSARs attempt to relate physical and chemical properties of molecules to their biological activities. For this there are so many descriptors (for example, molecular weight, number of rotatable bonds, Log P) and simple statistical methods such as Multiple Linear Regression (MLR) are used to predict a model. These models describe the activity of the data set and can predict activities for further sets of (untested) compounds. These types of descriptors are simple to calculate and allow for a relatively fast analysis. 3D-QSAR uses probe-based sampling within a molecular lattice to determine three-dimensional properties of molecules (particularly steric and electrostatic values) and can then correlate these 3D descriptors with biological activity. Physicochemical descriptors, include hydrophobicity, topology, electronic properties, and steric effects etc. These descriptors can be calculated empirically, statistically or through more recent computational methods. QSARs are currently being applied in many disciplines, with many pertaining to drug design and environmental risk assessment. Key word: QSAR, Ligand Designing, LogP, Cheminformatics, Docking.
每个分子都有自己的特点、结构和功能,当这些分子结合在一起就形成了一种化合物。分子的结构和功能是相互关联的,qsar(定量结构-活性关系)是基于分子的结构必须包含负责其物理,化学和生物特性的特征的标准,以及用一个或多个数字描述符表示化学物质的能力。通过QSAR模型,可以从已经评估过活性的类似化合物的分子结构推断出新的或未经测试的化学物质的生物活性。qsar试图将分子的物理和化学性质与其生物活性联系起来。为此,有许多描述符(例如,分子量,可旋转键数,Log P)和简单的统计方法,如多元线性回归(MLR)用于预测模型。这些模型描述了数据集的活性,并可以预测进一步的(未经测试的)化合物的活性。这些类型的描述符计算起来很简单,并且允许相对快速的分析。3D- qsar在分子晶格内使用基于探针的采样来确定分子的三维特性(特别是空间和静电值),然后可以将这些3D描述符与生物活性相关联。物理化学描述,包括疏水性,拓扑结构,电子性质和空间效应等。这些描述符可以通过经验、统计或最新的计算方法来计算。qsar目前应用于许多学科,其中许多与药物设计和环境风险评估有关。关键词:QSAR,配体设计,LogP,化学信息学,对接
{"title":"QSAR and its Role in Target-Ligand Interaction","authors":"Anamika Singh, Rajeev Singh","doi":"10.2174/1875036201307010063","DOIUrl":"https://doi.org/10.2174/1875036201307010063","url":null,"abstract":"Each molecule has its own specialty, structure and function and when these molecules are combined together they form a compound. Structure and function of a molecule are related to each other and QSARs (Quantitative Structure- Activity relationships) are based on the criteria that the structure of a molecule must contain the features responsible for its physical, chemical, and biological properties, and on the ability to represent the chemical by one, or more, numerical descriptor(s). By QSAR models, the biological activity of a new or untested chemical can be inferred from the molecular structure of similar compounds whose activities have already been assessed. QSARs attempt to relate physical and chemical properties of molecules to their biological activities. For this there are so many descriptors (for example, molecular weight, number of rotatable bonds, Log P) and simple statistical methods such as Multiple Linear Regression (MLR) are used to predict a model. These models describe the activity of the data set and can predict activities for further sets of (untested) compounds. These types of descriptors are simple to calculate and allow for a relatively fast analysis. 3D-QSAR uses probe-based sampling within a molecular lattice to determine three-dimensional properties of molecules (particularly steric and electrostatic values) and can then correlate these 3D descriptors with biological activity. Physicochemical descriptors, include hydrophobicity, topology, electronic properties, and steric effects etc. These descriptors can be calculated empirically, statistically or through more recent computational methods. QSARs are currently being applied in many disciplines, with many pertaining to drug design and environmental risk assessment. Key word: QSAR, Ligand Designing, LogP, Cheminformatics, Docking.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"7 1","pages":"63-67"},"PeriodicalIF":0.0,"publicationDate":"2013-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68106654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Statistical Methods for Overdispersion in mRNA-Seq Count Data mRNA-Seq计数数据过分散的统计方法
Q3 Computer Science Pub Date : 2013-12-13 DOI: 10.2174/1875036201307010034
Hui Zhang, S. Pounds, Li Tang
Recent developments in Next-Generation Sequencing (NGS) technologies have opened doors for ultra high throughput sequencing mRNA (mRNA-seq) of the whole transcriptome. mRNA-seq has enabled researchers to comprehensively search for underlying biological determinants of diseases and ultimately discover novel preventive and therapeutic solutions. Unfortunately, given the complexity of mRNA-seq data, data generation has outgrown current analytical capacity, hindering the pace of research in this area. Thus, there is an urgent need to develop novel statistical methodology that addresses problems related to mRNA-seq data. This review addresses the common challenge of the presence of overdispersion in mRNA count data. We review current methods for modeling overdispersion, such as negative binomial, quasi-likelihood Poisson method, and the two-stage adaptive method; introduce related statistical theories; and discuss their applications to mRNA-seq count data.
新一代测序(NGS)技术的最新发展为全转录组的超高通量测序mRNA (mRNA-seq)打开了大门。mRNA-seq使研究人员能够全面搜索疾病的潜在生物学决定因素,并最终发现新的预防和治疗解决方案。不幸的是,鉴于mRNA-seq数据的复杂性,数据生成已经超出了当前的分析能力,阻碍了这一领域的研究步伐。因此,迫切需要开发新的统计方法来解决与mRNA-seq数据相关的问题。这篇综述解决了mRNA计数数据中存在过分散的共同挑战。综述了目前过度色散建模的方法,如负二项法、准似然泊松法和两阶段自适应法;介绍相关统计理论;并讨论它们在mRNA-seq计数数据中的应用。
{"title":"Statistical Methods for Overdispersion in mRNA-Seq Count Data","authors":"Hui Zhang, S. Pounds, Li Tang","doi":"10.2174/1875036201307010034","DOIUrl":"https://doi.org/10.2174/1875036201307010034","url":null,"abstract":"Recent developments in Next-Generation Sequencing (NGS) technologies have opened doors for ultra high throughput sequencing mRNA (mRNA-seq) of the whole transcriptome. mRNA-seq has enabled researchers to comprehensively search for underlying biological determinants of diseases and ultimately discover novel preventive and therapeutic solutions. Unfortunately, given the complexity of mRNA-seq data, data generation has outgrown current analytical capacity, hindering the pace of research in this area. Thus, there is an urgent need to develop novel statistical methodology that addresses problems related to mRNA-seq data. This review addresses the common challenge of the presence of overdispersion in mRNA count data. We review current methods for modeling overdispersion, such as negative binomial, quasi-likelihood Poisson method, and the two-stage adaptive method; introduce related statistical theories; and discuss their applications to mRNA-seq count data.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"60 1","pages":"34-40"},"PeriodicalIF":0.0,"publicationDate":"2013-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68106753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A Short Survey on Genetic Sequences, Chou’s Pseudo Amino Acid Composition and its Combination with Fuzzy Set Theory 基因序列、周氏伪氨基酸组成及其与模糊集合理论的结合综述
Q3 Computer Science Pub Date : 2013-12-13 DOI: 10.2174/1875036201307010041
D. Georgiou, T. Karakasidis, A. Megaritis
The study of genetic sequences is of great importance in biology and medicine. Sequence analysis and taxonomy are two major fields of application of bioinformatics. In this survey, we present results concerning genetic sequences and Chou's pseudo amino acid composition as well as methodologies developed based on this concept along with elements of fuzzy set theory, and emphasize on fuzzy clustering and its application in analysis of genetic sequences.
基因序列的研究在生物学和医学中具有重要意义。序列分析和分类学是生物信息学的两个主要应用领域。在本文中,我们介绍了有关基因序列和Chou的伪氨基酸组成的研究成果,以及基于模糊集理论的概念和元素开发的方法,并重点介绍了模糊聚类及其在基因序列分析中的应用。
{"title":"A Short Survey on Genetic Sequences, Chou’s Pseudo Amino Acid Composition and its Combination with Fuzzy Set Theory","authors":"D. Georgiou, T. Karakasidis, A. Megaritis","doi":"10.2174/1875036201307010041","DOIUrl":"https://doi.org/10.2174/1875036201307010041","url":null,"abstract":"The study of genetic sequences is of great importance in biology and medicine. Sequence analysis and taxonomy are two major fields of application of bioinformatics. In this survey, we present results concerning genetic sequences and Chou's pseudo amino acid composition as well as methodologies developed based on this concept along with elements of fuzzy set theory, and emphasize on fuzzy clustering and its application in analysis of genetic sequences.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"7 1","pages":"41-48"},"PeriodicalIF":0.0,"publicationDate":"2013-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68106240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 57
Artificial Neural Network in Drug Delivery and Pharmaceutical Research 人工神经网络在药物传递和药物研究中的应用
Q3 Computer Science Pub Date : 2013-12-13 DOI: 10.2174/1875036201307010049
V. Sutariya, A. Groshev, Prabodh Sadana, D. Bhatia, Y. Pathak
Artificial neural networks (ANNs) technology models the pattern recognition capabilities of the neural networks of the brain. Similarly to a single neuron in the brain, artificial neuron unit receives inputs from many external sources, processes them, and makes decisions. Interestingly, ANN simulates the biological nervous system and draws on analogues of adaptive biological neurons. ANNs do not require rigidly structured experimental designs and can map functions using historical or incomplete data, which makes them a powerful tool for simulation of various non-linear systems.ANNs have many applications in various fields, including engineering, psychology, medicinal chemistry and pharmaceutical research. Because of their capacity for making predictions, pattern recognition, and modeling, ANNs have been very useful in many aspects of pharmaceutical research including modeling of the brain neural network, analytical data analysis, drug modeling, protein structure and function, dosage optimization and manufacturing, pharmacokinetics and pharmacodynamics modeling, and in vitro in vivo correlations. This review discusses the applications of ANNs in drug delivery and pharmacological research.
人工神经网络(ANNs)技术对大脑神经网络的模式识别能力进行建模。与大脑中的单个神经元类似,人工神经元单元接收来自许多外部来源的输入,对其进行处理,并做出决定。有趣的是,人工神经网络模拟了生物神经系统,并利用了自适应生物神经元的类似物。人工神经网络不需要严格结构化的实验设计,并且可以使用历史或不完整的数据来映射函数,这使它们成为模拟各种非线性系统的强大工具。人工神经网络在各个领域都有广泛的应用,包括工程、心理学、药物化学和药物研究。由于具有预测、模式识别和建模的能力,人工神经网络在药物研究的许多方面都非常有用,包括脑神经网络建模、分析数据分析、药物建模、蛋白质结构和功能、剂量优化和制造、药代动力学和药效学建模以及体外体内相关性。本文综述了人工神经网络在药物传递和药理研究中的应用。
{"title":"Artificial Neural Network in Drug Delivery and Pharmaceutical Research","authors":"V. Sutariya, A. Groshev, Prabodh Sadana, D. Bhatia, Y. Pathak","doi":"10.2174/1875036201307010049","DOIUrl":"https://doi.org/10.2174/1875036201307010049","url":null,"abstract":"Artificial neural networks (ANNs) technology models the pattern recognition capabilities of the neural networks of the brain. Similarly to a single neuron in the brain, artificial neuron unit receives inputs from many external sources, processes them, and makes decisions. Interestingly, ANN simulates the biological nervous system and draws on analogues of adaptive biological neurons. ANNs do not require rigidly structured experimental designs and can map functions using historical or incomplete data, which makes them a powerful tool for simulation of various non-linear systems.ANNs have many applications in various fields, including engineering, psychology, medicinal chemistry and pharmaceutical research. Because of their capacity for making predictions, pattern recognition, and modeling, ANNs have been very useful in many aspects of pharmaceutical research including modeling of the brain neural network, analytical data analysis, drug modeling, protein structure and function, dosage optimization and manufacturing, pharmacokinetics and pharmacodynamics modeling, and in vitro in vivo correlations. This review discusses the applications of ANNs in drug delivery and pharmacological research.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"7 1","pages":"49-62"},"PeriodicalIF":0.0,"publicationDate":"2013-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68106249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 50
Editorial: Bioinformatics Algorithms and Genomics 编辑:生物信息学算法和基因组学
Q3 Computer Science Pub Date : 2013-12-13 DOI: 10.2174/1875036201307010025
Feng Cheng
{"title":"Editorial: Bioinformatics Algorithms and Genomics","authors":"Feng Cheng","doi":"10.2174/1875036201307010025","DOIUrl":"https://doi.org/10.2174/1875036201307010025","url":null,"abstract":"","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"18 1","pages":"25-26"},"PeriodicalIF":0.0,"publicationDate":"2013-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68106696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Genetic Studies: The Linear Mixed Models in Genome-wide Association Studies 遗传研究:全基因组关联研究中的线性混合模型
Q3 Computer Science Pub Date : 2013-12-13 DOI: 10.2174/1875036201307010027
Gengxin Li, Hongjiang Zhu
With the availability of high-density genomic data containing millions of single nucleotide polymorphisms and tens or hundreds of thousands of individuals, genetic association study is likely to identify the variants contributing to complex traits in a genome-wide scale. However, genome-wide association studies are confounded by some spurious associations due to not properly interpreting sample structure (containing population structure, family structure and cryptic relatedness). The absence of complete genealogy of population in the genome-wide association studies model greatly motivates the development of new methods to correct the inflation of false positive. In this process, linear mixed model based approaches with the advantage of capturing multilevel relatedness have gained large ground. We summarize current literatures dealing with sample structure, and our review focuses on the following four areas: (i) The approaches handling population structure in genome-wide association studies; (ii) The linear mixed model based approaches in genome-wide association studies; (iii) The performance of linear mixed model based approaches in genome-wide association studies and (iv) The unsolved issues and future work of linear mixed model based approaches.
随着包含数百万个单核苷酸多态性和数万或数十万个个体的高密度基因组数据的可用性,遗传关联研究有可能在全基因组范围内识别导致复杂性状的变异。然而,全基因组关联研究由于不能正确解释样本结构(包括种群结构、家族结构和隐性亲缘关系)而被一些虚假关联所混淆。全基因组关联研究模型中缺乏完整的群体谱系,这极大地推动了纠正假阳性膨胀的新方法的发展。在此过程中,基于线性混合模型的方法以其捕获多层次相关性的优点获得了广泛的应用。本文对目前研究样本结构的文献进行了总结,并着重从以下四个方面进行了综述:(1)全基因组关联研究中处理群体结构的方法;(ii)基于线性混合模型的全基因组关联研究方法;(iii)基于线性混合模型的方法在全基因组关联研究中的表现;(iv)基于线性混合模型的方法尚未解决的问题和未来的工作。
{"title":"Genetic Studies: The Linear Mixed Models in Genome-wide Association Studies","authors":"Gengxin Li, Hongjiang Zhu","doi":"10.2174/1875036201307010027","DOIUrl":"https://doi.org/10.2174/1875036201307010027","url":null,"abstract":"With the availability of high-density genomic data containing millions of single nucleotide polymorphisms and tens or hundreds of thousands of individuals, genetic association study is likely to identify the variants contributing to complex traits in a genome-wide scale. However, genome-wide association studies are confounded by some spurious associations due to not properly interpreting sample structure (containing population structure, family structure and cryptic relatedness). The absence of complete genealogy of population in the genome-wide association studies model greatly motivates the development of new methods to correct the inflation of false positive. In this process, linear mixed model based approaches with the advantage of capturing multilevel relatedness have gained large ground. We summarize current literatures dealing with sample structure, and our review focuses on the following four areas: (i) The approaches handling population structure in genome-wide association studies; (ii) The linear mixed model based approaches in genome-wide association studies; (iii) The performance of linear mixed model based approaches in genome-wide association studies and (iv) The unsolved issues and future work of linear mixed model based approaches.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"7 1","pages":"27-33"},"PeriodicalIF":0.0,"publicationDate":"2013-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68106735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Haplotype Classification Using Copy Number Variation and Principal Components Analysis 利用拷贝数变异和主成分分析进行单倍型分类
Q3 Computer Science Pub Date : 2013-11-29 DOI: 10.2174/1875036201307010019
K. Blighe
Elaborate downstream methods are required to analyze large microarray data-sets. At times, where the end goal is to look for relationships between (or patterns within) different subgroups or even just individual samples, large data-sets must first be filtered using statistical thresholds in order to reduce their overall volume. As an example, in anthropological microarray studies, such 'dimension reduction' techniques are essential to elucidate any links between polymorphisms and phenotypes for given populations. In such large data-sets, a subset can first be taken to represent the larger data-set. For example, polling results taken during elections are used to infer the opinions of the population at large. However, what is the best and easiest method of capturing a sub-set of variation in a data-set that can represent the overall portrait of variation? In this article, principal components analysis (PCA) is discussed in detail, including its history, the mathematics behind the process, and in which ways it can be applied to modern large-scale biological datasets. New methods of analysis using PCA are also suggested, with tentative results outlined.
需要详细的下游方法来分析大型微阵列数据集。有时,如果最终目标是寻找不同子组之间的关系(或模式之间的关系),甚至只是单个样本,则必须首先使用统计阈值过滤大型数据集,以减少它们的总体容量。例如,在人类学微阵列研究中,这种“降维”技术对于阐明特定人群的多态性和表型之间的任何联系至关重要。在这样大的数据集中,首先可以取一个子集来表示更大的数据集。例如,在选举期间进行的民意调查结果被用来推断广大民众的意见。然而,什么是最好的和最简单的方法来捕获数据集中的一个子集的变化,可以代表变化的整体肖像?在本文中,详细讨论了主成分分析(PCA),包括其历史,过程背后的数学,以及它可以应用于现代大规模生物数据集的方式。本文还提出了新的PCA分析方法,并概述了初步结果。
{"title":"Haplotype Classification Using Copy Number Variation and Principal Components Analysis","authors":"K. Blighe","doi":"10.2174/1875036201307010019","DOIUrl":"https://doi.org/10.2174/1875036201307010019","url":null,"abstract":"Elaborate downstream methods are required to analyze large microarray data-sets. At times, where the end goal is to look for relationships between (or patterns within) different subgroups or even just individual samples, large data-sets must first be filtered using statistical thresholds in order to reduce their overall volume. As an example, in anthropological microarray studies, such 'dimension reduction' techniques are essential to elucidate any links between polymorphisms and phenotypes for given populations. In such large data-sets, a subset can first be taken to represent the larger data-set. For example, polling results taken during elections are used to infer the opinions of the population at large. However, what is the best and easiest method of capturing a sub-set of variation in a data-set that can represent the overall portrait of variation? In this article, principal components analysis (PCA) is discussed in detail, including its history, the mathematics behind the process, and in which ways it can be applied to modern large-scale biological datasets. New methods of analysis using PCA are also suggested, with tentative results outlined.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"7 1","pages":"19-24"},"PeriodicalIF":0.0,"publicationDate":"2013-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68106651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Community annotation and the evolution of cooperation: How patience matters 社区注释与合作的演变:耐心的重要性
Q3 Computer Science Pub Date : 2013-07-26 DOI: 10.2174/1875036201307010009
A. Basuchoudhary, Vahan Simoyan, R. Mazumder
We investigate why biologists fail to contribute to biological databases although almost all of them use these databases for research. We find, using evolutionary game theory and computer simulations, that (a) the initial distribution of contributors who are patient determines whether a culture of contribution will prevail or not (b) institutions (where institution means "a significant practice, relationship, or organization in a society or culture") that incentivize patience and therefore limit free riding make contribution more likely and, (c) a stable institution, whether it incentivizes patience or not, will increase contribution. As a result we suggest there is a trade-off between the benefits of changing institutions to incentivize patience and the costs of the change itself. Moreover, even if it is possible to create institutions that incentivize patience among scientists such institutions may nevertheless fail. We create a computer simulation of a population of biologists based on our theory. These simulations suggest that institutions should focus more on rewards rather than penalties to incentivize a culture of contribution. Our approach therefore provides a methodology for developing a practical blueprint for organizing scientists to encourage cooperation and maximizing scientific output.
我们调查了为什么生物学家不能为生物数据库做出贡献,尽管他们几乎所有人都使用这些数据库进行研究。我们发现,利用进化博弈论和计算机模拟,(a)有耐心的贡献者的初始分布决定了一种贡献文化是否会盛行;(b)激励耐心并因此限制搭便车的制度(制度指的是“社会或文化中的重要实践、关系或组织”)更有可能做出贡献;(c)一个稳定的制度,无论它是否激励耐心。将增加贡献。因此,我们认为,在改变制度以激励耐心的好处与改变本身的成本之间存在权衡。此外,即使有可能建立激励科学家耐心的制度,这些制度也可能失败。我们根据我们的理论创建了一个生物学家群体的计算机模拟。这些模拟表明,机构应该更多地关注奖励,而不是惩罚,以激励贡献文化。因此,我们的方法提供了一种方法,可以为组织科学家鼓励合作和最大化科学产出制定实用蓝图。
{"title":"Community annotation and the evolution of cooperation: How patience matters","authors":"A. Basuchoudhary, Vahan Simoyan, R. Mazumder","doi":"10.2174/1875036201307010009","DOIUrl":"https://doi.org/10.2174/1875036201307010009","url":null,"abstract":"We investigate why biologists fail to contribute to biological databases although almost all of them use these databases for research. We find, using evolutionary game theory and computer simulations, that (a) the initial distribution of contributors who are patient determines whether a culture of contribution will prevail or not (b) institutions (where institution means \"a significant practice, relationship, or organization in a society or culture\") that incentivize patience and therefore limit free riding make contribution more likely and, (c) a stable institution, whether it incentivizes patience or not, will increase contribution. As a result we suggest there is a trade-off between the benefits of changing institutions to incentivize patience and the costs of the change itself. Moreover, even if it is possible to create institutions that incentivize patience among scientists such institutions may nevertheless fail. We create a computer simulation of a population of biologists based on our theory. These simulations suggest that institutions should focus more on rewards rather than penalties to incentivize a culture of contribution. Our approach therefore provides a methodology for developing a practical blueprint for organizing scientists to encourage cooperation and maximizing scientific output.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"7 1","pages":"9-18"},"PeriodicalIF":0.0,"publicationDate":"2013-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68106642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Comparison of Sequencing Utility Programs 排序实用程序的比较
Q3 Computer Science Pub Date : 2013-01-31 DOI: 10.2174/1875036201307010001
Erik Aronesty
High throughput sequencing (HTS) has resulted in extreme growth rates of sequencing data. At our lab, we generate terabytes of data every day. It is usually seen as required for data output to be "cleaned" and processed in various ways prior to use for common tasks such as variant calling, expression quantification and assembly. Two common tasks associated with HTS are adapter trimming and paired-end joining. I have developed two tools at Expression Analysis, Inc. to address these common tasks. The names of these programs are fastq-mcf and fastq-join. I compared the performance of these tools to similar open-source utilities, both in terms of resource efficiency, and effectiveness.
高通量测序(HTS)导致了测序数据的急剧增长。在我们的实验室,我们每天生成数tb的数据。通常认为,数据输出在用于诸如变量调用、表达式量化和汇编等常见任务之前,需要以各种方式进行“清理”和处理。与HTS相关的两个常见任务是适配器修剪和对端连接。我在Expression Analysis, Inc.开发了两个工具来处理这些常见任务。这些程序的名称是fastq-mcf和fastq-join。我将这些工具的性能与类似的开源实用程序进行了比较,包括资源效率和有效性。
{"title":"Comparison of Sequencing Utility Programs","authors":"Erik Aronesty","doi":"10.2174/1875036201307010001","DOIUrl":"https://doi.org/10.2174/1875036201307010001","url":null,"abstract":"High throughput sequencing (HTS) has resulted in extreme growth rates of sequencing data. At our lab, we generate terabytes of data every day. It is usually seen as required for data output to be \"cleaned\" and processed in various ways prior to use for common tasks such as variant calling, expression quantification and assembly. Two common tasks associated with HTS are adapter trimming and paired-end joining. I have developed two tools at Expression Analysis, Inc. to address these common tasks. The names of these programs are fastq-mcf and fastq-join. I compared the performance of these tools to similar open-source utilities, both in terms of resource efficiency, and effectiveness.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"7 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2013-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68106525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 930
Primer1: Primer Design Web Service for Tetra-Primer ARMS-PCR Primer1:引物设计Web服务
Q3 Computer Science Pub Date : 2012-11-30 DOI: 10.2174/1875036201206010055
A. Collins, X. Ke
Tetra-primer ARMS-PCR is used extensively as a low cost, single PCR assay requiring no post-PCR manipulation. The design of successful primers depends on a number of variables such as melting temperatures, GC content, complementarity and selection of mismatch bases. The optimal selection of primers can be achieved in an automated way using a program which evaluates candidate primers for a given sequence. The Primer1 software was developed originally for use in the context of restriction fragment length polymorphism analysis using gel electrophoresis. However, recent applications have been more diverse, reviewed here, and we present an overview of the Primer1 software for primer design and web-service. We have updated the Primer1 program, and provide more complete details of the implementation. We also provide test data and output. The program is now available on a new, efficient, LAMP web service for users at: http://primer1.soton.ac.uk/primer1.html
四引物ARMS-PCR作为一种低成本的单次PCR检测方法被广泛使用,不需要PCR后操作。成功引物的设计取决于许多变量,如熔融温度、GC含量、互补性和错配碱基的选择。引物的最佳选择可以用一个程序来自动地实现,该程序对给定序列的候选引物进行评估。Primer1软件最初是用于凝胶电泳的限制性内切片段长度多态性分析。然而,最近的应用程序已经更加多样化,在这里回顾,我们提出了Primer1软件的概述,用于引物设计和网络服务。我们已经更新了Primer1程序,并提供了更完整的实现细节。我们也提供测试数据和输出。该程序现在可以在一个新的、高效的LAMP网络服务上使用:http://primer1.soton.ac.uk/primer1.html
{"title":"Primer1: Primer Design Web Service for Tetra-Primer ARMS-PCR","authors":"A. Collins, X. Ke","doi":"10.2174/1875036201206010055","DOIUrl":"https://doi.org/10.2174/1875036201206010055","url":null,"abstract":"Tetra-primer ARMS-PCR is used extensively as a low cost, single PCR assay requiring no post-PCR manipulation. The design of successful primers depends on a number of variables such as melting temperatures, GC content, complementarity and selection of mismatch bases. The optimal selection of primers can be achieved in an automated way using a program which evaluates candidate primers for a given sequence. The Primer1 software was developed originally for use in the context of restriction fragment length polymorphism analysis using gel electrophoresis. However, recent applications have been more diverse, reviewed here, and we present an overview of the Primer1 software for primer design and web-service. We have updated the Primer1 program, and provide more complete details of the implementation. We also provide test data and output. The program is now available on a new, efficient, LAMP web service for users at: http://primer1.soton.ac.uk/primer1.html","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"6 1","pages":"55-58"},"PeriodicalIF":0.0,"publicationDate":"2012-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68106515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 112
期刊
Open Bioinformatics Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1