首页 > 最新文献

Biostatistics, bioinformatics and biomathematics最新文献

英文 中文
An improvement of the 2ˆ(-delta delta CT) method for quantitative real-time polymerase chain reaction data analysis. 改进了用于定量实时聚合酶链反应数据分析的2°(- δ δ CT)方法。
Xiayu Rao, Xuelin Huang, Zhicheng Zhou, Xin Lin

Background: The 2-ΔΔCT method has been extensively used as a relative quantification strategy for quantitative real-time polymerase chain reaction (qPCR) data analysis. This method is a convenient way to calculate relative gene expression levels between different samples in that it directly uses the threshold cycles (CTs) generated by the qPCR system for calculation. However, this approach relies heavily on an invalid assumption of 100% PCR amplification efficiency across all samples. In addition, the 2-ΔΔCT method is applied to data with automatic removal of background fluorescence by the qPCR software. Since the background fluorescence is unknown, subtracting an inaccurate background can lead to distortion of the results. To address these problems, we present an improved method, the individual efficiency corrected calculation.

Results: Our method takes into account the PCR efficiency of each individual sample. In addition, it eliminates the need for background fluorescence estimation or subtraction because the background can be cancelled out using the differencing strategy. The DNA amount for a certain gene and the relative DNA amount among different samples estimated using our method were closer to the true values compared to the results of the 2-ΔΔCT method.

Conclusions: The improved method, the individual efficiency corrected calculation, produces more accurate estimates in relative gene expression than the 2-ΔΔCT method and is thus a better way to calculate relative gene expression.

背景:2-ΔΔCT方法已被广泛用作定量实时聚合酶链反应(qPCR)数据分析的相对定量策略。该方法直接使用qPCR系统产生的阈值循环(ct)进行计算,是计算不同样品间相对基因表达水平的方便方法。然而,这种方法严重依赖于所有样本100% PCR扩增效率的无效假设。此外,2-ΔΔCT方法应用于qPCR软件自动去除背景荧光的数据。由于背景荧光是未知的,减去不准确的背景会导致结果失真。针对这些问题,我们提出了一种改进的方法——个体效率修正计算。结果:我们的方法考虑了每个样品的PCR效率。此外,它消除了对背景荧光估计或减法的需要,因为可以使用差分策略抵消背景。与2-ΔΔCT方法相比,我们方法估计的某一基因的DNA量和不同样品间的相对DNA量更接近真实值。结论:改进后的个体效率校正计算法比2-ΔΔCT法更准确地估计了基因相对表达量,是一种更好的计算基因相对表达量的方法。
{"title":"An improvement of the 2ˆ(-delta delta CT) method for quantitative real-time polymerase chain reaction data analysis.","authors":"Xiayu Rao,&nbsp;Xuelin Huang,&nbsp;Zhicheng Zhou,&nbsp;Xin Lin","doi":"","DOIUrl":"","url":null,"abstract":"<p><strong>Background: </strong>The 2<sup>-ΔΔ</sup><i><sup>CT</sup></i> method has been extensively used as a relative quantification strategy for quantitative real-time polymerase chain reaction (qPCR) data analysis. This method is a convenient way to calculate relative gene expression levels between different samples in that it directly uses the threshold cycles (<i>CTs</i>) generated by the qPCR system for calculation. However, this approach relies heavily on an invalid assumption of 100% PCR amplification efficiency across all samples. In addition, the 2<sup>-ΔΔ</sup><i><sup>CT</sup></i> method is applied to data with automatic removal of background fluorescence by the qPCR software. Since the background fluorescence is unknown, subtracting an inaccurate background can lead to distortion of the results. To address these problems, we present an improved method, the individual efficiency corrected calculation.</p><p><strong>Results: </strong>Our method takes into account the PCR efficiency of each individual sample. In addition, it eliminates the need for background fluorescence estimation or subtraction because the background can be cancelled out using the differencing strategy. The DNA amount for a certain gene and the relative DNA amount among different samples estimated using our method were closer to the true values compared to the results of the 2<sup>-ΔΔ</sup><i><sup>CT</sup></i> method.</p><p><strong>Conclusions: </strong>The improved method, the individual efficiency corrected calculation, produces more accurate estimates in relative gene expression than the 2<sup>-ΔΔ</sup><i><sup>CT</sup></i> method and is thus a better way to calculate relative gene expression.</p>","PeriodicalId":90456,"journal":{"name":"Biostatistics, bioinformatics and biomathematics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4280562/pdf/nihms633016.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32949433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gene Selection with Sequential Classification and Regression Tree Algorithm. 序列分类与回归树算法的基因选择。
Caleb D Bastian, Grzegorz A Rempala

Background: In the typical setting of gene-selection problems from high-dimensional data, e.g., gene expression data from microarray or next-generation sequencing-based technologies, an enormous volume of high-throughput data is generated, and there is often a need for a simple, computationally-inexpensive, non-parametric screening procedure than can quickly and accurately find a low-dimensional variable subset that preserves biological information from the original very high-dimensional data (dimension p > 40,000). This is in contrast to the very sophisticated variable selection methods that are computationally expensive, need pre-processing routines, and often require calibration of priors.

Results: We present a tree-based sequential CART (S-CART) approach to variable selection in the binary classification setting and compare it against the more sophisticated procedures using simulated and real biological data. In simulated data, we analyze S-CART performance versus (i) a random forest (RF), (ii) a fully-parametric Bayesian stochastic search variable selection (SSVS), and (iii) the moderated t-test statistic from the LIMMA package in R. The simulation study is based on a hierarchical Bayesian model, where dataset dimensionality, percentage of significant variables, and substructure via dependency vary. Selection efficacy is measured through false-discovery and missed-discovery rates. In all scenarios, the S-CART method is seen to consistently outperform SSVS and RF in both speed and detection accuracy. We demonstrate the utility of the S-CART technique both on simulated data and in a control-treatment mouse study. We show that the network analysis based on the S-CART-selected gene subset in essence recapitulates the biological findings of the study using only a fraction of the original set of genes considered in the study's analysis.

Conclusions: The relatively simple-minded gene selection algorithms like S-CART may often in practical circumstances be preferred over much more sophisticated ones. The advantage of the "greedy" selection methods utilized by S-CART and the likes is that they scale well with the problem size and require virtually no tuning or training while remaining efficient in extracting the relevant information from microarray-like datasets containing large number of redundant or irrelevant variables.

Availability: The MATLAB 7.4b code for the S-CART implementation is available for download from https://neyman.mcg.edu/posts/scart.zip.

背景:在高维数据的基因选择问题的典型设置中,例如,来自微阵列或基于下一代测序技术的基因表达数据,产生了大量高通量数据,并且通常需要一种简单的,计算上便宜的,非参数筛选程序可以快速准确地从原始的非常高维数据(维度p > 40,000)中找到保留生物信息的低维变量子集。这与非常复杂的变量选择方法形成对比,后者计算成本高,需要预处理例程,并且通常需要校准先验。结果:我们提出了一种基于树的顺序CART (S-CART)方法来进行二元分类设置中的变量选择,并将其与使用模拟和真实生物学数据的更复杂的程序进行比较。在模拟数据中,我们分析了S-CART性能与(i)随机森林(RF), (ii)全参数贝叶斯随机搜索变量选择(SSVS),以及(iii) r中LIMMA软件包的缓和t检验统计量的关系。模拟研究基于分层贝叶斯模型,其中数据集维度,显著变量百分比和依赖关系的子结构各不相同。选择效率是通过错误发现率和未发现率来衡量的。在所有情况下,S-CART方法在速度和检测精度方面都优于SSVS和RF。我们展示了S-CART技术在模拟数据和对照处理小鼠研究中的实用性。我们表明,基于s - cart选择的基因子集的网络分析本质上概括了该研究的生物学发现,仅使用了研究分析中考虑的原始基因集的一小部分。结论:相对简单的基因选择算法,如S-CART,在实际情况下可能比更复杂的算法更受欢迎。S-CART等使用的“贪婪”选择方法的优点是,它们可以很好地随问题规模扩展,几乎不需要调整或训练,同时在从包含大量冗余或不相关变量的微阵列类数据集中提取相关信息方面保持高效。可用性:S-CART实现的MATLAB 7.4b代码可从https://neyman.mcg.edu/posts/scart.zip下载。
{"title":"Gene Selection with Sequential Classification and Regression Tree Algorithm.","authors":"Caleb D Bastian,&nbsp;Grzegorz A Rempala","doi":"","DOIUrl":"","url":null,"abstract":"<p><strong>Background: </strong>In the typical setting of gene-selection problems from high-dimensional data, e.g., gene expression data from microarray or next-generation sequencing-based technologies, an enormous volume of high-throughput data is generated, and there is often a need for a simple, computationally-inexpensive, non-parametric screening procedure than can quickly and accurately find a low-dimensional variable subset that preserves biological information from the original very high-dimensional data (dimension <i>p</i> > 40,000). This is in contrast to the very sophisticated variable selection methods that are computationally expensive, need pre-processing routines, and often require calibration of priors.</p><p><strong>Results: </strong>We present a tree-based sequential CART (S-CART) approach to variable selection in the binary classification setting and compare it against the more sophisticated procedures using simulated and real biological data. In simulated data, we analyze S-CART performance versus (i) a random forest (RF), (ii) a fully-parametric Bayesian stochastic search variable selection (SSVS), and (iii) the moderated <i>t</i>-test statistic from the LIMMA package in R. The simulation study is based on a hierarchical Bayesian model, where dataset dimensionality, percentage of significant variables, and substructure via dependency vary. Selection efficacy is measured through false-discovery and missed-discovery rates. In all scenarios, the S-CART method is seen to consistently outperform SSVS and RF in both speed and detection accuracy. We demonstrate the utility of the S-CART technique both on simulated data and in a control-treatment mouse study. We show that the network analysis based on the S-CART-selected gene subset in essence recapitulates the biological findings of the study using only a fraction of the original set of genes considered in the study's analysis.</p><p><strong>Conclusions: </strong>The relatively simple-minded gene selection algorithms like S-CART may often in practical circumstances be preferred over much more sophisticated ones. The advantage of the \"greedy\" selection methods utilized by S-CART and the likes is that they scale well with the problem size and require virtually no tuning or training while remaining efficient in extracting the relevant information from microarray-like datasets containing large number of redundant or irrelevant variables.</p><p><strong>Availability: </strong>The MATLAB 7.4b code for the S-CART implementation is available for download from https://neyman.mcg.edu/posts/scart.zip.</p>","PeriodicalId":90456,"journal":{"name":"Biostatistics, bioinformatics and biomathematics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4214923/pdf/nihms376173.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32787446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biostatistics, bioinformatics and biomathematics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1