{"title":"amIcompositional: Simple Tests for Compositional Behaviour of High Throughput Data with Common Transformations","authors":"G. Gloor","doi":"10.17713/ajs.v52i4.1617","DOIUrl":null,"url":null,"abstract":"\n\n\nCompositional approaches are beginning to permeate high throughput biomedical sciences in the areas of microbiome, genomics, transcriptomics and proteomics. Yet non-compositional approaches are still commonly observed. Non-compositional approaches are particularly problematic in network analysis based on correlation, ordination and exploratory data analysis based on distance, and differential abundance analysis based on normalization. Here we describe the aIc R package, a simple tool that answers the fundamental question: does the dataset or normalization exhibit compositional artefacts that will skew interpretations when analyzing high throughput biomedical data? The aIc R package includes options for several of the most widely used normalizations and filtering methods. The R package includes tests for subcompositional dominance and coherence along with perturbation and scale invariance. Exploratory analysis is facilitated by an R Shiny app that makes the process simple for those not wishing to use an R console. This simple approach will allow research groups to acknowledge and account for potential artefacts in data analysis resulting in more robust and reliable inferences.\n\n\n","PeriodicalId":51761,"journal":{"name":"Austrian Journal of Statistics","volume":"40 1","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2023-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Austrian Journal of Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17713/ajs.v52i4.1617","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 1
Abstract
Compositional approaches are beginning to permeate high throughput biomedical sciences in the areas of microbiome, genomics, transcriptomics and proteomics. Yet non-compositional approaches are still commonly observed. Non-compositional approaches are particularly problematic in network analysis based on correlation, ordination and exploratory data analysis based on distance, and differential abundance analysis based on normalization. Here we describe the aIc R package, a simple tool that answers the fundamental question: does the dataset or normalization exhibit compositional artefacts that will skew interpretations when analyzing high throughput biomedical data? The aIc R package includes options for several of the most widely used normalizations and filtering methods. The R package includes tests for subcompositional dominance and coherence along with perturbation and scale invariance. Exploratory analysis is facilitated by an R Shiny app that makes the process simple for those not wishing to use an R console. This simple approach will allow research groups to acknowledge and account for potential artefacts in data analysis resulting in more robust and reliable inferences.
期刊介绍:
The Austrian Journal of Statistics is an open-access journal (without any fees) with a long history and is published approximately quarterly by the Austrian Statistical Society. Its general objective is to promote and extend the use of statistical methods in all kind of theoretical and applied disciplines. The Austrian Journal of Statistics is indexed in many data bases, such as Scopus (by Elsevier), Web of Science - ESCI by Clarivate Analytics (formely Thompson & Reuters), DOAJ, Scimago, and many more. The current estimated impact factor (via Publish or Perish) is 0.775, see HERE, or even more indices HERE. Austrian Journal of Statistics ISNN number is 1026597X Original papers and review articles in English will be published in the Austrian Journal of Statistics if judged consistently with these general aims. All papers will be refereed. Special topics sections will appear from time to time. Each section will have as a theme a specialized area of statistical application, theory, or methodology. Technical notes or problems for considerations under Shorter Communications are also invited. A special section is reserved for book reviews.