{"title":"PPlot是一个web应用程序,用于划分地球化学数据并使用概率图建模分离混合亚种群","authors":"Francisco Campos, Otávio Licht, Nivaldo Campos","doi":"10.21715/gb2358-2812.202337002","DOIUrl":null,"url":null,"abstract":"Statistical methods are mostly designed to handle datasets comprising statistically single normal or log-normal populations, but geochemical and geophysical surveys usually deviate from this expectation. A reason for this is the heterogeneity in the occurrence of geological objects, so the complete dataset may correspond to multiple mixed subpopulations. Specifically, multiple mixed subpopulations can refer to differences between mineralized and barren areas, different geochemical facies of a geological unit, or contaminated and healthy areas. This implies a restriction on using classical or even robust statistical estimates, unless the underlying subpopulations can be extracted from the dataset. The probability plot can be used to assess a dataset and to infer a possible combination of subpopulations, either normal or log-normal, whose combination may generate it. The web-based app PPlot, presented in this paper, allows the plotting of the probability plot of a dataset and modeling the underlying subpopulations present in it, either automatically or manually. After modeling the dataset by the application, the user will obtain numerical results and plots of the range of values that delimit each subpopulation, as well as the mean and standard deviation for each of them. Computer-generated and real datasets were used to validate the procedure and coding, and an example of usage is presented. The app was developed using HTML5 and JavaScript and it runs in any modern browser, and is freely available in https://pplotweb.firebaseapp.com/.","PeriodicalId":34597,"journal":{"name":"Geochimica Brasiliensis","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PPlot, a webapp to partition geochemical data and isolate mixed subpopulations using probability plot modeling\",\"authors\":\"Francisco Campos, Otávio Licht, Nivaldo Campos\",\"doi\":\"10.21715/gb2358-2812.202337002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Statistical methods are mostly designed to handle datasets comprising statistically single normal or log-normal populations, but geochemical and geophysical surveys usually deviate from this expectation. A reason for this is the heterogeneity in the occurrence of geological objects, so the complete dataset may correspond to multiple mixed subpopulations. Specifically, multiple mixed subpopulations can refer to differences between mineralized and barren areas, different geochemical facies of a geological unit, or contaminated and healthy areas. This implies a restriction on using classical or even robust statistical estimates, unless the underlying subpopulations can be extracted from the dataset. The probability plot can be used to assess a dataset and to infer a possible combination of subpopulations, either normal or log-normal, whose combination may generate it. The web-based app PPlot, presented in this paper, allows the plotting of the probability plot of a dataset and modeling the underlying subpopulations present in it, either automatically or manually. After modeling the dataset by the application, the user will obtain numerical results and plots of the range of values that delimit each subpopulation, as well as the mean and standard deviation for each of them. Computer-generated and real datasets were used to validate the procedure and coding, and an example of usage is presented. The app was developed using HTML5 and JavaScript and it runs in any modern browser, and is freely available in https://pplotweb.firebaseapp.com/.\",\"PeriodicalId\":34597,\"journal\":{\"name\":\"Geochimica Brasiliensis\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geochimica Brasiliensis\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21715/gb2358-2812.202337002\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geochimica Brasiliensis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21715/gb2358-2812.202337002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
PPlot, a webapp to partition geochemical data and isolate mixed subpopulations using probability plot modeling
Statistical methods are mostly designed to handle datasets comprising statistically single normal or log-normal populations, but geochemical and geophysical surveys usually deviate from this expectation. A reason for this is the heterogeneity in the occurrence of geological objects, so the complete dataset may correspond to multiple mixed subpopulations. Specifically, multiple mixed subpopulations can refer to differences between mineralized and barren areas, different geochemical facies of a geological unit, or contaminated and healthy areas. This implies a restriction on using classical or even robust statistical estimates, unless the underlying subpopulations can be extracted from the dataset. The probability plot can be used to assess a dataset and to infer a possible combination of subpopulations, either normal or log-normal, whose combination may generate it. The web-based app PPlot, presented in this paper, allows the plotting of the probability plot of a dataset and modeling the underlying subpopulations present in it, either automatically or manually. After modeling the dataset by the application, the user will obtain numerical results and plots of the range of values that delimit each subpopulation, as well as the mean and standard deviation for each of them. Computer-generated and real datasets were used to validate the procedure and coding, and an example of usage is presented. The app was developed using HTML5 and JavaScript and it runs in any modern browser, and is freely available in https://pplotweb.firebaseapp.com/.