首页 > 最新文献

arXiv: Applications最新文献

英文 中文
Centrality-oriented causality. A study of EU agricultural subsidies and digital development in Poland Centrality-oriented因果关系。欧盟农业补贴与波兰数字发展研究
Pub Date : 2019-08-29 DOI: 10.37190/ord200303
K. Daniel, J. Rydlewski
Results of a convincing causal statistical inference related to socio-economic phenomena are treated as especially desired background for conducting various socio-economic programs or government interventions. Unfortunately, quite often real socio-economic issues do not fulfill restrictive assumptions of procedures of causal analysis proposed in the literature. This paper indicates certain empirical challenges and conceptual opportunities related to applications of procedures of data depth concept into a process of causal inference as to socio-economic phenomena. We show, how to apply a statistical functional depths in order to indicate factual and counterfactual distributions commonly used within procedures of causal inference. Thus a modification of Rubin causality concept is proposed, i.e., a centrality-oriented causality concept. The presented framework is especially useful in a context of conducting causal inference basing on official statistics, i.e., basing on already existing databases. Methodological considerations related to extremal depth, modified band depth, Fraiman-Muniz depth, and multivariate Wilcoxon sum rank statistic are illustrated by means of example related to a study of an impact of EU direct agricultural subsidies on a digital development in Poland in a period of 2012-2019.
与社会经济现象相关的令人信服的因果统计推断结果被视为实施各种社会经济计划或政府干预的特别理想背景。不幸的是,真实的社会经济问题往往不满足文献中提出的因果分析程序的限制性假设。本文指出了将数据深度概念程序应用于社会经济现象的因果推理过程中的某些经验挑战和概念机会。我们展示了如何应用统计函数深度来指示因果推理过程中常用的事实和反事实分布。在此基础上提出了Rubin因果关系概念的修正,即中心性导向的因果关系概念。所提出的框架在基于官方统计数据(即基于现有数据库)进行因果推理的情况下特别有用。通过研究2012-2019年期间欧盟直接农业补贴对波兰数字发展影响的实例,说明了与极值深度、修正波段深度、Fraiman-Muniz深度和多元Wilcoxon和秩统计相关的方法考虑。
{"title":"Centrality-oriented causality. A study of EU agricultural subsidies and digital development in Poland","authors":"K. Daniel, J. Rydlewski","doi":"10.37190/ord200303","DOIUrl":"https://doi.org/10.37190/ord200303","url":null,"abstract":"Results of a convincing causal statistical inference related to socio-economic phenomena are treated as especially desired background for conducting various socio-economic programs or government interventions. Unfortunately, quite often real socio-economic issues do not fulfill restrictive assumptions of procedures of causal analysis proposed in the literature. This paper indicates certain empirical challenges and conceptual opportunities related to applications of procedures of data depth concept into a process of causal inference as to socio-economic phenomena. We show, how to apply a statistical functional depths in order to indicate factual and counterfactual distributions commonly used within procedures of causal inference. Thus a modification of Rubin causality concept is proposed, i.e., a centrality-oriented causality concept. The presented framework is especially useful in a context of conducting causal inference basing on official statistics, i.e., basing on already existing databases. Methodological considerations related to extremal depth, modified band depth, Fraiman-Muniz depth, and multivariate Wilcoxon sum rank statistic are illustrated by means of example related to a study of an impact of EU direct agricultural subsidies on a digital development in Poland in a period of 2012-2019.","PeriodicalId":409996,"journal":{"name":"arXiv: Applications","volume":"123 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114706585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Probabilistic forecasting of the Arctic sea ice edge with contour modeling 用等高线模拟进行北极海冰边缘的概率预报
Pub Date : 2019-08-25 DOI: 10.1214/20-aoas1405
Hannah M. Director, A. Raftery, C. Bitz
Sea ice, or frozen ocean water, annually freezes and melts in the Arctic. The need for accurate forecasts of where sea ice will be located weeks to months in advance has increased as the amount of sea ice reduces due to climate change. Typical sea ice forecasts are made with ensemble models, physics-based deterministic models of sea ice and the surrounding ocean and atmosphere. This paper introduces Mixture Contour Forecasting, a method to forecast sea ice that post-processes output from ensembles and weights them with observed sea ice patterns in recent years. These forecasts are better calibrated than unadjusted dynamic ensemble forecasts and other statistical reference forecasts. To produce these forecasts, a novel statistical technique is introduced that directly models the sea ice edge contour, the boundary around the region that is ice-covered. Most of the computational effort in post-processing can therefore be placed on the sea ice edge contour, which is of particular importance due to its role in maritime planning. Mixture Contour Forecasting and reference methods are evaluated for monthly sea ice forecasts for 2008-2016 at lead times ranging from 0.5-6.5 months using the European Centre for Medium-Range Weather Forecasts ensemble.
在北极,海冰或冰冻的海水每年都会结冰和融化。由于气候变化导致海冰数量减少,因此需要提前数周到数月准确预测海冰的位置。典型的海冰预报是用集合模式,即基于物理的海冰及其周围海洋和大气的确定性模式来进行的。混合轮廓线预测是一种预测海冰的方法,它对集合的输出进行后处理,并将其与近年来观测到的海冰模式加权。与未经调整的动态集合预报和其他统计参考预报相比,这些预报的校准效果更好。为了产生这些预测,引入了一种新的统计技术,直接模拟海冰边缘轮廓,即海冰覆盖区域周围的边界。因此,后处理中的大部分计算工作都可以放在海冰边缘轮廓上,由于海冰边缘轮廓在海洋规划中的作用,这一点尤为重要。使用欧洲中期天气预报中心集合,评估了混合等高线预报和参考方法对2008-2016年每月海冰预报的预估时间从0.5-6.5个月不等。
{"title":"Probabilistic forecasting of the Arctic sea ice edge with contour modeling","authors":"Hannah M. Director, A. Raftery, C. Bitz","doi":"10.1214/20-aoas1405","DOIUrl":"https://doi.org/10.1214/20-aoas1405","url":null,"abstract":"Sea ice, or frozen ocean water, annually freezes and melts in the Arctic. The need for accurate forecasts of where sea ice will be located weeks to months in advance has increased as the amount of sea ice reduces due to climate change. Typical sea ice forecasts are made with ensemble models, physics-based deterministic models of sea ice and the surrounding ocean and atmosphere. This paper introduces Mixture Contour Forecasting, a method to forecast sea ice that post-processes output from ensembles and weights them with observed sea ice patterns in recent years. These forecasts are better calibrated than unadjusted dynamic ensemble forecasts and other statistical reference forecasts. To produce these forecasts, a novel statistical technique is introduced that directly models the sea ice edge contour, the boundary around the region that is ice-covered. Most of the computational effort in post-processing can therefore be placed on the sea ice edge contour, which is of particular importance due to its role in maritime planning. Mixture Contour Forecasting and reference methods are evaluated for monthly sea ice forecasts for 2008-2016 at lead times ranging from 0.5-6.5 months using the European Centre for Medium-Range Weather Forecasts ensemble.","PeriodicalId":409996,"journal":{"name":"arXiv: Applications","volume":"390 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132878765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Climate extreme event attribution using multivariate peaks-over-thresholds modeling and counterfactual theory 气候极端事件归因的多变量峰值超过阈值模型和反事实理论
Pub Date : 2019-08-08 DOI: 10.1214/20-aoas1355
A. Kiriliouk, P. Naveau
Numerical climate models are complex and combine a large number of physical processes. They are key tools in quantifying the relative contribution of potential anthropogenic causes (e.g., the current increase in greenhouse gases) on high impact atmospheric variables like heavy rainfall. These so-called climate extreme event attribution problems are particularly challenging in a multivariate context, that is, when the atmospheric variables are measured on a possibly high-dimensional grid. In this paper, we leverage two statistical theories to assess causality in the context of multivariate extreme event attribution. As we consider an event to be extreme when at least one of the components of the vector of interest is large, extreme-value theory justifies, in an asymptotical sense, a multivariate generalized Pareto distribution to model joint extremes. Under this class of distributions, we derive and study probabilities of necessary and sufficient causation as defined by the counterfactual theory of Pearl. To increase causal evidence, we propose a dimension reduction strategy based on the optimal linear projection that maximizes such causation probabilities. Our approach is tested on simulated examples and applied to weekly winter maxima precipitation outputs of the French CNRM from the recent CMIP6 experiment.
数值气候模式是复杂的,并且结合了大量的物理过程。它们是量化潜在人为原因(例如,目前温室气体的增加)对强降雨等高影响大气变量的相对贡献的关键工具。这些所谓的气候极端事件归因问题在多变量背景下尤其具有挑战性,也就是说,当大气变量在可能的高维网格上测量时。在本文中,我们利用两种统计理论来评估多元极端事件归因背景下的因果关系。当我们认为一个事件是极端的,当感兴趣的向量的至少一个分量很大时,极值理论证明,在渐近意义上,一个多元广义帕累托分布来模拟联合极值。在这类分布下,我们推导并研究了由Pearl的反事实理论定义的充分必要因果的概率。为了增加因果证据,我们提出了一种基于最优线性投影的降维策略,使因果概率最大化。我们的方法在模拟实例上进行了测试,并应用于法国CNRM最近的CMIP6实验的每周冬季最大降水输出。
{"title":"Climate extreme event attribution using multivariate peaks-over-thresholds modeling and counterfactual theory","authors":"A. Kiriliouk, P. Naveau","doi":"10.1214/20-aoas1355","DOIUrl":"https://doi.org/10.1214/20-aoas1355","url":null,"abstract":"Numerical climate models are complex and combine a large number of physical processes. They are key tools in quantifying the relative contribution of potential anthropogenic causes (e.g., the current increase in greenhouse gases) on high impact atmospheric variables like heavy rainfall. These so-called climate extreme event attribution problems are particularly challenging in a multivariate context, that is, when the atmospheric variables are measured on a possibly high-dimensional grid. \u0000In this paper, we leverage two statistical theories to assess causality in the context of multivariate extreme event attribution. As we consider an event to be extreme when at least one of the components of the vector of interest is large, extreme-value theory justifies, in an asymptotical sense, a multivariate generalized Pareto distribution to model joint extremes. Under this class of distributions, we derive and study probabilities of necessary and sufficient causation as defined by the counterfactual theory of Pearl. To increase causal evidence, we propose a dimension reduction strategy based on the optimal linear projection that maximizes such causation probabilities. Our approach is tested on simulated examples and applied to weekly winter maxima precipitation outputs of the French CNRM from the recent CMIP6 experiment.","PeriodicalId":409996,"journal":{"name":"arXiv: Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130002006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
How to Apply Multiple Imputation in Propensity Score Matching with Partially Observed Confounders: A Simulation Study and Practical Recommendations 如何在部分观察混杂因素的倾向评分匹配中应用多重归算:模拟研究和实用建议
Pub Date : 2019-04-16 DOI: 10.22237/jmasm/1608552120
Albee Y. Ling, M. Montez-Rath, Maya B. Mathur, K. Kapphahn, M. Desai
Propensity score matching (PSM) has been widely used to mitigate confounding in observational studies, although complications arise when the covariates used to estimate the PS are only partially observed. Multiple imputation (MI) is a potential solution for handling missing covariates in the estimation of the PS. Unfortunately, it is not clear how to best apply MI strategies in the context of PSM. We conducted a simulation study to compare the performances of popular non-MI missing data methods and various MI-based strategies under different missing data mechanisms (MDMs). We found that commonly applied missing data methods resulted in biased and inefficient estimates, and we observed large variation in performance across MI-based strategies. Based on our findings, we recommend 1) deriving the PS after applying MI (referred to as MI-derPassive); 2) conducting PSM within each imputed data set followed by averaging the treatment effects to arrive at one summarized finding (INT-within) for mild MDMs and averaging the PSs across multiply imputed datasets before obtaining one treatment effect using PSM (INT-across) for more complex MDMs; 3) a bootstrapped-based variance to account for uncertainty of PS estimation, matching, and imputation; and 4) inclusion of key auxiliary variables in the imputation model.
倾向评分匹配(PSM)已广泛用于减轻观察性研究中的混淆,尽管当用于估计PS的协变量仅部分观察到时,会出现并发症。多重插值(Multiple imputation, MI)是处理PS估计中缺失协变量的潜在解决方案。不幸的是,如何在PSM的背景下最好地应用MI策略尚不清楚。我们进行了一项模拟研究,比较了流行的非mi缺失数据方法和各种基于mi的策略在不同缺失数据机制(mdm)下的性能。我们发现,通常应用的缺失数据方法会导致有偏差和低效的估计,并且我们观察到基于mi的策略在性能上存在很大差异。根据我们的研究结果,我们建议1)在应用MI(称为MI- derpassive)后获得PS;2)在每个输入数据集中进行PSM,然后对轻度MDMs进行平均治疗效果,得出一个总结结果(INT-within),然后对多个输入数据集进行平均治疗效果,然后对更复杂的MDMs使用PSM (INT-across)获得一个治疗效果;3)基于自举的方差,以考虑PS估计、匹配和imputation的不确定性;4)将关键辅助变量纳入估算模型。
{"title":"How to Apply Multiple Imputation in Propensity Score Matching with Partially Observed Confounders: A Simulation Study and Practical Recommendations","authors":"Albee Y. Ling, M. Montez-Rath, Maya B. Mathur, K. Kapphahn, M. Desai","doi":"10.22237/jmasm/1608552120","DOIUrl":"https://doi.org/10.22237/jmasm/1608552120","url":null,"abstract":"Propensity score matching (PSM) has been widely used to mitigate confounding in observational studies, although complications arise when the covariates used to estimate the PS are only partially observed. Multiple imputation (MI) is a potential solution for handling missing covariates in the estimation of the PS. Unfortunately, it is not clear how to best apply MI strategies in the context of PSM. We conducted a simulation study to compare the performances of popular non-MI missing data methods and various MI-based strategies under different missing data mechanisms (MDMs). We found that commonly applied missing data methods resulted in biased and inefficient estimates, and we observed large variation in performance across MI-based strategies. Based on our findings, we recommend 1) deriving the PS after applying MI (referred to as MI-derPassive); 2) conducting PSM within each imputed data set followed by averaging the treatment effects to arrive at one summarized finding (INT-within) for mild MDMs and averaging the PSs across multiply imputed datasets before obtaining one treatment effect using PSM (INT-across) for more complex MDMs; 3) a bootstrapped-based variance to account for uncertainty of PS estimation, matching, and imputation; and 4) inclusion of key auxiliary variables in the imputation model.","PeriodicalId":409996,"journal":{"name":"arXiv: Applications","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133090647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Modelling of crash types at signalized intersections based on random effect model 基于随机效应模型的信号交叉口碰撞类型建模
Pub Date : 2018-05-16 DOI: 10.11908/j.issn.0253-374x.2016.01.012
Xuesong Wang, Jinghui Yuan, Xiaohan Yang
Approach-level models were developed to accommodate the diversity of approaches within the same intersection. A random effect term, which indicates the intersection-specific effect, was incorporated into each crash type model to deal with the spatial correlation between different approaches within the same intersection. The model parameters were estimated under the Bayesian framework. Results show that different crash types are correlated with different groups of factors, and each factor shows diverse effects on different crash types, which indicates the importance of crash type models. Besides, the significance of random effect term confirms the existence of spatial correlations among different approaches within the same intersection.
开发了方法级模型,以适应同一交叉口内方法的多样性。在每个碰撞类型模型中加入一个表示路口特定效应的随机效应项,以处理同一路口内不同路径之间的空间相关性。在贝叶斯框架下对模型参数进行估计。结果表明,不同的碰撞类型与不同的因素组相关,并且每个因素对不同的碰撞类型表现出不同的影响,这表明碰撞类型模型的重要性。此外,随机效应项的显著性证实了同一交叉口内不同方法之间存在空间相关性。
{"title":"Modelling of crash types at signalized intersections based on random effect model","authors":"Xuesong Wang, Jinghui Yuan, Xiaohan Yang","doi":"10.11908/j.issn.0253-374x.2016.01.012","DOIUrl":"https://doi.org/10.11908/j.issn.0253-374x.2016.01.012","url":null,"abstract":"Approach-level models were developed to accommodate the diversity of approaches within the same intersection. A random effect term, which indicates the intersection-specific effect, was incorporated into each crash type model to deal with the spatial correlation between different approaches within the same intersection. The model parameters were estimated under the Bayesian framework. Results show that different crash types are correlated with different groups of factors, and each factor shows diverse effects on different crash types, which indicates the importance of crash type models. Besides, the significance of random effect term confirms the existence of spatial correlations among different approaches within the same intersection.","PeriodicalId":409996,"journal":{"name":"arXiv: Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123927004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Discrete View of the Indian Monsoon to Identify Spatial Patterns of Rainfall 印度季风的离散视图以确定降雨的空间模式
Pub Date : 2018-05-01 DOI: 10.1093/CLIMSYS/DZY009
Adway Mitra, A. Apte, R. Govindarajan, V. Vasan, S. Vadlamani
We propose a representation of the Indian summer monsoon rainfall in terms of a probabilistic model based on a Markov Random Field, consisting of discrete state variables representing low and high rainfall at grid-scale and daily rainfall patterns across space and in time. These discrete states are conditioned on observed daily gridded rainfall data from the period 2000-2007. The model gives us a set of 10 spatial patterns of daily monsoon rainfall over India, which are robust over a range of user-chosen parameters as well as coherent in space and time. Each day in the monsoon season is assigned precisely one of the spatial patterns, that approximates the spatial distribution of rainfall on that day. Such approximations are quite accurate for nearly 95% of the days. Remarkably, these patterns are representative (with similar accuracy) of the monsoon seasons from 1901 to 2000 as well. Finally, we compare the proposed model with alternative approaches to extract spatial patterns of rainfall, using empirical orthogonal functions as well as clustering algorithms such as K-means and spectral clustering.
我们提出了一个基于马尔可夫随机场的概率模型来表示印度夏季季风降雨,该模型由离散状态变量组成,表示网格尺度上的低降雨和高降雨以及跨越空间和时间的日降雨模式。这些离散状态是根据2000-2007年期间观测到的逐日网格化降雨数据得出的。该模型为我们提供了印度每日季风降雨的10种空间模式,这些模式在用户选择的参数范围内是稳健的,并且在空间和时间上是一致的。季风季节的每一天都精确地分配了一种空间模式,这近似于当天降雨量的空间分布。这种近似在95%的日子里是相当准确的。值得注意的是,这些模式也具有1901年至2000年季风季节的代表性(精度相似)。最后,我们将所提出的模型与使用经验正交函数以及K-means和光谱聚类等聚类算法提取降雨空间模式的其他方法进行了比较。
{"title":"A Discrete View of the Indian Monsoon to Identify Spatial Patterns of Rainfall","authors":"Adway Mitra, A. Apte, R. Govindarajan, V. Vasan, S. Vadlamani","doi":"10.1093/CLIMSYS/DZY009","DOIUrl":"https://doi.org/10.1093/CLIMSYS/DZY009","url":null,"abstract":"We propose a representation of the Indian summer monsoon rainfall in terms of a probabilistic model based on a Markov Random Field, consisting of discrete state variables representing low and high rainfall at grid-scale and daily rainfall patterns across space and in time. These discrete states are conditioned on observed daily gridded rainfall data from the period 2000-2007. The model gives us a set of 10 spatial patterns of daily monsoon rainfall over India, which are robust over a range of user-chosen parameters as well as coherent in space and time. Each day in the monsoon season is assigned precisely one of the spatial patterns, that approximates the spatial distribution of rainfall on that day. Such approximations are quite accurate for nearly 95% of the days. Remarkably, these patterns are representative (with similar accuracy) of the monsoon seasons from 1901 to 2000 as well. Finally, we compare the proposed model with alternative approaches to extract spatial patterns of rainfall, using empirical orthogonal functions as well as clustering algorithms such as K-means and spectral clustering.","PeriodicalId":409996,"journal":{"name":"arXiv: Applications","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132338314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Principal Component Analysis on the Philippine Health Data. 菲律宾卫生数据的主成分分析。
Pub Date : 2018-04-16 DOI: 10.2139/ssrn.3339627
Marites F. Carillo, F. Largo, Roel F Ceballos
This study was conducted to determine the structures of a set of n correlated variables and creates a new set of uncorrelated indices which are the underlying components of the Philippine health data.The data utilized in this study was the 2009 Philippine Health Data which was made available by National Statistical Coordination Board(NSCB) in its 2009 publication.The publication contains the health data of 81 provinces of the Philippines consisting of ten system-related determinants which was considered as the variables in this study. From the ten health system-related determinants, it was found out that there are three significant underlying components that could summarize the Philippine health data. The first component was named as importance of safe water supply and emphasis on child heat while the second and third component were named as importance of Barangay Health Stations, government health workers and emphasis on pregnant women's health and emphasis on women's health, respectively. These three components jointly account for a total of 73.01% of the total variance explained by the component.
进行这项研究是为了确定一组n个相关变量的结构,并创建一组新的不相关指数,这些指数是菲律宾卫生数据的基本组成部分。本研究使用的数据是由国家统计协调委员会(NSCB)在其2009年出版物中提供的2009年菲律宾卫生数据。该出版物包含菲律宾81个省的健康数据,包括10个与系统相关的决定因素,这些决定因素被视为本研究中的变量。从与卫生系统相关的十个决定因素中,发现有三个重要的基本组成部分可以总结菲律宾的卫生数据。第一个组成部分被命名为安全供水的重要性和对儿童供暖的重视,第二个和第三个组成部分分别被命名为Barangay卫生站、政府卫生工作者的重要性和对孕妇健康的重视和对妇女健康的重视。这三个分量合计占该分量解释的总方差的73.01%。
{"title":"Principal Component Analysis on the Philippine Health Data.","authors":"Marites F. Carillo, F. Largo, Roel F Ceballos","doi":"10.2139/ssrn.3339627","DOIUrl":"https://doi.org/10.2139/ssrn.3339627","url":null,"abstract":"This study was conducted to determine the structures of a set of n correlated variables and creates a new set of uncorrelated indices which are the underlying components of the Philippine health data.The data utilized in this study was the 2009 Philippine Health Data which was made available by National Statistical Coordination Board(NSCB) in its 2009 publication.The publication contains the health data of 81 provinces of the Philippines consisting of ten system-related determinants which was considered as the variables in this study. From the ten health system-related determinants, it was found out that there are three significant underlying components that could summarize the Philippine health data. The first component was named as importance of safe water supply and emphasis on child heat while the second and third component were named as importance of Barangay Health Stations, government health workers and emphasis on pregnant women's health and emphasis on women's health, respectively. These three components jointly account for a total of 73.01% of the total variance explained by the component.","PeriodicalId":409996,"journal":{"name":"arXiv: Applications","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125151341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Point process models for quasi-periodic volcanic earthquakes 准周期性火山地震的点过程模型
Pub Date : 2018-03-20 DOI: 10.5038/2163-338X.4.2
A. Ignatieva, A. Bell, B. Worton
Long period (LP) earthquakes are common at active volcanoes, and are ubiquitous at persistently active andesitic and dacitic subduction zone volcanoes. They provide critical information regarding the state of volcanic unrest, and their occurrence rates are key data for eruption forecasting. LPs are commonly quasi-periodic or 'anti-clustered', unlike volcano-tectonic (VT) earthquakes, so the existing Poisson point process methods used to model occurrence rates of VT earthquakes are unlikely to be optimal for LP data. We evaluate the performance of candidate formulations for LP data, based on inhomogeneous point process models with four different inter-event time distributions: exponential (IP), Gamma (IG), inverse Gaussian (IIG), and Weibull (IW). We examine how well these models explain the observed data, and the quality of retrospective forecasts of eruption time. We use a Bayesian MCMC approach to fit the models. Goodness-of-fit is assessed using Quantile-Quantile and Kolmogorov-Smirnov methods, and benchmarking against results obtained from synthetic datasets. IG and IIG models were both found to fit the data well, with the IIG model slightly outperforming the IG model. Retrospective forecasting analysis shows that the IG model performs best, with the initial preference for the IIG model controlled by catalogue incompleteness late in the sequence. The IG model fits the data significantly better than the IP model, and simulations show it produces better forecasts for highly periodic data. Simulations also show that forecast precision increases with the degree of periodicity of the earthquake process using the IG model, and so should be better for LP earthquakes than VTs. These results provide a new framework for point process modelling of volcanic earthquake time series, and verification of alternative models.
长周期地震在活火山上很常见,在持续活跃的安山岩和英安质俯冲带火山上普遍存在。它们提供了关于火山不稳定状态的关键信息,它们的发生率是火山爆发预测的关键数据。与火山构造地震(VT)不同,LP通常是准周期性或“反聚集性”地震,因此现有的泊松点过程方法用于模拟VT地震的发生率,不太可能是LP数据的最佳选择。我们基于具有四种不同事件间时间分布的非齐次点过程模型评估LP数据候选公式的性能:指数(IP)、伽马(IG)、逆高斯(IIG)和威布尔(IW)。我们研究了这些模型如何很好地解释观测数据,以及对喷发时间的回顾性预测的质量。我们使用贝叶斯MCMC方法来拟合模型。使用分位数-分位数和Kolmogorov-Smirnov方法评估拟合优度,并对从合成数据集获得的结果进行基准测试。我们发现IG和IIG模型都能很好地拟合数据,IIG模型的表现略优于IG模型。回顾性预测分析表明,IG模型表现最好,序列后期目录不完全性控制了对IIG模型的初始偏好。IG模型比IP模型对数据的拟合效果要好得多,仿真结果表明,IG模型对高周期数据的预测效果更好。模拟结果还表明,IG模型的预报精度随着地震过程周期性程度的增加而增加,因此对中低压地震的预报精度要优于对中低压地震的预报。这些结果为火山地震时间序列的点过程模拟和替代模型的验证提供了一个新的框架。
{"title":"Point process models for quasi-periodic volcanic earthquakes","authors":"A. Ignatieva, A. Bell, B. Worton","doi":"10.5038/2163-338X.4.2","DOIUrl":"https://doi.org/10.5038/2163-338X.4.2","url":null,"abstract":"Long period (LP) earthquakes are common at active volcanoes, and are ubiquitous at persistently active andesitic and dacitic subduction zone volcanoes. They provide critical information regarding the state of volcanic unrest, and their occurrence rates are key data for eruption forecasting. LPs are commonly quasi-periodic or 'anti-clustered', unlike volcano-tectonic (VT) earthquakes, so the existing Poisson point process methods used to model occurrence rates of VT earthquakes are unlikely to be optimal for LP data. We evaluate the performance of candidate formulations for LP data, based on inhomogeneous point process models with four different inter-event time distributions: exponential (IP), Gamma (IG), inverse Gaussian (IIG), and Weibull (IW). We examine how well these models explain the observed data, and the quality of retrospective forecasts of eruption time. We use a Bayesian MCMC approach to fit the models. Goodness-of-fit is assessed using Quantile-Quantile and Kolmogorov-Smirnov methods, and benchmarking against results obtained from synthetic datasets. IG and IIG models were both found to fit the data well, with the IIG model slightly outperforming the IG model. Retrospective forecasting analysis shows that the IG model performs best, with the initial preference for the IIG model controlled by catalogue incompleteness late in the sequence. The IG model fits the data significantly better than the IP model, and simulations show it produces better forecasts for highly periodic data. Simulations also show that forecast precision increases with the degree of periodicity of the earthquake process using the IG model, and so should be better for LP earthquakes than VTs. These results provide a new framework for point process modelling of volcanic earthquake time series, and verification of alternative models.","PeriodicalId":409996,"journal":{"name":"arXiv: Applications","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131748581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
'Cultural Additivity' and How the Values and Norms of Confucianism, Buddhism, and Taoism Co-Exist, Interact, and Influence Vietnamese Society: A Bayesian Analysis of Long-Standing Folktales, Using R and Stan “文化可加性”与儒释道价值观与规范如何共存、互动、影响越南社会:基于R和Stan的长期民间故事贝叶斯分析
Pub Date : 2018-03-04 DOI: 10.2139/SSRN.3134541
Q. Vuong, Manh-Tung Ho, Viet-Phuong La, D. Nhue, Bui Quang Khiem, Nghiem Phu Kien Cuong, Thu-Trang Vuong, Manh-Toan Ho, H. Nguyen, Viet-Ha T. Nguyen, Hiep-Hung Pham, N. Napier
Every year, the Vietnamese people reportedly burned about 50,000 tons of joss papers, which took the form of not only bank notes, but iPhones, cars, clothes, even housekeepers, in hope of pleasing the dead. The practice was mistakenly attributed to traditional Buddhist teachings but originated in fact from China, which most Vietnamese were not aware of. In other aspects of life, there were many similar examples of Vietnamese so ready and comfortable with adding new norms, values, and beliefs, even contradictory ones, to their culture. This phenomenon, dubbed "cultural additivity", prompted us to study the co-existence, interaction, and influences among core values and norms of the Three Teachings –Confucianism, Buddhism, and Taoism–as shown through Vietnamese folktales. By applying Bayesian logistic regression, we evaluated the possibility of whether the key message of a story was dominated by a religion (dependent variables), as affected by the appearance of values and anti-values pertaining to the Three Teachings in the story (independent variables). Our main findings included the existence of the cultural additivity of Confucian and Taoist values. More specifically, empirical results showed that the interaction or addition of the values of Taoism and Confucianism in folktales together helped predict whether the key message of a story was about Confucianism, β{VT ⋅ VC} = 0.86. Meanwhile, there was no such statistical tendency for Buddhism. The results lead to a number of important implications. First, this showed the dominance of Confucianism because the fact that Confucian and Taoist values appeared together in a story led to the story’s key message dominated by Confucianism. Thus, it presented the evidence of Confucian dominance and against liberal interpretations of the concept of the Common Roots of Three Religions ("tam giao đồng nguyen") as religious unification or unicity. Second, the concept of "cultural additivity" could help explain many interesting socio-cultural phenomena, namely the absence of religious intolerance and extremism in the Vietnamese society, outrageous cases of sophistry in education, the low productivity in creative endeavors like science and technology, the misleading branding strategy in business. We are aware that our results are only preliminary and more studies, both theoretical and empirical, must be carried out to give a full account of the explanatory reach of "cultural additivity".
据报道,越南人每年焚烧约5万吨纸钱,这些纸钱的形式不仅包括纸币,还包括iphone、汽车、衣服,甚至家政人员,希望以此来取悦死者。这种做法被错误地归因于传统的佛教教义,但实际上起源于中国,大多数越南人并不知道这一点。在生活的其他方面,也有许多类似的例子,越南人乐于为自己的文化增添新的规范、价值观和信仰,甚至是相互矛盾的东西。这种现象被称为“文化可加性”,促使我们通过越南民间故事来研究儒、释、道三教的核心价值和规范之间的共存、互动和影响。通过应用贝叶斯逻辑回归,我们评估了故事的关键信息是否由宗教(因变量)主导的可能性,是否受到故事中与三教有关的价值观和反价值观的影响(自变量)。我们的主要发现包括儒家和道教价值观的文化可加性的存在。更具体地说,实证结果表明,道教和儒家价值观在民间故事中的相互作用或共同作用有助于预测故事的关键信息是否与儒家有关,β{VT⋅VC} = 0.86。而佛教则没有这样的统计趋势。研究结果引出了一些重要的启示。首先,这表明儒家的主导地位,因为儒家和道家的价值观同时出现在一个故事中,导致故事的关键信息由儒家主导。因此,它提出了儒家主导地位的证据,反对自由主义对三教共同根源(“tam giao đồng nguyen”)概念的解释,认为这是宗教统一或单一的。其次,“文化可加性”的概念可以帮助解释许多有趣的社会文化现象,即越南社会中没有宗教不容忍和极端主义,教育中的荒谬诡辩,科学和技术等创造性活动的低生产率,以及商业中误导性的品牌战略。我们意识到,我们的结果只是初步的,必须进行更多的理论和实证研究,以充分说明“文化可加性”的解释范围。
{"title":"'Cultural Additivity' and How the Values and Norms of Confucianism, Buddhism, and Taoism Co-Exist, Interact, and Influence Vietnamese Society: A Bayesian Analysis of Long-Standing Folktales, Using R and Stan","authors":"Q. Vuong, Manh-Tung Ho, Viet-Phuong La, D. Nhue, Bui Quang Khiem, Nghiem Phu Kien Cuong, Thu-Trang Vuong, Manh-Toan Ho, H. Nguyen, Viet-Ha T. Nguyen, Hiep-Hung Pham, N. Napier","doi":"10.2139/SSRN.3134541","DOIUrl":"https://doi.org/10.2139/SSRN.3134541","url":null,"abstract":"Every year, the Vietnamese people reportedly burned about 50,000 tons of joss papers, which took the form of not only bank notes, but iPhones, cars, clothes, even housekeepers, in hope of pleasing the dead. The practice was mistakenly attributed to traditional Buddhist teachings but originated in fact from China, which most Vietnamese were not aware of. In other aspects of life, there were many similar examples of Vietnamese so ready and comfortable with adding new norms, values, and beliefs, even contradictory ones, to their culture. This phenomenon, dubbed \"cultural additivity\", prompted us to study the co-existence, interaction, and influences among core values and norms of the Three Teachings –Confucianism, Buddhism, and Taoism–as shown through Vietnamese folktales. By applying Bayesian logistic regression, we evaluated the possibility of whether the key message of a story was dominated by a religion (dependent variables), as affected by the appearance of values and anti-values pertaining to the Three Teachings in the story (independent variables). Our main findings included the existence of the cultural additivity of Confucian and Taoist values. More specifically, empirical results showed that the interaction or addition of the values of Taoism and Confucianism in folktales together helped predict whether the key message of a story was about Confucianism, β{VT ⋅ VC} = 0.86. Meanwhile, there was no such statistical tendency for Buddhism. The results lead to a number of important implications. First, this showed the dominance of Confucianism because the fact that Confucian and Taoist values appeared together in a story led to the story’s key message dominated by Confucianism. Thus, it presented the evidence of Confucian dominance and against liberal interpretations of the concept of the Common Roots of Three Religions (\"tam giao đồng nguyen\") as religious unification or unicity. Second, the concept of \"cultural additivity\" could help explain many interesting socio-cultural phenomena, namely the absence of religious intolerance and extremism in the Vietnamese society, outrageous cases of sophistry in education, the low productivity in creative endeavors like science and technology, the misleading branding strategy in business. We are aware that our results are only preliminary and more studies, both theoretical and empirical, must be carried out to give a full account of the explanatory reach of \"cultural additivity\".","PeriodicalId":409996,"journal":{"name":"arXiv: Applications","volume":"122 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116619818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Probability-Scale Residuals in HIV/AIDS Research: Diagnostics and Inference HIV/AIDS研究中的概率尺度残差:诊断和推断
Pub Date : 2018-03-01 DOI: 10.1201/9781315120805-11
B. Shepherd, Qi Liu, Valentine Wanga, C. Li
The probability-scale residual (PSR) is well defined across a wide variety of variable types and models, making it useful for studies of HIV/AIDS. In this manuscript, we highlight some of the properties of the PSR and illustrate its application with HIV data. As a residual, it can be useful for model diagnostics; we demonstrate its use with ordered categorical data and semiparametric transformation models. The PSR can also be used to construct tests of residual correlation. In fact, partial Spearman's rank correlation between $X$ and $Y$ while adjusting for covariates $Z$ can be constructed as the correlation between PSRs from models of $Y$ on $Z$ and of $X$ on $Z$. The covariance of PSRs is also useful in some settings. We apply these methods to a variety of HIV datasets including 1) a study examining risk factors for more severe forms of cervical lesions among 145 women living with HIV in Zambia, 2) a study investigating the association between 21 metabolomic biomarkers among 70 HIV-positive patients in the southeastern United States, and 3) a genome wide association study investigating the association between single nucleotide polymorphisms and tenofovir clearance among 501 HIV-positive persons participating in a multi-site randomized clinical trial.
概率尺度残差(PSR)在各种变量类型和模型中都有很好的定义,这使得它对艾滋病毒/艾滋病的研究很有用。在这篇文章中,我们强调了PSR的一些特性,并说明了它在HIV数据中的应用。作为残差,它可以用于模型诊断;我们演示了它在有序分类数据和半参数转换模型中的应用。PSR也可以用来构造残差相关性的检验。事实上,在调整协变量$Z$时,$X$和$Y$之间的部分Spearman等级相关性可以构建为$Y$对$Z$和$X$对$Z$模型的psr之间的相关性。psr的协方差在某些情况下也很有用。我们将这些方法应用于各种艾滋病毒数据集,包括1)一项研究,检查了赞比亚145名艾滋病毒感染者中更严重形式的宫颈病变的危险因素;2)一项研究,调查了美国东南部70名艾滋病毒阳性患者中21种代谢组学生物标志物之间的关系。3)一项全基因组关联研究,调查了501名参加多地点随机临床试验的hiv阳性患者的单核苷酸多态性与替诺福韦清除率之间的关系。
{"title":"Probability-Scale Residuals in HIV/AIDS Research: Diagnostics and Inference","authors":"B. Shepherd, Qi Liu, Valentine Wanga, C. Li","doi":"10.1201/9781315120805-11","DOIUrl":"https://doi.org/10.1201/9781315120805-11","url":null,"abstract":"The probability-scale residual (PSR) is well defined across a wide variety of variable types and models, making it useful for studies of HIV/AIDS. In this manuscript, we highlight some of the properties of the PSR and illustrate its application with HIV data. As a residual, it can be useful for model diagnostics; we demonstrate its use with ordered categorical data and semiparametric transformation models. The PSR can also be used to construct tests of residual correlation. In fact, partial Spearman's rank correlation between $X$ and $Y$ while adjusting for covariates $Z$ can be constructed as the correlation between PSRs from models of $Y$ on $Z$ and of $X$ on $Z$. The covariance of PSRs is also useful in some settings. We apply these methods to a variety of HIV datasets including 1) a study examining risk factors for more severe forms of cervical lesions among 145 women living with HIV in Zambia, 2) a study investigating the association between 21 metabolomic biomarkers among 70 HIV-positive patients in the southeastern United States, and 3) a genome wide association study investigating the association between single nucleotide polymorphisms and tenofovir clearance among 501 HIV-positive persons participating in a multi-site randomized clinical trial.","PeriodicalId":409996,"journal":{"name":"arXiv: Applications","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127010994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv: Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1