Objectives: To introduce a new, patient-oriented predictive index as a measure of gain in certainty.
Study design: Algebraic equations.
Results: A new measure is suggested based on error rates in a patient population. The new Predictive Summary Index (PSI) reflects the true total gain in certainty obtained by performing a diagnostic test based on knowledge of disease prevalence, i.e., the overall additional certainty. We show that the overall gain in certainty can be expressed in the form of the following expression: PSI = PPV+NPV-1. PSI is a more comprehensive measure than the post-test probability or the Youden Index (J). The reciprocal of J is interpreted as the number of persons with a given disease who need to be examined in order to detect correctly one person with the disease. The reciprocal of PSI is suggested as the number of persons who need to be examined in order to correctly predict a diagnosis of the disease.
Conclusion: PSI provides more information than J and the predictive values, making it more appropriate in a clinical setting.
{"title":"New patient-oriented summary measure of net total gain in certainty for dichotomous diagnostic tests.","authors":"Shai Linn, Peter D Grunau","doi":"10.1186/1742-5573-3-11","DOIUrl":"10.1186/1742-5573-3-11","url":null,"abstract":"<p><strong>Objectives: </strong>To introduce a new, patient-oriented predictive index as a measure of gain in certainty.</p><p><strong>Study design: </strong>Algebraic equations.</p><p><strong>Results: </strong>A new measure is suggested based on error rates in a patient population. The new Predictive Summary Index (PSI) reflects the true total gain in certainty obtained by performing a diagnostic test based on knowledge of disease prevalence, i.e., the overall additional certainty. We show that the overall gain in certainty can be expressed in the form of the following expression: PSI = PPV+NPV-1. PSI is a more comprehensive measure than the post-test probability or the Youden Index (J). The reciprocal of J is interpreted as the number of persons with a given disease who need to be examined in order to detect correctly one person with the disease. The reciprocal of PSI is suggested as the number of persons who need to be examined in order to correctly predict a diagnosis of the disease.</p><p><strong>Conclusion: </strong>PSI provides more information than J and the predictive values, making it more appropriate in a clinical setting.</p>","PeriodicalId":87082,"journal":{"name":"Epidemiologic perspectives & innovations : EP+I","volume":"3 ","pages":"11"},"PeriodicalIF":0.0,"publicationDate":"2006-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1635036/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"26294185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Annette Peters, Stephanie von Klot, Niklas Berglind, Allmut Hörmann, Hannelore Löwel, Fredrik Nyberg, Juha Pekkanen, Carlo A Perucci, Massimo Stafoggia, Jordi Sunyer, Pekka Tiittanen, Francesco Forastiere
Background: Short-term fluctuations of ambient air pollution have been associated with exacerbation of cardiovascular disease. A multi-city study was designed to assess the probability of recurrent hospitalization in a cohort of incident myocardial infarction survivors in five European cities. The objective of this paper is to discuss the methods for analyzing short-term health effects in a cohort study based on a case-series.
Methods: Three methods were considered for the analyses of the cohort data: Poisson regression approach, case-crossover analyses and extended Cox regression analyses. The major challenge of these analyses is to appropriately consider changes within the cohort over time due to changes in the underlying risk following a myocardial infarction, slow time trends in risk factors within the population, dynamic cohort size and seasonal variation.
Results: Poisson regression analyses, case-crossover analyses and Extended Cox regression analyses gave similar results. Application of smoothing methods showed the capability to adequately model the complex time trends.
Conclusion: From a practical point of view, Poisson regression analyses are less time-consuming, and therefore might be used for confounder selection and most of the analyses. However, replication of the results with Cox models is desirable to assure that the results are independent of the analytical approach used. In addition, extended Cox regression analyses would allow a joint estimation of long-term and short-term health effects of time-varying exposures.
{"title":"Comparison of different methods in analyzing short-term air pollution effects in a cohort study of susceptible individuals.","authors":"Annette Peters, Stephanie von Klot, Niklas Berglind, Allmut Hörmann, Hannelore Löwel, Fredrik Nyberg, Juha Pekkanen, Carlo A Perucci, Massimo Stafoggia, Jordi Sunyer, Pekka Tiittanen, Francesco Forastiere","doi":"10.1186/1742-5573-3-10","DOIUrl":"https://doi.org/10.1186/1742-5573-3-10","url":null,"abstract":"<p><strong>Background: </strong>Short-term fluctuations of ambient air pollution have been associated with exacerbation of cardiovascular disease. A multi-city study was designed to assess the probability of recurrent hospitalization in a cohort of incident myocardial infarction survivors in five European cities. The objective of this paper is to discuss the methods for analyzing short-term health effects in a cohort study based on a case-series.</p><p><strong>Methods: </strong>Three methods were considered for the analyses of the cohort data: Poisson regression approach, case-crossover analyses and extended Cox regression analyses. The major challenge of these analyses is to appropriately consider changes within the cohort over time due to changes in the underlying risk following a myocardial infarction, slow time trends in risk factors within the population, dynamic cohort size and seasonal variation.</p><p><strong>Results: </strong>Poisson regression analyses, case-crossover analyses and Extended Cox regression analyses gave similar results. Application of smoothing methods showed the capability to adequately model the complex time trends.</p><p><strong>Conclusion: </strong>From a practical point of view, Poisson regression analyses are less time-consuming, and therefore might be used for confounder selection and most of the analyses. However, replication of the results with Cox models is desirable to assure that the results are independent of the analytical approach used. In addition, extended Cox regression analyses would allow a joint estimation of long-term and short-term health effects of time-varying exposures.</p>","PeriodicalId":87082,"journal":{"name":"Epidemiologic perspectives & innovations : EP+I","volume":"3 ","pages":"10"},"PeriodicalIF":0.0,"publicationDate":"2006-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1742-5573-3-10","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"26192895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Growth, current size and the role of the 'reversal paradox' in the foetal origins of adult disease: an illustration using vector geometry.","authors":"Yu-Kang Tu, George T H Ellison, Mark S Gilthorpe","doi":"10.1186/1742-5573-3-9","DOIUrl":"https://doi.org/10.1186/1742-5573-3-9","url":null,"abstract":"","PeriodicalId":87082,"journal":{"name":"Epidemiologic perspectives & innovations : EP+I","volume":"3 ","pages":"9"},"PeriodicalIF":0.0,"publicationDate":"2006-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1742-5573-3-9","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"26178232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Eric A Whitsel, P Miguel Quibrera, Richard L Smith, Diane J Catellier, Duanping Liao, Amanda C Henley, Gerardo Heiss
Background: Published studies of geocoding accuracy often focus on a single geographic area, address source or vendor, do not adjust accuracy measures for address characteristics, and do not examine effects of inaccuracy on exposure measures. We addressed these issues in a Women's Health Initiative ancillary study, the Environmental Epidemiology of Arrhythmogenesis in WHI.
Results: Addresses in 49 U.S. states (n = 3,615) with established coordinates were geocoded by four vendors (A-D). There were important differences among vendors in address match rate (98%; 82%; 81%; 30%), concordance between established and vendor-assigned census tracts (85%; 88%; 87%; 98%) and distance between established and vendor-assigned coordinates (mean rho [meters]: 1809; 748; 704; 228). Mean rho was lowest among street-matched, complete, zip-coded, unedited and urban addresses, and addresses with North American Datum of 1983 or World Geodetic System of 1984 coordinates. In mixed models restricted to vendors with minimally acceptable match rates (A-C) and adjusted for address characteristics, within-address correlation, and among-vendor heteroscedasticity of rho, differences in mean rho were small for street-type matches (280; 268; 275), i.e. likely to bias results relying on them about equally for most applications. In contrast, differences between centroid-type matches were substantial in some vendor contrasts, but not others (5497; 4303; 4210) p(interaction) < 10(-4), i.e. more likely to bias results differently in many applications. The adjusted odds of an address match was higher for vendor A versus C (odds ratio = 66, 95% confidence interval: 47, 93), but not B versus C (OR = 1.1, 95% CI: 0.9, 1.3). That of census tract concordance was no higher for vendor A versus C (OR = 1.0, 95% CI: 0.9, 1.2) or B versus C (OR = 1.1, 95% CI: 0.9, 1.3). Misclassification of a related exposure measure--distance to the nearest highway--increased with mean rho and in the absence of confounding, non-differential misclassification of this distance biased its hypothetical association with coronary heart disease mortality toward the null.
Conclusion: Geocoding error depends on measures used to evaluate it, address characteristics and vendor. Vendor selection presents a trade-off between potential for missing data and error in estimating spatially defined attributes. Informed selection is needed to control the trade-off and adjust analyses for its effects.
背景:已发表的关于地理编码准确性的研究通常只关注单一地理区域、地址来源或供应商,不会根据地址特征调整准确性测量,也不会检查不准确性对暴露测量的影响。我们在 "妇女健康倡议 "辅助研究 "WHI 中心律失常发生的环境流行病学 "中解决了这些问题:四家供应商(A-D)对美国 49 个州(n = 3,615 个)具有既定坐标的地址进行了地理编码。不同供应商在地址匹配率(98%;82%;81%;30%)、既定普查区与供应商分配的普查区之间的一致性(85%;88%;87%;98%)以及既定坐标与供应商分配的坐标之间的距离(平均 rho [米]:1809;748;704;228)方面存在显著差异。在街道匹配、完整、邮政编码、未编辑和城市地址,以及具有 1983 年北美基准或 1984 年世界大地测量系统坐标的地址中,平均 rho 值最低。在混合模型中,仅限于匹配率最低的供应商(A-C),并根据地址特征、地址内相关性和供应商间 rho 的异方差性进行调整,街道类型匹配的平均 rho 差异较小(280;268;275),也就是说,在大多数应用中,依靠街道类型匹配的结果可能会有偏差。与此相反,在某些供应商对比中,中心点类型匹配的差异很大,而在其他供应商对比中则不然(5497;4303;4210),p(交互作用) < 10(-4),即在许多应用中更有可能对结果产生不同的偏差。供应商 A 与供应商 C 的地址匹配调整后几率更高(几率比 = 66,95% 置信区间:47, 93),但供应商 B 与供应商 C 的地址匹配调整后几率不高(OR = 1.1,95% 置信区间:0.9, 1.3)。人口普查区的一致性在供应商 A 与供应商 C(OR = 1.0,95% CI:0.9,1.2)或供应商 B 与供应商 C(OR = 1.1,95% CI:0.9,1.3)之间并不高。在没有混杂因素的情况下,该距离的非差异性错误分类使其与冠心病死亡率的假定关联偏向于空值:地理编码误差取决于评估误差的方法、地址特征和供应商。供应商的选择需要在潜在的数据缺失和空间定义属性估计误差之间进行权衡。需要在知情的情况下进行选择,以控制权衡,并根据其影响调整分析。
{"title":"Accuracy of commercial geocoding: assessment and implications.","authors":"Eric A Whitsel, P Miguel Quibrera, Richard L Smith, Diane J Catellier, Duanping Liao, Amanda C Henley, Gerardo Heiss","doi":"10.1186/1742-5573-3-8","DOIUrl":"10.1186/1742-5573-3-8","url":null,"abstract":"<p><strong>Background: </strong>Published studies of geocoding accuracy often focus on a single geographic area, address source or vendor, do not adjust accuracy measures for address characteristics, and do not examine effects of inaccuracy on exposure measures. We addressed these issues in a Women's Health Initiative ancillary study, the Environmental Epidemiology of Arrhythmogenesis in WHI.</p><p><strong>Results: </strong>Addresses in 49 U.S. states (n = 3,615) with established coordinates were geocoded by four vendors (A-D). There were important differences among vendors in address match rate (98%; 82%; 81%; 30%), concordance between established and vendor-assigned census tracts (85%; 88%; 87%; 98%) and distance between established and vendor-assigned coordinates (mean rho [meters]: 1809; 748; 704; 228). Mean rho was lowest among street-matched, complete, zip-coded, unedited and urban addresses, and addresses with North American Datum of 1983 or World Geodetic System of 1984 coordinates. In mixed models restricted to vendors with minimally acceptable match rates (A-C) and adjusted for address characteristics, within-address correlation, and among-vendor heteroscedasticity of rho, differences in mean rho were small for street-type matches (280; 268; 275), i.e. likely to bias results relying on them about equally for most applications. In contrast, differences between centroid-type matches were substantial in some vendor contrasts, but not others (5497; 4303; 4210) p(interaction) < 10(-4), i.e. more likely to bias results differently in many applications. The adjusted odds of an address match was higher for vendor A versus C (odds ratio = 66, 95% confidence interval: 47, 93), but not B versus C (OR = 1.1, 95% CI: 0.9, 1.3). That of census tract concordance was no higher for vendor A versus C (OR = 1.0, 95% CI: 0.9, 1.2) or B versus C (OR = 1.1, 95% CI: 0.9, 1.3). Misclassification of a related exposure measure--distance to the nearest highway--increased with mean rho and in the absence of confounding, non-differential misclassification of this distance biased its hypothetical association with coronary heart disease mortality toward the null.</p><p><strong>Conclusion: </strong>Geocoding error depends on measures used to evaluate it, address characteristics and vendor. Vendor selection presents a trade-off between potential for missing data and error in estimating spatially defined attributes. Informed selection is needed to control the trade-off and adjust analyses for its effects.</p>","PeriodicalId":87082,"journal":{"name":"Epidemiologic perspectives & innovations : EP+I","volume":"3 ","pages":"8"},"PeriodicalIF":0.0,"publicationDate":"2006-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1557664/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"26156355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Causal thinking and causal language in epidemiology: a cause by any other name is still a cause: response to Lipton and Ødegaard.","authors":"Clarence C Tam","doi":"10.1186/1742-5573-3-7","DOIUrl":"https://doi.org/10.1186/1742-5573-3-7","url":null,"abstract":"","PeriodicalId":87082,"journal":{"name":"Epidemiologic perspectives & innovations : EP+I","volume":"3 ","pages":"7"},"PeriodicalIF":0.0,"publicationDate":"2006-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1742-5573-3-7","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"26072249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Andria Q Jones, Catherine E Dewey, Kathryn Doré, Shannon E Majowicz, Scott A McEwen, David Waltner-Toews
Background: Exposure assessment is typically the greatest weakness of epidemiologic studies of disinfection by-products (DBPs) in drinking water, which largely stems from the difficulty in obtaining accurate data on individual-level water consumption patterns and activity. Thus, surrogate measures for such waterborne exposures are commonly used. Little attention however, has been directed towards formal validation of these measures.
Methods: We conducted a study in the City of Hamilton, Ontario (Canada) in 2001-2002, to assess the accuracy of two surrogate measures of home water source: (a) urban/rural status as assigned using residential postal codes, and (b) mapping of residential postal codes to municipal water systems within a Geographic Information System (GIS). We then assessed the accuracy of a commonly-used surrogate measure of an individual's actual drinking water source, namely, their home water source.
Results: The surrogates for home water source provided good classification of residents served by municipal water systems (approximately 98% predictive value), but did not perform well in classifying those served by private water systems (average: 63.5% predictive value). More importantly, we found that home water source was a poor surrogate measure of the individuals' actual drinking water source(s), being associated with high misclassification errors.
Conclusion: This study demonstrated substantial misclassification errors associated with a surrogate measure commonly used in studies of drinking water disinfection byproducts. Further, the limited accuracy of two surrogate measures of an individual's home water source heeds caution in their use in exposure classification methodology. While these surrogates are inexpensive and convenient, they should not be substituted for direct collection of accurate data pertaining to the subjects' waterborne disease exposure. In instances where such surrogates must be used, estimation of the misclassification and its subsequent effects are recommended for the interpretation and communication of results. Our results also lend support for further investigation into the quantification of the exposure misclassification associated with these surrogate measures, which would provide useful estimates for consideration in interpretation of waterborne disease studies.
{"title":"Exposure assessment in investigations of waterborne illness: a quantitative estimate of measurement error.","authors":"Andria Q Jones, Catherine E Dewey, Kathryn Doré, Shannon E Majowicz, Scott A McEwen, David Waltner-Toews","doi":"10.1186/1742-5573-3-6","DOIUrl":"https://doi.org/10.1186/1742-5573-3-6","url":null,"abstract":"<p><strong>Background: </strong>Exposure assessment is typically the greatest weakness of epidemiologic studies of disinfection by-products (DBPs) in drinking water, which largely stems from the difficulty in obtaining accurate data on individual-level water consumption patterns and activity. Thus, surrogate measures for such waterborne exposures are commonly used. Little attention however, has been directed towards formal validation of these measures.</p><p><strong>Methods: </strong>We conducted a study in the City of Hamilton, Ontario (Canada) in 2001-2002, to assess the accuracy of two surrogate measures of home water source: (a) urban/rural status as assigned using residential postal codes, and (b) mapping of residential postal codes to municipal water systems within a Geographic Information System (GIS). We then assessed the accuracy of a commonly-used surrogate measure of an individual's actual drinking water source, namely, their home water source.</p><p><strong>Results: </strong>The surrogates for home water source provided good classification of residents served by municipal water systems (approximately 98% predictive value), but did not perform well in classifying those served by private water systems (average: 63.5% predictive value). More importantly, we found that home water source was a poor surrogate measure of the individuals' actual drinking water source(s), being associated with high misclassification errors.</p><p><strong>Conclusion: </strong>This study demonstrated substantial misclassification errors associated with a surrogate measure commonly used in studies of drinking water disinfection byproducts. Further, the limited accuracy of two surrogate measures of an individual's home water source heeds caution in their use in exposure classification methodology. While these surrogates are inexpensive and convenient, they should not be substituted for direct collection of accurate data pertaining to the subjects' waterborne disease exposure. In instances where such surrogates must be used, estimation of the misclassification and its subsequent effects are recommended for the interpretation and communication of results. Our results also lend support for further investigation into the quantification of the exposure misclassification associated with these surrogate measures, which would provide useful estimates for consideration in interpretation of waterborne disease studies.</p>","PeriodicalId":87082,"journal":{"name":"Epidemiologic perspectives & innovations : EP+I","volume":"3 ","pages":"6"},"PeriodicalIF":0.0,"publicationDate":"2006-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1742-5573-3-6","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"26050031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Interaction measured on the additive scale has been argued to be better correlated with biologic interaction than when measured on the multiplicative scale. Measures of interaction on the additive scale have been developed using risk ratios. However, in studies that use odds ratios as the sole measure of effect, the calculation of these measures of additive interaction is usually performed by directly substituting odds ratios for risk ratios. Yet assessing additive interaction based on replacing risk ratios by odds ratios in formulas that were derived using the former may be erroneous. In this paper, we evaluate the extent to which three measures of additive interaction - the interaction contrast ratio (ICR), the attributable proportion due to interaction (AP), and the synergy index (S), estimated using odds ratios versus using risk ratios differ as the incidence of the outcome of interest increases in the source population and/or as the magnitude of interaction increases. Our analysis shows that the difference between the two depends on the measure of interaction used, the type of interaction present, and the baseline incidence of the outcome. Substituting odds ratios for risk ratios, when calculating measures of additive interaction, may result in misleading conclusions. Of the three measures, AP appears to be the most robust to this direct substitution. Formulas that use stratum specific odds and odds ratios to accurately calculate measures of additive interaction are presented.
{"title":"Measuring additive interaction using odds ratios.","authors":"Linda Kalilani, Julius Atashili","doi":"10.1186/1742-5573-3-5","DOIUrl":"https://doi.org/10.1186/1742-5573-3-5","url":null,"abstract":"<p><p>Interaction measured on the additive scale has been argued to be better correlated with biologic interaction than when measured on the multiplicative scale. Measures of interaction on the additive scale have been developed using risk ratios. However, in studies that use odds ratios as the sole measure of effect, the calculation of these measures of additive interaction is usually performed by directly substituting odds ratios for risk ratios. Yet assessing additive interaction based on replacing risk ratios by odds ratios in formulas that were derived using the former may be erroneous. In this paper, we evaluate the extent to which three measures of additive interaction - the interaction contrast ratio (ICR), the attributable proportion due to interaction (AP), and the synergy index (S), estimated using odds ratios versus using risk ratios differ as the incidence of the outcome of interest increases in the source population and/or as the magnitude of interaction increases. Our analysis shows that the difference between the two depends on the measure of interaction used, the type of interaction present, and the baseline incidence of the outcome. Substituting odds ratios for risk ratios, when calculating measures of additive interaction, may result in misleading conclusions. Of the three measures, AP appears to be the most robust to this direct substitution. Formulas that use stratum specific odds and odds ratios to accurately calculate measures of additive interaction are presented.</p>","PeriodicalId":87082,"journal":{"name":"Epidemiologic perspectives & innovations : EP+I","volume":"3 ","pages":"5"},"PeriodicalIF":0.0,"publicationDate":"2006-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1742-5573-3-5","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25978411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Janet F Grant, Catherine R Chittleborough, Anne W Taylor, Eleonora Dal Grande, David H Wilson, Patrick J Phillips, Robert J Adams, Julianne Cheek, Kay Price, Tiffany Gill, Richard E Ruffin
The North West Adelaide Health Study is a population-based biomedical cohort study investigating the prevalence of a number of chronic conditions and health-related risk factors along a continuum. This methodology may assist with evidence-based decisions for health policy makers and planners, and inform health professionals who are involved in chronic disease prevention and management, by providing a better description of people at risk of developing or already diagnosed with selected chronic conditions for more accurate targeting groups for health gain and improved health outcomes. Longitudinal data will provide information on progression of chronic conditions and allow description of those who move forward and back along the continuum over time. Detailed methods are provided regarding the random recruitment and examination of a representative sample of participants (n = 4060), including the rationale for various processes and valuable lessons learnt. Self-reported and biomedical data were obtained on risk factors (smoking, alcohol consumption, physical activity, family history, body mass index, blood pressure, cholesterol) and chronic conditions (asthma, chronic obstructive pulmonary disease, diabetes) to classify participants according to their status along a continuum. Segmenting this population sample along a continuum showed that 71.5% had at least one risk factor for developing asthma, chronic obstructive pulmonary disease or diabetes. Almost one-fifth (18.8%) had been previously diagnosed with at least one of these chronic conditions, and an additional 3.9% had at least one of these conditions but had not been diagnosed. This paper provides a novel opportunity to examine how a cohort study was born. It presents detailed methodology behind the selection, recruitment and examination of a cohort and how participants with selected chronic conditions can be segmented along a continuum that may assist with health promotion and health services planning.
{"title":"The North West Adelaide Health Study: detailed methods and baseline segmentation of a cohort for selected chronic diseases.","authors":"Janet F Grant, Catherine R Chittleborough, Anne W Taylor, Eleonora Dal Grande, David H Wilson, Patrick J Phillips, Robert J Adams, Julianne Cheek, Kay Price, Tiffany Gill, Richard E Ruffin","doi":"10.1186/1742-5573-3-4","DOIUrl":"https://doi.org/10.1186/1742-5573-3-4","url":null,"abstract":"<p><p>The North West Adelaide Health Study is a population-based biomedical cohort study investigating the prevalence of a number of chronic conditions and health-related risk factors along a continuum. This methodology may assist with evidence-based decisions for health policy makers and planners, and inform health professionals who are involved in chronic disease prevention and management, by providing a better description of people at risk of developing or already diagnosed with selected chronic conditions for more accurate targeting groups for health gain and improved health outcomes. Longitudinal data will provide information on progression of chronic conditions and allow description of those who move forward and back along the continuum over time. Detailed methods are provided regarding the random recruitment and examination of a representative sample of participants (n = 4060), including the rationale for various processes and valuable lessons learnt. Self-reported and biomedical data were obtained on risk factors (smoking, alcohol consumption, physical activity, family history, body mass index, blood pressure, cholesterol) and chronic conditions (asthma, chronic obstructive pulmonary disease, diabetes) to classify participants according to their status along a continuum. Segmenting this population sample along a continuum showed that 71.5% had at least one risk factor for developing asthma, chronic obstructive pulmonary disease or diabetes. Almost one-fifth (18.8%) had been previously diagnosed with at least one of these chronic conditions, and an additional 3.9% had at least one of these conditions but had not been diagnosed. This paper provides a novel opportunity to examine how a cohort study was born. It presents detailed methodology behind the selection, recruitment and examination of a cohort and how participants with selected chronic conditions can be segmented along a continuum that may assist with health promotion and health services planning.</p>","PeriodicalId":87082,"journal":{"name":"Epidemiologic perspectives & innovations : EP+I","volume":"3 ","pages":"4"},"PeriodicalIF":0.0,"publicationDate":"2006-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1742-5573-3-4","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25967160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Donald J Slymen, Guadalupe X Ayala, Elva M Arredondo, John P Elder
Counting outcomes such as days of physical activity or servings of fruits and vegetables often have distributions that are highly skewed toward the right with a preponderance of zeros, posing analytical challenges. This paper demonstrates how such outcomes may be analyzed with several modifications to Poisson regression. Five regression models 1) Poisson, 2) overdispersed Poisson, 3) negative binomial, 4) zero-inflated Poisson (ZIP), and 5) zero-inflated negative binomial (ZINB) are fitted to data assessing predictors of vigorous physical activity (VPA) among Latina women. The models are described, and analytical and graphical approaches are discussed to aid in model selection. Poisson regression provided a poor fit where 82% of the subjects reported no days of VPA. The fit improved considerably with the negative binomial and ZIP models. There was little difference in fit between the ZIP and ZINB models. Overall, the ZIP model fit best. No days of VPA were associated with poorer self-reported health and less assimilation to Anglo culture, and marginally associated with increasing BMI. The intensity portion of the model suggested that increasing days of VPA were associated with more education, and marginally associated with increasing age. These underutilized models provide useful approaches for handling counting outcomes.
{"title":"A demonstration of modeling count data with an application to physical activity.","authors":"Donald J Slymen, Guadalupe X Ayala, Elva M Arredondo, John P Elder","doi":"10.1186/1742-5573-3-3","DOIUrl":"https://doi.org/10.1186/1742-5573-3-3","url":null,"abstract":"<p><p>Counting outcomes such as days of physical activity or servings of fruits and vegetables often have distributions that are highly skewed toward the right with a preponderance of zeros, posing analytical challenges. This paper demonstrates how such outcomes may be analyzed with several modifications to Poisson regression. Five regression models 1) Poisson, 2) overdispersed Poisson, 3) negative binomial, 4) zero-inflated Poisson (ZIP), and 5) zero-inflated negative binomial (ZINB) are fitted to data assessing predictors of vigorous physical activity (VPA) among Latina women. The models are described, and analytical and graphical approaches are discussed to aid in model selection. Poisson regression provided a poor fit where 82% of the subjects reported no days of VPA. The fit improved considerably with the negative binomial and ZIP models. There was little difference in fit between the ZIP and ZINB models. Overall, the ZIP model fit best. No days of VPA were associated with poorer self-reported health and less assimilation to Anglo culture, and marginally associated with increasing BMI. The intensity portion of the model suggested that increasing days of VPA were associated with more education, and marginally associated with increasing age. These underutilized models provide useful approaches for handling counting outcomes.</p>","PeriodicalId":87082,"journal":{"name":"Epidemiologic perspectives & innovations : EP+I","volume":"3 ","pages":"3"},"PeriodicalIF":0.0,"publicationDate":"2006-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1742-5573-3-3","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25918099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In a series of papers, Robins and colleagues describe inverse probability of treatment weighted (IPTW) estimation in marginal structural models (MSMs), a method of causal analysis of longitudinal data based on counterfactual principles. This family of statistical techniques is similar in concept to weighting of survey data, except that the weights are estimated using study data rather than defined so as to reflect sampling design and post-stratification to an external population. Several decades ago Miettinen described an elementary method of causal analysis of case-control data based on indirect standardization. In this paper we extend the Miettinen approach using ideas closely related to IPTW estimation in MSMs. The technique is illustrated using data from a case-control study of oral contraceptives and myocardial infarction.
{"title":"Causal analysis of case-control data.","authors":"Stephen C Newman","doi":"10.1186/1742-5573-3-2","DOIUrl":"10.1186/1742-5573-3-2","url":null,"abstract":"<p><p>In a series of papers, Robins and colleagues describe inverse probability of treatment weighted (IPTW) estimation in marginal structural models (MSMs), a method of causal analysis of longitudinal data based on counterfactual principles. This family of statistical techniques is similar in concept to weighting of survey data, except that the weights are estimated using study data rather than defined so as to reflect sampling design and post-stratification to an external population. Several decades ago Miettinen described an elementary method of causal analysis of case-control data based on indirect standardization. In this paper we extend the Miettinen approach using ideas closely related to IPTW estimation in MSMs. The technique is illustrated using data from a case-control study of oral contraceptives and myocardial infarction.</p>","PeriodicalId":87082,"journal":{"name":"Epidemiologic perspectives & innovations : EP+I","volume":"3 ","pages":"2"},"PeriodicalIF":0.0,"publicationDate":"2006-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1742-5573-3-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25824424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}