Determining and visualising at-risk groups in case-control data.

Journal of epidemiology and biostatistics Pub Date : 2001-01-01 DOI:10.1080/13595220152601819

R. Marshall

{"title":"Determining and visualising at-risk groups in case-control data.","authors":"R. Marshall","doi":"10.1080/13595220152601819","DOIUrl":null,"url":null,"abstract":"BACKGROUND Case-control research is often exploratory; to determine factors that increase risk. Often, regression methods are used to determine combinations of risk factors that predispose to excess risk. Recently, tree-based methods have also been proposed. Both have limitations. An alternative approach is suggested, based on a search algorithm to identify at-risk subgroups. METHODS Statistical methods to determine and visualise at-risk sub-groups in case-control studies are presented. The method of determining sub-groups--search partition analysis (SPAN)--searches among different Boolean combinations of risk factors. Sub-groups that have been identified are visualised by scaled rectangle diagrams. These show the size of sub-groups and the extent to which they overlap. RESULTS Theory is presented for applying the method to case-control data. The methods are illustrated by analysis of three case-control studies: one on sudden infant death syndrome, a second on heart disease and a third on child pedestrian injuries. CONCLUSIONS The methods provide a useful alternative to regression and tree-based analysis. They demarcate subgroups that, in the three examples, are easy to interpret and would not have been found by other methods. Scaled rectangle diagrams are a useful way to visualise the results.","PeriodicalId":80024,"journal":{"name":"Journal of epidemiology and biostatistics","volume":"6 4 1","pages":"343-8"},"PeriodicalIF":0.0000,"publicationDate":"2001-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of epidemiology and biostatistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/13595220152601819","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

BACKGROUND Case-control research is often exploratory; to determine factors that increase risk. Often, regression methods are used to determine combinations of risk factors that predispose to excess risk. Recently, tree-based methods have also been proposed. Both have limitations. An alternative approach is suggested, based on a search algorithm to identify at-risk subgroups. METHODS Statistical methods to determine and visualise at-risk sub-groups in case-control studies are presented. The method of determining sub-groups--search partition analysis (SPAN)--searches among different Boolean combinations of risk factors. Sub-groups that have been identified are visualised by scaled rectangle diagrams. These show the size of sub-groups and the extent to which they overlap. RESULTS Theory is presented for applying the method to case-control data. The methods are illustrated by analysis of three case-control studies: one on sudden infant death syndrome, a second on heart disease and a third on child pedestrian injuries. CONCLUSIONS The methods provide a useful alternative to regression and tree-based analysis. They demarcate subgroups that, in the three examples, are easy to interpret and would not have been found by other methods. Scaled rectangle diagrams are a useful way to visualise the results.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

在病例对照数据中确定和可视化高危人群。

病例对照研究往往是探索性的;确定增加风险的因素。通常，回归方法用于确定易导致过度风险的风险因素的组合。最近，基于树的方法也被提出。两者都有局限性。提出了另一种方法，基于搜索算法来识别有风险的子群体。方法介绍了病例对照研究中确定和可视化高危亚组的统计方法。确定子群的方法——搜索分区分析(SPAN)——在不同的风险因素布尔组合中搜索。已确定的子组通过缩放矩形图可视化。这些显示了子群体的大小和它们重叠的程度。结果为该方法在病例对照资料中的应用提供了理论依据。这些方法是通过对三个病例对照研究的分析来说明的:一个是关于婴儿猝死综合症的，第二个是关于心脏病的，第三个是关于儿童行人伤害的。结论该方法是一种有效的替代回归分析和树分析法的方法。在上述三个例子中，它们划分的子群体很容易解释，用其他方法是找不到的。缩放矩形图是可视化结果的有效方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of epidemiology and biostatistics

自引率

0.00%

发文量

期刊最新文献

Statistical Inference Population, Exposure, and Outcome Measures of Disease Frequency Practice Problem Workbook Solutions Case Reports and Case Series