{"title":"Marginal Log‐linear Parameters and their Collapsibility for Categorical Data","authors":"S. Ghosh, P. Vellaisamy","doi":"10.1111/stan.12332","DOIUrl":null,"url":null,"abstract":"Collapsibility is a practical and useful technique for dimension reduction in multidimensional contingency tables. In this paper, we consider marginal log‐linear models for studying collapsibility and related aspects in such tables. These models generalize ordinary log‐linear and multivariate logistic models, besides several others. First, we obtain some characteristic properties of marginal log‐linear parameters. Then we define collapsibility and strict collapsibility of these parameters in a general sense. Several necessary and sufficient conditions for collapsibility and strict collapsibility are derived based on simple functions of only the cell probabilities, which are easily verifiable. These include results for an arbitrary set of marginal log‐linear parameters having some common effects. The connections of strict collapsibility to various forms of independence of the variables are explored. We analyze some real‐life datasets to illustrate the above results on collapsibility and strict collapsibility. Finally, we obtain a result relating parameters with the same effect, but different margins for an arbitrary table, and demonstrate smoothness of marginal log‐linear models under collapsibility conditions. This article is protected by copyright. All rights reserved.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":"11 2","pages":"0"},"PeriodicalIF":1.4000,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistica Neerlandica","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1111/stan.12332","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 1
Abstract
Collapsibility is a practical and useful technique for dimension reduction in multidimensional contingency tables. In this paper, we consider marginal log‐linear models for studying collapsibility and related aspects in such tables. These models generalize ordinary log‐linear and multivariate logistic models, besides several others. First, we obtain some characteristic properties of marginal log‐linear parameters. Then we define collapsibility and strict collapsibility of these parameters in a general sense. Several necessary and sufficient conditions for collapsibility and strict collapsibility are derived based on simple functions of only the cell probabilities, which are easily verifiable. These include results for an arbitrary set of marginal log‐linear parameters having some common effects. The connections of strict collapsibility to various forms of independence of the variables are explored. We analyze some real‐life datasets to illustrate the above results on collapsibility and strict collapsibility. Finally, we obtain a result relating parameters with the same effect, but different margins for an arbitrary table, and demonstrate smoothness of marginal log‐linear models under collapsibility conditions. This article is protected by copyright. All rights reserved.
期刊介绍:
Statistica Neerlandica has been the journal of the Netherlands Society for Statistics and Operations Research since 1946. It covers all areas of statistics, from theoretical to applied, with a special emphasis on mathematical statistics, statistics for the behavioural sciences and biostatistics. This wide scope is reflected by the expertise of the journal’s editors representing these areas. The diverse editorial board is committed to a fast and fair reviewing process, and will judge submissions on quality, correctness, relevance and originality. Statistica Neerlandica encourages transparency and reproducibility, and offers online resources to make data, code, simulation results and other additional materials publicly available.