{"title":"识别和鉴定序列群中的偏差案例:原因与方法","authors":"Raffaella Piccarreta, Emanuela Struffolino","doi":"10.1007/s10680-023-09682-3","DOIUrl":null,"url":null,"abstract":"<p><p>Sequence analysis is employed in different fields-e.g., demography, sociology, and political sciences-to describe longitudinal processes represented as sequences of categorical states. In many applications, sequences are clustered to identify relevant types, which reflect the different empirical realisations of the temporal process under study. We explore criteria to inspect internal cluster composition and to detect deviant sequences, that is, cases characterised by rare patterns or outliers that might compromise cluster homogeneity. We also introduce tools to visualise and distinguish the features of regular and deviant cases. Our proposals offer a more accurate and granular description of the data structure, by identifying-besides the most typical types-peculiar sequences that might be interesting from a substantive and theoretical point of view. This analysis could be very useful in applications where-under the assumption of within homogeneity-clusters are used as outcome or explanatory variables in regressions. We demonstrate the added value of our proposal in a motivating application from life-course socio-demography, focusing on Italian women's employment trajectories and on their link with their mothers' participation in the labour market across geographical areas.</p>","PeriodicalId":51496,"journal":{"name":"European Journal of Population-Revue Europeenne De Demographie","volume":"40 1","pages":"1"},"PeriodicalIF":1.9000,"publicationDate":"2023-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10730788/pdf/","citationCount":"0","resultStr":"{\"title\":\"Identifying and Qualifying Deviant Cases in Clusters of Sequences: The Why and The How.\",\"authors\":\"Raffaella Piccarreta, Emanuela Struffolino\",\"doi\":\"10.1007/s10680-023-09682-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Sequence analysis is employed in different fields-e.g., demography, sociology, and political sciences-to describe longitudinal processes represented as sequences of categorical states. In many applications, sequences are clustered to identify relevant types, which reflect the different empirical realisations of the temporal process under study. We explore criteria to inspect internal cluster composition and to detect deviant sequences, that is, cases characterised by rare patterns or outliers that might compromise cluster homogeneity. We also introduce tools to visualise and distinguish the features of regular and deviant cases. Our proposals offer a more accurate and granular description of the data structure, by identifying-besides the most typical types-peculiar sequences that might be interesting from a substantive and theoretical point of view. This analysis could be very useful in applications where-under the assumption of within homogeneity-clusters are used as outcome or explanatory variables in regressions. We demonstrate the added value of our proposal in a motivating application from life-course socio-demography, focusing on Italian women's employment trajectories and on their link with their mothers' participation in the labour market across geographical areas.</p>\",\"PeriodicalId\":51496,\"journal\":{\"name\":\"European Journal of Population-Revue Europeenne De Demographie\",\"volume\":\"40 1\",\"pages\":\"1\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2023-12-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10730788/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Population-Revue Europeenne De Demographie\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://doi.org/10.1007/s10680-023-09682-3\",\"RegionNum\":2,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"DEMOGRAPHY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Population-Revue Europeenne De Demographie","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1007/s10680-023-09682-3","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"DEMOGRAPHY","Score":null,"Total":0}
Identifying and Qualifying Deviant Cases in Clusters of Sequences: The Why and The How.
Sequence analysis is employed in different fields-e.g., demography, sociology, and political sciences-to describe longitudinal processes represented as sequences of categorical states. In many applications, sequences are clustered to identify relevant types, which reflect the different empirical realisations of the temporal process under study. We explore criteria to inspect internal cluster composition and to detect deviant sequences, that is, cases characterised by rare patterns or outliers that might compromise cluster homogeneity. We also introduce tools to visualise and distinguish the features of regular and deviant cases. Our proposals offer a more accurate and granular description of the data structure, by identifying-besides the most typical types-peculiar sequences that might be interesting from a substantive and theoretical point of view. This analysis could be very useful in applications where-under the assumption of within homogeneity-clusters are used as outcome or explanatory variables in regressions. We demonstrate the added value of our proposal in a motivating application from life-course socio-demography, focusing on Italian women's employment trajectories and on their link with their mothers' participation in the labour market across geographical areas.
期刊介绍:
European Journal of Population addresses a broad public of researchers, policy makers and others concerned with population processes and their consequences. Its aim is to improve understanding of population phenomena by giving priority to work that contributes to the development of theory and method, and that spans the boundaries between demography and such disciplines as sociology, anthropology, economics, geography, history, political science, epidemiology and other sciences contributing to public health. The Journal is open to authors from all over the world, and its articles cover European and non-European countries (specifically including developing countries) alike.