Sarah Holmes Watkins, Christian Testa, Jarvis T Chen, Immaculata De Vivo, Andrew J Simpkin, Kate Tilling, Ana V Diez Roux, George Davey Smith, Pamela D Waterman, Matthew Suderman, Caroline Relton, Nancy Krieger
{"title":"表观遗传学时钟以及缺乏关于其开发对象的数据所产生的研究影响:对已报告和缺失的社会人口特征的审查。","authors":"Sarah Holmes Watkins, Christian Testa, Jarvis T Chen, Immaculata De Vivo, Andrew J Simpkin, Kate Tilling, Ana V Diez Roux, George Davey Smith, Pamela D Waterman, Matthew Suderman, Caroline Relton, Nancy Krieger","doi":"10.1093/eep/dvad005","DOIUrl":null,"url":null,"abstract":"<p><p>Epigenetic clocks are increasingly being used as a tool to assess the impact of a wide variety of phenotypes and exposures on healthy ageing, with a recent focus on social determinants of health. However, little attention has been paid to the sociodemographic characteristics of participants on whom these clocks have been based. Participant characteristics are important because sociodemographic and socioeconomic factors are known to be associated with both DNA methylation variation and healthy ageing. It is also well known that machine learning algorithms have the potential to exacerbate health inequities through the use of unrepresentative samples - prediction models may underperform in social groups that were poorly represented in the training data used to construct the model. To address this gap in the literature, we conducted a review of the sociodemographic characteristics of the participants whose data were used to construct 13 commonly used epigenetic clocks. We found that although some of the epigenetic clocks were created utilizing data provided by individuals from different ages, sexes/genders, and racialized groups, sociodemographic characteristics are generally poorly reported. Reported information is limited by inadequate conceptualization of the social dimensions and exposure implications of gender and racialized inequality, and socioeconomic data are infrequently reported. It is important for future work to ensure clear reporting of tangible data on the sociodemographic and socioeconomic characteristics of all the participants in the study to ensure that other researchers can make informed judgements about the appropriateness of the model for their study population.</p>","PeriodicalId":11774,"journal":{"name":"Environmental Epigenetics","volume":"9 1","pages":"dvad005"},"PeriodicalIF":4.8000,"publicationDate":"2023-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411856/pdf/","citationCount":"0","resultStr":"{\"title\":\"Epigenetic clocks and research implications of the lack of data on whom they have been developed: a review of reported and missing sociodemographic characteristics.\",\"authors\":\"Sarah Holmes Watkins, Christian Testa, Jarvis T Chen, Immaculata De Vivo, Andrew J Simpkin, Kate Tilling, Ana V Diez Roux, George Davey Smith, Pamela D Waterman, Matthew Suderman, Caroline Relton, Nancy Krieger\",\"doi\":\"10.1093/eep/dvad005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Epigenetic clocks are increasingly being used as a tool to assess the impact of a wide variety of phenotypes and exposures on healthy ageing, with a recent focus on social determinants of health. However, little attention has been paid to the sociodemographic characteristics of participants on whom these clocks have been based. Participant characteristics are important because sociodemographic and socioeconomic factors are known to be associated with both DNA methylation variation and healthy ageing. It is also well known that machine learning algorithms have the potential to exacerbate health inequities through the use of unrepresentative samples - prediction models may underperform in social groups that were poorly represented in the training data used to construct the model. To address this gap in the literature, we conducted a review of the sociodemographic characteristics of the participants whose data were used to construct 13 commonly used epigenetic clocks. We found that although some of the epigenetic clocks were created utilizing data provided by individuals from different ages, sexes/genders, and racialized groups, sociodemographic characteristics are generally poorly reported. Reported information is limited by inadequate conceptualization of the social dimensions and exposure implications of gender and racialized inequality, and socioeconomic data are infrequently reported. It is important for future work to ensure clear reporting of tangible data on the sociodemographic and socioeconomic characteristics of all the participants in the study to ensure that other researchers can make informed judgements about the appropriateness of the model for their study population.</p>\",\"PeriodicalId\":11774,\"journal\":{\"name\":\"Environmental Epigenetics\",\"volume\":\"9 1\",\"pages\":\"dvad005\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2023-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411856/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Epigenetics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/eep/dvad005\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Epigenetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/eep/dvad005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
摘要
表观遗传时钟越来越多地被用作评估各种表型和暴露对健康老龄化影响的工具,最近的重点是健康的社会决定因素。然而,人们很少关注这些时钟所依据的参与者的社会人口特征。参与者特征非常重要,因为已知社会人口和社会经济因素与 DNA 甲基化变异和健康老龄化有关。同样众所周知的是,机器学习算法有可能通过使用不具代表性的样本而加剧健康不平等--预测模型在用于构建模型的训练数据中代表性较差的社会群体中可能表现不佳。为了弥补文献中的这一空白,我们对参与者的社会人口特征进行了回顾,这些参与者的数据被用来构建 13 个常用的表观遗传时钟。我们发现,尽管一些表观遗传时钟是利用不同年龄、性别和种族群体的个人提供的数据创建的,但社会人口学特征的报告普遍较少。由于对性别和种族不平等的社会层面和暴露影响概念化不足,报告的信息受到限制,社会经济数据也很少报告。在今后的工作中,必须确保清楚地报告研究中所有参与者的社会人口和社会经济特征的具体数据,以确保其他研究人员能够对该模型是否适合其研究人群做出明智的判断。
Epigenetic clocks and research implications of the lack of data on whom they have been developed: a review of reported and missing sociodemographic characteristics.
Epigenetic clocks are increasingly being used as a tool to assess the impact of a wide variety of phenotypes and exposures on healthy ageing, with a recent focus on social determinants of health. However, little attention has been paid to the sociodemographic characteristics of participants on whom these clocks have been based. Participant characteristics are important because sociodemographic and socioeconomic factors are known to be associated with both DNA methylation variation and healthy ageing. It is also well known that machine learning algorithms have the potential to exacerbate health inequities through the use of unrepresentative samples - prediction models may underperform in social groups that were poorly represented in the training data used to construct the model. To address this gap in the literature, we conducted a review of the sociodemographic characteristics of the participants whose data were used to construct 13 commonly used epigenetic clocks. We found that although some of the epigenetic clocks were created utilizing data provided by individuals from different ages, sexes/genders, and racialized groups, sociodemographic characteristics are generally poorly reported. Reported information is limited by inadequate conceptualization of the social dimensions and exposure implications of gender and racialized inequality, and socioeconomic data are infrequently reported. It is important for future work to ensure clear reporting of tangible data on the sociodemographic and socioeconomic characteristics of all the participants in the study to ensure that other researchers can make informed judgements about the appropriateness of the model for their study population.