Development trends of etiological research contents and methods of noncommunicable diseases

Dafang Chen, Yujia Ma, Han Xiao, Zeyu Yan
{"title":"Development trends of etiological research contents and methods of noncommunicable diseases","authors":"Dafang Chen,&nbsp;Yujia Ma,&nbsp;Han Xiao,&nbsp;Zeyu Yan","doi":"10.1002/hcs2.69","DOIUrl":null,"url":null,"abstract":"<p>Noncommunicable diseases (NCDs) are a significant public concern, greatly impacting the economic and social development in China. In 2019, NCDs accounted for a staggering 88.5% of total deaths in China, with cardiovascular diseases, cancer, chronic respiratory diseases, and diabetes—the four major chronic diseases—contributing to a premature mortality rate of 16.5% [<span>1</span>]. The complexity of NCDs arises from the involvement of multiple genetic and environmental factors that interact in intricate ways. The complexity is characterized by a multitude of interactions among genes, proteins, and metabolic pathways throughout the various stages of life. Furthermore, these interactions demonstrate time-dependent specificity during the different phases of the life course. Prior research on the etiology of NCDs tended to focus on “specificity,” which overlooked the concept of “universality.” Studies are often conducted from one risk factor, one disease, or one dimension, leading to an insufficient understanding of NCD etiology and less than satisfactory outcomes in prevention and control efforts. Therefore, the aim of this review is to highlight and propose a new trend in NCD etiology research, considering the research focus and research methodology.</p><p>The relationships among NCDs are intricate, and patients often show distinct patterns of multiple diseases, reflecting population heterogeneity in comorbidity. The study of comorbidity patterns among populations affected by NCDs can offer valuable insights for developing effective prevention and management strategies. In a retrospective study by Jansana et al. [<span>2</span>] using electronic health records, five multimorbidity clusters were identified among breast cancer survivors in Spain; notably, the “musculoskeletal and cardiovascular disease” pattern showed a significantly higher risk of mortality than other NCDs. Advancements in computational science contribute to the emergence of network analysis based on graph theory as a powerful tool for understanding the complexity of comorbidity from a holistic and systemic perspective. Graph theory in network analysis facilitates the construction of comorbidity networks in which disease status is represented as nodes and risk associations are shown as edges, thereby visualizing the co-occurrence of diseases in a concise and intuitive manner. Such topological approaches enable the prioritization of disease severity and identification of the core disease within a comorbidity network. Furthermore, network clustering techniques have been applied to identify specific comorbidity patterns in NCDs. However, cautiousness in interpreting the identified patterns is essential because some network topology indexes may lack practical significance. The challenge in interpreting the identified patterns can be addressed by considering association rules. Typically, association rule mining is used to identify comorbidity patterns, and network analysis is used to visualize and determine the core diseases within a comorbidity network. For example, Hernández et al. [<span>3</span>] discovered several comorbidity patterns in Irish adults using association rules and subsequently found that high cholesterol, hypertension, and arthritis had the highest number of associations with other medical conditions by network analysis, designating them as the core diseases in the comorbidity network.</p><p>The development of NCDs is a prolonged and gradual process characterized by the accumulation of risks over time. The intricate variations during disease progression mean that patients may have different trajectories leading to the same disease pattern. It is crucial to consider the temporal characteristics of the progression of each disease component, even within a specific comorbidity pattern. Identifying disease trajectories at the population level is vital for preventing comorbidity among specific NCD populations and provides essential epidemiological evidence for understanding the etiology of comorbidity, making comorbidity trajectory research a current research hotspot. Jensen et al. [<span>4</span>] conducted a discovery-driven analysis of temporal disease progression patterns using data from an electronic health registry that covered the entire population of Denmark. They identified 1171 significant trajectories and grouped them into patterns centered on key diagnoses, such as chronic obstructive pulmonary disease and gout, which was critical to disease progression and early diagnosis to mitigate adverse outcomes. Comorbidity trajectory research in the general population poses challenges in study design, data analysis, and result interpretation, and therefore such research is often carried out in populations with a specific disease, thereby simplifying the design, data analysis, and result interpretation. For example, Jeong et al. [<span>5</span>] investigated type 2 diabetes using population-wide claim data in a nested case‒control study design. They constructed time-dependent type 2 diabetes trajectories and formed a comorbidity development network of patients with type 2 diabetes, then calculated the relative risk of progression from type 2 diabetes to other diseases. Several less-reported comorbidities, such as depression and hearing impairment, as well as time-critical associations between type 2 diabetes and other diseases, were discovered in the Jeong et al. [<span>5</span>] study. Their findings were beneficial for improving disease management for patients with type 2 diabetes.</p><p>Comorbidity of NCDs suggests there may be shared genetic architecture and environmental risk factors among these conditions. At the exposome level, the shared risk factors for comorbidity can be classified as: (1) general external exposure, including climate, social, economic, and psychological factors; (2) specific external exposure, such as occupational factors and individual environmental factors, such as lifestyle behaviors, dietary preferences, and medical interventions; and (3) internal exposure to endogenous substances generated by pleiotropic effectors at the molecular, cellular, tissue, and organ levels [<span>6, 7</span>]. For metabolic disorders, Guo et al. [<span>8</span>] proposed that multiple core pathological processes, including neuroendocrine disruption, insulin resistance, oxidative stress, chronic inflammatory response, and gut microbiome dysbiosis, interconnect and contribute to the onset and progression of metabolic diseases, forming a “multiple hit” scenario. Identifying potential comorbidity mechanisms and shared environmental risk factors and implementing rational preventive measures are effective strategies to reduce the disease burden of NCDs. Pietzner et al. [<span>9</span>] integrated electronic medical records with plasma metabolomics data to construct an “NCDs–clinical risk factors–metabolites” network and identified shared pathways among obesity, smoking, impaired glucose homeostasis, inflammation, lipoprotein metabolism, liver function, and kidney function, showing the potential for “network for comorbidity prevention” approaches. Similarly, Li et al. [<span>10</span>] explored pleiotropic drug targets between heart failure and five prevalent chronic diseases (diabetes, obesity, chronic obstructive pulmonary disease, chronic kidney disease, and obstructive sleep apnea) using public databases. They found that the PI3K/AKT pathway played a crucial role and identified potential drugs, such as sodium-glucose cotransporter-2 inhibitors, IL-1β inhibitors, and metformin, which could be used simultaneously by network analysis. A “disease–genetics–environment” network can help researchers (1) identify shared pathogenic pathways among important comorbidities and provide evidence for investigating underlying mechanisms and potential therapeutic strategies for comorbid conditions and (2) identify modifiable environmental factors associated with comorbidities and offer feasible interventions. These two objectives are key for the development of research on comorbidity patterns in NCDs.</p><p>The “omics” technologies that were inspired by the Human Genome Project have brought about a paradigm shift in research and provided crucial technical support for the concept of holism. This transformative development disrupts the traditional fragmented approach of the conventional “single-factor versus single-outcome” paradigm and moves it to the more comprehensive “multi-factor versus multi-phenotype” paradigm. This profound shift in perspective expands the scope of research, allowing organisms to be studied holistically and revolutionizing the understanding of life from a simplistic viewpoint to one that acknowledges its inherent complexity [<span>11</span>].</p><p>The etiology of NCDs exhibits remarkable adaptability and self-organization, and has three main attributes. (1) Pleiotropy, where a single molecule can give rise to multiple phenotypes. Hu et al. [<span>12</span>] proposed five models to explain the mapping between genotype and phenotype, which provided a theoretical basis for studying NCD etiology from a pleiotropic perspective. (2) Robustness, where the original function of a molecule is maintained under internal and external perturbations. The interactions are often governed by nonlinear and dynamic control, whereas specific proteins, such as chaperones, act as adaptive mechanisms to buffer the impact of disturbances [<span>13, 14</span>]. (3) Rewiring, the inherent restructuring of interactions between biological units in response to conditional change, where adaptive modifications in intrinsic interactions are involved. These distinct attributes of NCD etiology necessitate a comprehensive analysis from a multilevel and multifactorial perspective. Technological limitations meant that simplistic “one-versus-one” approaches were used to study the etiology of NCDs; however, the newer omics technologies have helped to overcome these limitations and allowed trans-omics network analysis to emerge as a promising approach [<span>15</span>]. In trans-omics analysis, a group of biological molecules or phenotypes is treated as a single variable that is integrated as an information layer, forming a multilevel structural database. This approach enables an objective and comprehensive reconstruction of the intricate network that connects the human genome, exposome, and phenome within the human body. Given the complexity of the network, biological network models have become the preferred choice [<span>16</span>]. Such models abandon the singular perspective of studying disease etiology from a single molecular or omics level and instead use bioinformatics and computational techniques to discover interactions between molecules, thereby establishing high-dimensional internal connections between different types of biological data layers. This approach results in the formation of a complex molecular information network and aligns with the principles of systematic biology [<span>17</span>]. Consequently, identifying pathogenic pathways based on trans-omics data and constructing etiological networks have become indispensable in unraveling the underlying causes of NCDs [<span>16, 18, 19</span>]. Bodein et al. [<span>20</span>] made full use of longitudinal data from transcriptomics, proteomics, and metabolomics to construct single omics networks, then used network propagation via a random walk to establish regulatory networks between multiple omics layers. By identifying inter-omics interactions that are not captured by single omics analysis, they discovered two core dynamic biological clusters that connected the etiological network of diabetes with renal tubular acidosis and restless leg syndrome. This breakthrough discovery provides new insights into the potential mechanisms and interactions underlying the onset and progression of diabetes.</p><p>Network comparison is essential to obtain statistical evidence of pathogenic networks and pathways. There are two typical analysis strategies for network comparison. (1) Hypothesis-driven strategy, where a comprehensive understanding of the physiological, biochemical, and pathological mechanisms of the disease of interest is required. Based on a priori understanding from previous cellular experiments, animal studies, or omics analyses, a reasonable hypothetical pathogenic network/pathway is outlined in advance. Subsequently, intergroup differences and effects of the network/pathway nodes are examined at the population level to assess the validity and practicality of the initial hypothesis-based pathogenic network/pathway in the population. (2) Data-driven strategy, where high-throughput omics markers are acquired at the population level without any predefined hypothesis. Systematic biology methods are used to construct a network connecting exposure factors, biomarkers, and disease endpoints. Intergroup differences and effects of the network/pathway are evaluated at the population level and used to provide a basis for further experimental validation, drug target identification, and the development of prevention or treatment measures [<span>21</span>]. Ji et al. [<span>22</span>] proposed a powerful score-based statistical test (NetDifM) to measure group differences in weighted biological networks. They successfully captured differences in gene expression networks between patients with ovarian cancer and healthy controls and identified pathogenic PI3K-AKT signaling pathways, Notch signaling pathways, and their downstream subnetworks.</p><p>In etiological research on NCDs, the dynamic attributes of complex biological systems necessitate temporality and high dimensionality jointly, which indicates the need to investigate the metabolic characteristics of disease occurrence and development across the whole life course. Currently, NCD research has focused mainly on adults; for example, the Framingham study of a population aged 28–74 years [<span>23</span>], the UK Biobank study of a population aged 37–85 years [<span>24</span>], and the China Chronic Disease Prospective study of a population aged 30–79 years [<span>25</span>]. However, the Developmental Origins of Health and Disease theory proposes that maternal nutrition and environmental exposures during pregnancy may affect the risk of the offspring developing NCDs in adulthood [<span>26, 27</span>]. This proposal suggests that etiological research on NCDs should transition from a stage-specific perspective predominantly focused on adults to a life course approach that encompasses pregnancy, childhood, adolescence, youth, middle age, and old age to identify risk factors associated with the development of NCDs and other health outcomes across the entire lifespan, known as life course epidemiology [<span>28</span>].</p><p>Life course epidemiology consists mainly of a risk accumulation model and a critical period model. The risk accumulation model assumes that risk factors, such as environmental exposures, socioeconomic status, and behavioral factors, independently or synergistically have long-term effects on health. Thus, this model focuses on the accumulation and clustering of exposures because diseases are associated not only with individual exposure but also with household exposure and socioeconomic status [<span>28, 29</span>]. The critical period model emphasizes that biological programming during critical developmental periods may be modified by later physiological or psychological stress [<span>28, 29</span>]. Trajectory analysis is a commonly used longitudinal data processing method in life course epidemiology. Trajectory analysis methods are used to fit growth trajectories to individual exposure data with repeated longitudinal measurements, identify subgroups with potentially different growth trajectories within a population, describe trends in exposure factor growth curves collectively and individually, and explore the cumulative effects and critical/sensitive periods of exposure on disease occurrence and development by analyzing growth curve parameters [<span>30, 31</span>]. Conventional trajectory analysis methods include growth curve fitting based on Z scores, multilevel modeling, group-based trajectory modeling, and latent class mixed effects models [<span>32, 33</span>]. Zhang et al. [<span>34</span>] performed life course trajectory analysis and mediation analysis to quantify the life course cumulative burden of childhood to adulthood obesity and showed that the adverse effects of obesity on cardiovascular health began in childhood and accumulated over the life course. This study provides new evidence for the early-life origins of cardiovascular disease and has significant implications for formulating early prevention strategies and measures related to obesity-associated atherosclerosis.</p><p>In summary, previous research on the etiology of NCDs, despite integrating information from multiple omics, has focused mainly on “one-versus-one” associations between single molecular biomarkers and the occurrence of a single disease (specificity). Regarding the causal relationships between diseases and health, Professor Chen noted that “Any study on causal relationships is, in fact, extracted from a complex, interdependent network of relationships, and represents a relationship which we conceive may exist” [<span>35, 36</span>]. Therefore, research on NCD etiology should strive to elucidate the complex and interdependent underlying network. Trans-omics causal network studies integrate omics data into a holistic system to discover multidimensional, multilevel, and multitime point interactions between trans-omics networks and phenotype networks. Given our understanding of the characteristics of NCD etiology, we believe that the perspective of research on NCD etiology needs to shift from local to systemic and from single biology to systemic biology. Establishing a complete functional atlas between genes and phenotypes throughout the entire life course, known as a genotype–phenotype map, will be essential for a comprehensive understanding of the etiology of NCDs.</p><p><b>Dafang Chen</b>: Conceptualization (lead); funding acquisition (lead); project administration (lead); supervision (lead). <b>Yujia Ma</b>: Investigation (lead); methodology (lead); writing—original draft (lead); writing—review &amp; editing (lead). <b>Han Xiao</b>: Investigation (equal). <b>Zeyu Yan</b>: Investigation (equal).</p><p>The authors declare no conflict of interest.</p><p>Not applicable.</p><p>Not applicable.</p>","PeriodicalId":100601,"journal":{"name":"Health Care Science","volume":"2 5","pages":"352-357"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/hcs2.69","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Care Science","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/hcs2.69","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Noncommunicable diseases (NCDs) are a significant public concern, greatly impacting the economic and social development in China. In 2019, NCDs accounted for a staggering 88.5% of total deaths in China, with cardiovascular diseases, cancer, chronic respiratory diseases, and diabetes—the four major chronic diseases—contributing to a premature mortality rate of 16.5% [1]. The complexity of NCDs arises from the involvement of multiple genetic and environmental factors that interact in intricate ways. The complexity is characterized by a multitude of interactions among genes, proteins, and metabolic pathways throughout the various stages of life. Furthermore, these interactions demonstrate time-dependent specificity during the different phases of the life course. Prior research on the etiology of NCDs tended to focus on “specificity,” which overlooked the concept of “universality.” Studies are often conducted from one risk factor, one disease, or one dimension, leading to an insufficient understanding of NCD etiology and less than satisfactory outcomes in prevention and control efforts. Therefore, the aim of this review is to highlight and propose a new trend in NCD etiology research, considering the research focus and research methodology.

The relationships among NCDs are intricate, and patients often show distinct patterns of multiple diseases, reflecting population heterogeneity in comorbidity. The study of comorbidity patterns among populations affected by NCDs can offer valuable insights for developing effective prevention and management strategies. In a retrospective study by Jansana et al. [2] using electronic health records, five multimorbidity clusters were identified among breast cancer survivors in Spain; notably, the “musculoskeletal and cardiovascular disease” pattern showed a significantly higher risk of mortality than other NCDs. Advancements in computational science contribute to the emergence of network analysis based on graph theory as a powerful tool for understanding the complexity of comorbidity from a holistic and systemic perspective. Graph theory in network analysis facilitates the construction of comorbidity networks in which disease status is represented as nodes and risk associations are shown as edges, thereby visualizing the co-occurrence of diseases in a concise and intuitive manner. Such topological approaches enable the prioritization of disease severity and identification of the core disease within a comorbidity network. Furthermore, network clustering techniques have been applied to identify specific comorbidity patterns in NCDs. However, cautiousness in interpreting the identified patterns is essential because some network topology indexes may lack practical significance. The challenge in interpreting the identified patterns can be addressed by considering association rules. Typically, association rule mining is used to identify comorbidity patterns, and network analysis is used to visualize and determine the core diseases within a comorbidity network. For example, Hernández et al. [3] discovered several comorbidity patterns in Irish adults using association rules and subsequently found that high cholesterol, hypertension, and arthritis had the highest number of associations with other medical conditions by network analysis, designating them as the core diseases in the comorbidity network.

The development of NCDs is a prolonged and gradual process characterized by the accumulation of risks over time. The intricate variations during disease progression mean that patients may have different trajectories leading to the same disease pattern. It is crucial to consider the temporal characteristics of the progression of each disease component, even within a specific comorbidity pattern. Identifying disease trajectories at the population level is vital for preventing comorbidity among specific NCD populations and provides essential epidemiological evidence for understanding the etiology of comorbidity, making comorbidity trajectory research a current research hotspot. Jensen et al. [4] conducted a discovery-driven analysis of temporal disease progression patterns using data from an electronic health registry that covered the entire population of Denmark. They identified 1171 significant trajectories and grouped them into patterns centered on key diagnoses, such as chronic obstructive pulmonary disease and gout, which was critical to disease progression and early diagnosis to mitigate adverse outcomes. Comorbidity trajectory research in the general population poses challenges in study design, data analysis, and result interpretation, and therefore such research is often carried out in populations with a specific disease, thereby simplifying the design, data analysis, and result interpretation. For example, Jeong et al. [5] investigated type 2 diabetes using population-wide claim data in a nested case‒control study design. They constructed time-dependent type 2 diabetes trajectories and formed a comorbidity development network of patients with type 2 diabetes, then calculated the relative risk of progression from type 2 diabetes to other diseases. Several less-reported comorbidities, such as depression and hearing impairment, as well as time-critical associations between type 2 diabetes and other diseases, were discovered in the Jeong et al. [5] study. Their findings were beneficial for improving disease management for patients with type 2 diabetes.

Comorbidity of NCDs suggests there may be shared genetic architecture and environmental risk factors among these conditions. At the exposome level, the shared risk factors for comorbidity can be classified as: (1) general external exposure, including climate, social, economic, and psychological factors; (2) specific external exposure, such as occupational factors and individual environmental factors, such as lifestyle behaviors, dietary preferences, and medical interventions; and (3) internal exposure to endogenous substances generated by pleiotropic effectors at the molecular, cellular, tissue, and organ levels [6, 7]. For metabolic disorders, Guo et al. [8] proposed that multiple core pathological processes, including neuroendocrine disruption, insulin resistance, oxidative stress, chronic inflammatory response, and gut microbiome dysbiosis, interconnect and contribute to the onset and progression of metabolic diseases, forming a “multiple hit” scenario. Identifying potential comorbidity mechanisms and shared environmental risk factors and implementing rational preventive measures are effective strategies to reduce the disease burden of NCDs. Pietzner et al. [9] integrated electronic medical records with plasma metabolomics data to construct an “NCDs–clinical risk factors–metabolites” network and identified shared pathways among obesity, smoking, impaired glucose homeostasis, inflammation, lipoprotein metabolism, liver function, and kidney function, showing the potential for “network for comorbidity prevention” approaches. Similarly, Li et al. [10] explored pleiotropic drug targets between heart failure and five prevalent chronic diseases (diabetes, obesity, chronic obstructive pulmonary disease, chronic kidney disease, and obstructive sleep apnea) using public databases. They found that the PI3K/AKT pathway played a crucial role and identified potential drugs, such as sodium-glucose cotransporter-2 inhibitors, IL-1β inhibitors, and metformin, which could be used simultaneously by network analysis. A “disease–genetics–environment” network can help researchers (1) identify shared pathogenic pathways among important comorbidities and provide evidence for investigating underlying mechanisms and potential therapeutic strategies for comorbid conditions and (2) identify modifiable environmental factors associated with comorbidities and offer feasible interventions. These two objectives are key for the development of research on comorbidity patterns in NCDs.

The “omics” technologies that were inspired by the Human Genome Project have brought about a paradigm shift in research and provided crucial technical support for the concept of holism. This transformative development disrupts the traditional fragmented approach of the conventional “single-factor versus single-outcome” paradigm and moves it to the more comprehensive “multi-factor versus multi-phenotype” paradigm. This profound shift in perspective expands the scope of research, allowing organisms to be studied holistically and revolutionizing the understanding of life from a simplistic viewpoint to one that acknowledges its inherent complexity [11].

The etiology of NCDs exhibits remarkable adaptability and self-organization, and has three main attributes. (1) Pleiotropy, where a single molecule can give rise to multiple phenotypes. Hu et al. [12] proposed five models to explain the mapping between genotype and phenotype, which provided a theoretical basis for studying NCD etiology from a pleiotropic perspective. (2) Robustness, where the original function of a molecule is maintained under internal and external perturbations. The interactions are often governed by nonlinear and dynamic control, whereas specific proteins, such as chaperones, act as adaptive mechanisms to buffer the impact of disturbances [13, 14]. (3) Rewiring, the inherent restructuring of interactions between biological units in response to conditional change, where adaptive modifications in intrinsic interactions are involved. These distinct attributes of NCD etiology necessitate a comprehensive analysis from a multilevel and multifactorial perspective. Technological limitations meant that simplistic “one-versus-one” approaches were used to study the etiology of NCDs; however, the newer omics technologies have helped to overcome these limitations and allowed trans-omics network analysis to emerge as a promising approach [15]. In trans-omics analysis, a group of biological molecules or phenotypes is treated as a single variable that is integrated as an information layer, forming a multilevel structural database. This approach enables an objective and comprehensive reconstruction of the intricate network that connects the human genome, exposome, and phenome within the human body. Given the complexity of the network, biological network models have become the preferred choice [16]. Such models abandon the singular perspective of studying disease etiology from a single molecular or omics level and instead use bioinformatics and computational techniques to discover interactions between molecules, thereby establishing high-dimensional internal connections between different types of biological data layers. This approach results in the formation of a complex molecular information network and aligns with the principles of systematic biology [17]. Consequently, identifying pathogenic pathways based on trans-omics data and constructing etiological networks have become indispensable in unraveling the underlying causes of NCDs [16, 18, 19]. Bodein et al. [20] made full use of longitudinal data from transcriptomics, proteomics, and metabolomics to construct single omics networks, then used network propagation via a random walk to establish regulatory networks between multiple omics layers. By identifying inter-omics interactions that are not captured by single omics analysis, they discovered two core dynamic biological clusters that connected the etiological network of diabetes with renal tubular acidosis and restless leg syndrome. This breakthrough discovery provides new insights into the potential mechanisms and interactions underlying the onset and progression of diabetes.

Network comparison is essential to obtain statistical evidence of pathogenic networks and pathways. There are two typical analysis strategies for network comparison. (1) Hypothesis-driven strategy, where a comprehensive understanding of the physiological, biochemical, and pathological mechanisms of the disease of interest is required. Based on a priori understanding from previous cellular experiments, animal studies, or omics analyses, a reasonable hypothetical pathogenic network/pathway is outlined in advance. Subsequently, intergroup differences and effects of the network/pathway nodes are examined at the population level to assess the validity and practicality of the initial hypothesis-based pathogenic network/pathway in the population. (2) Data-driven strategy, where high-throughput omics markers are acquired at the population level without any predefined hypothesis. Systematic biology methods are used to construct a network connecting exposure factors, biomarkers, and disease endpoints. Intergroup differences and effects of the network/pathway are evaluated at the population level and used to provide a basis for further experimental validation, drug target identification, and the development of prevention or treatment measures [21]. Ji et al. [22] proposed a powerful score-based statistical test (NetDifM) to measure group differences in weighted biological networks. They successfully captured differences in gene expression networks between patients with ovarian cancer and healthy controls and identified pathogenic PI3K-AKT signaling pathways, Notch signaling pathways, and their downstream subnetworks.

In etiological research on NCDs, the dynamic attributes of complex biological systems necessitate temporality and high dimensionality jointly, which indicates the need to investigate the metabolic characteristics of disease occurrence and development across the whole life course. Currently, NCD research has focused mainly on adults; for example, the Framingham study of a population aged 28–74 years [23], the UK Biobank study of a population aged 37–85 years [24], and the China Chronic Disease Prospective study of a population aged 30–79 years [25]. However, the Developmental Origins of Health and Disease theory proposes that maternal nutrition and environmental exposures during pregnancy may affect the risk of the offspring developing NCDs in adulthood [26, 27]. This proposal suggests that etiological research on NCDs should transition from a stage-specific perspective predominantly focused on adults to a life course approach that encompasses pregnancy, childhood, adolescence, youth, middle age, and old age to identify risk factors associated with the development of NCDs and other health outcomes across the entire lifespan, known as life course epidemiology [28].

Life course epidemiology consists mainly of a risk accumulation model and a critical period model. The risk accumulation model assumes that risk factors, such as environmental exposures, socioeconomic status, and behavioral factors, independently or synergistically have long-term effects on health. Thus, this model focuses on the accumulation and clustering of exposures because diseases are associated not only with individual exposure but also with household exposure and socioeconomic status [28, 29]. The critical period model emphasizes that biological programming during critical developmental periods may be modified by later physiological or psychological stress [28, 29]. Trajectory analysis is a commonly used longitudinal data processing method in life course epidemiology. Trajectory analysis methods are used to fit growth trajectories to individual exposure data with repeated longitudinal measurements, identify subgroups with potentially different growth trajectories within a population, describe trends in exposure factor growth curves collectively and individually, and explore the cumulative effects and critical/sensitive periods of exposure on disease occurrence and development by analyzing growth curve parameters [30, 31]. Conventional trajectory analysis methods include growth curve fitting based on Z scores, multilevel modeling, group-based trajectory modeling, and latent class mixed effects models [32, 33]. Zhang et al. [34] performed life course trajectory analysis and mediation analysis to quantify the life course cumulative burden of childhood to adulthood obesity and showed that the adverse effects of obesity on cardiovascular health began in childhood and accumulated over the life course. This study provides new evidence for the early-life origins of cardiovascular disease and has significant implications for formulating early prevention strategies and measures related to obesity-associated atherosclerosis.

In summary, previous research on the etiology of NCDs, despite integrating information from multiple omics, has focused mainly on “one-versus-one” associations between single molecular biomarkers and the occurrence of a single disease (specificity). Regarding the causal relationships between diseases and health, Professor Chen noted that “Any study on causal relationships is, in fact, extracted from a complex, interdependent network of relationships, and represents a relationship which we conceive may exist” [35, 36]. Therefore, research on NCD etiology should strive to elucidate the complex and interdependent underlying network. Trans-omics causal network studies integrate omics data into a holistic system to discover multidimensional, multilevel, and multitime point interactions between trans-omics networks and phenotype networks. Given our understanding of the characteristics of NCD etiology, we believe that the perspective of research on NCD etiology needs to shift from local to systemic and from single biology to systemic biology. Establishing a complete functional atlas between genes and phenotypes throughout the entire life course, known as a genotype–phenotype map, will be essential for a comprehensive understanding of the etiology of NCDs.

Dafang Chen: Conceptualization (lead); funding acquisition (lead); project administration (lead); supervision (lead). Yujia Ma: Investigation (lead); methodology (lead); writing—original draft (lead); writing—review & editing (lead). Han Xiao: Investigation (equal). Zeyu Yan: Investigation (equal).

The authors declare no conflict of interest.

Not applicable.

Not applicable.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
非传染性疾病病原学研究内容和方法的发展趋势
非传染性疾病(NCDs)是公众关注的一个重要问题,极大地影响了中国的经济和社会发展。2019年,非传染性疾病占中国总死亡人数的88.5%,心血管疾病、癌症、慢性呼吸道疾病和糖尿病这四种主要慢性病的过早死亡率高达16.5%[1]。非传染性疾病的复杂性源于多种遗传和环境因素的参与,这些因素以复杂的方式相互作用。这种复杂性的特点是在生命的各个阶段,基因、蛋白质和代谢途径之间存在大量的相互作用。此外,在生命过程的不同阶段,这些相互作用表现出与时间相关的特异性。先前对非传染性疾病病因的研究往往侧重于“特异性”,而忽略了“普遍性”的概念。研究往往从一个风险因素、一种疾病或一个维度进行,导致对非传染性病毒病因的理解不足,预防和控制工作的结果也不令人满意。因此,本综述的目的是在考虑研究重点和研究方法的基础上,突出并提出非传染性疾病病因研究的新趋势。非传染性疾病之间的关系是复杂的,患者往往表现出多种疾病的不同模式,反映出共病的人群异质性。对非传染性疾病患者共病模式的研究可以为制定有效的预防和管理策略提供有价值的见解。在Jansana等人[2]使用电子健康记录进行的回顾性研究中,在西班牙的癌症幸存者中确定了五个多发病集群;值得注意的是,“肌肉骨骼和心血管疾病”模式的死亡率明显高于其他非传染性疾病。计算科学的进步有助于基于图论的网络分析的出现,它是从整体和系统的角度理解共病复杂性的有力工具。网络分析中的图论有助于构建共病网络,其中疾病状态表示为节点,风险关联表示为边缘,从而以简洁直观的方式可视化疾病的共现。这种拓扑方法能够在共病网络中确定疾病严重程度的优先级和核心疾病的识别。此外,网络聚类技术已被应用于识别非传染性疾病的特定共病模式。然而,谨慎地解释识别的模式是至关重要的,因为一些网络拓扑索引可能缺乏实际意义。解释已识别模式的挑战可以通过考虑关联规则来解决。通常,关联规则挖掘用于识别共病模式,网络分析用于可视化和确定共病网络中的核心疾病。例如,Hernández等人[3]使用关联规则在爱尔兰成年人中发现了几种共病模式,随后通过网络分析发现,高胆固醇、高血压和关节炎与其他疾病的关联次数最多,将其列为共病网络中的核心疾病。非传染性疾病的发展是一个漫长而渐进的过程,其特点是风险随着时间的推移而积累。疾病进展过程中的复杂变化意味着患者可能有不同的轨迹,导致相同的疾病模式。至关重要的是要考虑每种疾病成分进展的时间特征,即使是在特定的共病模式中。在人群层面识别疾病轨迹对于预防特定非传染性疾病人群的共病至关重要,并为了解共病病因提供了重要的流行病学证据,使共病轨迹研究成为当前的研究热点。Jensen等人[4]利用覆盖丹麦全体人口的电子健康登记处的数据,对暂时性疾病进展模式进行了发现驱动的分析。他们确定了1171个重要的轨迹,并将其分组为以关键诊断为中心的模式,如慢性阻塞性肺病和痛风,这对疾病进展和早期诊断至关重要,以减轻不良后果。普通人群的共病轨迹研究在研究设计、数据分析和结果解释方面提出了挑战,因此此类研究通常在患有特定疾病的人群中进行,从而简化了设计、数据解析和结果解释。例如,Jeong等人[5]在嵌套的病例对照研究设计中,使用全人群的索赔数据调查了2型糖尿病。 在反式组学分析中,一组生物分子或表型被视为单一变量,整合为信息层,形成多级结构数据库。这种方法能够客观全面地重建连接人类基因组、暴露体和人体内现象的复杂网络。鉴于网络的复杂性,生物网络模型已成为首选[16]。这种模型放弃了从单个分子或组学水平研究疾病病因的单一视角,而是使用生物信息学和计算技术来发现分子之间的相互作用,从而在不同类型的生物数据层之间建立高维的内部联系。这种方法导致了复杂分子信息网络的形成,并符合系统生物学的原理[17]。因此,基于跨组学数据识别致病途径和构建致病网络对于揭示非传染性疾病的根本原因变得不可或缺[16,18,19]。Bodein等人[20]充分利用转录组学、蛋白质组学和代谢组学的纵向数据构建单个组学网络,然后通过随机游动使用网络传播在多个组学层之间建立调控网络。通过识别单组学分析无法捕捉到的组间相互作用,他们发现了两个核心的动态生物学集群,它们将糖尿病的病因网络与肾小管酸中毒和不宁腿综合征联系起来。这一突破性发现为糖尿病发病和进展的潜在机制和相互作用提供了新的见解。网络比较对于获得致病网络和途径的统计证据至关重要。网络比较有两种典型的分析策略。(1) 假设驱动策略,需要全面了解感兴趣疾病的生理、生化和病理机制。基于先前细胞实验、动物研究或组学分析的先验理解,提前概述了合理的假设致病网络/途径。随后,在人群水平上检查组间差异和网络/通路节点的影响,以评估基于初始假设的致病网络/通路在人群中的有效性和实用性。(2) 数据驱动策略,在没有任何预定义假设的情况下,在人群水平上获得高通量组学标记。系统生物学方法用于构建一个连接暴露因素、生物标志物和疾病终点的网络。在人群水平上评估网络/途径的组间差异和效果,并用于为进一步的实验验证、药物靶点识别和预防或治疗措施的制定提供基础[21]。Ji等人[22]提出了一种强大的基于分数的统计测试(NetDifM)来测量加权生物网络中的群体差异。他们成功地捕捉到了卵巢癌症患者和健康对照者之间基因表达网络的差异,并确定了致病性PI3K-AKT信号通路、Notch信号通路及其下游子网络。在非传染性疾病的病因研究中,复杂生物系统的动态属性共同要求时间性和高维性,这表明需要研究疾病发生和发展的整个生命过程的代谢特征。目前,非传染性疾病的研究主要集中在成年人身上;例如,对28-74岁人群的Framingham研究[23],对37-85岁人群的英国生物库研究[24],以及对30-79岁人群的中国慢性病前瞻性研究[25]。然而,健康与疾病的发育起源理论提出,母亲在怀孕期间的营养和环境暴露可能会影响后代成年后患非传染性疾病的风险[26,27]。该提案建议,非传染性疾病的病因研究应从主要关注成年人的特定阶段视角过渡到包括妊娠、儿童、青少年、青年、中年和老年在内的生命历程方法,以确定与非传染性疾病发展和整个生命期其他健康结果相关的风险因素,称为生命过程流行病学[28]。生命过程流行病学主要由风险累积模型和关键时期模型组成。风险累积模型假设环境暴露、社会经济地位和行为因素等风险因素独立或协同对健康产生长期影响。 因此,该模型侧重于暴露的积累和聚集,因为疾病不仅与个人暴露有关,还与家庭暴露和社会经济地位有关[28,29]。关键时期模型强调,关键发育时期的生物编程可能会受到后期生理或心理压力的影响[28,29]。轨迹分析是生命历程流行病学中常用的纵向数据处理方法。轨迹分析方法用于通过重复的纵向测量将生长轨迹与个体暴露数据拟合,识别群体中具有潜在不同生长轨迹的亚组,集体和单独描述暴露因子生长曲线的趋势,通过分析生长曲线参数,探讨暴露对疾病发生和发展的累积影响和临界/敏感期[30,31]。传统的轨迹分析方法包括基于Z分数的增长曲线拟合、多级建模、基于组的轨迹建模和潜在类别混合效应模型[32,33]。张等人[34]进行了生命历程轨迹分析和中介分析,以量化儿童至成年肥胖的生命历程累积负担,并表明肥胖对心血管健康的不利影响始于儿童时期,并在整个生命历程中累积。这项研究为心血管疾病的早期起源提供了新的证据,并对制定与肥胖相关的动脉粥样硬化的早期预防策略和措施具有重要意义。总之,尽管整合了来自多种组学的信息,但先前对非传染性疾病病因的研究主要集中在单分子生物标志物与单一疾病发生之间的“一对一”关联(特异性)。关于疾病与健康之间的因果关系,陈教授指出,“任何关于因果关系的研究,实际上都是从一个复杂的、相互依存的关系网络中提取出来的,代表了我们认为可能存在的关系”[35,36]。因此,对非传染性疾病病因的研究应努力阐明复杂而相互依存的潜在网络。跨组学因果网络研究将组学数据整合到一个整体系统中,以发现跨组学网络和表型网络之间的多维、多层次和多时间点相互作用。鉴于我们对非传染性疾病病因特征的理解,我们认为非传染性疾病的病因研究视角需要从局部转向系统,从单一生物学转向系统生物学。在整个生命过程中建立一个完整的基因和表型功能图谱,称为基因型-表型图谱,对于全面了解非传染性疾病的病因至关重要。陈大方:概念化(铅);融资收购(牵头);项目管理(牵头);监督(领导)。马玉佳:调查(牵头);方法论(主导);书写——原始草稿(铅);写作——复习;编辑(引导)。韩晓:调查(平等)。严泽宇:调查(平等)。作者声明没有利益冲突。不适用。不适用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
0.90
自引率
0.00%
发文量
0
期刊最新文献
Study protocol: A national cross-sectional study on psychology and behavior investigation of Chinese residents in 2023. Caregiving in Asia: Priority areas for research, policy, and practice to support family caregivers. Innovative public strategies in response to COVID-19: A review of practices from China. Sixty years of ethical evolution: The 2024 revision of the Declaration of Helsinki (DoH). A novel ensemble ARIMA-LSTM approach for evaluating COVID-19 cases and future outbreak preparedness.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1