The state of mechanistic research in the evidence-based medicine era: A sandwalk between triangulation and hierarchies

IF 2.8 4区医学 Q2 PHYSIOLOGY Experimental Physiology Pub Date : 2025-02-21 DOI:10.1113/EP092157

Ronan M. G. Berg, Cody G. Durrer, Jan Kyrre Berg Olsen Friis, Mathias Ried-Larsen

{"title":"The state of mechanistic research in the evidence-based medicine era: A sandwalk between triangulation and hierarchies","authors":"Ronan M. G. Berg, Cody G. Durrer, Jan Kyrre Berg Olsen Friis, Mathias Ried-Larsen","doi":"10.1113/EP092157","DOIUrl":null,"url":null,"abstract":"Formally, to have evidence is to have a ‘conceptual warrant for belief or action’ (Goldenberg, 2006), and to paraphrase Frank Herbert (1926–1980) in his novel Dune, evidence is certainly the melange, or ‘spice,’ around which all science is centred (Herbert, 1965). The concept of evidence that seems to dominate most biomedical sciences these days is that advocated by the evidence-based medicine (EBM) movement, which emerged as a new paradigm in the early 1990s with the ambition of basing clinical practice and teaching strictly on evidence as defined within the ‘hierarchy of evidence’ (Timmermans & Mauck, 2005) (Figure 1). Despite our enduring quest for truth within the field of experimental physiology, which seems more important than ever these days (Drummond & Tipton, 2024), mechanistic studies are consistently placed at the bottom in various incarnations of this hierarchy (Djulbegovic & Guyatt, 2017; Williamson, 2019). Does this mean that all our efforts as researchers within experimental physiology and other lines of mechanistic research contribute nothing more than ‘low quality’ evidence? Certainly not! Here, we will make the case that while EBM and experimental physiology benefit from each other and are complementary in many ways, they operate with fundamentally different frameworks. The main scientific philosophical concepts we will discuss are summarised in Box 1. We will make the case that EBM-based criteria for what constitutes ‘good evidence’ cannot uncritically be extrapolated to mechanistic research. And vice versa for that matter.The field of experimental physiology emerged in France and Germany during the early 19th century, liberating physiology from natural philosophy, which had dominated its early history (Bailey et al., 2023a; Cunningham, 2002). Consequently, scientists in this new field radically discarded the concept of vitalism, which posited a mystical life force as the distinguishing element between living and non-living matter. Instead, life's processes were to be determined within the realms of scientific materialism, that is, by the laws of physics and chemistry alone and thus requiring analysis through these exact scientific disciplines (Coleman, 1987; Culotta, 1970). While this new field of experimental physiology was a basic science, its scope was quite practical: to inform clinical practice by providing physicians with a theoretical framework upon which to base their decisions (Goldenberg, 2006).At its core, experimental physiology is a positivistic science, founded in so-called scientific realism, which is based on the premise that a real world exists, with its own structural and functional properties, which can be objectively studied through experimentation (Goldenberg, 2006). Although it underwent gradual modification, this positivism, with its reliance on deductive and inductive reasoning from mechanistic principles and theories, dominated clinical medicine for most of the 20th century (Timmermans & Mauck, 2005). EBM specifically evolved as a reaction against this, as mechanistic studies and theories were repeatedly found to be ineffective in predicting treatment effects in the clinical setting, and sometimes even leading to practices that turned out to be directly harmful. Even from the standpoint of scientific realism, this makes sense, because experimental models and conditions rarely reflect clinical reality in all its complexity, such that seeking for monocausal relationships for phenomena that are in reality multicausal has a high likelihood of failure. Furthermore, when one mechanism is targeted, its operation can be stopped or screened off by another causal factor, because several mechanisms typically operate simultaneously in vivo. Only the joint knowledge of all mechanisms and their interactions would suffice for taking mechanistic causal claims as a basis for decisions regarding interventions, something that even the most skilled experimentalist would find extremely challenging, if possible at all!EBM advocated for a radically different approach. Rather than relying on mechanistic theories, treatment effects should be evaluated specifically on the basis of clinical and patient-centred outcomes. These include mortality, hospital discharge or readmission, changes in treatments and health-related quality of life, as well as through the systematic registration of adverse events (Timmermans & Mauck, 2005) and/or by changes in context-specific biomarkers that are known to correlate with a given clinical outcome of interest (Manyara et al., 2024). While the exact philosophical underpinnings of EBM are unclear and a matter of debate (Djulbegovic & Guyatt, 2017; Kulkarni, 2005; Thomas, 2023), it is undeniably positivistic, yet clearly opposes the scientific realism of mechanistic reasoning, at least in the context of clinical decision-making. Rather, EBM has clear elements of pragmatism, that is, emphasising the practical application of knowledge over theoretical reasoning, such that knowledge comes from observations and experiences rather than innate ideas or reason. Taken to the extreme, this implies that it is only relevant if a given cause–effect relationship is present or not, and any theoretical considerations of how and why are irrelevant (Goldenberg, 2006). While the purpose is thus not theory-building with the goal of understanding the real world, EBM mainly has an instrumentalistic approach to theories. This implies that EBM may accept theory-building as a tool for predicting and controlling phenomena, but without considering such theories true descriptions of the real world. This is consistent with the fact that studies conducted within an EBM framework often test hypotheses regarding treatment effects based on mechanistic theories. Indeed, it would rarely be considered ethical to conduct a clinical trial on a new treatment with no theoretical rationale to support its benefit.When it comes to obtaining evidence to inform clinical decision-making, EBM requires that the efficacy and adverse events of a treatment are systematically evaluated on patients in the clinical setting. The primary focus is on design to minimise confounding variables; this is principally achieved through randomised controlled trials (Williamson, 2019). However, it is important to note that considerations regarding what constitutes ‘good evidence’, as depicted in the hierarchy of evidence, specifically relate to the ability to inform clinical decision-making. Thus, the placement of randomised trials at the top and mechanistic studies at the bottom does not reflect a difference between their implicit scientific value, but merely in their utility for directly informing clinical decision-making and health recommendations. Hence, just as mechanistic studies are insufficient for informing clinical decision-making, clinical studies—here understood as studies on patient populations focusing on clinical and patient-centred outcomes—are rarely suited for making causal claims regarding mechanisms (Maziarz, 2023).Here, it may be worth noting that due to their different philosophical underpinnings, EBM and mechanistic research operate with different concepts of causality. As EBM has its philosophical roots in pragmatism, it builds on a strictly manipulative causality concept, asserting that the experimental manipulation of a cause will result in the manipulation of an effect, thus practically making randomised controlled trials the only means for estimating the average treatment effect and potential harms, provided that they are well designed and well conducted. As such, the best evidence is obtained when the same hypothesis regarding a treatment effect repeatedly resists falsification in similarly conducted studies in similar populations.In contrast, causality is best appreciated as pluralistic, relying on both manipulative and descriptive reasoning in mechanistic research (Maziarz, 2023; Williamson, 2019). While causal claims can strictly be applied only to the specific experimental setting, model and system under study in mechanistic research, scientific realism permits interpretation of cause–effect relationships within its own adaptable theoretical framework, provided they can be replicated and consistently resist experimental falsification. This collective evidence is then used to make general claims about physiological mechanisms. Indeed, it is this ‘inductive leap’ of scientific realism that builds theories from which new hypotheses can be formulated, including clinical studies conducted within an EBM framework, such that these do not rely solely on incidental discoveries to generate new ideas for potential therapies.While the use of mechanistic studies to inform working hypotheses for clinical trials is the modus operandi, it works both ways. Clinical trial results can also yield incidental findings that inspire new mechanistic hypotheses, prompting further research. A notable example is the Women's Health Initiative trial, which unexpectedly revealed that combined oestrogen–progestin hormone replacement therapy increased the risk of breast cancer and cardiovascular disease compared to placebo in post-menopausal women (Rossouw et al., 2002). This was contrary to the prevailing belief that hormone replacement therapy might be protective against these conditions and given that the Women's Health Initiative trial did not provide the actual mechanisms, the findings led to a long line of mechanistic studies into the roles of oestrogen and progestins in both carcinogenesis and vascular function. Similarly, a randomised controlled trial of intensive insulin treatment to maintain strict blood glucose control in critically ill surgical patients showed that this reduced mortality specifically due to septic complications (Van den Berghe et al., 2001), which led to subsequent mechanistic studies on the immune-modulatory effects of insulin and related peptides.Mechanistic research has learned many important lessons from EBM, particularly the emphasis on design, where random assignment to treatment and control groups ensures, at least in principle, an equal distribution of unknown confounders, although these may be unequally distributed by chance, particularly if the sample size is small. This is notably important in clinical trials because the risk of unknown confounders that are unbalanced between groups is particularly high for clinical populations. Experimental physiology and related fields within mechanistic research, such as pharmacology, biochemistry and microbiology, encompass a wide range of research methods in both laboratory and applied (i.e., environmental or clinical) settings. For experimental physiology, this includes in vitro, ex vivo and in vivo models, which are applied to a variety of systems, including individual cells, cells in culture, tissue preparations and various isolated organ preparations, as well as animals and studies involving human subjects. From its inception, experimental physiology drew from the controlled experiments that formed the basis of physics and chemistry (Coleman, 1987). In its simplest form, the controlled experiment involves predicting an event by assessing the impact of changes in preconditions within a highly controlled environment. The experimental conditions are then systematically modified and adapted to manipulate or observe spontaneous changes in an independent variable while standardising conditions to rule out the effects of other confounding variables. This is done as part of an iterative process that often involves multiple cycles of switching between deduction and induction, thereby identifying cause–effect relationships between the independent and dependent variables. While successful randomisation is in principle the only means of eliminating both known and unknown confounders, several other procedures may be used to effectively minimise this in mechanistic studies. This includes various experimental manipulations that target the biological pathways under study through pharmacological, environmental, behavioural and/or genetic activation or inhibition. This may be relevant when randomisation is either impossible or unethical, such as when the natural history of a disease is studied or when a disease is compared to the healthy state, or when the exposure under study is assumed to have harmful effects, as based on theoretical reasoning or other lines of empirical evidence. Furthermore, controlling for various known confounders in the statistical analysis can also be effective here, for example, via inclusion of covariates in the analysis or by weighted regression.Despite some thematic overlap, it is important to note that although experimental physiology has historically been conceived to inform clinical medicine, it is a basic science with various aspects of physiology having a much broader scope than health-related outcomes. Challenges may thus arise when mechanistic studies in humans are classified as clinical studies for legal or ethical reasons, often leading to the mistaken belief that this classification also applies to the scientific aspects of the study (Richter et al., 2024). Clearly, when conducting studies on humans in various applied (including the clinical) settings, the risk of confounders is higher than in the controlled laboratory setting—for we can only to a very limited extent control previous or concurrent factors, including various genetic and environmental exposures, that may contribute to the observed cause–effect relationships. Of note, the same applies to studies on non-laboratory animals, such as pets, livestock and wild animals.While randomisation is probably the most powerful tool for avoiding unbalanced unknown confounders between groups, it is important to note that randomisation in itself does not control for confounders if improperly implemented or if sample sizes are too small. Along with imprecise outcome assessments, and poorly described experimental set-ups, this increases the risk of so-called magnitude and sign errors (Gelman & Carlin, 2014). Furthermore, properly performed randomisation with adequate sample sizes is not always possible or even relevant in mechanistic studies, particularly in the applied setting. In any event, it is useful to clearly distinguish actual hypothesis-testing studies from exploratory studies. We posit that many different designs can be used to test mechanistic hypotheses in applied settings, but the highest degree of certainty is achieved when based on a randomised controlled design, in which it is possible to prioritise outcomes and define an expected effect size, that is, a physiologically relevant difference, for sample size calculation. Consequently, most studies in the applied setting are exploratory, as they rarely focus on a single variable or outcome but rather on several interconnected variables that together show a pattern of change, rendering the prioritisation of outcomes meaningless. Furthermore, it is often impossible to define the minimal difference that is mechanistically relevant. However, as we will argue below, results from exploratory studies are by no means to be considered ‘low quality evidence’ as in EBM.How to define a minimal physiological difference that is mechanistically relevant is a matter of considerable debate, with some emphasising that a physiologically relevant difference should be determined by either theoretical or statistical predictions or field-established thresholds (Mesquida & Lakens, 2024; Williams et al., 2023), and others arguing that it should simply be set at the established minimally clinically relevant difference (Ciani et al., 2023). However, the latter is rarely available for the given context and furthermore only reflects the utility of the variable as a surrogate measure of a given clinical outcome, and thus not its mechanistic involvement. Alternatively, one could argue that any measurable change may in principle be physiologically relevant, thus highlighting the importance of the validity and reliability of the specific physiological measurement technique at play (Hartmann et al., 2023). However, unless one has unlimited resources, it often becomes impossible to falsify a hypothesis based on this, making relatively low statistical power a prerequisite in many physiological studies (Berg et al., 2024).A major strength of EBM lies in its ability to incorporate systematic reviews and meta-analyses, which synthesise findings from multiple studies and provide an assessment of the degree of the certainty of existing evidence regarding treatment effects on well-defined outcomes in clinical populations. While mechanistic evidence draws on collective evidence from different lines of research rather than individual studies, it can still benefit from a similar approach to data synthesis through systematic reviews and meta-analyses to assess the degree of certainty of evidence in favour of a given mechanism. Rather than focusing on randomisation as embedded within the evidence hierarchy, this should be based on so-called triangulation (Box 2). Here, triangulation involves the use of multiple models, systems and settings to address one question, each with its own unrelated assumptions, strengths and weaknesses, because results that agree across these different models and systems are less likely to be artefacts (Munafò & Smith, 2018). This approach would also ensure that findings from many mechanistic studies, both hypothesis-testing and exploratory, including the many individual studies that are underpowered due to constraints of scale, resources and ethical considerations (Berg et al., 2024; Schulz & Grimes, 2005), all contribute meaningfully to the collective evidence base.It is clear that both EBM and mechanistic research strive to attain evidence in the form of a conceptual warrant for belief (mechanistic research) and action (EBM), respectively, but from quite different philosophical standpoints. However, as the global replication crisis within all life science research has intensified since the turn of the millennium (Bailey et al., 2023b; Schooler, 2014), it is relevant to discuss and establish what constitutes ‘good evidence’ in experimental physiology and other types of mechanistic research. This should be done while keeping in mind that succumbing entirely to the principles of EBM would devalue more than 150 years of research within our field; simply put, it is egregiously bad practice to judge what is good or bad evidence based on incorrect premises! However, just as our field has benefited from the preregistration of study protocols and analysis plans to avoid selective analysis and publication bias (Christensen et al., 2025), an approach also endorsed by the Registered Report publication type in this journal (Rasmussen et al., 2025), may it be time to develop field-specific consensus guidelines for data synthesis and reporting in systematic reviews and meta-analyses on mechanistic studies?'Step… drag… drag… step… step… wait… drag… step,' Frank Herbert writes in Dune (1965) to describe the so-called sandwalk—a mystical, dance-like but arrhythmic walk, seemingly changing direction abruptly, with a jittery and unpredictable style. This method, essential for non-native travellers crossing the deadly deserts of the planet Arrakis, is used to avoid attracting the territorial and gigantic sandworms that dwell beneath the sands. Similarly, developing consensus guidelines for data synthesis and reporting in systematic reviews and meta-analyses of mechanistic studies will inevitably be a ‘sandwalk’ between the different traditions and philosophies outlined above. But despite the thematic overlap, mechanistic research fundamentally differs from clinical research, focusing on causal claims regarding mechanisms rather than informing clinical decision-making. This necessitates requirements that draw from, rather than merely copy, the best of EBM.In our opinion, field-specific guidelines for data synthesis and reporting in systematic reviews and meta-analyses in mechanistic research would mandate a triangulation-based approach that could potentially form the basis for field-specific criteria to assess the certainty of evidence. While the exact implementation of triangulation remains to be determined, we propose the conceptual model presented in Figure 2. Put briefly, evidence supporting a mechanism is dependent findings that are replicable and documented by (1) multiple sources of data, that is, models, systems and/or settings; (2) multiple measurement methods; and (3) multiple experimental manipulations. Importantly, any conclusions about a mechanism should remain confined to the model, system and setting in which they were demonstrated. In essence, the more data sources, measurement methods and experimental manipulations that successfully document a mechanism, the greater the certainty of evidence. Ultimately, this approach could lead to the development of quantitative tools for evidence synthesis and a formal assessment of its certainty, akin to the GRADE (Grading of Recommendations, Assessment, Development and Evaluations) framework used in EBM (Neumann & Schünemann, 2024).As Frank Herbert wrote (Herbert, 1965): ‘He who controls the spice controls the universe.’ While evidence may be the ‘spice’ of any scientific inquiry, this is perhaps too daunting for any scientist when considering how difficult it is in reality to control even the simplest experiment, whether in the laboratory or applied setting. The systematic assessment of whether the results are confounded and whether they can be replicated and reproduced may help ensure that the ‘inductive leap’ to generalisability can be justified, enabling the integration of the mechanism into existing physiological theory. The spice must flow!Ronan M. G. Berg: conception, first draft, revisions. Cody G. Durrer: prepared figures, revisions. Jan Kyrre Berg Olsen Friis: revisions. Mathias Ried-Larsen: conception, revisions. All authors have read and approved the final version of this manuscript and agree to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. All persons designated as authors qualify for authorship, and all those who qualify for authorship are listed.MR-L is employed by Novo Nordisk A/S. Novo Nordisk A/S had no role in the decision to prepare, write, or publish the manuscript. No other authors have any conflict of interest to declare.The Centre for Physical Activity Research (CFAS) is supported by TrygFonden (grants ID 101390, ID 20045, ID 125132, and ID 177225). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. C.G.D. was supported by the Canadian Institutes of Health Research (MFE-176582). The funders had no role in the decision to prepare, write, or publish the manuscript.","PeriodicalId":12092,"journal":{"name":"Experimental Physiology","volume":"110 9","pages":"1179-1185"},"PeriodicalIF":2.8000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://physoc.onlinelibrary.wiley.com/doi/epdf/10.1113/EP092157","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Experimental Physiology","FirstCategoryId":"3","ListUrlMain":"https://physoc.onlinelibrary.wiley.com/doi/10.1113/EP092157","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Formally, to have evidence is to have a ‘conceptual warrant for belief or action’ (Goldenberg, 2006), and to paraphrase Frank Herbert (1926–1980) in his novel Dune, evidence is certainly the melange, or ‘spice,’ around which all science is centred (Herbert, 1965). The concept of evidence that seems to dominate most biomedical sciences these days is that advocated by the evidence-based medicine (EBM) movement, which emerged as a new paradigm in the early 1990s with the ambition of basing clinical practice and teaching strictly on evidence as defined within the ‘hierarchy of evidence’ (Timmermans & Mauck, 2005) (Figure 1). Despite our enduring quest for truth within the field of experimental physiology, which seems more important than ever these days (Drummond & Tipton, 2024), mechanistic studies are consistently placed at the bottom in various incarnations of this hierarchy (Djulbegovic & Guyatt, 2017; Williamson, 2019). Does this mean that all our efforts as researchers within experimental physiology and other lines of mechanistic research contribute nothing more than ‘low quality’ evidence? Certainly not! Here, we will make the case that while EBM and experimental physiology benefit from each other and are complementary in many ways, they operate with fundamentally different frameworks. The main scientific philosophical concepts we will discuss are summarised in Box 1. We will make the case that EBM-based criteria for what constitutes ‘good evidence’ cannot uncritically be extrapolated to mechanistic research. And vice versa for that matter.

The field of experimental physiology emerged in France and Germany during the early 19th century, liberating physiology from natural philosophy, which had dominated its early history (Bailey et al., 2023a; Cunningham, 2002). Consequently, scientists in this new field radically discarded the concept of vitalism, which posited a mystical life force as the distinguishing element between living and non-living matter. Instead, life's processes were to be determined within the realms of scientific materialism, that is, by the laws of physics and chemistry alone and thus requiring analysis through these exact scientific disciplines (Coleman, 1987; Culotta, 1970). While this new field of experimental physiology was a basic science, its scope was quite practical: to inform clinical practice by providing physicians with a theoretical framework upon which to base their decisions (Goldenberg, 2006).

At its core, experimental physiology is a positivistic science, founded in so-called scientific realism, which is based on the premise that a real world exists, with its own structural and functional properties, which can be objectively studied through experimentation (Goldenberg, 2006). Although it underwent gradual modification, this positivism, with its reliance on deductive and inductive reasoning from mechanistic principles and theories, dominated clinical medicine for most of the 20th century (Timmermans & Mauck, 2005). EBM specifically evolved as a reaction against this, as mechanistic studies and theories were repeatedly found to be ineffective in predicting treatment effects in the clinical setting, and sometimes even leading to practices that turned out to be directly harmful. Even from the standpoint of scientific realism, this makes sense, because experimental models and conditions rarely reflect clinical reality in all its complexity, such that seeking for monocausal relationships for phenomena that are in reality multicausal has a high likelihood of failure. Furthermore, when one mechanism is targeted, its operation can be stopped or screened off by another causal factor, because several mechanisms typically operate simultaneously in vivo. Only the joint knowledge of all mechanisms and their interactions would suffice for taking mechanistic causal claims as a basis for decisions regarding interventions, something that even the most skilled experimentalist would find extremely challenging, if possible at all!

EBM advocated for a radically different approach. Rather than relying on mechanistic theories, treatment effects should be evaluated specifically on the basis of clinical and patient-centred outcomes. These include mortality, hospital discharge or readmission, changes in treatments and health-related quality of life, as well as through the systematic registration of adverse events (Timmermans & Mauck, 2005) and/or by changes in context-specific biomarkers that are known to correlate with a given clinical outcome of interest (Manyara et al., 2024). While the exact philosophical underpinnings of EBM are unclear and a matter of debate (Djulbegovic & Guyatt, 2017; Kulkarni, 2005; Thomas, 2023), it is undeniably positivistic, yet clearly opposes the scientific realism of mechanistic reasoning, at least in the context of clinical decision-making. Rather, EBM has clear elements of pragmatism, that is, emphasising the practical application of knowledge over theoretical reasoning, such that knowledge comes from observations and experiences rather than innate ideas or reason. Taken to the extreme, this implies that it is only relevant if a given cause–effect relationship is present or not, and any theoretical considerations of how and why are irrelevant (Goldenberg, 2006). While the purpose is thus not theory-building with the goal of understanding the real world, EBM mainly has an instrumentalistic approach to theories. This implies that EBM may accept theory-building as a tool for predicting and controlling phenomena, but without considering such theories true descriptions of the real world. This is consistent with the fact that studies conducted within an EBM framework often test hypotheses regarding treatment effects based on mechanistic theories. Indeed, it would rarely be considered ethical to conduct a clinical trial on a new treatment with no theoretical rationale to support its benefit.

When it comes to obtaining evidence to inform clinical decision-making, EBM requires that the efficacy and adverse events of a treatment are systematically evaluated on patients in the clinical setting. The primary focus is on design to minimise confounding variables; this is principally achieved through randomised controlled trials (Williamson, 2019). However, it is important to note that considerations regarding what constitutes ‘good evidence’, as depicted in the hierarchy of evidence, specifically relate to the ability to inform clinical decision-making. Thus, the placement of randomised trials at the top and mechanistic studies at the bottom does not reflect a difference between their implicit scientific value, but merely in their utility for directly informing clinical decision-making and health recommendations. Hence, just as mechanistic studies are insufficient for informing clinical decision-making, clinical studies—here understood as studies on patient populations focusing on clinical and patient-centred outcomes—are rarely suited for making causal claims regarding mechanisms (Maziarz, 2023).

Here, it may be worth noting that due to their different philosophical underpinnings, EBM and mechanistic research operate with different concepts of causality. As EBM has its philosophical roots in pragmatism, it builds on a strictly manipulative causality concept, asserting that the experimental manipulation of a cause will result in the manipulation of an effect, thus practically making randomised controlled trials the only means for estimating the average treatment effect and potential harms, provided that they are well designed and well conducted. As such, the best evidence is obtained when the same hypothesis regarding a treatment effect repeatedly resists falsification in similarly conducted studies in similar populations.

In contrast, causality is best appreciated as pluralistic, relying on both manipulative and descriptive reasoning in mechanistic research (Maziarz, 2023; Williamson, 2019). While causal claims can strictly be applied only to the specific experimental setting, model and system under study in mechanistic research, scientific realism permits interpretation of cause–effect relationships within its own adaptable theoretical framework, provided they can be replicated and consistently resist experimental falsification. This collective evidence is then used to make general claims about physiological mechanisms. Indeed, it is this ‘inductive leap’ of scientific realism that builds theories from which new hypotheses can be formulated, including clinical studies conducted within an EBM framework, such that these do not rely solely on incidental discoveries to generate new ideas for potential therapies.

While the use of mechanistic studies to inform working hypotheses for clinical trials is the modus operandi, it works both ways. Clinical trial results can also yield incidental findings that inspire new mechanistic hypotheses, prompting further research. A notable example is the Women's Health Initiative trial, which unexpectedly revealed that combined oestrogen–progestin hormone replacement therapy increased the risk of breast cancer and cardiovascular disease compared to placebo in post-menopausal women (Rossouw et al., 2002). This was contrary to the prevailing belief that hormone replacement therapy might be protective against these conditions and given that the Women's Health Initiative trial did not provide the actual mechanisms, the findings led to a long line of mechanistic studies into the roles of oestrogen and progestins in both carcinogenesis and vascular function. Similarly, a randomised controlled trial of intensive insulin treatment to maintain strict blood glucose control in critically ill surgical patients showed that this reduced mortality specifically due to septic complications (Van den Berghe et al., 2001), which led to subsequent mechanistic studies on the immune-modulatory effects of insulin and related peptides.

Mechanistic research has learned many important lessons from EBM, particularly the emphasis on design, where random assignment to treatment and control groups ensures, at least in principle, an equal distribution of unknown confounders, although these may be unequally distributed by chance, particularly if the sample size is small. This is notably important in clinical trials because the risk of unknown confounders that are unbalanced between groups is particularly high for clinical populations. Experimental physiology and related fields within mechanistic research, such as pharmacology, biochemistry and microbiology, encompass a wide range of research methods in both laboratory and applied (i.e., environmental or clinical) settings. For experimental physiology, this includes in vitro, ex vivo and in vivo models, which are applied to a variety of systems, including individual cells, cells in culture, tissue preparations and various isolated organ preparations, as well as animals and studies involving human subjects. From its inception, experimental physiology drew from the controlled experiments that formed the basis of physics and chemistry (Coleman, 1987). In its simplest form, the controlled experiment involves predicting an event by assessing the impact of changes in preconditions within a highly controlled environment. The experimental conditions are then systematically modified and adapted to manipulate or observe spontaneous changes in an independent variable while standardising conditions to rule out the effects of other confounding variables. This is done as part of an iterative process that often involves multiple cycles of switching between deduction and induction, thereby identifying cause–effect relationships between the independent and dependent variables. While successful randomisation is in principle the only means of eliminating both known and unknown confounders, several other procedures may be used to effectively minimise this in mechanistic studies. This includes various experimental manipulations that target the biological pathways under study through pharmacological, environmental, behavioural and/or genetic activation or inhibition. This may be relevant when randomisation is either impossible or unethical, such as when the natural history of a disease is studied or when a disease is compared to the healthy state, or when the exposure under study is assumed to have harmful effects, as based on theoretical reasoning or other lines of empirical evidence. Furthermore, controlling for various known confounders in the statistical analysis can also be effective here, for example, via inclusion of covariates in the analysis or by weighted regression.

Despite some thematic overlap, it is important to note that although experimental physiology has historically been conceived to inform clinical medicine, it is a basic science with various aspects of physiology having a much broader scope than health-related outcomes. Challenges may thus arise when mechanistic studies in humans are classified as clinical studies for legal or ethical reasons, often leading to the mistaken belief that this classification also applies to the scientific aspects of the study (Richter et al., 2024). Clearly, when conducting studies on humans in various applied (including the clinical) settings, the risk of confounders is higher than in the controlled laboratory setting—for we can only to a very limited extent control previous or concurrent factors, including various genetic and environmental exposures, that may contribute to the observed cause–effect relationships. Of note, the same applies to studies on non-laboratory animals, such as pets, livestock and wild animals.

While randomisation is probably the most powerful tool for avoiding unbalanced unknown confounders between groups, it is important to note that randomisation in itself does not control for confounders if improperly implemented or if sample sizes are too small. Along with imprecise outcome assessments, and poorly described experimental set-ups, this increases the risk of so-called magnitude and sign errors (Gelman & Carlin, 2014). Furthermore, properly performed randomisation with adequate sample sizes is not always possible or even relevant in mechanistic studies, particularly in the applied setting. In any event, it is useful to clearly distinguish actual hypothesis-testing studies from exploratory studies. We posit that many different designs can be used to test mechanistic hypotheses in applied settings, but the highest degree of certainty is achieved when based on a randomised controlled design, in which it is possible to prioritise outcomes and define an expected effect size, that is, a physiologically relevant difference, for sample size calculation. Consequently, most studies in the applied setting are exploratory, as they rarely focus on a single variable or outcome but rather on several interconnected variables that together show a pattern of change, rendering the prioritisation of outcomes meaningless. Furthermore, it is often impossible to define the minimal difference that is mechanistically relevant. However, as we will argue below, results from exploratory studies are by no means to be considered ‘low quality evidence’ as in EBM.

How to define a minimal physiological difference that is mechanistically relevant is a matter of considerable debate, with some emphasising that a physiologically relevant difference should be determined by either theoretical or statistical predictions or field-established thresholds (Mesquida & Lakens, 2024; Williams et al., 2023), and others arguing that it should simply be set at the established minimally clinically relevant difference (Ciani et al., 2023). However, the latter is rarely available for the given context and furthermore only reflects the utility of the variable as a surrogate measure of a given clinical outcome, and thus not its mechanistic involvement. Alternatively, one could argue that any measurable change may in principle be physiologically relevant, thus highlighting the importance of the validity and reliability of the specific physiological measurement technique at play (Hartmann et al., 2023). However, unless one has unlimited resources, it often becomes impossible to falsify a hypothesis based on this, making relatively low statistical power a prerequisite in many physiological studies (Berg et al., 2024).

A major strength of EBM lies in its ability to incorporate systematic reviews and meta-analyses, which synthesise findings from multiple studies and provide an assessment of the degree of the certainty of existing evidence regarding treatment effects on well-defined outcomes in clinical populations. While mechanistic evidence draws on collective evidence from different lines of research rather than individual studies, it can still benefit from a similar approach to data synthesis through systematic reviews and meta-analyses to assess the degree of certainty of evidence in favour of a given mechanism. Rather than focusing on randomisation as embedded within the evidence hierarchy, this should be based on so-called triangulation (Box 2). Here, triangulation involves the use of multiple models, systems and settings to address one question, each with its own unrelated assumptions, strengths and weaknesses, because results that agree across these different models and systems are less likely to be artefacts (Munafò & Smith, 2018). This approach would also ensure that findings from many mechanistic studies, both hypothesis-testing and exploratory, including the many individual studies that are underpowered due to constraints of scale, resources and ethical considerations (Berg et al., 2024; Schulz & Grimes, 2005), all contribute meaningfully to the collective evidence base.

It is clear that both EBM and mechanistic research strive to attain evidence in the form of a conceptual warrant for belief (mechanistic research) and action (EBM), respectively, but from quite different philosophical standpoints. However, as the global replication crisis within all life science research has intensified since the turn of the millennium (Bailey et al., 2023b; Schooler, 2014), it is relevant to discuss and establish what constitutes ‘good evidence’ in experimental physiology and other types of mechanistic research. This should be done while keeping in mind that succumbing entirely to the principles of EBM would devalue more than 150 years of research within our field; simply put, it is egregiously bad practice to judge what is good or bad evidence based on incorrect premises! However, just as our field has benefited from the preregistration of study protocols and analysis plans to avoid selective analysis and publication bias (Christensen et al., 2025), an approach also endorsed by the Registered Report publication type in this journal (Rasmussen et al., 2025), may it be time to develop field-specific consensus guidelines for data synthesis and reporting in systematic reviews and meta-analyses on mechanistic studies?

'Step… drag… drag… step… step… wait… drag… step,' Frank Herbert writes in Dune (1965) to describe the so-called sandwalk—a mystical, dance-like but arrhythmic walk, seemingly changing direction abruptly, with a jittery and unpredictable style. This method, essential for non-native travellers crossing the deadly deserts of the planet Arrakis, is used to avoid attracting the territorial and gigantic sandworms that dwell beneath the sands. Similarly, developing consensus guidelines for data synthesis and reporting in systematic reviews and meta-analyses of mechanistic studies will inevitably be a ‘sandwalk’ between the different traditions and philosophies outlined above. But despite the thematic overlap, mechanistic research fundamentally differs from clinical research, focusing on causal claims regarding mechanisms rather than informing clinical decision-making. This necessitates requirements that draw from, rather than merely copy, the best of EBM.

In our opinion, field-specific guidelines for data synthesis and reporting in systematic reviews and meta-analyses in mechanistic research would mandate a triangulation-based approach that could potentially form the basis for field-specific criteria to assess the certainty of evidence. While the exact implementation of triangulation remains to be determined, we propose the conceptual model presented in Figure 2. Put briefly, evidence supporting a mechanism is dependent findings that are replicable and documented by (1) multiple sources of data, that is, models, systems and/or settings; (2) multiple measurement methods; and (3) multiple experimental manipulations. Importantly, any conclusions about a mechanism should remain confined to the model, system and setting in which they were demonstrated. In essence, the more data sources, measurement methods and experimental manipulations that successfully document a mechanism, the greater the certainty of evidence. Ultimately, this approach could lead to the development of quantitative tools for evidence synthesis and a formal assessment of its certainty, akin to the GRADE (Grading of Recommendations, Assessment, Development and Evaluations) framework used in EBM (Neumann & Schünemann, 2024).

As Frank Herbert wrote (Herbert, 1965): ‘He who controls the spice controls the universe.’ While evidence may be the ‘spice’ of any scientific inquiry, this is perhaps too daunting for any scientist when considering how difficult it is in reality to control even the simplest experiment, whether in the laboratory or applied setting. The systematic assessment of whether the results are confounded and whether they can be replicated and reproduced may help ensure that the ‘inductive leap’ to generalisability can be justified, enabling the integration of the mechanism into existing physiological theory. The spice must flow!

Ronan M. G. Berg: conception, first draft, revisions. Cody G. Durrer: prepared figures, revisions. Jan Kyrre Berg Olsen Friis: revisions. Mathias Ried-Larsen: conception, revisions. All authors have read and approved the final version of this manuscript and agree to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. All persons designated as authors qualify for authorship, and all those who qualify for authorship are listed.

MR-L is employed by Novo Nordisk A/S. Novo Nordisk A/S had no role in the decision to prepare, write, or publish the manuscript. No other authors have any conflict of interest to declare.

The Centre for Physical Activity Research (CFAS) is supported by TrygFonden (grants ID 101390, ID 20045, ID 125132, and ID 177225). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. C.G.D. was supported by the Canadian Institutes of Health Research (MFE-176582). The funders had no role in the decision to prepare, write, or publish the manuscript.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

循证医学时代的机械研究现状：三角测量与层次之间的夹道。

， 2001)，这导致了随后对胰岛素和相关肽免疫调节作用的机制研究。机械研究从循证医学中学到了许多重要的经验，尤其是对设计的强调，其中随机分配给实验组和对照组至少在原则上确保了未知混杂因素的均匀分布，尽管这些混杂因素可能是随机分布的，特别是在样本量很小的情况下。这在临床试验中尤为重要，因为在临床人群中，组间不平衡的未知混杂因素的风险特别高。实验生理学和机械研究中的相关领域，如药理学、生物化学和微生物学，涵盖了实验室和应用（即环境或临床）设置的广泛研究方法。对于实验生理学，这包括体外、离体和体内模型，这些模型适用于各种系统，包括单个细胞、培养细胞、组织制备和各种分离器官制备，以及动物和涉及人类受试者的研究。从一开始，实验生理学就借鉴了构成物理和化学基础的对照实验（Coleman, 1987）。在其最简单的形式中，受控实验涉及通过评估在高度受控的环境中先决条件变化的影响来预测事件。然后系统地修改和适应实验条件，以操纵或观察自变量的自发变化，同时标准化条件以排除其他混杂变量的影响。这是作为迭代过程的一部分完成的，该过程通常涉及在演绎和归纳之间切换的多个循环，从而确定自变量和因变量之间的因果关系。虽然成功的随机化原则上是消除已知和未知混杂因素的唯一方法，但在机械研究中，可以使用其他几种方法来有效地减少这种混杂因素。这包括通过药理学、环境、行为和/或基因激活或抑制来针对正在研究的生物途径的各种实验操作。当随机化不可能或不道德时，这可能是相关的，例如当研究疾病的自然史时，或当将疾病与健康状态进行比较时，或当根据理论推理或其他经验证据假定所研究的暴露具有有害影响时。此外，在统计分析中控制各种已知的混杂因素在这里也是有效的，例如，通过在分析中包含协变量或通过加权回归。尽管有一些主题重叠，但重要的是要注意，尽管实验生理学历来被认为是为临床医学提供信息，但它是一门基础科学，生理学的各个方面具有比健康相关结果更广泛的范围。因此，当人体机械研究出于法律或伦理原因被归类为临床研究时，可能会出现挑战，这往往会导致人们错误地认为这种分类也适用于研究的科学方面（Richter et al., 2024）。显然，当在各种应用（包括临床）环境中对人类进行研究时，混杂因素的风险比在受控的实验室环境中更高，因为我们只能在非常有限的程度上控制先前或并发的因素，包括各种遗传和环境暴露，这些因素可能有助于观察到的因果关系。值得注意的是，这同样适用于对非实验动物的研究，如宠物、牲畜和野生动物。虽然随机化可能是避免组间不平衡未知混杂因素的最强大工具，但重要的是要注意，如果执行不当或样本量太小，随机化本身并不能控制混杂因素。伴随着不精确的结果评估和描述不佳的实验设置，这增加了所谓的幅度和符号错误的风险（Gelman & Carlin, 2014）。此外，在机械研究中，特别是在应用环境中，适当执行足够样本量的随机化并不总是可能的，甚至也不相关。无论如何，明确区分实际的假设检验研究和探索性研究是有用的。我们假设许多不同的设计可用于在应用环境中测试机制假设，但最高程度的确定性是基于随机对照设计实现的，在随机对照设计中，可以优先考虑结果并定义预期效应大小，即生理相关差异，用于样本量计算。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Experimental Physiology 医学-生理学

CiteScore

5.10

自引率

3.70%

发文量

262

审稿时长

1 months

期刊介绍： Experimental Physiology publishes research papers that report novel insights into homeostatic and adaptive responses in health, as well as those that further our understanding of pathophysiological mechanisms in disease. We encourage papers that embrace the journal’s orientation of translation and integration, including studies of the adaptive responses to exercise, acute and chronic environmental stressors, growth and aging, and diseases where integrative homeostatic mechanisms play a key role in the response to and evolution of the disease process. Examples of such diseases include hypertension, heart failure, hypoxic lung disease, endocrine and neurological disorders. We are also keen to publish research that has a translational aspect or clinical application. Comparative physiology work that can be applied to aid the understanding human physiology is also encouraged. Manuscripts that report the use of bioinformatic, genomic, molecular, proteomic and cellular techniques to provide novel insights into integrative physiological and pathophysiological mechanisms are welcomed.