首页 > 最新文献

Evaluation Review最新文献

英文 中文
Randomized Controlled Trial Aversion among Public Sector Leadership: A Survey Experiment. 公共部门领导的随机对照试验厌恶症:调查实验。
IF 3 4区 社会学 Q1 SOCIAL SCIENCES, INTERDISCIPLINARY Pub Date : 2024-08-01 Epub Date: 2023-08-07 DOI: 10.1177/0193841X231193483
Emily Cardon, Leonard Lopoo

Background: While randomized controlled trials (RCTs) are typically considered the gold standard of program evaluation, they are infrequently chosen by public sector leaders, defined as government and nonprofit decision-makers, when an impact evaluation is required. Objectives: This study provides descriptive evidence on RCT aversion among public sector leaders and attempts to understand what factors affect their likelihood of choosing RCTs for impact evaluations. Research Design: The authors ask if public sector leaders follow similar preference patterns found among non-public sector leaders when choosing either an RCT or a quasi-experimental design and use a survey experiment to determine which factors affect the RCT choice. Subjects: The study sample includes 2050 public sector leaders and a comparison group of 2060 respondents who do not lead public sector organizations. Measures: The primary outcome measure is selecting an RCT as the preferred evaluation option. Results: When asked to make a decision about an impact evaluation, the majority of people do not choose an RCT. While also averse to RCTs, public sector leaders are about 13% more likely to prefer a RCT to a quasi-experimental evaluation compared to the general population. Public sector leaders are less likely to use RCTs for evaluations of more intense interventions, potentially because they are perceived to be superior to the options available for the control group. Conclusion: Funders should be aware that when given a choice, public sector leaders prefer other options to RCTs. Greater awareness of the benefits of RCTs could increase their use in the public sector.

背景:虽然随机对照试验(RCT)通常被认为是项目评估的黄金标准,但在需要进行影响评估时,公共部门的领导者(指政府和非营利组织的决策者)却很少选择随机对照试验。研究目的本研究提供了公共部门领导者厌恶 RCT 的描述性证据,并试图了解哪些因素会影响他们选择 RCT 进行影响评估的可能性。研究设计:作者询问公共部门领导人在选择 RCT 或准实验设计时是否遵循非公共部门领导人的类似偏好模式,并使用调查实验来确定哪些因素会影响 RCT 的选择。研究对象:研究样本包括 2050 名公共部门领导和 2060 名非公共部门组织领导的对比组受访者。衡量标准:主要结果指标是选择 RCT 作为首选评估方案。结果:当被要求就影响评估做出决定时,大多数人不会选择 RCT。公共部门的领导者虽然也不喜欢 RCT,但与普通人相比,他们更倾向于 RCT,而不是准实验评估。公共部门的领导者不太可能使用 RCT 来评估强度较大的干预措施,这可能是因为他们认为 RCT 比对照组的可选方案更优越。结论:资助者应该意识到,在有选择的情况下,公共部门领导人更倾向于其他方案而不是 RCT。提高对 RCT 优点的认识可以增加 RCT 在公共部门的使用。
{"title":"Randomized Controlled Trial Aversion among Public Sector Leadership: A Survey Experiment.","authors":"Emily Cardon, Leonard Lopoo","doi":"10.1177/0193841X231193483","DOIUrl":"10.1177/0193841X231193483","url":null,"abstract":"<p><p><i>Background:</i> While randomized controlled trials (RCTs) are typically considered the gold standard of program evaluation, they are infrequently chosen by public sector leaders, defined as government and nonprofit decision-makers, when an impact evaluation is required. <i>Objectives</i>: This study provides descriptive evidence on RCT aversion among public sector leaders and attempts to understand what factors affect their likelihood of choosing RCTs for impact evaluations. <i>Research Design</i>: The authors ask if public sector leaders follow similar preference patterns found among non-public sector leaders when choosing either an RCT or a quasi-experimental design and use a survey experiment to determine which factors affect the RCT choice. <i>Subjects</i>: The study sample includes 2050 public sector leaders and a comparison group of 2060 respondents who do not lead public sector organizations. <i>Measures:</i> The primary outcome measure is selecting an RCT as the preferred evaluation option. <i>Results</i>: When asked to make a decision about an impact evaluation, the majority of people do not choose an RCT. While also averse to RCTs, public sector leaders are about 13% more likely to prefer a RCT to a quasi-experimental evaluation compared to the general population. Public sector leaders are less likely to use RCTs for evaluations of more intense interventions, potentially because they are perceived to be superior to the options available for the control group. <i>Conclusion</i>: Funders should be aware that when given a choice, public sector leaders prefer other options to RCTs. Greater awareness of the benefits of RCTs could increase their use in the public sector.</p>","PeriodicalId":47533,"journal":{"name":"Evaluation Review","volume":" ","pages":"579-609"},"PeriodicalIF":3.0,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9953612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Funding Innovation and Risk: A Grey-Based Startup Investment Decision. 资助创新与风险:基于灰色的初创企业投资决策。
IF 3 4区 社会学 Q1 SOCIAL SCIENCES, INTERDISCIPLINARY Pub Date : 2024-07-24 DOI: 10.1177/0193841X241262887
Manoj Kumar Srivastava, Ashutosh Dash, Imlak Shaikh

As found in behavioral decision theory, venture capitalists (VCs) rely on heuristics and bias, owing to their bounded rationality, either by limited alternatives or information and resources. India's booming startup scene challenges VCs in decision-making owing to information overload from numerous evolving ventures, which hinders informed judgment. VC investment behavior, due diligence, and cognitive factors related to decision-making have always drawn the attention of researchers. We provide an alternative approach for an optimal decision by VCs by identifying the attributes that influence investment or funding decisions at an early stage of a venture in tech-based industries. Through a literature review, we identify eight attributes, both on internal and external criteria, that venture investors consider when making investment decisions. Based on interviews with 20 experts, we further identify eight key tech-based sectors. Using grey system theory, we then determine the rankings of eight tech startups for investors' early-stage investment decisions. This study presents a linguistic variable-based approach of grey numbers to decide weights and ratings, the grey possibility degree to compare and rank different tech startups, and based on the results, suggests the ideal tech startup. We find that agritech ranks first; thus, investors should prefer venturing into such startups for early-stage investment. E-commerce and edutech ranked second and third, respectively, followed by electric vehicle infrastructure, insurtech, fintech, space tech, and software as a service.

行为决策理论认为,风险投资家(VCs)由于其理性受限,或因选择有限,或因信息和资源有限,而依赖启发式方法和偏见。印度初创企业的蓬勃发展给风险投资人的决策带来了挑战,因为众多不断发展的企业带来了超负荷的信息,阻碍了他们做出明智的判断。与决策相关的风险投资行为、尽职调查和认知因素一直吸引着研究人员的关注。我们通过识别在科技产业风险投资早期阶段影响投资或融资决策的属性,为风险投资公司做出最优决策提供了另一种方法。通过文献综述,我们确定了风险投资者在做出投资决策时会考虑的八个属性,包括内部和外部标准。根据对 20 位专家的访谈,我们进一步确定了八个关键的科技行业。利用灰色系统理论,我们确定了八家初创科技企业在投资者早期投资决策中的排名。本研究提出了一种基于语言变量的灰色数字方法来决定权重和评级,用灰色可能性程度对不同的初创科技公司进行比较和排序,并根据结果提出理想的初创科技公司。我们发现,农业科技排名第一;因此,投资者应首选此类初创企业进行早期投资。电子商务和教育科技分别排名第二和第三,其后依次是电动汽车基础设施、保险科技、金融科技、空间科技和软件即服务。
{"title":"Funding Innovation and Risk: A Grey-Based Startup Investment Decision.","authors":"Manoj Kumar Srivastava, Ashutosh Dash, Imlak Shaikh","doi":"10.1177/0193841X241262887","DOIUrl":"https://doi.org/10.1177/0193841X241262887","url":null,"abstract":"<p><p>As found in behavioral decision theory, venture capitalists (VCs) rely on heuristics and bias, owing to their bounded rationality, either by limited alternatives or information and resources. India's booming startup scene challenges VCs in decision-making owing to information overload from numerous evolving ventures, which hinders informed judgment. VC investment behavior, due diligence, and cognitive factors related to decision-making have always drawn the attention of researchers. We provide an alternative approach for an optimal decision by VCs by identifying the attributes that influence investment or funding decisions at an early stage of a venture in tech-based industries. Through a literature review, we identify eight attributes, both on internal and external criteria, that venture investors consider when making investment decisions. Based on interviews with 20 experts, we further identify eight key tech-based sectors. Using grey system theory, we then determine the rankings of eight tech startups for investors' early-stage investment decisions. This study presents a linguistic variable-based approach of grey numbers to decide weights and ratings, the grey possibility degree to compare and rank different tech startups, and based on the results, suggests the ideal tech startup. We find that agritech ranks first; thus, investors should prefer venturing into such startups for early-stage investment. E-commerce and edutech ranked second and third, respectively, followed by electric vehicle infrastructure, insurtech, fintech, space tech, and software as a service.</p>","PeriodicalId":47533,"journal":{"name":"Evaluation Review","volume":" ","pages":"193841X241262887"},"PeriodicalIF":3.0,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141761661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the Effectiveness of Maternal, Neonatal, and Child Healthcare in Moroccan Hospitals and SDG 3: Using Two-Stage Data Envelopment Analysis and Tobit Regression. 评估摩洛哥医院中孕产妇、新生儿和儿童保健的有效性与可持续发展目标 3:使用两阶段数据包络分析和托比特回归。
IF 3 4区 社会学 Q1 SOCIAL SCIENCES, INTERDISCIPLINARY Pub Date : 2024-07-20 DOI: 10.1177/0193841X241264863
Youssef Er-Rays, Meriem M'dioud

Maternal, neonatal, and child health play crucial roles in achieving the objectives of Sustainable Development Goal (SDG) 2030, particularly in promoting health and wellbeing. However, maternal, neonatal, and child services in Moroccan public hospitals face challenges, particularly concerning mortality rates and inefficient resource allocation, which hinder optimal outcomes. This study aimed to evaluate the operational effectiveness of 76 neonatal and child health services networks (MNCSN) within Moroccan public hospitals. Using Data Envelopment Analysis (DEA), we assessed technical efficiency (TE) employing both Variable Returns to Scale for inputs (VRS-I) and outputs (VRS-O) orientation. Additionally, the Tobit method (TM) was utilized to explore factors influencing inefficiency, with hospital, doctor, and paramedical staff considered as inputs, and admissions, cesarean interventions, functional capacity, and hospitalization days as outputs. Our findings revealed that VRS-I exhibited a higher average TE score of 0.76 compared to VRS-O (0.23). Notably, the Casablanca-Anfa MNCSN received the highest referrals (30) under VRS-I, followed by the Khemisset MNCSN (24). In contrast, under VRS-O, Ben Msick, Rabat, and Mediouna MNCSN each had three peers, with 71, 22, and 17 references, respectively. Moreover, the average Malmquist Index under VRS-I indicated a 7.7% increase in productivity over the 9-year study period, while under VRS-O, the average Malmquist Index decreased by 8.7%. Furthermore, doctors and functional bed capacity received the highest Tobit model score of 0.01, followed by hospitalization days and cesarean sections. This study underscores the imperative for policymakers to strategically prioritize input factors to enhance efficiency and ensure optimal maternal, neonatal, and child healthcare outcomes.

孕产妇、新生儿和儿童健康在实现 2030 年可持续发展目标(SDG)方面发挥着至关重要的作用,尤其是在促进健康和福祉方面。然而,摩洛哥公立医院的孕产妇、新生儿和儿童服务面临着各种挑战,尤其是在死亡率和资源分配效率低下方面,这阻碍了最佳结果的实现。本研究旨在评估摩洛哥公立医院内 76 个新生儿和儿童保健服务网络(MNCSN)的运行效果。我们采用数据包络分析法(DEA),以投入(VRS-I)和产出(VRS-O)的规模回报率(Variable Returns to Scale)为导向,评估技术效率(TE)。此外,我们还利用 Tobit 法(TM)探讨了影响低效率的因素,将医院、医生和医务辅助人员视为投入,将入院率、剖宫产率、功能能力和住院天数视为产出。我们的研究结果表明,VRS-I 的平均 TE 得分为 0.76,高于 VRS-O 的 0.23。值得注意的是,在 VRS-I 下,卡萨布兰卡-安法多学科监护网络收到的转诊次数最多(30 次),其次是凯米塞特多学科监护网络(24 次)。相比之下,在 VRS-O 条件下,Ben Msick、Rabat 和 Mediouna MNCSN 各有三个同行,分别有 71、22 和 17 个转介。此外,在为期 9 年的研究期间,VRS-I 条件下的平均 Malmquist 指数表明生产率提高了 7.7%,而在 VRS-O 条件下,平均 Malmquist 指数下降了 8.7%。此外,医生和功能床位的 Tobit 模型得分最高,为 0.01,其次是住院天数和剖腹产。这项研究强调,政策制定者必须从战略上优先考虑投入因素,以提高效率并确保最佳的孕产妇、新生儿和儿童医疗保健结果。
{"title":"Evaluating the Effectiveness of Maternal, Neonatal, and Child Healthcare in Moroccan Hospitals and SDG 3: Using Two-Stage Data Envelopment Analysis and Tobit Regression.","authors":"Youssef Er-Rays, Meriem M'dioud","doi":"10.1177/0193841X241264863","DOIUrl":"https://doi.org/10.1177/0193841X241264863","url":null,"abstract":"<p><p>Maternal, neonatal, and child health play crucial roles in achieving the objectives of Sustainable Development Goal (SDG) 2030, particularly in promoting health and wellbeing. However, maternal, neonatal, and child services in Moroccan public hospitals face challenges, particularly concerning mortality rates and inefficient resource allocation, which hinder optimal outcomes. This study aimed to evaluate the operational effectiveness of 76 neonatal and child health services networks (MNCSN) within Moroccan public hospitals. Using Data Envelopment Analysis (DEA), we assessed technical efficiency (TE) employing both Variable Returns to Scale for inputs (VRS-I) and outputs (VRS-O) orientation. Additionally, the Tobit method (TM) was utilized to explore factors influencing inefficiency, with hospital, doctor, and paramedical staff considered as inputs, and admissions, cesarean interventions, functional capacity, and hospitalization days as outputs. Our findings revealed that VRS-I exhibited a higher average TE score of 0.76 compared to VRS-O (0.23). Notably, the Casablanca-Anfa MNCSN received the highest referrals (30) under VRS-I, followed by the Khemisset MNCSN (24). In contrast, under VRS-O, Ben Msick, Rabat, and Mediouna MNCSN each had three peers, with 71, 22, and 17 references, respectively. Moreover, the average Malmquist Index under VRS-I indicated a 7.7% increase in productivity over the 9-year study period, while under VRS-O, the average Malmquist Index decreased by 8.7%. Furthermore, doctors and functional bed capacity received the highest Tobit model score of 0.01, followed by hospitalization days and cesarean sections. This study underscores the imperative for policymakers to strategically prioritize input factors to enhance efficiency and ensure optimal maternal, neonatal, and child healthcare outcomes.</p>","PeriodicalId":47533,"journal":{"name":"Evaluation Review","volume":" ","pages":"193841X241264863"},"PeriodicalIF":3.0,"publicationDate":"2024-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141731503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Ripple Effect of Managerial Behavior: Exploring Post-experimental Impact of Leading by Example on Small Firms' Cooperation and Performance. 管理行为的涟漪效应:探索 "以身作则 "对小型企业合作与绩效的实验后影响。
IF 0.9 4区 社会学 Q1 SOCIAL SCIENCES, INTERDISCIPLINARY Pub Date : 2024-06-08 DOI: 10.1177/0193841X241260466
Quang Nguyen, Huong Trang Kim

Cooperation between employees in a company is an important input to firm performance. This study examines how a manager's cooperative behavior and the visibility of this behavior affect the cooperation amongst employees, and subsequently firm performance. To do so, we conducted a field experiment with managers and their employees from 320 Vietnamese small and micro firms to determine the impact of a manager's leading by example (LBE) on employees' behavior, corporate culture, and firm performance. Both managers and employees participated in a Public Good experiment which aimed to elicit an individual cooperative behavior. Noteworthy is that the decision made by a manager in the experiment was given as an example to employees before they made decision in that same experiment. We considered that the example of cooperation by managers in the Public Good experiment communicated a powerful signal to the employees regarding the importance of fostering cooperation in the workplace. Such a signal by the manager, who is at the top in the organizational hierarchy, would impact their employees' behavior in the workplace and firm's outcomes beyond the experiment. Interestingly, we found that concealing a manager's identity from their employees enhances the impacts of LBE.

公司员工之间的合作是公司业绩的重要组成部分。本研究探讨了管理者的合作行为和这种行为的可见度如何影响员工之间的合作,进而影响企业绩效。为此,我们对来自 320 家越南小型和微型企业的管理者及其员工进行了实地实验,以确定管理者的以身作则(LBE)对员工行为、企业文化和企业绩效的影响。管理者和员工都参与了旨在激发个人合作行为的公益实验。值得注意的是,在同一实验中,管理者在实验中做出的决定在员工做出决定之前,会给他们树立一个榜样。我们认为,在 "公益 "实验中,管理者的合作范例向员工传达了一个强有力的信号,即在工作场所促进合作的重要性。管理者处于组织结构的顶层,其发出的这种信号将影响员工在工作场所的行为以及公司在实验之后的成果。有趣的是,我们发现向员工隐瞒经理的身份会增强 LBE 的影响。
{"title":"The Ripple Effect of Managerial Behavior: Exploring Post-experimental Impact of Leading by Example on Small Firms' Cooperation and Performance.","authors":"Quang Nguyen, Huong Trang Kim","doi":"10.1177/0193841X241260466","DOIUrl":"https://doi.org/10.1177/0193841X241260466","url":null,"abstract":"<p><p>Cooperation between employees in a company is an important input to firm performance. This study examines how a manager's cooperative behavior and the visibility of this behavior affect the cooperation amongst employees, and subsequently firm performance. To do so, we conducted a field experiment with managers and their employees from 320 Vietnamese small and micro firms to determine the impact of a manager's leading by example (LBE) on employees' behavior, corporate culture, and firm performance. Both managers and employees participated in a Public Good experiment which aimed to elicit an individual cooperative behavior. Noteworthy is that the decision made by a manager in the experiment was given as an example to employees before they made decision in that same experiment. We considered that the example of cooperation by managers in the Public Good experiment communicated a powerful signal to the employees regarding the importance of fostering cooperation in the workplace. Such a signal by the manager, who is at the top in the organizational hierarchy, would impact their employees' behavior in the workplace and firm's outcomes beyond the experiment. Interestingly, we found that concealing a manager's identity from their employees enhances the impacts of LBE.</p>","PeriodicalId":47533,"journal":{"name":"Evaluation Review","volume":" ","pages":"193841X241260466"},"PeriodicalIF":0.9,"publicationDate":"2024-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141293836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contexts of Convenience: Generalizing from Published Evaluations of School Finance Policies. Contexts of Convenience:从已公布的学校财务政策评估中归纳总结。
IF 0.9 4区 社会学 Q1 SOCIAL SCIENCES, INTERDISCIPLINARY Pub Date : 2024-06-01 Epub Date: 2024-01-31 DOI: 10.1177/0193841X241228335
Danielle V Handel, Eric A Hanushek

Recent attention to the causal identification of spending impacts provides improved estimates of spending outcomes in a variety of circumstances, but the estimates are substantially different across studies. Half of the variation in estimated funding impact on test scores and over three-quarters of the variation of impacts on school attainment reflect differences in the true parameters across study contexts. Unfortunately, inability to describe the circumstances underlying effective school spending impedes any attempts to generalize from the extant results to new policy situations. The evidence indicates that how funds are used is crucial to the outcomes, but such factors as targeting of funds or court interventions fail to explain the existing pattern of results.

最近对支出影响因果关系识别的关注改进了各种情况下支出结果的估算,但不同研究的估算结果大相径庭。在估计资金对考试成绩的影响方面,有一半的差异和超过四分之三的对学业成绩影响的差异反映了不同研究背景下真实参数的差异。遗憾的是,由于无法描述有效学校支出的基本情况,因此无法将现有结果推广到新的政策情境中。证据表明,如何使用资金对结果至关重要,但资金的针对性或法院干预等因素无法解释现有的结果模式。
{"title":"Contexts of Convenience: Generalizing from Published Evaluations of School Finance Policies.","authors":"Danielle V Handel, Eric A Hanushek","doi":"10.1177/0193841X241228335","DOIUrl":"10.1177/0193841X241228335","url":null,"abstract":"<p><p>Recent attention to the causal identification of spending impacts provides improved estimates of spending outcomes in a variety of circumstances, but the estimates are substantially different across studies. Half of the variation in estimated funding impact on test scores and over three-quarters of the variation of impacts on school attainment reflect differences in the true parameters across study contexts. Unfortunately, inability to describe the circumstances underlying effective school spending impedes any attempts to generalize from the extant results to new policy situations. The evidence indicates that how funds are used is crucial to the outcomes, but such factors as targeting of funds or court interventions fail to explain the existing pattern of results.</p>","PeriodicalId":47533,"journal":{"name":"Evaluation Review","volume":" ","pages":"461-494"},"PeriodicalIF":0.9,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139651807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Logic of Generalization From Systematic Reviews and Meta-Analyses of Impact Evaluations. 从影响评估的系统回顾和元分析中归纳逻辑。
IF 0.9 4区 社会学 Q1 SOCIAL SCIENCES, INTERDISCIPLINARY Pub Date : 2024-06-01 Epub Date: 2024-01-23 DOI: 10.1177/0193841X241227481
Julia H Littell

Systematic reviews and meta-analyses are viewed as potent tools for generalized causal inference. These reviews are routinely used to inform decision makers about expected effects of interventions. However, the logic of generalization from research reviews to diverse policy and practice contexts is not well developed. Building on sampling theory, concerns about epistemic uncertainty, and principles of generalized causal inference, this article presents a pragmatic approach to generalizability assessment for use with systematic reviews and meta-analyses. This approach is applied to two systematic reviews and meta-analyses of effects of "evidence-based" psychosocial interventions for youth and families. Evaluations included in systematic reviews are not necessarily representative of populations and treatments of interest. Generalizability of results is limited by high risks of bias, uncertain estimates, and insufficient descriptive data from impact evaluations. Systematic reviews and meta-analyses can be used to test generalizability claims, explore heterogeneity, and identify potential moderators of effects. These reviews can also produce pooled estimates that are not representative of any larger sets of studies, programs, or people. Further work is needed to improve the conduct and reporting of impact evaluations and systematic reviews, and to develop practical approaches to generalizability assessment and guide applications of interventions in diverse policy and practice contexts.

系统综述和荟萃分析被视为进行普遍因果推断的有力工具。这些综述通常用于向决策者提供有关干预措施预期效果的信息。然而,从研究综述到不同政策和实践背景的归纳逻辑并不完善。本文以抽样理论、对认识论不确定性的关注以及广义因果推论的原则为基础,提出了一种实用的方法来评估系统综述和荟萃分析的可推广性。该方法适用于两项针对青少年和家庭的 "循证 "社会心理干预效果的系统综述和荟萃分析。系统综述中包含的评估不一定代表相关人群和治疗方法。由于偏差风险高、估计值不确定以及影响评估的描述性数据不足,结果的推广性受到限制。系统综述和荟萃分析可用于检验可推广性的说法、探索异质性和确定潜在的效果调节因素。这些综述也可能产生不代表任何更大规模的研究、项目或人群的集合估计值。需要进一步开展工作,以改进影响评估和系统性综述的实施和报告,并开发实用的方法来进行可推广性评估,指导干预措施在不同政策和实践环境中的应用。
{"title":"The Logic of Generalization From Systematic Reviews and Meta-Analyses of Impact Evaluations.","authors":"Julia H Littell","doi":"10.1177/0193841X241227481","DOIUrl":"10.1177/0193841X241227481","url":null,"abstract":"<p><p>Systematic reviews and meta-analyses are viewed as potent tools for generalized causal inference. These reviews are routinely used to inform decision makers about expected effects of interventions. However, the logic of generalization from research reviews to diverse policy and practice contexts is not well developed. Building on sampling theory, concerns about epistemic uncertainty, and principles of generalized causal inference, this article presents a pragmatic approach to generalizability assessment for use with systematic reviews and meta-analyses. This approach is applied to two systematic reviews and meta-analyses of effects of \"evidence-based\" psychosocial interventions for youth and families. Evaluations included in systematic reviews are not necessarily representative of populations and treatments of interest. Generalizability of results is limited by high risks of bias, uncertain estimates, and insufficient descriptive data from impact evaluations. Systematic reviews and meta-analyses can be used to test generalizability claims, explore heterogeneity, and identify potential moderators of effects. These reviews can also produce pooled estimates that are not representative of any larger sets of studies, programs, or people. Further work is needed to improve the conduct and reporting of impact evaluations and systematic reviews, and to develop practical approaches to generalizability assessment and guide applications of interventions in diverse policy and practice contexts.</p>","PeriodicalId":47533,"journal":{"name":"Evaluation Review","volume":" ","pages":"427-460"},"PeriodicalIF":0.9,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139543102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving the Usefulness and Use of Meta-Analysis to Inform Policy and Practice. 提高元分析的实用性和使用率,为政策和实践提供依据。
IF 0.9 4区 社会学 Q1 SOCIAL SCIENCES, INTERDISCIPLINARY Pub Date : 2024-06-01 Epub Date: 2024-02-03 DOI: 10.1177/0193841X241229885
Rebecca Maynard

This chapter begins with an overview of recent developments that have encouraged and facilitated greater use of research syntheses, including Meta-Analysis, to guide public policy and practice in education, workforce development, and social services. It discusses the role of Meta-Analysis for improving knowledge of the effectiveness of programs, policies, and practices and the applicability and generalizability of that knowledge to conditions other than those represented by the study samples and settings. The chapter concludes with recommendations for improving the potential of Meta-Analysis to accelerate knowledge development through changing how we design, conduct, and report findings of individual studies to maximize their usefulness in Meta-Analysis as well as how we produce and report Meta-Analysis findings. The paper includes references to resources supporting the recommendations.

本章首先概述了鼓励和促进更多地使用研究综述(包括元分析)来指导教育、劳动力发展和社会服务领域的公共政策和实践的最新进展。本章讨论了元分析在提高对计划、政策和实践的有效性的认识方面所起的作用,以及这种认识在研究样本和环境所代表的条件之外的适用性和普遍性。本章最后提出了一些建议,通过改变我们设计、开展和报告单项研究结果的方式,最大限度地发挥它们在元分析中的作用,以及改变我们生成和报告元分析结果的方式,提高元分析加速知识发展的潜力。本文包括支持这些建议的资源参考。
{"title":"Improving the Usefulness and Use of Meta-Analysis to Inform Policy and Practice.","authors":"Rebecca Maynard","doi":"10.1177/0193841X241229885","DOIUrl":"10.1177/0193841X241229885","url":null,"abstract":"<p><p>This chapter begins with an overview of recent developments that have encouraged and facilitated greater use of research syntheses, including Meta-Analysis, to guide public policy and practice in education, workforce development, and social services. It discusses the role of Meta-Analysis for improving knowledge of the effectiveness of programs, policies, and practices and the applicability and generalizability of that knowledge to conditions other than those represented by the study samples and settings. The chapter concludes with recommendations for improving the potential of Meta-Analysis to accelerate knowledge development through changing how we design, conduct, and report findings of individual studies to maximize their usefulness in Meta-Analysis as well as how we produce and report Meta-Analysis findings. The paper includes references to resources supporting the recommendations.</p>","PeriodicalId":47533,"journal":{"name":"Evaluation Review","volume":" ","pages":"515-543"},"PeriodicalIF":0.9,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11003195/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139673299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transferability of Lessons From Program Evaluations: Iron Laws, Hiding Hands and the Evidence Ecosystem. 计划评估经验的可借鉴性:铁律、藏匿之手和证据生态系统。
IF 0.9 4区 社会学 Q1 SOCIAL SCIENCES, INTERDISCIPLINARY Pub Date : 2024-06-01 Epub Date: 2024-01-18 DOI: 10.1177/0193841X241228332
Tom Ling

Assessing the transferability of lessons from social research or evaluation continues to raise challenges. Efforts to identify transferable lessons can be based on two different forms of argumentation. The first draws upon statistics and causal inferences. The second involves constructing a reasoned case based on weighing up different data collected along the causal chain from designing to delivery. Both approaches benefit from designing research based upon existing evidence and ensuring that the descriptions of the programme, context, and intended beneficiaries are sufficiently rich. Identifying transferable lessons should not be thought of as a one-off event but involves contributing to the iterative and learning of a scientific community. To understand the circumstances under which findings can be confidently transferred, we need to understand: (1) How far and why outcomes of interest have multiple, interacting and fluctuating causes. (2) The program design and implementation capacity. (3) Prior knowledge and causal landscapes (and how far these are included in the theory of change). (4) New and relevant knowledge; what can we learn in our 'disputatious community of truth seekers'.

评估从社会研究或评估中汲取的经验教训的可借鉴性仍然是一项挑战。确定可借鉴经验的努力可以基于两种不同形式的论证。第一种是利用统计数据和因果推论。第二种是在权衡从设计到实施的因果链条上收集的不同数据的基础上,构建一个有理有据的案例。这两种方法都得益于以现有证据为基础设计研究,并确保对计划、背景和预期受益人的描述足够丰富。不应将确定可借鉴的经验教训视为一次性活动,而应促进科学界的反复学习。要了解在何种情况下可以有把握地转让研究结果,我们需要了解:(1) 在多大程度上以及为什么相关结果具有多重、相互作用和波动的原因。(2) 计划的设计和实施能力。(3) 先前的知识和因果关系(以及这些知识和因果关系在多大程度上包含在变革理论中)。(4) 新的相关知识;在我们这个 "寻求真理者的争议社区 "中,我们能学到什么。
{"title":"Transferability of Lessons From Program Evaluations: Iron Laws, Hiding Hands and the Evidence Ecosystem.","authors":"Tom Ling","doi":"10.1177/0193841X241228332","DOIUrl":"10.1177/0193841X241228332","url":null,"abstract":"<p><p>Assessing the transferability of lessons from social research or evaluation continues to raise challenges. Efforts to identify transferable lessons can be based on two different forms of argumentation. The first draws upon statistics and causal inferences. The second involves constructing a reasoned case based on weighing up different data collected along the causal chain from designing to delivery. Both approaches benefit from designing research based upon existing evidence and ensuring that the descriptions of the programme, context, and intended beneficiaries are sufficiently rich. Identifying transferable lessons should not be thought of as a one-off event but involves contributing to the iterative and learning of a scientific community. To understand the circumstances under which findings can be confidently transferred, we need to understand: (1) How far and why outcomes of interest have multiple, interacting and fluctuating causes. (2) The program design and implementation capacity. (3) Prior knowledge and causal landscapes (and how far these are included in the theory of change). (4) New and relevant knowledge; what can we learn in our 'disputatious community of truth seekers'.</p>","PeriodicalId":47533,"journal":{"name":"Evaluation Review","volume":" ","pages":"410-426"},"PeriodicalIF":0.9,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139486569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How Mixed-Methods Research Can Improve the Policy Relevance of Impact Evaluations. 混合方法研究如何提高影响评估的政策相关性。
IF 0.9 4区 社会学 Q1 SOCIAL SCIENCES, INTERDISCIPLINARY Pub Date : 2024-06-01 Epub Date: 2024-02-01 DOI: 10.1177/0193841X241227480
Burt S Barnow, Sanjay K Pandey, Qian Eric Luo

This paper describes how mixed methods can improve the value and policy relevance of impact evaluations, paying particular attention to how mixed methods can be used to address external validity and generalization issues. We briefly review the literature on the rationales for using mixed methods; provide documentation of the extent to which mixed methods have been used in impact evaluations in recent years; describe how we developed a list of recent impact evaluations using mixed methods and the process used to conduct full-text reviews of these articles; summarize the findings from our analysis of the articles; discuss three exemplars of using mixed methods in impact evaluations; and discuss how mixed methods have been used for studying and improving external validity and potential improvements that could be made in this area. We find that mixed methods are rarely used in impact evaluations, and we believe that increased use of mixed methods would be useful because they can reinforce findings from the quantitative analysis (triangulation), and they can also help us understand the mechanism by which programs have their impacts and the reasons why programs fail.

本文介绍了混合方法如何提高影响评估的价值和政策相关性,特别关注混合方法如何用于解决外部有效性和普遍性问题。我们简要回顾了有关使用混合方法的理由的文献;提供了近年来在影响评估中使用混合方法程度的文献;描述了我们如何编制使用混合方法的近期影响评估清单,以及对这些文章进行全文审阅的过程;总结了我们对这些文章的分析结果;讨论了在影响评估中使用混合方法的三个范例;并讨论了如何使用混合方法研究和改进外部有效性,以及在这一领域可能做出的改进。我们发现,在影响评估中很少使用混合方法,我们认为增加混合方法的使用将是有益的,因为它们可以加强定量分析的结果(三角测量),还可以帮助我们了解项目产生影响的机制以及项目失败的原因。
{"title":"How Mixed-Methods Research Can Improve the Policy Relevance of Impact Evaluations.","authors":"Burt S Barnow, Sanjay K Pandey, Qian Eric Luo","doi":"10.1177/0193841X241227480","DOIUrl":"10.1177/0193841X241227480","url":null,"abstract":"<p><p>This paper describes how mixed methods can improve the value and policy relevance of impact evaluations, paying particular attention to how mixed methods can be used to address external validity and generalization issues. We briefly review the literature on the rationales for using mixed methods; provide documentation of the extent to which mixed methods have been used in impact evaluations in recent years; describe how we developed a list of recent impact evaluations using mixed methods and the process used to conduct full-text reviews of these articles; summarize the findings from our analysis of the articles; discuss three exemplars of using mixed methods in impact evaluations; and discuss how mixed methods have been used for studying and improving external validity and potential improvements that could be made in this area. We find that mixed methods are rarely used in impact evaluations, and we believe that increased use of mixed methods would be useful because they can reinforce findings from the quantitative analysis (triangulation), and they can also help us understand the mechanism by which programs have their impacts and the reasons why programs fail.</p>","PeriodicalId":47533,"journal":{"name":"Evaluation Review","volume":" ","pages":"495-514"},"PeriodicalIF":0.9,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139651808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Validity Evidence for an Observational Fidelity Measure to Inform Scale-Up of Evidence-Based Interventions 观察保真度测量的有效性证据,为循证干预措施的推广提供依据
IF 0.9 4区 社会学 Q1 SOCIAL SCIENCES, INTERDISCIPLINARY Pub Date : 2024-04-30 DOI: 10.1177/0193841x241248864
Pamela R. Buckley, Katie Massey Combs, Karen M. Drewelow, Brittany L. Hubler, Marion Amanda Lain
As evidence-based interventions are scaled, fidelity of implementation, and thus effectiveness, often wanes. Validated fidelity measures can improve researchers’ ability to attribute outcomes to the intervention and help practitioners feel more confident in implementing the intervention as intended. We aim to provide a model for the validation of fidelity observation protocols to guide future research studying evidence-based interventions scaled-up under real-world conditions. We describe a process to build evidence of validity for items within the Session Review Form, an observational tool measuring fidelity to interactive drug prevention programs such as the Botvin LifeSkills Training program. Following Kane’s (2006) assumptions framework requiring that validity evidence be built across four areas (scoring, generalizability, extrapolation, and decision), confirmatory factor analysis supported the hypothesized two-factor structure measuring quality of delivery (seven items assessing how well the material is implemented) and participant responsiveness (three items evaluating how well the intervention is received), and measurement invariance tests suggested the structure held across grade level and schools serving different student populations. These findings provide some evidence supporting the extrapolation assumption, though additional research is warranted since a more complete overall depiction of the validity argument is needed to evaluate fidelity measures.
随着循证干预措施规模的扩大,实施的忠实度以及有效性往往会减弱。经过验证的忠实度测量方法可以提高研究人员将结果归因于干预措施的能力,并帮助实践者更有信心地按照预期实施干预措施。我们的目标是为忠实性观察协议的验证提供一个模型,以指导未来在真实世界条件下对基于证据的干预措施进行推广的研究。我们描述了为会话审查表中的项目建立有效性证据的过程,会话审查表是衡量互动式毒品预防项目(如博文生活技能培训项目)忠实性的观察工具。根据凯恩(2006 年)的假设框架,需要在四个方面(评分、可推广性、外推和决策)建立有效性证据,确认性因素分析支持假设的双因素结构,即测量交付质量(七个项目评估材料的实施情况)和参与者响应性(三个项目评估干预措施的接受情况),测量不变性测试表明该结构在不同年级和服务于不同学生群体的学校中保持不变。这些研究结果提供了一些支持外推法假设的证据,但还需要进行更多的研究,因为需要对有效性论证进行更全面的整体描述,以评估忠实性测量。
{"title":"Validity Evidence for an Observational Fidelity Measure to Inform Scale-Up of Evidence-Based Interventions","authors":"Pamela R. Buckley, Katie Massey Combs, Karen M. Drewelow, Brittany L. Hubler, Marion Amanda Lain","doi":"10.1177/0193841x241248864","DOIUrl":"https://doi.org/10.1177/0193841x241248864","url":null,"abstract":"As evidence-based interventions are scaled, fidelity of implementation, and thus effectiveness, often wanes. Validated fidelity measures can improve researchers’ ability to attribute outcomes to the intervention and help practitioners feel more confident in implementing the intervention as intended. We aim to provide a model for the validation of fidelity observation protocols to guide future research studying evidence-based interventions scaled-up under real-world conditions. We describe a process to build evidence of validity for items within the Session Review Form, an observational tool measuring fidelity to interactive drug prevention programs such as the Botvin LifeSkills Training program. Following Kane’s (2006) assumptions framework requiring that validity evidence be built across four areas (scoring, generalizability, extrapolation, and decision), confirmatory factor analysis supported the hypothesized two-factor structure measuring quality of delivery (seven items assessing how well the material is implemented) and participant responsiveness (three items evaluating how well the intervention is received), and measurement invariance tests suggested the structure held across grade level and schools serving different student populations. These findings provide some evidence supporting the extrapolation assumption, though additional research is warranted since a more complete overall depiction of the validity argument is needed to evaluate fidelity measures.","PeriodicalId":47533,"journal":{"name":"Evaluation Review","volume":"11 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2024-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140833906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Evaluation Review
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1