首页 > 最新文献

Journal of behavioral data science最新文献

英文 中文
Rephrasing the Lengthy and Involved Proof of Kristof’s Theorem: A Tutorial with Some New Findings 重新表述克里斯托夫定理冗长而复杂的证明:附带一些新发现的教程
Pub Date : 2024-07-27 DOI: 10.35566/jbds/ogasawara2
Haruhiko Ogasawara
Kristof’s theorem gives the global maximum and minimum of the trace of some matrix products without using calculus or Lagrange multipliers with various applications in psychometrics and multivariate analysis. However, the underutilization has been seen irrespective of its great use in practice. This may partially be due to the lengthy and involved proof of the theorem. In this tutorial, some known or new lemmas are rephrased or provided to understand the essential points in the proof. ten Berge’s generalized Kristof theorem is also addressed. Then, the modified Kristof and ten Berge theorems using parent orthonormal matrices are shown, which may be of use to see the properties of the Kristof and ten Berge theorems. 
克里斯托夫定理给出了某些矩阵乘积迹的全局最大值和最小值,而无需使用微积分或拉格朗日乘法器,在心理测量学和多元分析中有着广泛的应用。然而,尽管该定理在实践中应用广泛,但却一直未得到充分利用。部分原因可能是该定理的证明过程冗长而复杂。在本教程中,将重新表述或提供一些已知或新的公理,以便理解证明中的要点。然后,展示了使用父正交矩阵的修正克利斯朵夫定理和ten Berge定理,这可能有助于了解克利斯朵夫定理和ten Berge定理的性质。
{"title":"Rephrasing the Lengthy and Involved Proof of Kristof’s Theorem: A Tutorial with Some New Findings","authors":"Haruhiko Ogasawara","doi":"10.35566/jbds/ogasawara2","DOIUrl":"https://doi.org/10.35566/jbds/ogasawara2","url":null,"abstract":"Kristof’s theorem gives the global maximum and minimum of the trace of some matrix products without using calculus or Lagrange multipliers with various applications in psychometrics and multivariate analysis. However, the underutilization has been seen irrespective of its great use in practice. This may partially be due to the lengthy and involved proof of the theorem. In this tutorial, some known or new lemmas are rephrased or provided to understand the essential points in the proof. ten Berge’s generalized Kristof theorem is also addressed. Then, the modified Kristof and ten Berge theorems using parent orthonormal matrices are shown, which may be of use to see the properties of the Kristof and ten Berge theorems.\u0000 ","PeriodicalId":93575,"journal":{"name":"Journal of behavioral data science","volume":"76 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141798469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stability and Spread: Transition Metrics that are Robust to Time Interval Misspecification 稳定性与传播:不受时间间隔错误规范影响的过渡指标
Pub Date : 2024-06-11 DOI: 10.35566/jbds/daniel
Katharine Daniel, Robert Moulder, Matthew Southward, Jennifer Cheavens, Steven Boker
Intensive longitudinal data collected via ecological momentary assessment (EMA) are often sampled with unequal time spacing between surveys. Given the popularity of EMA data, it is important to understand whether time series methods are robust to such time interval misspecification. The present study demonstrates via simulation that stability and spread—two metrics for quantifying different aspects of transitioning behavior within multivariate binary time series data—are unbiased when applied to data that are collected along an off/on burst sampling schedule, a between-person random sampling schedule, and a within-person random sampling schedule. These results held in randomly generated data with differing numbers of time series variables (k=10 and k=20) and in data simulated based on the proportions of observed data from a prior EMA study. Further, stability and spread demonstrated approximately 95% coverage for all between- and within-person random sampling schedules. However, coverage for stability and spread was poor in the off/on burst sampling schedules (around 67%). We also applied these transition metrics—which measure repetitiveness and diversity of transitions, respectively—to a foundational EMA dataset that was among the first to show that adults regularly use many different emotion regulation strategies throughout their daily life citep{heiy2014back}. As hypothesized, we found a stronger positive relation between mood and higher stability/lower spread in emotion regulation among people with fewer depressive symptoms than those with more depressive symptoms. Taken together, stability and spread appear to be appropriate metrics to use with data collected using common unequal time spacing conditions and can be used to uncover theoretically consistent insights in real psychosocial data.
通过生态瞬时评估(EMA)收集的密集纵向数据通常是在调查时间间隔不等的情况下采样的。鉴于 EMA 数据的普及性,了解时间序列方法是否对这种时间间隔错误规范具有鲁棒性非常重要。本研究通过仿真证明,稳定性和传播度--这两个用于量化多元二元时间序列数据中过渡行为不同方面的指标--在应用于按照离/开突发抽样计划、人与人之间随机抽样计划和人与人之间随机抽样计划收集的数据时是无偏的。这些结果适用于随机生成的具有不同数量时间序列变量(k=10 和 k=20)的数据,以及基于先前 EMA 研究中观察到的数据比例模拟的数据。此外,在所有人与人之间和人与人之间的随机抽样计划中,稳定性和传播性的覆盖率约为 95%。然而,在关/开突发采样计划中,稳定性和扩散的覆盖率较低(约为 67%)。我们还将这些过渡度量指标--它们分别测量过渡的重复性和多样性--应用于一个基础性的EMA数据集,该数据集是最早显示成年人在日常生活中经常使用多种不同的情绪调节策略的数据集之一(citep{heiy2014back})。正如假设的那样,我们发现与抑郁症状较多的人相比,抑郁症状较少的人的情绪与情绪调节的较高稳定性/较低分散性之间存在更强的正相关关系。综合来看,稳定性和扩散性似乎是使用常见的不等时间间隔条件收集数据的合适指标,可用于在真实的社会心理数据中发现理论上一致的见解。
{"title":"Stability and Spread: Transition Metrics that are Robust to Time Interval Misspecification","authors":"Katharine Daniel, Robert Moulder, Matthew Southward, Jennifer Cheavens, Steven Boker","doi":"10.35566/jbds/daniel","DOIUrl":"https://doi.org/10.35566/jbds/daniel","url":null,"abstract":"Intensive longitudinal data collected via ecological momentary assessment (EMA) are often sampled with unequal time spacing between surveys. Given the popularity of EMA data, it is important to understand whether time series methods are robust to such time interval misspecification. The present study demonstrates via simulation that stability and spread—two metrics for quantifying different aspects of transitioning behavior within multivariate binary time series data—are unbiased when applied to data that are collected along an off/on burst sampling schedule, a between-person random sampling schedule, and a within-person random sampling schedule. These results held in randomly generated data with differing numbers of time series variables (k=10 and k=20) and in data simulated based on the proportions of observed data from a prior EMA study. Further, stability and spread demonstrated approximately 95% coverage for all between- and within-person random sampling schedules. However, coverage for stability and spread was poor in the off/on burst sampling schedules (around 67%). We also applied these transition metrics—which measure repetitiveness and diversity of transitions, respectively—to a foundational EMA dataset that was among the first to show that adults regularly use many different emotion regulation strategies throughout their daily life citep{heiy2014back}. As hypothesized, we found a stronger positive relation between mood and higher stability/lower spread in emotion regulation among people with fewer depressive symptoms than those with more depressive symptoms. Taken together, stability and spread appear to be appropriate metrics to use with data collected using common unequal time spacing conditions and can be used to uncover theoretically consistent insights in real psychosocial data.","PeriodicalId":93575,"journal":{"name":"Journal of behavioral data science","volume":"8 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141356163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Approach for Identifying Unobserved Heterogeneity in Longitudinal Growth Trajectories Using Natural Cubic Smoothing Splines 利用自然立方平滑样条确定纵向增长轨迹中未观察到的异质性的新方法
Pub Date : 2024-05-12 DOI: 10.35566/jbds/marcoulides
Katerina M. Marcoulides, Laura Trinchera
A novel algorithmic modeling method is proposed to determine dissimilarities between subjects for longitudinal data clustering using natural cubic smoothing splines. Although various modeling techniques have to date been suggested for conducting such analyses, a major problem with many of these approaches is that they often impose overly restrictive assumptions. As a consequence, potentially problematic interpretations of data clustering regarding both the number and the nature of the growth trajectory patterns can occur. The proposed method is shown to be highly effective in identifying heterogeneity of growth trajectories in settings with data exhibiting complex nonlinear longitudinal patterns and without imposing potentially problematic constraints on the model.
本文提出了一种新颖的算法建模方法,利用自然立方平滑样条确定纵向数据聚类中受试者之间的差异性。虽然迄今为止已有多种建模技术用于进行此类分析,但其中许多方法的一个主要问题是,它们往往施加了限制性过强的假设。因此,对数据聚类的解释可能会在增长轨迹模式的数量和性质方面出现问题。事实证明,在数据呈现复杂的非线性纵向模式的情况下,所提出的方法在识别成长轨迹的异质性方面非常有效,而且不会对模型施加可能存在问题的限制。
{"title":"A Novel Approach for Identifying Unobserved Heterogeneity in Longitudinal Growth Trajectories Using Natural Cubic Smoothing Splines","authors":"Katerina M. Marcoulides, Laura Trinchera","doi":"10.35566/jbds/marcoulides","DOIUrl":"https://doi.org/10.35566/jbds/marcoulides","url":null,"abstract":"A novel algorithmic modeling method is proposed to determine dissimilarities between subjects for longitudinal data clustering using natural cubic smoothing splines. Although various modeling techniques have to date been suggested for conducting such analyses, a major problem with many of these approaches is that they often impose overly restrictive assumptions. As a consequence, potentially problematic interpretations of data clustering regarding both the number and the nature of the growth trajectory patterns can occur. The proposed method is shown to be highly effective in identifying heterogeneity of growth trajectories in settings with data exhibiting complex nonlinear longitudinal patterns and without imposing potentially problematic constraints on the model.","PeriodicalId":93575,"journal":{"name":"Journal of behavioral data science","volume":"122 21","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140986514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Proof-of-Concept Study Demonstrating How FITBIR Datasets Can be Harmonized to Examine Posttraumatic Stress Disorder-Traumatic Brain Injury Associations 概念验证研究展示如何协调 FITBIR 数据集,以检查创伤后应激障碍与创伤性脑损伤之间的关系
Pub Date : 2024-04-25 DOI: 10.35566/jbds/oneil
Maya E O'Neil, David C Cameron, Kate Clauss, Danielle Krushnic, William Baker-Robinson, Sara Hannon, Tamara Cheney, Josh Kaplan, Lawrence Cook, Meike Niederhausen, Miranda Pappas, David Cifu
Background: Although posttraumatic stress disorder (PTSD) is common following traumatic brain injury (TBI), the specific associations between these conditions is difficult to elucidate in part due to the diverse methodologies, small samples, and limited longitudinal data in the extant literature. Objective: Conduct a proof-of-concept study demonstrating our ability to compile patient-level TBI data from shared studies in the Federal Interagency Traumatic Brain Injury Research (FITBIR) Informatics System to address these shortcomings and improve our understanding of TBI outcomes including the rates PTSD comorbidity. Method: We searched the FITBIR database for shared studies reporting rates of probable PTSD among participants with no TBI, history of mild TBI, or history of moderate/severe TBI. We merged and harmonized data across the relevant studies and analyzed rates of probable PTSD across TBI history and severity categories. Results: Four FITBIR studies with 2,312 participants included PTSD outcome data. The final sample for comparative analyses comprised 1,633 participants from two studies with TBI group comparison data. Approximately 79% had a history of mild TBI and 32-37% screened positive for probable PTSD. Participants with a history of mild TBI had 2.8 greater odds of probable PTSD compared to those without TBI (95% CI: 2.0, 3.7). Conclusions: Only two FITBIR studies reported data examining PTSD outcomes for mild TBI as of January 2021. The analyses are consistent with prior literature, suggesting mild TBI is associated with higher rates of probable PTSD than no TBI. This study developed the methods, shared the harmonization and analysis code, and publicly shared the TBI and PTSD meta-dataset back to FITBIR for dissemination through their website, allowing future research teams to update these and other, related analyses as more studies are contributed to and shared via the FITBIR platform.
背景:虽然创伤后应激障碍(PTSD)在创伤性脑损伤(TBI)后很常见,但这些病症之间的具体关联却很难阐明,部分原因是现有文献中的方法多样、样本较少且纵向数据有限。目标:开展一项概念验证研究,证明我们有能力从联邦机构间创伤性脑损伤研究(FITBIR)信息系统中的共享研究中汇编患者级别的创伤性脑损伤数据,以解决这些不足之处,并提高我们对创伤性脑损伤结果(包括创伤后应激障碍合并症发生率)的认识。方法:我们在 FITBIR 数据库中搜索了报告无创伤性脑损伤、轻度创伤性脑损伤或中度/重度创伤性脑损伤参与者中创伤后应激障碍可能发生率的共享研究。我们合并并统一了相关研究的数据,分析了不同创伤后应激障碍病史和严重程度类别的可能创伤后应激障碍发生率。研究结果四项 FITBIR 研究共纳入了 2312 名参与者的创伤后应激障碍结果数据。用于对比分析的最终样本包括来自两项研究的 1,633 名参与者,其中包含 TBI 组对比数据。约 79% 的人有轻度创伤后应激障碍病史,32%-37% 的人可能患有创伤后应激障碍。有轻度创伤性脑损伤病史的参与者与没有创伤性脑损伤病史的参与者相比,可能患有创伤后应激障碍的几率要高出 2.8(95% CI:2.0,3.7)。结论:截至 2021 年 1 月,只有两项 FITBIR 研究报告了轻度 TBI 的创伤后应激障碍结果数据。分析结果与之前的文献一致,表明轻度创伤性脑损伤与可能的创伤后应激障碍发生率高于无创伤性脑损伤。本研究开发了相关方法,共享了协调和分析代码,并将 TBI 和创伤后应激障碍元数据集公开共享给 FITBIR,以便通过其网站进行传播,从而使未来的研究团队能够随着更多研究的贡献和通过 FITBIR 平台的共享而更新这些及其他相关分析。
{"title":"A Proof-of-Concept Study Demonstrating How FITBIR Datasets Can be Harmonized to Examine Posttraumatic Stress Disorder-Traumatic Brain Injury Associations","authors":"Maya E O'Neil, David C Cameron, Kate Clauss, Danielle Krushnic, William Baker-Robinson, Sara Hannon, Tamara Cheney, Josh Kaplan, Lawrence Cook, Meike Niederhausen, Miranda Pappas, David Cifu","doi":"10.35566/jbds/oneil","DOIUrl":"https://doi.org/10.35566/jbds/oneil","url":null,"abstract":"Background: Although posttraumatic stress disorder (PTSD) is common following traumatic brain injury (TBI), the specific associations between these conditions is difficult to elucidate in part due to the diverse methodologies, small samples, and limited longitudinal data in the extant literature. \u0000Objective: Conduct a proof-of-concept study demonstrating our ability to compile patient-level TBI data from shared studies in the Federal Interagency Traumatic Brain Injury Research (FITBIR) Informatics System to address these shortcomings and improve our understanding of TBI outcomes including the rates PTSD comorbidity. \u0000Method: We searched the FITBIR database for shared studies reporting rates of probable PTSD among participants with no TBI, history of mild TBI, or history of moderate/severe TBI. We merged and harmonized data across the relevant studies and analyzed rates of probable PTSD across TBI history and severity categories. \u0000Results: Four FITBIR studies with 2,312 participants included PTSD outcome data. The final sample for comparative analyses comprised 1,633 participants from two studies with TBI group comparison data. Approximately 79% had a history of mild TBI and 32-37% screened positive for probable PTSD. Participants with a history of mild TBI had 2.8 greater odds of probable PTSD compared to those without TBI (95% CI: 2.0, 3.7). \u0000Conclusions: Only two FITBIR studies reported data examining PTSD outcomes for mild TBI as of January 2021. The analyses are consistent with prior literature, suggesting mild TBI is associated with higher rates of probable PTSD than no TBI. This study developed the methods, shared the harmonization and analysis code, and publicly shared the TBI and PTSD meta-dataset back to FITBIR for dissemination through their website, allowing future research teams to update these and other, related analyses as more studies are contributed to and shared via the FITBIR platform.","PeriodicalId":93575,"journal":{"name":"Journal of behavioral data science","volume":"3 11","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140654610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Loss Aversion Distribution: The Science Behind Loss Aversion Exhibited by Sellers of Perishable Good 损失厌恶分布:易腐商品卖家损失规避行为背后的科学原理
Pub Date : 2024-03-24 DOI: 10.35566/jbds/koh
Daniel Koh
This research introduces the concept of the loss aversion distribution, a pioneering framework designed for the analysis of consumer behavior. Departing from the conventions of traditional exponential models, this innovative approach incorporates a non-memoryless characteristic, which modulates the consumer's response to loss aversion throughout the product's life cycle. This modulation is achieved by a variable exponent influenced by the parameter $b$, representing the psychological impact of loss aversion, and the constant $k$, which reflects the market value of the good at the time of manufacture. Together, these parameters adeptly encapsulate the dynamic nature of consumer loss aversion from the moment of manufacture to the point of expiry. The model elucidates an initial muted response from consumers at the onset of ownership, which then intensifies during the mid-life cycle of the product, before ultimately diminishing as the product approaches its expiry. Through a meticulous derivative analysis of the probability density function, the study delineates the distribution's key properties, including its monotonicity, boundedness within the interval [0, 1], and its adherence to non-negativity. This framework not only enhances our comprehension of consumer behavior in relation to perishable goods but also paves the way for further investigations into psychometrics and the intricacies of loss aversion modeling.
这项研究引入了损失厌恶分布的概念,这是一个专为分析消费者行为而设计的开创性框架。与传统指数模型的惯例不同,这种创新方法包含了一种无记忆特性,可在产品的整个生命周期中调节消费者对损失厌恶的反应。这种调节是通过一个可变指数实现的,该指数受代表损失厌恶心理影响的参数 $b$ 和反映商品生产时市场价值的常数 $k$ 的影响。这些参数巧妙地概括了消费者从生产到过期的损失规避的动态性质。该模型阐明了消费者在购买产品之初的反应,这种反应在产品的生命周期中期会加剧,最终随着产品的到期而减弱。通过对概率密度函数进行细致的导数分析,研究界定了该分布的关键属性,包括单调性、区间[0, 1]内的有界性以及非负性。这一框架不仅增强了我们对与易腐商品相关的消费者行为的理解,还为进一步研究心理计量学和错综复杂的损失厌恶模型铺平了道路。
{"title":"Loss Aversion Distribution: The Science Behind Loss Aversion Exhibited by Sellers of Perishable Good","authors":"Daniel Koh","doi":"10.35566/jbds/koh","DOIUrl":"https://doi.org/10.35566/jbds/koh","url":null,"abstract":"This research introduces the concept of the loss aversion distribution, a pioneering framework designed for the analysis of consumer behavior. Departing from the conventions of traditional exponential models, this innovative approach incorporates a non-memoryless characteristic, which modulates the consumer's response to loss aversion throughout the product's life cycle. This modulation is achieved by a variable exponent influenced by the parameter $b$, representing the psychological impact of loss aversion, and the constant $k$, which reflects the market value of the good at the time of manufacture. Together, these parameters adeptly encapsulate the dynamic nature of consumer loss aversion from the moment of manufacture to the point of expiry. The model elucidates an initial muted response from consumers at the onset of ownership, which then intensifies during the mid-life cycle of the product, before ultimately diminishing as the product approaches its expiry. Through a meticulous derivative analysis of the probability density function, the study delineates the distribution's key properties, including its monotonicity, boundedness within the interval [0, 1], and its adherence to non-negativity. This framework not only enhances our comprehension of consumer behavior in relation to perishable goods but also paves the way for further investigations into psychometrics and the intricacies of loss aversion modeling.","PeriodicalId":93575,"journal":{"name":"Journal of behavioral data science","volume":" 114","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140385882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Tutorial on Bayesian Linear Regression with Compositional Predictors Using JAGS 使用 JAGS 对组合预测因子进行贝叶斯线性回归的教程
Pub Date : 2024-01-28 DOI: 10.35566/jbds/tongliu
Yunli Liu, Xin Tong
This tutorial offers an exploration of advanced Bayesian methodologies for compositional data analysis, specifically the Bayesian Lasso and Bayesian Spike-and-Slab Lasso (SSL) techniques. Our focus is on a novel Bayesian methodology that integrates Lasso and SSL priors, enhancing both parameter estimation and variable selection for linear regression with compositional predictors. The tutorial is structured to streamline the learning process, breaking down complex analyses into a series of straightforward steps. We demonstrate these methods using R and JAGS, employing simulated datasets to illustrate key concepts. Our objective is to provide a clear and comprehensive understanding of these sophisticated Bayesian techniques, preparing readers to adeptly navigate and apply these methods in their own compositional data analysis endeavors.
本教程探讨了用于组合数据分析的高级贝叶斯方法,特别是贝叶斯拉索(Bayesian Lasso)和贝叶斯尖峰-斜线拉索(Bayesian Spike-and-Slab Lasso,SSL)技术。我们的重点是一种新颖的贝叶斯方法,该方法将 Lasso 和 SSL 先验整合在一起,增强了参数估计和变量选择的能力,适用于带有组合预测因子的线性回归。本教程的结构简化了学习过程,将复杂的分析分解为一系列简单明了的步骤。我们使用 R 和 JAGS 演示了这些方法,并使用模拟数据集来说明关键概念。我们的目标是让读者对这些复杂的贝叶斯技术有一个清晰而全面的了解,为他们在自己的成分数据分析工作中熟练地掌握和应用这些方法做好准备。
{"title":"A Tutorial on Bayesian Linear Regression with Compositional Predictors Using JAGS","authors":"Yunli Liu, Xin Tong","doi":"10.35566/jbds/tongliu","DOIUrl":"https://doi.org/10.35566/jbds/tongliu","url":null,"abstract":"This tutorial offers an exploration of advanced Bayesian methodologies for compositional data analysis, specifically the Bayesian Lasso and Bayesian Spike-and-Slab Lasso (SSL) techniques. Our focus is on a novel Bayesian methodology that integrates Lasso and SSL priors, enhancing both parameter estimation and variable selection for linear regression with compositional predictors. The tutorial is structured to streamline the learning process, breaking down complex analyses into a series of straightforward steps. We demonstrate these methods using R and JAGS, employing simulated datasets to illustrate key concepts. Our objective is to provide a clear and comprehensive understanding of these sophisticated Bayesian techniques, preparing readers to adeptly navigate and apply these methods in their own compositional data analysis endeavors.","PeriodicalId":93575,"journal":{"name":"Journal of behavioral data science","volume":"348 5-6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140490852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Considering the Distributional Form of Zeroes When Calculating Mediation Effects with Zero-Inflated Count Outcomes 在计算零膨胀计数结果的中介效应时考虑零的分布形式
Pub Date : 2023-11-10 DOI: 10.35566/jbds/v3n2/orourke
Holly O'Rourke, Da Eun Han
Recent work has demonstrated how to calculate conditional mediated effects for mediation models with zero-inflated count outcomes in a non-causal framework (O’Rourke & Vazquez, 2019); however, those formulas do not distinguish between logistic and count portions of the data distribution when calculating mediated effects separately for zeroes and counts. When calculating conditional mediated effects for the counts in a zero-inflated count outcome Y, the b path should use the partial derivative of the log-linear regression equation for X and M predicting Y. When calculating conditional mediated effects for the zeroes, the b path should use the partial derivative of the logistic regression equation for X and M predicting Y instead of the log-linear equation. This paper presents adjustments to the analytical formulas of conditional mediated effects for mediation with zero-inflated count outcomes when zeroes and counts are differentially predicted. Using a Monte Carlo simulation, we also empirically show that these adjustments produce different results than when the distributional form of zeroes is ignored.
最近的工作已经证明了如何在非因果框架中计算具有零膨胀计数结果的中介模型的条件中介效应(O 'Rourke &巴斯克斯,2019);然而,在分别计算零和计数的中介效应时,这些公式没有区分数据分布的逻辑部分和计数部分。在计算零膨胀计数结果Y中的计数的条件中介效应时,b路径应使用X和M预测Y的对数线性回归方程的偏导数。在计算零的条件中介效应时,b路径应使用X和M预测Y的逻辑回归方程的偏导数,而不是对数线性方程。本文提出了对零膨胀计数结果的条件中介效应分析公式的调整,当零和计数有差异预测时。使用蒙特卡罗模拟,我们也经验地表明,这些调整产生不同的结果比当零的分布形式被忽略。
{"title":"Considering the Distributional Form of Zeroes When Calculating Mediation Effects with Zero-Inflated Count Outcomes","authors":"Holly O'Rourke, Da Eun Han","doi":"10.35566/jbds/v3n2/orourke","DOIUrl":"https://doi.org/10.35566/jbds/v3n2/orourke","url":null,"abstract":"Recent work has demonstrated how to calculate conditional mediated effects for mediation models with zero-inflated count outcomes in a non-causal framework (O’Rourke & Vazquez, 2019); however, those formulas do not distinguish between logistic and count portions of the data distribution when calculating mediated effects separately for zeroes and counts. When calculating conditional mediated effects for the counts in a zero-inflated count outcome Y, the b path should use the partial derivative of the log-linear regression equation for X and M predicting Y. When calculating conditional mediated effects for the zeroes, the b path should use the partial derivative of the logistic regression equation for X and M predicting Y instead of the log-linear equation. This paper presents adjustments to the analytical formulas of conditional mediated effects for mediation with zero-inflated count outcomes when zeroes and counts are differentially predicted. Using a Monte Carlo simulation, we also empirically show that these adjustments produce different results than when the distributional form of zeroes is ignored.","PeriodicalId":93575,"journal":{"name":"Journal of behavioral data science","volume":" 15","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135141756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting Dyslexia with Machine Learning: A Comprehensive Review of Feature Selection, Algorithms, and Evaluation Metrics 用机器学习预测阅读障碍:特征选择、算法和评估指标的综合综述
Pub Date : 2023-07-28 DOI: 10.35566/jbds/v3n1/s
Velmurugan S
This literature review explores the use of machine learning-based approaches for the diagnosis and treatment of dyslexia, a learning disorder that affects reading and spelling skills. Various machine learning models, such as artificial neural networks (ANNs), support vector machines (SVMs), and decision trees, have been used to classify individuals as either dyslexic or non-dyslexic based on functional magnetic resonance imaging (fMRI) and electroencephalography (EEG) data. These models have shown promising results for early detection and personalized treatment plans. However, further research is needed to validate these approaches and identify optimal features and models for dyslexia diagnosis and treatment.
这篇文献综述探讨了基于机器学习的方法在阅读障碍诊断和治疗中的应用,阅读障碍是一种影响阅读和拼写技能的学习障碍。各种机器学习模型,如人工神经网络(Ann)、支持向量机(SVM)和决策树,已被用于基于功能磁共振成像(fMRI)和脑电图(EEG)数据将个体分类为阅读障碍或非阅读障碍。这些模型在早期发现和个性化治疗计划方面显示出了有希望的结果。然而,还需要进一步的研究来验证这些方法,并确定阅读障碍诊断和治疗的最佳特征和模型。
{"title":"Predicting Dyslexia with Machine Learning: A Comprehensive Review of Feature Selection, Algorithms, and Evaluation Metrics","authors":"Velmurugan S","doi":"10.35566/jbds/v3n1/s","DOIUrl":"https://doi.org/10.35566/jbds/v3n1/s","url":null,"abstract":"This literature review explores the use of machine learning-based approaches for the diagnosis and treatment of dyslexia, a learning disorder that affects reading and spelling skills. Various machine learning models, such as artificial neural networks (ANNs), support vector machines (SVMs), and decision trees, have been used to classify individuals as either dyslexic or non-dyslexic based on functional magnetic resonance imaging (fMRI) and electroencephalography (EEG) data. These models have shown promising results for early detection and personalized treatment plans. However, further research is needed to validate these approaches and identify optimal features and models for dyslexia diagnosis and treatment.","PeriodicalId":93575,"journal":{"name":"Journal of behavioral data science","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47245030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
API Face Value API面值
Pub Date : 2023-07-13 DOI: 10.35566/jbds/v3n1/wyman
Austin Wyman, Zhiyong Zhang
Emotion recognition application programming interface (API) is a recent advancement in computing technology that synthesizes computer vision, machine-learning algorithms, deep-learning neural networks, and other information to detect and label human emotions. The strongest iterations of this technology are produced by technology giants with large, cloud infrastructure (i.e., Google, and Microsoft), bolstering high true positive rates. We review the current status of applications of emotion recognition API in psychological research and find that, despite evidence of spatial, age, and race bias effects, API is improving the accessibility of clinical and educational research. Specifically, emotion detection software can assist individuals with emotion-related deficits (e.g., Autism Spectrum Disorder, Attention Deficit-Hyperactivity Disorder, Alexithymia). API has been incorporated in various computer-assisted interventions for Autism, where it has been used to diagnose, train, and monitor emotional responses to one's environment. We identify AP's potential to enhance interventions in other emotional dysfunction populations and to address various professional needs. Future work should aim to address the bias limitations of API software and expand its utility in subfields of clinical, educational, neurocognitive, and industrial-organizational psychology.
情绪识别应用编程接口(API)是计算技术的最新进展,它综合了计算机视觉、机器学习算法、深度学习神经网络和其他信息来检测和标记人类情绪。这项技术的最强迭代是由拥有大型云基础设施的科技巨头(即谷歌和微软)生产的,从而提高了高真阳性率。我们回顾了情感识别API在心理学研究中的应用现状,发现尽管有证据表明存在空间、年龄和种族偏见效应,但API正在提高临床和教育研究的可及性。具体而言,情绪检测软件可以帮助患有情绪相关缺陷的个体(例如,自闭症谱系障碍、注意力缺陷多动障碍、述情障碍)。API已被纳入自闭症的各种计算机辅助干预措施中,用于诊断、训练和监测对环境的情绪反应。我们确定AP在加强对其他情绪功能障碍人群的干预和满足各种专业需求方面的潜力。未来的工作应旨在解决API软件的偏见限制,并扩大其在临床、教育、神经认知和工业组织心理学子领域的实用性。
{"title":"API Face Value","authors":"Austin Wyman, Zhiyong Zhang","doi":"10.35566/jbds/v3n1/wyman","DOIUrl":"https://doi.org/10.35566/jbds/v3n1/wyman","url":null,"abstract":"Emotion recognition application programming interface (API) is a recent advancement in computing technology that synthesizes computer vision, machine-learning algorithms, deep-learning neural networks, and other information to detect and label human emotions. The strongest iterations of this technology are produced by technology giants with large, cloud infrastructure (i.e., Google, and Microsoft), bolstering high true positive rates. We review the current status of applications of emotion recognition API in psychological research and find that, despite evidence of spatial, age, and race bias effects, API is improving the accessibility of clinical and educational research. Specifically, emotion detection software can assist individuals with emotion-related deficits (e.g., Autism Spectrum Disorder, Attention Deficit-Hyperactivity Disorder, Alexithymia). API has been incorporated in various computer-assisted interventions for Autism, where it has been used to diagnose, train, and monitor emotional responses to one's environment. We identify AP's potential to enhance interventions in other emotional dysfunction populations and to address various professional needs. Future work should aim to address the bias limitations of API software and expand its utility in subfields of clinical, educational, neurocognitive, and industrial-organizational psychology.","PeriodicalId":93575,"journal":{"name":"Journal of behavioral data science","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47350702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Bayesian Piecewise Growth Curve Models to Handle Complex Nonlinear Trajectories 用贝叶斯分段增长曲线模型处理复杂的非线性轨迹
Pub Date : 2023-07-13 DOI: 10.35566/jbds/v3n1/marvin
Luca Marvin, Haiyan Liu, S. Depaoli
Bayesian growth curve modeling is a popular method for studying longitudinal data. In this study, we discuss a flexible extension, the Bayesian piecewise growth curve model (BPGCM), which allows the researcher to break up a trajectory into phases joined at change points called knots. By fitting BPGCMs, the researcher can specify three or more phases of growth without concern for model identification. Our goal is to provide substantive researchers with a guide for implementing this important class of models. We present a simple application of Bayesian linear BPGCMs to childrens' math achievement. Our tutorial includes Mplus code, strategies for specifying knots, and how to interpret model selection and fit indices. Extensions of the model are discussed.
贝叶斯生长曲线建模是研究纵向数据的常用方法。在这项研究中,我们讨论了一个灵活的扩展,贝叶斯分段增长曲线模型(BPGCM),它允许研究人员将轨迹分解为在称为结点的变化点连接的阶段。通过拟合bpgcm,研究人员可以指定三个或更多的生长阶段,而无需考虑模型识别。我们的目标是为实质性的研究人员提供实现这类重要模型的指南。我们提出了贝叶斯线性bpgcm在儿童数学成绩中的一个简单应用。我们的教程包括Mplus代码,指定结的策略,以及如何解释模型选择和拟合指数。讨论了模型的扩展。
{"title":"Using Bayesian Piecewise Growth Curve Models to Handle Complex Nonlinear Trajectories","authors":"Luca Marvin, Haiyan Liu, S. Depaoli","doi":"10.35566/jbds/v3n1/marvin","DOIUrl":"https://doi.org/10.35566/jbds/v3n1/marvin","url":null,"abstract":"Bayesian growth curve modeling is a popular method for studying longitudinal data. In this study, we discuss a flexible extension, the Bayesian piecewise growth curve model (BPGCM), which allows the researcher to break up a trajectory into phases joined at change points called knots. By fitting BPGCMs, the researcher can specify three or more phases of growth without concern for model identification. Our goal is to provide substantive researchers with a guide for implementing this important class of models. We present a simple application of Bayesian linear BPGCMs to childrens' math achievement. Our tutorial includes Mplus code, strategies for specifying knots, and how to interpret model selection and fit indices. Extensions of the model are discussed.","PeriodicalId":93575,"journal":{"name":"Journal of behavioral data science","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43379689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of behavioral data science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1