首页 > 最新文献

Journal of data science最新文献

英文 中文
Prediction of Fetal Health Status Using Machine Learning 利用机器学习预测胎儿健康状况
Pub Date : 2024-07-01 DOI: 10.61453/jods.v2024no17
Naidile S Saragodu, Shreedhara N Hegde, Harprith Kaur
The goal of this promising area of study is to enhance prenatal care and lower fetal morbidity and mortality by utilizingmachine learning to anticipate fetal disease. In this study, we present a machine learning-based strategy for predicting fetaldiseases from clinical data. First, we gathered a sizable collection of clinical information from expectant mothers with various fetal disorders. Using clinical guidelines, we pre-processed the data and retrieved pertinent features. We integrated a range of machine learning algorithms, including logistic regression, support vector machines, decision trees, and random forests, to train and test our model. We evaluated the performance of our model using severalfactors, such as accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve (AUC-ROC).The results of this study demonstrate how machine learning algorithms can accurately forecast fetal health status. The developed models achieve good accuracy and AUC-ROC ratings todistinguish between healthy and at-risk fetuses. The interpretability study identifies key clinical characteristics that have a significant impact on the prediction, providing medical practitioners with useful information when making decisions about prenatal care. Through the provision of more unbiasedand precise assessments of fetal health status, machine learning techniques incorporated into prenatal care have the potential to transform the industry. By providing accurate and early projections, this technology can assist healthcare professionals in identifying high-risk pregnancies and carrying out the necessary procedures, improving mother and fetal outcomes. Future research should concentrate on verifying and improving predictive models on larger and more varied datasets to ensure real-world applicability and reliability
这一前景广阔的研究领域的目标是利用机器学习预测胎儿疾病,从而加强产前护理,降低胎儿发病率和死亡率。在这项研究中,我们提出了一种基于机器学习的策略,从临床数据中预测胎儿疾病。首先,我们收集了大量患有各种胎儿疾病的准妈妈的临床信息。利用临床指南,我们对数据进行了预处理,并检索了相关特征。我们整合了一系列机器学习算法,包括逻辑回归、支持向量机、决策树和随机森林,以训练和测试我们的模型。我们使用准确性、灵敏度、特异性和接收者操作特征曲线下面积(AUC-ROC)等多个因素评估了模型的性能。所开发的模型在区分健康胎儿和高危胎儿方面具有良好的准确性和 AUC-ROC 评级。可解释性研究确定了对预测有重大影响的关键临床特征,为医疗从业人员在产前护理决策时提供了有用的信息。通过对胎儿健康状况进行更公正、更精确的评估,将机器学习技术融入产前护理有望改变整个行业。通过提供准确的早期预测,这项技术可以帮助医疗保健专业人员识别高危妊娠并实施必要的手术,从而改善母亲和胎儿的预后。未来的研究应集中于在更大、更多样的数据集上验证和改进预测模型,以确保其在现实世界中的适用性和可靠性。
{"title":"Prediction of Fetal Health Status Using Machine Learning","authors":"Naidile S Saragodu, Shreedhara N Hegde, Harprith Kaur","doi":"10.61453/jods.v2024no17","DOIUrl":"https://doi.org/10.61453/jods.v2024no17","url":null,"abstract":"The goal of this promising area of study is to enhance prenatal care and lower fetal morbidity and mortality by utilizingmachine learning to anticipate fetal disease. In this study, we present a machine learning-based strategy for predicting fetaldiseases from clinical data. First, we gathered a sizable collection of clinical information from expectant mothers with various fetal disorders. Using clinical guidelines, we pre-processed the data and retrieved pertinent features. We integrated a range of machine learning algorithms, including logistic regression, support vector machines, decision trees, and random forests, to train and test our model. We evaluated the performance of our model using severalfactors, such as accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve (AUC-ROC).The results of this study demonstrate how machine learning algorithms can accurately forecast fetal health status. The developed models achieve good accuracy and AUC-ROC ratings todistinguish between healthy and at-risk fetuses. The interpretability study identifies key clinical characteristics that have a significant impact on the prediction, providing medical practitioners with useful information when making decisions about prenatal care. Through the provision of more unbiasedand precise assessments of fetal health status, machine learning techniques incorporated into prenatal care have the potential to transform the industry. By providing accurate and early projections, this technology can assist healthcare professionals in identifying high-risk pregnancies and carrying out the necessary procedures, improving mother and fetal outcomes. Future research should concentrate on verifying and improving predictive models on larger and more varied datasets to ensure real-world applicability and reliability","PeriodicalId":15636,"journal":{"name":"Journal of data science","volume":"5 20","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141846732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Teaching Visual Accessibility in Introductory Data Science Classes with Multi-Modal Data Representations 使用多模态数据表示在数据科学入门课程中教授可视化可访问性
Pub Date : 2023-01-01 DOI: 10.6339/23-jds1095
JooYoung Seo, Mine Dogucu
Although there are various ways to represent data patterns and models, visualization has been primarily taught in many data science courses for its efficiency. Such vision-dependent output may cause critical barriers against those who are blind and visually impaired and people with learning disabilities. We argue that instructors need to teach multiple data representation methods so that all students can produce data products that are more accessible. In this paper, we argue that accessibility should be taught as early as the introductory course as part of the data science curriculum so that regardless of whether learners major in data science or not, they can have foundational exposure to accessibility. As data science educators who teach accessibility as part of our lower-division courses in two different institutions, we share specific examples that can be utilized by other data science instructors.
尽管有各种表示数据模式和模型的方法,但可视化主要是在许多数据科学课程中教授的,因为它的效率很高。这种依赖视觉的输出可能对盲人和视力受损者以及有学习障碍的人造成严重障碍。我们认为,教师需要教授多种数据表示方法,以便所有学生都能产生更易于访问的数据产品。在本文中,我们认为可访问性应该早在入门课程中就作为数据科学课程的一部分进行教授,这样无论学习者是否主修数据科学,他们都可以对可访问性有基本的了解。作为数据科学教育工作者,我们在两所不同的机构中教授可访问性作为我们低级别课程的一部分,我们分享了其他数据科学教师可以使用的具体示例。
{"title":"Teaching Visual Accessibility in Introductory Data Science Classes with Multi-Modal Data Representations","authors":"JooYoung Seo, Mine Dogucu","doi":"10.6339/23-jds1095","DOIUrl":"https://doi.org/10.6339/23-jds1095","url":null,"abstract":"Although there are various ways to represent data patterns and models, visualization has been primarily taught in many data science courses for its efficiency. Such vision-dependent output may cause critical barriers against those who are blind and visually impaired and people with learning disabilities. We argue that instructors need to teach multiple data representation methods so that all students can produce data products that are more accessible. In this paper, we argue that accessibility should be taught as early as the introductory course as part of the data science curriculum so that regardless of whether learners major in data science or not, they can have foundational exposure to accessibility. As data science educators who teach accessibility as part of our lower-division courses in two different institutions, we share specific examples that can be utilized by other data science instructors.","PeriodicalId":15636,"journal":{"name":"Journal of data science","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135585729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Philosophy of Copula Modeling: A Conversation with ChatGPT Copula建模的哲学:与ChatGPT的对话
Pub Date : 2023-01-01 DOI: 10.6339/23-jds1114
Marius Hofert
In the form of a scholarly exchange with ChatGPT, we cover fundamentals of modeling stochastic dependence with copulas. The conversation is aimed at a broad audience and provides a light introduction to the topic of copula modeling, a field of potential relevance in all areas where more than one random variable appears in the modeling process. Topics covered include the definition, Sklar’s theorem, the invariance principle, pseudo-observations, tail dependence and stochastic representations. The conversation also shows to what degree it can be useful (or not) to learn about such concepts by interacting with the current version of a chatbot.
在与ChatGPT的学术交流中,我们介绍了用copula建模随机依赖的基本原理。对话针对的是广泛的受众,并提供了对copula建模主题的简单介绍,这是一个在建模过程中出现多个随机变量的所有领域中潜在的相关领域。涵盖的主题包括定义、斯克拉定理、不变性原理、伪观测、尾依赖和随机表示。对话还显示了通过与当前版本的聊天机器人交互来了解这些概念在多大程度上是有用的(或无用的)。
{"title":"The Philosophy of Copula Modeling: A Conversation with ChatGPT","authors":"Marius Hofert","doi":"10.6339/23-jds1114","DOIUrl":"https://doi.org/10.6339/23-jds1114","url":null,"abstract":"In the form of a scholarly exchange with ChatGPT, we cover fundamentals of modeling stochastic dependence with copulas. The conversation is aimed at a broad audience and provides a light introduction to the topic of copula modeling, a field of potential relevance in all areas where more than one random variable appears in the modeling process. Topics covered include the definition, Sklar’s theorem, the invariance principle, pseudo-observations, tail dependence and stochastic representations. The conversation also shows to what degree it can be useful (or not) to learn about such concepts by interacting with the current version of a chatbot.","PeriodicalId":15636,"journal":{"name":"Journal of data science","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136209899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring Racial and Ethnic Differences in US Home Ownership with Bayesian Beta-Binomial Regression 用贝叶斯β -二项回归探讨美国住房所有权的种族差异
Pub Date : 2023-01-01 DOI: 10.6339/23-jds1113
Jhonatan Medri, Tejasvi Channagiri, Lu Lu
Racial and ethnic representation in home ownership rates is an important public policy topic for addressing inequality within society. Although more than half of the households in the US are owned, rather than rented, the representation of home ownership is unequal among different racial and ethnic groups. Here we analyze the US Census Bureau’s American Community Survey data to conduct an exploratory and statistical analysis of home ownership in the US, and find sociodemographic factors that are associated with differences in home ownership rates. We use binomial and beta-binomial generalized linear models (GLMs) with 2020 county-level data to model the home ownership rate, and fit the beta-binomial models with Bayesian estimation. We determine that race/ethnic group, geographic region, and income all have significant associations with the home ownership rate. To make the data and results accessible to the public, we develop an Shiny web application in R with exploratory plots and model predictions.
住房自有率中的种族和民族代表性是解决社会不平等问题的重要公共政策主题。尽管美国有一半以上的家庭是自有住房,而不是租房,但不同种族和族裔群体的住房拥有率是不平等的。在这里,我们分析了美国人口普查局的美国社区调查数据,对美国的住房拥有率进行了探索性和统计分析,并找到了与住房拥有率差异相关的社会人口因素。本文采用二项和β -二项广义线性模型(GLMs)对2020年县级住房自有率进行建模,并用贝叶斯估计对β -二项模型进行拟合。我们确定种族/民族、地理区域和收入都与住房自有率有显著关联。为了让公众可以访问数据和结果,我们用R语言开发了一个Shiny的web应用程序,其中包含探索性图和模型预测。
{"title":"Exploring Racial and Ethnic Differences in US Home Ownership with Bayesian Beta-Binomial Regression","authors":"Jhonatan Medri, Tejasvi Channagiri, Lu Lu","doi":"10.6339/23-jds1113","DOIUrl":"https://doi.org/10.6339/23-jds1113","url":null,"abstract":"Racial and ethnic representation in home ownership rates is an important public policy topic for addressing inequality within society. Although more than half of the households in the US are owned, rather than rented, the representation of home ownership is unequal among different racial and ethnic groups. Here we analyze the US Census Bureau’s American Community Survey data to conduct an exploratory and statistical analysis of home ownership in the US, and find sociodemographic factors that are associated with differences in home ownership rates. We use binomial and beta-binomial generalized linear models (GLMs) with 2020 county-level data to model the home ownership rate, and fit the beta-binomial models with Bayesian estimation. We determine that race/ethnic group, geographic region, and income all have significant associations with the home ownership rate. To make the data and results accessible to the public, we develop an Shiny web application in R with exploratory plots and model predictions.","PeriodicalId":15636,"journal":{"name":"Journal of data science","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135313124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Legendary Career and Colorful Life: A Conversation with Dr. Bob Riffenburgh 传奇的事业和丰富多彩的生活:与鲍勃·里芬伯格博士对话
Pub Date : 2023-01-01 DOI: 10.6339/23-jds1115
Haim Bar, Jun Yan
In 2022 the American Statistical Association established the Riffenburgh Award, which recognizes exceptional innovation in extending statistical methods across diverse fields. Simultaneously, the Department of Statistics at the University of Connecticut proudly commemorated six decades of excellence, having evolved into a preeminent hub for academic, industrial, and governmental statistical grooming. To honor this legacy, a captivating virtual dialogue was conducted with the department’s visionary founder, Dr. Robert H. Riffenburgh, delving into his extraordinary career trajectory, profound insights into the statistical vocation, and heartfelt accounts from the faculty and students he personally nurtured. This multifaceted narrative documents the conversation with more detailed background information on each topic covered by the interview than what is presented in the video recording on YouTube.
2022年,美国统计协会设立了里芬伯格奖,以表彰在不同领域扩展统计方法的杰出创新。同时,康涅狄格大学统计系自豪地纪念了60年来的卓越成就,该系已发展成为学术、工业和政府统计培训的卓越中心。为了纪念这一遗产,我们与该部门富有远见的创始人罗伯特·h·里芬伯格博士进行了一场引人入胜的虚拟对话,深入探讨了他非凡的职业轨迹,对统计职业的深刻见解,以及他亲自培养的教职员工和学生的衷心叙述。这种多方面的叙述记录了谈话,对采访中涉及的每个话题都有更详细的背景信息,而不是YouTube上的视频记录。
{"title":"Legendary Career and Colorful Life: A Conversation with Dr. Bob Riffenburgh","authors":"Haim Bar, Jun Yan","doi":"10.6339/23-jds1115","DOIUrl":"https://doi.org/10.6339/23-jds1115","url":null,"abstract":"In 2022 the American Statistical Association established the Riffenburgh Award, which recognizes exceptional innovation in extending statistical methods across diverse fields. Simultaneously, the Department of Statistics at the University of Connecticut proudly commemorated six decades of excellence, having evolved into a preeminent hub for academic, industrial, and governmental statistical grooming. To honor this legacy, a captivating virtual dialogue was conducted with the department’s visionary founder, Dr. Robert H. Riffenburgh, delving into his extraordinary career trajectory, profound insights into the statistical vocation, and heartfelt accounts from the faculty and students he personally nurtured. This multifaceted narrative documents the conversation with more detailed background information on each topic covered by the interview than what is presented in the video recording on YouTube.","PeriodicalId":15636,"journal":{"name":"Journal of data science","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136301956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of data science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1