Frontiers in Artificial Intelligence最新文献_第5页

Segmenting female students' perceptions about Fintech using Explainable AI. 利用可解释人工智能细分女学生对金融科技的看法。

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Artificial Intelligence

Pub Date : 2024-12-12 eCollection Date: 2024-01-01 DOI: 10.3389/frai.2024.1504963

Christos Adam

The use of Financial Technology (Fintech) has been proposed as a promising way to bridge the gender gap, both financially and socially. However, there is evidence that Fintech is far from achieving this objective, and that women's perceptions of Fintech usages are not clear. Therefore, the main objective of the this study is to segment women's perceptions toward Fintech tools and interpret these segments using machine learning methods. Two primary segments of women were produced, namely a "Fintech-friendly" group and a "Fintech-sceptical" group. The importance and reasonings behind the aforementioned segmentation are then examined. The most prominent factors affecting a woman being in the "Fintech-friendly" group are the perceived benefits of Fintech tools compared to the traditional ones, such as ease of usage, time-space convenience, and its advantageous nature. Finally, for Fintech stakeholders, implications for usability, ease, Fintech education, and tailored experiences may be advantageous approaches.

金融科技（Fintech）的使用被认为是弥合经济和社会性别差距的一种有希望的方式。然而，有证据表明，金融科技远未实现这一目标，女性对金融科技用途的看法也不清楚。因此，本研究的主要目的是细分女性对金融科技工具的看法，并使用机器学习方法解释这些细分。研究产生了两个主要的女性群体，即“金融科技友好”群体和“金融科技怀疑”群体。然后检查上述分割背后的重要性和原因。影响女性加入“金融科技友好型”群体的最重要因素是，与传统工具相比，金融科技工具的感知优势，如易用性、时空便捷性及其优势性质。最后，对于金融科技利益相关者来说，可用性、易用性、金融科技教育和量身定制的体验可能是有利的方法。

引用次数: 0

A hybrid deep learning-based approach for optimal genotype by environment selection. 基于混合深度学习的环境最优基因型选择方法。

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Artificial Intelligence

Pub Date : 2024-12-11 eCollection Date: 2024-01-01 DOI: 10.3389/frai.2024.1312115

Zahra Khalilzadeh, Motahareh Kashanian, Saeed Khaki, Lizhi Wang

The ability to accurately predict the yields of different crop genotypes in response to weather variability is crucial for developing climate resilient crop cultivars. Genotype-environment interactions introduce large variations in crop-climate responses, and are hard to factor in to breeding programs. Data-driven approaches, particularly those based on machine learning, can help guide breeding efforts by factoring in genotype-environment interactions when making yield predictions. Using a new yield dataset containing 93,028 records of soybean hybrids across 159 locations, 28 states, and 13 years, with 5,838 distinct genotypes and daily weather data over a 214-day growing season, we developed two convolutional neural network (CNN) models: one that integrates CNN and fully-connected neural networks (CNN model), and another that incorporates a long short-term memory (LSTM) layer after the CNN component (CNN-LSTM model). By applying the Generalized Ensemble Method (GEM), we combined the CNN-based models and optimized their weights to improve overall predictive performance. The dataset provided unique genotype information on seeds, enabling an investigation into the potential of planting different genotypes based on weather variables. We employed the proposed GEM model to identify the best-performing genotypes across various locations and weather conditions, making yield predictions for all potential genotypes in each specific setting. To assess the performance of the GEM model, we evaluated it on unseen genotype-location combinations, simulating real-world scenarios where new genotypes are introduced. By combining the base models, the GEM ensemble approach provided much better prediction accuracy compared to using the CNN-LSTM model alone and slightly better accuracy than the CNN model, as measured by both RMSE and MAE on the validation and test sets. The proposed data-driven approach can be valuable for genotype selection in scenarios with limited testing years. In addition, we explored the impact of incorporating state-level soil data alongside the weather, location, genotype and year variables. Due to data constraints, including the absence of latitude and longitude details, we used uniform soil variables for all locations within the same state. This limitation restricted our spatial information to state-level knowledge. Our findings suggested that integrating state-level soil variables did not substantially enhance the predictive capabilities of the models. We also performed a feature importance analysis using RMSE change to identify crucial predictors. Location showed the highest RMSE change, followed by genotype and year. Among weather variables, maximum direct normal irradiance (MDNI) and average precipitation (AP) displayed higher RMSE changes, indicating their importance.

根据天气变化准确预测不同作物基因型产量的能力对于开发适应气候变化的作物品种至关重要。基因型与环境的相互作用会导致作物对气候的反应发生巨大变化，而且很难在育种计划中加以考虑。数据驱动的方法，特别是那些基于机器学习的方法，可以通过在进行产量预测时考虑基因型-环境相互作用来帮助指导育种工作。利用一个新的产量数据集，该数据集包含159个地区、28个州、13年、5838种不同基因型的93028份大豆杂交记录，以及214天生长季节的每日天气数据，我们开发了两个卷积神经网络（CNN）模型：一个集成了CNN和全连接神经网络（CNN模型），另一个在CNN组件之后合并了长短期记忆（LSTM）层（CNN-LSTM模型）。采用广义集成方法（GEM），结合基于cnn的模型，优化其权重，提高整体预测性能。该数据集提供了关于种子的独特基因型信息，从而能够根据天气变量调查种植不同基因型的潜力。我们采用所提出的GEM模型来确定在不同地点和天气条件下表现最佳的基因型，并在每种特定环境下对所有潜在基因型进行产量预测。为了评估GEM模型的性能，我们对未见过的基因型-位置组合进行了评估，模拟了引入新基因型的现实世界场景。通过结合基本模型，GEM集成方法比单独使用CNN- lstm模型提供了更好的预测精度，并且在验证集和测试集上通过RMSE和MAE测量的精度略高于CNN模型。在测试年份有限的情况下，提出的数据驱动方法对基因型选择很有价值。此外，我们还探讨了将州级土壤数据与天气、地点、基因型和年份变量结合起来的影响。由于数据的限制，包括缺少纬度和经度细节，我们对同一州内的所有位置使用统一的土壤变量。这一限制将我们的空间信息局限于国家层面的知识。我们的研究结果表明，整合国家级土壤变量并没有显著提高模型的预测能力。我们还使用RMSE变化进行了特征重要性分析，以确定关键的预测因子。地点的RMSE变化最大，其次是基因型和年份。在气象变量中，最大直接正常辐照度（MDNI）和平均降水量（AP）的RMSE变化较大，说明它们的重要性。

{"title":"A hybrid deep learning-based approach for optimal genotype by environment selection.","authors":"Zahra Khalilzadeh, Motahareh Kashanian, Saeed Khaki, Lizhi Wang","doi":"10.3389/frai.2024.1312115","DOIUrl":"10.3389/frai.2024.1312115","url":null,"abstract":"The ability to accurately predict the yields of different crop genotypes in response to weather variability is crucial for developing climate resilient crop cultivars. Genotype-environment interactions introduce large variations in crop-climate responses, and are hard to factor in to breeding programs. Data-driven approaches, particularly those based on machine learning, can help guide breeding efforts by factoring in genotype-environment interactions when making yield predictions. Using a new yield dataset containing 93,028 records of soybean hybrids across 159 locations, 28 states, and 13 years, with 5,838 distinct genotypes and daily weather data over a 214-day growing season, we developed two convolutional neural network (CNN) models: one that integrates CNN and fully-connected neural networks (CNN model), and another that incorporates a long short-term memory (LSTM) layer after the CNN component (CNN-LSTM model). By applying the Generalized Ensemble Method (GEM), we combined the CNN-based models and optimized their weights to improve overall predictive performance. The dataset provided unique genotype information on seeds, enabling an investigation into the potential of planting different genotypes based on weather variables. We employed the proposed GEM model to identify the best-performing genotypes across various locations and weather conditions, making yield predictions for all potential genotypes in each specific setting. To assess the performance of the GEM model, we evaluated it on unseen genotype-location combinations, simulating real-world scenarios where new genotypes are introduced. By combining the base models, the GEM ensemble approach provided much better prediction accuracy compared to using the CNN-LSTM model alone and slightly better accuracy than the CNN model, as measured by both RMSE and MAE on the validation and test sets. The proposed data-driven approach can be valuable for genotype selection in scenarios with limited testing years. In addition, we explored the impact of incorporating state-level soil data alongside the weather, location, genotype and year variables. Due to data constraints, including the absence of latitude and longitude details, we used uniform soil variables for all locations within the same state. This limitation restricted our spatial information to state-level knowledge. Our findings suggested that integrating state-level soil variables did not substantially enhance the predictive capabilities of the models. We also performed a feature importance analysis using RMSE change to identify crucial predictors. Location showed the highest RMSE change, followed by genotype and year. Among weather variables, maximum direct normal irradiance (MDNI) and average precipitation (AP) displayed higher RMSE changes, indicating their importance.","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1312115"},"PeriodicalIF":3.0,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11670329/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142898665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The use of artificial intelligence for automatic analysis and reporting of software defects. 使用人工智能对软件缺陷进行自动分析和报告。

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Artificial Intelligence

Pub Date : 2024-12-11 eCollection Date: 2024-01-01 DOI: 10.3389/frai.2024.1443956

Mark Esposito, Saman Sarbazvatan, Terence Tse, Gabriel Silva-Atencio

The COVID-19 pandemic marked a before and after in the business world, causing a growing demand for applications that streamline operations, reduce delivery times and costs, and improve the quality of products. In this context, artificial intelligence (AI) has taken a relevant role in improving these processes, since it incorporates mathematical models that allow analyzing the logical structure of the systems to detect and reduce errors or failures in real-time. This study aimed to determine the most relevant aspects to be considered for detecting software defects using AI. The methodology used was qualitative, with an exploratory, descriptive, and non-experimental approach. The technique involved a documentary review of 79 bibliometric references. The most relevant finding was the use of regression testing techniques and automated log files, in machine learning (ML) and robotic process automation (RPA) environments. These techniques help reduce the time required to identify failures, thereby enhancing efficiency and effectiveness in the lifecycle of applications. In conclusion, companies that incorporate AI algorithms will be able to include an agile model in their lifecycle, as they will reduce the rate of failures, errors, and breakdowns allowing cost savings, and ensuring quality.

COVID-19 大流行标志着商业世界的前前后后，导致对能够简化操作、缩短交货时间、降低成本和提高产品质量的应用程序的需求不断增长。在这种情况下，人工智能（AI）在改善这些流程方面发挥了重要作用，因为它结合了数学模型，可以分析系统的逻辑结构，实时检测并减少错误或故障。本研究旨在确定使用人工智能检测软件缺陷时需要考虑的最相关方面。所采用的方法是定性方法，具有探索性、描述性和非实验性。研究技术包括对 79 篇文献计量学参考文献进行文献综述。最相关的发现是在机器学习（ML）和机器人流程自动化（RPA）环境中使用回归测试技术和自动日志文件。这些技术有助于缩短识别故障所需的时间，从而提高应用程序生命周期的效率和有效性。总之，采用人工智能算法的公司将能够在其生命周期中采用敏捷模式，因为它们将降低故障、错误和崩溃率，从而节约成本并确保质量。

{"title":"The use of artificial intelligence for automatic analysis and reporting of software defects.","authors":"Mark Esposito, Saman Sarbazvatan, Terence Tse, Gabriel Silva-Atencio","doi":"10.3389/frai.2024.1443956","DOIUrl":"10.3389/frai.2024.1443956","url":null,"abstract":"The COVID-19 pandemic marked a before and after in the business world, causing a growing demand for applications that streamline operations, reduce delivery times and costs, and improve the quality of products. In this context, artificial intelligence (AI) has taken a relevant role in improving these processes, since it incorporates mathematical models that allow analyzing the logical structure of the systems to detect and reduce errors or failures in real-time. This study aimed to determine the most relevant aspects to be considered for detecting software defects using AI. The methodology used was qualitative, with an exploratory, descriptive, and non-experimental approach. The technique involved a documentary review of 79 bibliometric references. The most relevant finding was the use of regression testing techniques and automated log files, in machine learning (ML) and robotic process automation (RPA) environments. These techniques help reduce the time required to identify failures, thereby enhancing efficiency and effectiveness in the lifecycle of applications. In conclusion, companies that incorporate AI algorithms will be able to include an agile model in their lifecycle, as they will reduce the rate of failures, errors, and breakdowns allowing cost savings, and ensuring quality.","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1443956"},"PeriodicalIF":3.0,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11668792/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142898684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Editorial: Generative AI in education. 编辑：教育中的生成式人工智能。

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Artificial Intelligence

Pub Date : 2024-12-10 eCollection Date: 2024-01-01 DOI: 10.3389/frai.2024.1532896

Diego Zapata-Rivera, Ilaria Torre, Chien-Sing Lee, Antonio Sarasa-Cabezuelo, Ioana Ghergulescu, Paul Libbrecht

引用次数: 0

Navigating the unseen peril: safeguarding medical imaging in the age of AI. 在看不见的危险中航行：在人工智能时代保护医学成像。

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Artificial Intelligence

Pub Date : 2024-12-09 eCollection Date: 2024-01-01 DOI: 10.3389/frai.2024.1400732

Alexandra Maertens, Steve Brykman, Thomas Hartung, Andrei Gafita, Harrison Bai, David Hoelzer, Ed Skoudis, Channing Judith Paller

In response to the increasing significance of artificial intelligence (AI) in healthcare, there has been increased attention - including a Presidential executive order to create an AI Safety Institute - to the potential threats posed by AI. While much attention has been given to the conventional risks AI poses to cybersecurity, and critical infrastructure, here we provide an overview of some unique challenges of AI for the medical community. Above and beyond obvious concerns about vetting algorithms that impact patient care, there are additional subtle yet equally important things to consider: the potential harm AI poses to its own integrity and the broader medical information ecosystem. Recognizing the role of healthcare professionals as both consumers and contributors to AI training data, this article advocates for a proactive approach in understanding and shaping the data that underpins AI systems, emphasizing the need for informed engagement to maximize the benefits of AI while mitigating the risks.

为了应对人工智能（AI）在医疗保健领域日益重要的意义，人们越来越关注人工智能构成的潜在威胁，包括总统行政命令创建人工智能安全研究所。虽然人们对人工智能对网络安全和关键基础设施构成的传统风险给予了很多关注，但在这里，我们概述了人工智能对医学界的一些独特挑战。除了对影响患者护理的审查算法的明显担忧之外，还有一些微妙但同样重要的事情需要考虑：人工智能对其自身完整性和更广泛的医疗信息生态系统构成的潜在危害。认识到医疗保健专业人员既是人工智能培训数据的消费者，也是贡献者，本文提倡采取积极主动的方法来理解和塑造支撑人工智能系统的数据，强调需要知情参与，以最大限度地发挥人工智能的好处，同时降低风险。

{"title":"Navigating the unseen peril: safeguarding medical imaging in the age of AI.","authors":"Alexandra Maertens, Steve Brykman, Thomas Hartung, Andrei Gafita, Harrison Bai, David Hoelzer, Ed Skoudis, Channing Judith Paller","doi":"10.3389/frai.2024.1400732","DOIUrl":"10.3389/frai.2024.1400732","url":null,"abstract":"In response to the increasing significance of artificial intelligence (AI) in healthcare, there has been increased attention - including a Presidential executive order to create an AI Safety Institute - to the potential threats posed by AI. While much attention has been given to the conventional risks AI poses to cybersecurity, and critical infrastructure, here we provide an overview of some unique challenges of AI for the medical community. Above and beyond obvious concerns about vetting algorithms that impact patient care, there are additional subtle yet equally important things to consider: the potential harm AI poses to its own integrity and the broader medical information ecosystem. Recognizing the role of healthcare professionals as both consumers and contributors to AI training data, this article advocates for a proactive approach in understanding and shaping the data that underpins AI systems, emphasizing the need for informed engagement to maximize the benefits of AI while mitigating the risks.","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1400732"},"PeriodicalIF":3.0,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11665297/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142883166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Frugal innovation in the business environment: a literature review and future perspectives. 商业环境下的节俭创新：文献综述及未来展望。

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Artificial Intelligence

Pub Date : 2024-12-06 eCollection Date: 2024-01-01 DOI: 10.3389/frai.2024.1385522

Carlos Escudero-Cipriani, Julio García-Del Junco, Raquel Chafloque-Céspedes, Aldo Alvarez-Risco

Introduction: This research aims to explore the growing field of frugal innovation within the business environment, particularly its intersection with sustainability and artificial intelligence.

Methods: Through a comprehensive literature review, the study analyzes key research trends and methodologies from 420 scholarly articles published between 2012 and August 2024. A bibliometric review traces the evolution of frugal innovation, while a content analysis provides insights into its practical applications across various industries, especially in resource-constrained settings.

Results: The findings highlight the significant role of frugal innovation in addressing global challenges, such as reducing environmental impact and promoting social inclusion, especially through the adoption of cleaner technologies and socially responsible business practices. The study also emphasizes the transformative potential of AI in enhancing the scalability and efficiency of frugal solutions.

Discussion: This research contributes to the ongoing conversation on sustainable development by identifying knowledge gaps and proposing future strategies for leveraging frugal innovation to drive inclusive growth. The implications of this research are valuable for academics, practitioners, and policymakers aiming to foster sustainable innovation in diverse socio-economic contexts.

本研究旨在探索商业环境中日益增长的节约型创新领域，特别是其与可持续性和人工智能的交叉。方法：通过文献综述，对2012年至2024年8月发表的420篇学术论文的主要研究趋势和研究方法进行分析。文献计量学回顾追踪了节俭创新的演变，而内容分析提供了其在不同行业的实际应用的见解，特别是在资源受限的环境中。结果：研究结果强调了节约型创新在应对全球挑战方面的重要作用，例如减少环境影响和促进社会包容，特别是通过采用更清洁的技术和对社会负责的商业实践。该研究还强调了人工智能在提高节俭解决方案的可扩展性和效率方面的变革潜力。讨论：本研究通过确定知识差距并提出利用节约型创新推动包容性增长的未来战略，为可持续发展的持续对话做出了贡献。本研究对旨在促进不同社会经济背景下可持续创新的学者、从业者和政策制定者具有重要意义。

{"title":"Frugal innovation in the business environment: a literature review and future perspectives.","authors":"Carlos Escudero-Cipriani, Julio García-Del Junco, Raquel Chafloque-Céspedes, Aldo Alvarez-Risco","doi":"10.3389/frai.2024.1385522","DOIUrl":"10.3389/frai.2024.1385522","url":null,"abstract":"Introduction: This research aims to explore the growing field of frugal innovation within the business environment, particularly its intersection with sustainability and artificial intelligence.Methods: Through a comprehensive literature review, the study analyzes key research trends and methodologies from 420 scholarly articles published between 2012 and August 2024. A bibliometric review traces the evolution of frugal innovation, while a content analysis provides insights into its practical applications across various industries, especially in resource-constrained settings.Results: The findings highlight the significant role of frugal innovation in addressing global challenges, such as reducing environmental impact and promoting social inclusion, especially through the adoption of cleaner technologies and socially responsible business practices. The study also emphasizes the transformative potential of AI in enhancing the scalability and efficiency of frugal solutions.Discussion: This research contributes to the ongoing conversation on sustainable development by identifying knowledge gaps and proposing future strategies for leveraging frugal innovation to drive inclusive growth. The implications of this research are valuable for academics, practitioners, and policymakers aiming to foster sustainable innovation in diverse socio-economic contexts.","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1385522"},"PeriodicalIF":3.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11662995/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142877973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Causal contextual bandits with one-shot data integration. 具有一次性数据集成的因果上下文强盗。

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Artificial Intelligence

Pub Date : 2024-12-06 eCollection Date: 2024-01-01 DOI: 10.3389/frai.2024.1346700

Chandrasekar Subramanian, Balaraman Ravindran

We study a contextual bandit setting where the agent has access to causal side information, in addition to the ability to perform multiple targeted experiments corresponding to potentially different context-action pairs-simultaneously in one-shot within a budget. This new formalism provides a natural model for several real-world scenarios where parallel targeted experiments can be conducted and where some domain knowledge of causal relationships is available. We propose a new algorithm that utilizes a novel entropy-like measure that we introduce. We perform several experiments, both using purely synthetic data and using a real-world dataset. In addition, we study sensitivity of our algorithm's performance to various aspects of the problem setting. The results show that our algorithm performs better than baselines in all of the experiments. We also show that the algorithm is sound; that is, as budget increases, the learned policy eventually converges to an optimal policy. Further, we theoretically bound our algorithm's regret under additional assumptions. Finally, we provide ways to achieve two popular notions of fairness, namely counterfactual fairness and demographic parity, with our algorithm.

我们研究了一个上下文强盗设置，其中代理除了能够在预算内一次性执行对应于潜在不同上下文-动作对的多个目标实验之外，还可以访问因果侧信息。这种新的形式为几个现实世界的场景提供了一个自然的模型，在这些场景中可以进行平行的目标实验，并且可以获得一些因果关系的领域知识。我们提出了一种新的算法，它利用了我们引入的一种新的类似熵的度量。我们进行了几个实验，既使用纯合成数据，也使用真实世界的数据集。此外，我们还研究了算法性能对问题设置各个方面的敏感性。结果表明，该算法在所有实验中都优于基线。我们还证明了该算法是合理的；也就是说，随着预算的增加，学习到的策略最终收敛到最优策略。此外，我们在理论上将算法的遗憾约束在附加假设下。最后，我们提供了用我们的算法实现两种流行的公平概念的方法，即反事实公平和人口均等。

{"title":"Causal contextual bandits with one-shot data integration.","authors":"Chandrasekar Subramanian, Balaraman Ravindran","doi":"10.3389/frai.2024.1346700","DOIUrl":"10.3389/frai.2024.1346700","url":null,"abstract":"We study a contextual bandit setting where the agent has access to causal side information, in addition to the ability to perform multiple targeted experiments corresponding to potentially different context-action pairs-simultaneously in one-shot within a budget. This new formalism provides a natural model for several real-world scenarios where parallel targeted experiments can be conducted and where some domain knowledge of causal relationships is available. We propose a new algorithm that utilizes a novel entropy-like measure that we introduce. We perform several experiments, both using purely synthetic data and using a real-world dataset. In addition, we study sensitivity of our algorithm's performance to various aspects of the problem setting. The results show that our algorithm performs better than baselines in all of the experiments. We also show that the algorithm is sound; that is, as budget increases, the learned policy eventually converges to an optimal policy. Further, we theoretically bound our algorithm's regret under additional assumptions. Finally, we provide ways to achieve two popular notions of fairness, namely counterfactual fairness and demographic parity, with our algorithm.","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1346700"},"PeriodicalIF":3.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11659213/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142877969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

What is in a food store name? Leveraging large language models to enhance food environment data. 食品店的名字里有什么？利用大型语言模型增强食品环境数据。

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Artificial Intelligence

Pub Date : 2024-12-06 eCollection Date: 2024-01-01 DOI: 10.3389/frai.2024.1476950

Analee J Etheredge, Samuel Hosmer, Aldo Crossa, Rachel Suss, Mark Torrey

Introduction: It is not uncommon to repurpose administrative food data to create food environment datasets in the health department and research settings; however, the available administrative data are rarely categorized in a way that supports meaningful insight or action, and ground-truthing or manually reviewing an entire city or neighborhood is rate-limiting to essential operations and analysis. We show that such categorizations should be viewed as a classification problem well addressed by recent advances in natural language processing and deep learning-with the advent of large language models (LLMs).

Methods: To demonstrate how to automate the process of categorizing food stores, we use the foundation model BERT to give a first approximation to such categorizations: a best guess by store name. First, 10 food retail classes were developed to comprehensively categorize food store types from a public health perspective.

Results: Based on this rubric, the model was tuned and evaluated (F1_micro = 0.710, F1_macro = 0.709) on an extensive storefront directory of New York City. Second, the model was applied to infer insights from a large, unlabeled dataset using store names alone, aiming to replicate known temporospatial patterns. Finally, a complimentary application of the model as a data quality enhancement tool was demonstrated on a secondary, pre-labeled restaurant dataset.

Discussion: This novel application of an LLM to the enumeration of the food environment allowed for marked gains in efficiency compared to manual, in-person methods, addressing a known challenge to research and operations in a local health department.

在卫生部门和研究机构中，重新利用行政食品数据来创建食品环境数据集并不罕见；然而，可用的行政数据很少以一种支持有意义的见解或行动的方式进行分类，并且实地调查或手动审查整个城市或社区限制了基本的操作和分析。我们表明，随着大型语言模型（llm）的出现，这种分类应该被视为自然语言处理和深度学习的最新进展很好地解决的分类问题。方法：为了演示如何自动化对食品商店进行分类的过程，我们使用基础模型BERT来给出这种分类的第一个近似：根据商店名称进行最佳猜测。首先，开发了10个食品零售类别，从公共卫生的角度对食品商店类型进行了综合分类。结果：基于这个标题，模型被调整和评估（F1micro = 0.710, F1macro = 0.709）在纽约市的一个广泛的店面目录。其次，该模型被应用于仅使用商店名称从大型未标记数据集中推断见解，旨在复制已知的时空模式。最后，在一个次要的、预先标记的餐馆数据集上演示了该模型作为数据质量增强工具的免费应用。讨论：与人工、现场方法相比，这种将法学硕士应用于食品环境枚举的新颖应用可以显著提高效率，解决了当地卫生部门研究和操作的已知挑战。

{"title":"What is in a food store name? Leveraging large language models to enhance food environment data.","authors":"Analee J Etheredge, Samuel Hosmer, Aldo Crossa, Rachel Suss, Mark Torrey","doi":"10.3389/frai.2024.1476950","DOIUrl":"10.3389/frai.2024.1476950","url":null,"abstract":"Introduction: It is not uncommon to repurpose administrative food data to create food environment datasets in the health department and research settings; however, the available administrative data are rarely categorized in a way that supports meaningful insight or action, and ground-truthing or manually reviewing an entire city or neighborhood is rate-limiting to essential operations and analysis. We show that such categorizations should be viewed as a classification problem well addressed by recent advances in natural language processing and deep learning-with the advent of large language models (LLMs).Methods: To demonstrate how to automate the process of categorizing food stores, we use the foundation model BERT to give a first approximation to such categorizations: a best guess by store name. First, 10 food retail classes were developed to comprehensively categorize food store types from a public health perspective.Results: Based on this rubric, the model was tuned and evaluated (F1micro = 0.710, F1macro = 0.709) on an extensive storefront directory of New York City. Second, the model was applied to infer insights from a large, unlabeled dataset using store names alone, aiming to replicate known temporospatial patterns. Finally, a complimentary application of the model as a data quality enhancement tool was demonstrated on a secondary, pre-labeled restaurant dataset.Discussion: This novel application of an LLM to the enumeration of the food environment allowed for marked gains in efficiency compared to manual, in-person methods, addressing a known challenge to research and operations in a local health department.","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1476950"},"PeriodicalIF":3.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11660183/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142877974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Explainable machine learning for predicting recurrence-free survival in endometrial carcinosarcoma patients. 可解释的机器学习预测子宫内膜癌肉瘤患者无复发生存期。

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Artificial Intelligence

Pub Date : 2024-12-06 eCollection Date: 2024-01-01 DOI: 10.3389/frai.2024.1388188

Samantha Bove, Francesca Arezzo, Gennaro Cormio, Erica Silvestris, Alessia Cafforio, Maria Colomba Comes, Annarita Fanizzi, Giuseppe Accogli, Gerardo Cazzato, Giorgio De Nunzio, Brigida Maiorano, Emanuele Naglieri, Andrea Lupo, Elsa Vitale, Vera Loizzi, Raffaella Massafra

Objectives: Endometrial carcinosarcoma is a rare, aggressive high-grade endometrial cancer, accounting for about 5% of all uterine cancers and 15% of deaths from uterine cancers. The treatment can be complex, and the prognosis is poor. Its increasing incidence underscores the urgent requirement for personalized approaches in managing such challenging diseases.

Method: In this work, we designed an explainable machine learning approach to predict recurrence-free survival in patients affected by endometrial carcinosarcoma. For this purpose, we exploited the predictive power of clinical and histopathological data, as well as chemotherapy and surgical information collected for a cohort of 80 patients monitored over time. Among these patients, 32.5% have experienced the appearance of a recurrence.

Results: The designed model was able to well describe the observed sequence of events, providing a reliable ranking of the survival times based on the individual risk scores, and achieving a C-index equals to 70.00% (95% CI, 59.38-84.74).

Conclusion: Accordingly, machine learning methods could support clinicians in discriminating between endometrial carcinosarcoma patients at low-risk or high-risk of recurrence, in a non-invasive and inexpensive way. To the best of our knowledge, this is the first study proposing a preliminary approach addressing this task.

目的：子宫内膜癌肉瘤是一种罕见的侵袭性高级别子宫内膜癌，约占所有子宫癌的5%，占子宫癌死亡人数的15%。治疗可能很复杂，预后很差。其发病率不断增加，强调迫切需要采取个性化方法来管理这类具有挑战性的疾病。方法：在这项工作中，我们设计了一种可解释的机器学习方法来预测子宫内膜癌肉瘤患者的无复发生存。为此，我们利用临床和组织病理学数据的预测能力，以及对80名患者进行长期监测的化疗和手术信息收集。在这些患者中，有32.5%出现了复发。结果：设计的模型能够很好地描述观察到的事件序列，根据个体风险评分提供可靠的生存时间排序，c指数达到70.00% （95% CI, 59.38-84.74）。结论：因此，机器学习方法可以支持临床医生以无创和廉价的方式区分低风险或高风险复发的子宫内膜癌肉瘤患者。据我们所知，这是第一个提出解决这一任务的初步方法的研究。

{"title":"Explainable machine learning for predicting recurrence-free survival in endometrial carcinosarcoma patients.","authors":"Samantha Bove, Francesca Arezzo, Gennaro Cormio, Erica Silvestris, Alessia Cafforio, Maria Colomba Comes, Annarita Fanizzi, Giuseppe Accogli, Gerardo Cazzato, Giorgio De Nunzio, Brigida Maiorano, Emanuele Naglieri, Andrea Lupo, Elsa Vitale, Vera Loizzi, Raffaella Massafra","doi":"10.3389/frai.2024.1388188","DOIUrl":"10.3389/frai.2024.1388188","url":null,"abstract":"Objectives: Endometrial carcinosarcoma is a rare, aggressive high-grade endometrial cancer, accounting for about 5% of all uterine cancers and 15% of deaths from uterine cancers. The treatment can be complex, and the prognosis is poor. Its increasing incidence underscores the urgent requirement for personalized approaches in managing such challenging diseases.Method: In this work, we designed an explainable machine learning approach to predict recurrence-free survival in patients affected by endometrial carcinosarcoma. For this purpose, we exploited the predictive power of clinical and histopathological data, as well as chemotherapy and surgical information collected for a cohort of 80 patients monitored over time. Among these patients, 32.5% have experienced the appearance of a recurrence.Results: The designed model was able to well describe the observed sequence of events, providing a reliable ranking of the survival times based on the individual risk scores, and achieving a C-index equals to 70.00% (95% CI, 59.38-84.74).Conclusion: Accordingly, machine learning methods could support clinicians in discriminating between endometrial carcinosarcoma patients at low-risk or high-risk of recurrence, in a non-invasive and inexpensive way. To the best of our knowledge, this is the first study proposing a preliminary approach addressing this task.","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1388188"},"PeriodicalIF":3.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11659245/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142877970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

AI-assisted human clinical reasoning in the ICU: beyond "to err is human". 人工智能在ICU中辅助人类临床推理：超越“犯错即是人”。

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Artificial Intelligence

Pub Date : 2024-12-04 eCollection Date: 2024-01-01 DOI: 10.3389/frai.2024.1506676

Khalil El Gharib, Bakr Jundi, David Furfaro, Raja-Elie E Abdulnour

Diagnostic errors pose a significant public health challenge, affecting nearly 800,000 Americans annually, with even higher rates globally. In the ICU, these errors are particularly prevalent, leading to substantial morbidity and mortality. The clinical reasoning process aims to reduce diagnostic uncertainty and establish a plausible differential diagnosis but is often hindered by cognitive load, patient complexity, and clinician burnout. These factors contribute to cognitive biases that compromise diagnostic accuracy. Emerging technologies like large language models (LLMs) offer potential solutions to enhance clinical reasoning and improve diagnostic precision. In this perspective article, we explore the roles of LLMs, such as GPT-4, in addressing diagnostic challenges in critical care settings through a case study of a critically ill patient managed with LLM assistance.

诊断错误构成了重大的公共卫生挑战，每年影响近80万美国人，在全球范围内甚至更高。在ICU，这些错误尤其普遍，导致大量的发病率和死亡率。临床推理过程旨在减少诊断的不确定性，并建立一个合理的鉴别诊断，但往往受阻于认知负荷，病人的复杂性，和临床医生的倦怠。这些因素会导致认知偏差，从而影响诊断的准确性。大型语言模型（llm）等新兴技术为增强临床推理和提高诊断精度提供了潜在的解决方案。在这篇观点文章中，我们探讨了法学硕士（如GPT-4）在解决重症监护环境中的诊断挑战方面的作用，通过一个由法学硕士协助管理的危重病患者的案例研究。

引用次数: 0