Pub Date : 2024-12-12eCollection Date: 2024-01-01DOI: 10.3389/frai.2024.1504963
Christos Adam
The use of Financial Technology (Fintech) has been proposed as a promising way to bridge the gender gap, both financially and socially. However, there is evidence that Fintech is far from achieving this objective, and that women's perceptions of Fintech usages are not clear. Therefore, the main objective of the this study is to segment women's perceptions toward Fintech tools and interpret these segments using machine learning methods. Two primary segments of women were produced, namely a "Fintech-friendly" group and a "Fintech-sceptical" group. The importance and reasonings behind the aforementioned segmentation are then examined. The most prominent factors affecting a woman being in the "Fintech-friendly" group are the perceived benefits of Fintech tools compared to the traditional ones, such as ease of usage, time-space convenience, and its advantageous nature. Finally, for Fintech stakeholders, implications for usability, ease, Fintech education, and tailored experiences may be advantageous approaches.
{"title":"Segmenting female students' perceptions about Fintech using Explainable AI.","authors":"Christos Adam","doi":"10.3389/frai.2024.1504963","DOIUrl":"10.3389/frai.2024.1504963","url":null,"abstract":"<p><p>The use of Financial Technology (Fintech) has been proposed as a promising way to bridge the gender gap, both financially and socially. However, there is evidence that Fintech is far from achieving this objective, and that women's perceptions of Fintech usages are not clear. Therefore, the main objective of the this study is to segment women's perceptions toward Fintech tools and interpret these segments using machine learning methods. Two primary segments of women were produced, namely a \"Fintech-friendly\" group and a \"Fintech-sceptical\" group. The importance and reasonings behind the aforementioned segmentation are then examined. The most prominent factors affecting a woman being in the \"Fintech-friendly\" group are the perceived benefits of Fintech tools compared to the traditional ones, such as ease of usage, time-space convenience, and its advantageous nature. Finally, for Fintech stakeholders, implications for usability, ease, Fintech education, and tailored experiences may be advantageous approaches.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1504963"},"PeriodicalIF":3.0,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11670257/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142898679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-11eCollection Date: 2024-01-01DOI: 10.3389/frai.2024.1312115
Zahra Khalilzadeh, Motahareh Kashanian, Saeed Khaki, Lizhi Wang
The ability to accurately predict the yields of different crop genotypes in response to weather variability is crucial for developing climate resilient crop cultivars. Genotype-environment interactions introduce large variations in crop-climate responses, and are hard to factor in to breeding programs. Data-driven approaches, particularly those based on machine learning, can help guide breeding efforts by factoring in genotype-environment interactions when making yield predictions. Using a new yield dataset containing 93,028 records of soybean hybrids across 159 locations, 28 states, and 13 years, with 5,838 distinct genotypes and daily weather data over a 214-day growing season, we developed two convolutional neural network (CNN) models: one that integrates CNN and fully-connected neural networks (CNN model), and another that incorporates a long short-term memory (LSTM) layer after the CNN component (CNN-LSTM model). By applying the Generalized Ensemble Method (GEM), we combined the CNN-based models and optimized their weights to improve overall predictive performance. The dataset provided unique genotype information on seeds, enabling an investigation into the potential of planting different genotypes based on weather variables. We employed the proposed GEM model to identify the best-performing genotypes across various locations and weather conditions, making yield predictions for all potential genotypes in each specific setting. To assess the performance of the GEM model, we evaluated it on unseen genotype-location combinations, simulating real-world scenarios where new genotypes are introduced. By combining the base models, the GEM ensemble approach provided much better prediction accuracy compared to using the CNN-LSTM model alone and slightly better accuracy than the CNN model, as measured by both RMSE and MAE on the validation and test sets. The proposed data-driven approach can be valuable for genotype selection in scenarios with limited testing years. In addition, we explored the impact of incorporating state-level soil data alongside the weather, location, genotype and year variables. Due to data constraints, including the absence of latitude and longitude details, we used uniform soil variables for all locations within the same state. This limitation restricted our spatial information to state-level knowledge. Our findings suggested that integrating state-level soil variables did not substantially enhance the predictive capabilities of the models. We also performed a feature importance analysis using RMSE change to identify crucial predictors. Location showed the highest RMSE change, followed by genotype and year. Among weather variables, maximum direct normal irradiance (MDNI) and average precipitation (AP) displayed higher RMSE changes, indicating their importance.
{"title":"A hybrid deep learning-based approach for optimal genotype by environment selection.","authors":"Zahra Khalilzadeh, Motahareh Kashanian, Saeed Khaki, Lizhi Wang","doi":"10.3389/frai.2024.1312115","DOIUrl":"10.3389/frai.2024.1312115","url":null,"abstract":"<p><p>The ability to accurately predict the yields of different crop genotypes in response to weather variability is crucial for developing climate resilient crop cultivars. Genotype-environment interactions introduce large variations in crop-climate responses, and are hard to factor in to breeding programs. Data-driven approaches, particularly those based on machine learning, can help guide breeding efforts by factoring in genotype-environment interactions when making yield predictions. Using a new yield dataset containing 93,028 records of soybean hybrids across 159 locations, 28 states, and 13 years, with 5,838 distinct genotypes and daily weather data over a 214-day growing season, we developed two convolutional neural network (CNN) models: one that integrates CNN and fully-connected neural networks (CNN model), and another that incorporates a long short-term memory (LSTM) layer after the CNN component (CNN-LSTM model). By applying the Generalized Ensemble Method (GEM), we combined the CNN-based models and optimized their weights to improve overall predictive performance. The dataset provided unique genotype information on seeds, enabling an investigation into the potential of planting different genotypes based on weather variables. We employed the proposed GEM model to identify the best-performing genotypes across various locations and weather conditions, making yield predictions for all potential genotypes in each specific setting. To assess the performance of the GEM model, we evaluated it on unseen genotype-location combinations, simulating real-world scenarios where new genotypes are introduced. By combining the base models, the GEM ensemble approach provided much better prediction accuracy compared to using the CNN-LSTM model alone and slightly better accuracy than the CNN model, as measured by both RMSE and MAE on the validation and test sets. The proposed data-driven approach can be valuable for genotype selection in scenarios with limited testing years. In addition, we explored the impact of incorporating state-level soil data alongside the weather, location, genotype and year variables. Due to data constraints, including the absence of latitude and longitude details, we used uniform soil variables for all locations within the same state. This limitation restricted our spatial information to state-level knowledge. Our findings suggested that integrating state-level soil variables did not substantially enhance the predictive capabilities of the models. We also performed a feature importance analysis using RMSE change to identify crucial predictors. Location showed the highest RMSE change, followed by genotype and year. Among weather variables, maximum direct normal irradiance (MDNI) and average precipitation (AP) displayed higher RMSE changes, indicating their importance.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1312115"},"PeriodicalIF":3.0,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11670329/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142898665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-11eCollection Date: 2024-01-01DOI: 10.3389/frai.2024.1443956
Mark Esposito, Saman Sarbazvatan, Terence Tse, Gabriel Silva-Atencio
The COVID-19 pandemic marked a before and after in the business world, causing a growing demand for applications that streamline operations, reduce delivery times and costs, and improve the quality of products. In this context, artificial intelligence (AI) has taken a relevant role in improving these processes, since it incorporates mathematical models that allow analyzing the logical structure of the systems to detect and reduce errors or failures in real-time. This study aimed to determine the most relevant aspects to be considered for detecting software defects using AI. The methodology used was qualitative, with an exploratory, descriptive, and non-experimental approach. The technique involved a documentary review of 79 bibliometric references. The most relevant finding was the use of regression testing techniques and automated log files, in machine learning (ML) and robotic process automation (RPA) environments. These techniques help reduce the time required to identify failures, thereby enhancing efficiency and effectiveness in the lifecycle of applications. In conclusion, companies that incorporate AI algorithms will be able to include an agile model in their lifecycle, as they will reduce the rate of failures, errors, and breakdowns allowing cost savings, and ensuring quality.
{"title":"The use of artificial intelligence for automatic analysis and reporting of software defects.","authors":"Mark Esposito, Saman Sarbazvatan, Terence Tse, Gabriel Silva-Atencio","doi":"10.3389/frai.2024.1443956","DOIUrl":"10.3389/frai.2024.1443956","url":null,"abstract":"<p><p>The COVID-19 pandemic marked a before and after in the business world, causing a growing demand for applications that streamline operations, reduce delivery times and costs, and improve the quality of products. In this context, artificial intelligence (AI) has taken a relevant role in improving these processes, since it incorporates mathematical models that allow analyzing the logical structure of the systems to detect and reduce errors or failures in real-time. This study aimed to determine the most relevant aspects to be considered for detecting software defects using AI. The methodology used was qualitative, with an exploratory, descriptive, and non-experimental approach. The technique involved a documentary review of 79 bibliometric references. The most relevant finding was the use of regression testing techniques and automated log files, in machine learning (ML) and robotic process automation (RPA) environments. These techniques help reduce the time required to identify failures, thereby enhancing efficiency and effectiveness in the lifecycle of applications. In conclusion, companies that incorporate AI algorithms will be able to include an agile model in their lifecycle, as they will reduce the rate of failures, errors, and breakdowns allowing cost savings, and ensuring quality.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1443956"},"PeriodicalIF":3.0,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11668792/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142898684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-10eCollection Date: 2024-01-01DOI: 10.3389/frai.2024.1532896
Diego Zapata-Rivera, Ilaria Torre, Chien-Sing Lee, Antonio Sarasa-Cabezuelo, Ioana Ghergulescu, Paul Libbrecht
{"title":"Editorial: Generative AI in education.","authors":"Diego Zapata-Rivera, Ilaria Torre, Chien-Sing Lee, Antonio Sarasa-Cabezuelo, Ioana Ghergulescu, Paul Libbrecht","doi":"10.3389/frai.2024.1532896","DOIUrl":"https://doi.org/10.3389/frai.2024.1532896","url":null,"abstract":"","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1532896"},"PeriodicalIF":3.0,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11667108/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142886265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-09eCollection Date: 2024-01-01DOI: 10.3389/frai.2024.1400732
Alexandra Maertens, Steve Brykman, Thomas Hartung, Andrei Gafita, Harrison Bai, David Hoelzer, Ed Skoudis, Channing Judith Paller
In response to the increasing significance of artificial intelligence (AI) in healthcare, there has been increased attention - including a Presidential executive order to create an AI Safety Institute - to the potential threats posed by AI. While much attention has been given to the conventional risks AI poses to cybersecurity, and critical infrastructure, here we provide an overview of some unique challenges of AI for the medical community. Above and beyond obvious concerns about vetting algorithms that impact patient care, there are additional subtle yet equally important things to consider: the potential harm AI poses to its own integrity and the broader medical information ecosystem. Recognizing the role of healthcare professionals as both consumers and contributors to AI training data, this article advocates for a proactive approach in understanding and shaping the data that underpins AI systems, emphasizing the need for informed engagement to maximize the benefits of AI while mitigating the risks.
{"title":"Navigating the unseen peril: safeguarding medical imaging in the age of AI.","authors":"Alexandra Maertens, Steve Brykman, Thomas Hartung, Andrei Gafita, Harrison Bai, David Hoelzer, Ed Skoudis, Channing Judith Paller","doi":"10.3389/frai.2024.1400732","DOIUrl":"10.3389/frai.2024.1400732","url":null,"abstract":"<p><p>In response to the increasing significance of artificial intelligence (AI) in healthcare, there has been increased attention - including a Presidential executive order to create an AI Safety Institute - to the potential threats posed by AI. While much attention has been given to the conventional risks AI poses to cybersecurity, and critical infrastructure, here we provide an overview of some unique challenges of AI for the medical community. Above and beyond obvious concerns about vetting algorithms that impact patient care, there are additional subtle yet equally important things to consider: the potential harm AI poses to its own integrity and the broader medical information ecosystem. Recognizing the role of healthcare professionals as both consumers and contributors to AI training data, this article advocates for a proactive approach in understanding and shaping the data that underpins AI systems, emphasizing the need for informed engagement to maximize the benefits of AI while mitigating the risks.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1400732"},"PeriodicalIF":3.0,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11665297/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142883166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-06eCollection Date: 2024-01-01DOI: 10.3389/frai.2024.1385522
Carlos Escudero-Cipriani, Julio García-Del Junco, Raquel Chafloque-Céspedes, Aldo Alvarez-Risco
Introduction: This research aims to explore the growing field of frugal innovation within the business environment, particularly its intersection with sustainability and artificial intelligence.
Methods: Through a comprehensive literature review, the study analyzes key research trends and methodologies from 420 scholarly articles published between 2012 and August 2024. A bibliometric review traces the evolution of frugal innovation, while a content analysis provides insights into its practical applications across various industries, especially in resource-constrained settings.
Results: The findings highlight the significant role of frugal innovation in addressing global challenges, such as reducing environmental impact and promoting social inclusion, especially through the adoption of cleaner technologies and socially responsible business practices. The study also emphasizes the transformative potential of AI in enhancing the scalability and efficiency of frugal solutions.
Discussion: This research contributes to the ongoing conversation on sustainable development by identifying knowledge gaps and proposing future strategies for leveraging frugal innovation to drive inclusive growth. The implications of this research are valuable for academics, practitioners, and policymakers aiming to foster sustainable innovation in diverse socio-economic contexts.
{"title":"Frugal innovation in the business environment: a literature review and future perspectives.","authors":"Carlos Escudero-Cipriani, Julio García-Del Junco, Raquel Chafloque-Céspedes, Aldo Alvarez-Risco","doi":"10.3389/frai.2024.1385522","DOIUrl":"10.3389/frai.2024.1385522","url":null,"abstract":"<p><strong>Introduction: </strong>This research aims to explore the growing field of frugal innovation within the business environment, particularly its intersection with sustainability and artificial intelligence.</p><p><strong>Methods: </strong>Through a comprehensive literature review, the study analyzes key research trends and methodologies from 420 scholarly articles published between 2012 and August 2024. A bibliometric review traces the evolution of frugal innovation, while a content analysis provides insights into its practical applications across various industries, especially in resource-constrained settings.</p><p><strong>Results: </strong>The findings highlight the significant role of frugal innovation in addressing global challenges, such as reducing environmental impact and promoting social inclusion, especially through the adoption of cleaner technologies and socially responsible business practices. The study also emphasizes the transformative potential of AI in enhancing the scalability and efficiency of frugal solutions.</p><p><strong>Discussion: </strong>This research contributes to the ongoing conversation on sustainable development by identifying knowledge gaps and proposing future strategies for leveraging frugal innovation to drive inclusive growth. The implications of this research are valuable for academics, practitioners, and policymakers aiming to foster sustainable innovation in diverse socio-economic contexts.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1385522"},"PeriodicalIF":3.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11662995/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142877973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-06eCollection Date: 2024-01-01DOI: 10.3389/frai.2024.1346700
Chandrasekar Subramanian, Balaraman Ravindran
We study a contextual bandit setting where the agent has access to causal side information, in addition to the ability to perform multiple targeted experiments corresponding to potentially different context-action pairs-simultaneously in one-shot within a budget. This new formalism provides a natural model for several real-world scenarios where parallel targeted experiments can be conducted and where some domain knowledge of causal relationships is available. We propose a new algorithm that utilizes a novel entropy-like measure that we introduce. We perform several experiments, both using purely synthetic data and using a real-world dataset. In addition, we study sensitivity of our algorithm's performance to various aspects of the problem setting. The results show that our algorithm performs better than baselines in all of the experiments. We also show that the algorithm is sound; that is, as budget increases, the learned policy eventually converges to an optimal policy. Further, we theoretically bound our algorithm's regret under additional assumptions. Finally, we provide ways to achieve two popular notions of fairness, namely counterfactual fairness and demographic parity, with our algorithm.
{"title":"Causal contextual bandits with one-shot data integration.","authors":"Chandrasekar Subramanian, Balaraman Ravindran","doi":"10.3389/frai.2024.1346700","DOIUrl":"10.3389/frai.2024.1346700","url":null,"abstract":"<p><p>We study a contextual bandit setting where the agent has access to causal side information, in addition to the ability to perform multiple targeted experiments corresponding to potentially different context-action pairs-simultaneously in one-shot within a budget. This new formalism provides a natural model for several real-world scenarios where parallel targeted experiments can be conducted and where some domain knowledge of causal relationships is available. We propose a new algorithm that utilizes a novel entropy-like measure that we introduce. We perform several experiments, both using purely synthetic data and using a real-world dataset. In addition, we study sensitivity of our algorithm's performance to various aspects of the problem setting. The results show that our algorithm performs better than baselines in all of the experiments. We also show that the algorithm is sound; that is, as budget increases, the learned policy eventually converges to an optimal policy. Further, we theoretically bound our algorithm's regret under additional assumptions. Finally, we provide ways to achieve two popular notions of fairness, namely counterfactual fairness and demographic parity, with our algorithm.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1346700"},"PeriodicalIF":3.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11659213/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142877969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-06eCollection Date: 2024-01-01DOI: 10.3389/frai.2024.1476950
Analee J Etheredge, Samuel Hosmer, Aldo Crossa, Rachel Suss, Mark Torrey
Introduction: It is not uncommon to repurpose administrative food data to create food environment datasets in the health department and research settings; however, the available administrative data are rarely categorized in a way that supports meaningful insight or action, and ground-truthing or manually reviewing an entire city or neighborhood is rate-limiting to essential operations and analysis. We show that such categorizations should be viewed as a classification problem well addressed by recent advances in natural language processing and deep learning-with the advent of large language models (LLMs).
Methods: To demonstrate how to automate the process of categorizing food stores, we use the foundation model BERT to give a first approximation to such categorizations: a best guess by store name. First, 10 food retail classes were developed to comprehensively categorize food store types from a public health perspective.
Results: Based on this rubric, the model was tuned and evaluated (F1micro = 0.710, F1macro = 0.709) on an extensive storefront directory of New York City. Second, the model was applied to infer insights from a large, unlabeled dataset using store names alone, aiming to replicate known temporospatial patterns. Finally, a complimentary application of the model as a data quality enhancement tool was demonstrated on a secondary, pre-labeled restaurant dataset.
Discussion: This novel application of an LLM to the enumeration of the food environment allowed for marked gains in efficiency compared to manual, in-person methods, addressing a known challenge to research and operations in a local health department.
{"title":"What is in a food store name? Leveraging large language models to enhance food environment data.","authors":"Analee J Etheredge, Samuel Hosmer, Aldo Crossa, Rachel Suss, Mark Torrey","doi":"10.3389/frai.2024.1476950","DOIUrl":"10.3389/frai.2024.1476950","url":null,"abstract":"<p><strong>Introduction: </strong>It is not uncommon to repurpose administrative food data to create food environment datasets in the health department and research settings; however, the available administrative data are rarely categorized in a way that supports meaningful insight or action, and ground-truthing or manually reviewing an entire city or neighborhood is rate-limiting to essential operations and analysis. We show that such categorizations should be viewed as a classification problem well addressed by recent advances in natural language processing and deep learning-with the advent of large language models (LLMs).</p><p><strong>Methods: </strong>To demonstrate how to automate the process of categorizing food stores, we use the foundation model BERT to give a first approximation to such categorizations: a best guess by store name. First, 10 food retail classes were developed to comprehensively categorize food store types from a public health perspective.</p><p><strong>Results: </strong>Based on this rubric, the model was tuned and evaluated (F1<sub>micro</sub> = 0.710, F1<sub>macro</sub> = 0.709) on an extensive storefront directory of New York City. Second, the model was applied to infer insights from a large, unlabeled dataset using store names alone, aiming to replicate known temporospatial patterns. Finally, a complimentary application of the model as a data quality enhancement tool was demonstrated on a secondary, pre-labeled restaurant dataset.</p><p><strong>Discussion: </strong>This novel application of an LLM to the enumeration of the food environment allowed for marked gains in efficiency compared to manual, in-person methods, addressing a known challenge to research and operations in a local health department.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1476950"},"PeriodicalIF":3.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11660183/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142877974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-06eCollection Date: 2024-01-01DOI: 10.3389/frai.2024.1388188
Samantha Bove, Francesca Arezzo, Gennaro Cormio, Erica Silvestris, Alessia Cafforio, Maria Colomba Comes, Annarita Fanizzi, Giuseppe Accogli, Gerardo Cazzato, Giorgio De Nunzio, Brigida Maiorano, Emanuele Naglieri, Andrea Lupo, Elsa Vitale, Vera Loizzi, Raffaella Massafra
Objectives: Endometrial carcinosarcoma is a rare, aggressive high-grade endometrial cancer, accounting for about 5% of all uterine cancers and 15% of deaths from uterine cancers. The treatment can be complex, and the prognosis is poor. Its increasing incidence underscores the urgent requirement for personalized approaches in managing such challenging diseases.
Method: In this work, we designed an explainable machine learning approach to predict recurrence-free survival in patients affected by endometrial carcinosarcoma. For this purpose, we exploited the predictive power of clinical and histopathological data, as well as chemotherapy and surgical information collected for a cohort of 80 patients monitored over time. Among these patients, 32.5% have experienced the appearance of a recurrence.
Results: The designed model was able to well describe the observed sequence of events, providing a reliable ranking of the survival times based on the individual risk scores, and achieving a C-index equals to 70.00% (95% CI, 59.38-84.74).
Conclusion: Accordingly, machine learning methods could support clinicians in discriminating between endometrial carcinosarcoma patients at low-risk or high-risk of recurrence, in a non-invasive and inexpensive way. To the best of our knowledge, this is the first study proposing a preliminary approach addressing this task.
{"title":"Explainable machine learning for predicting recurrence-free survival in endometrial carcinosarcoma patients.","authors":"Samantha Bove, Francesca Arezzo, Gennaro Cormio, Erica Silvestris, Alessia Cafforio, Maria Colomba Comes, Annarita Fanizzi, Giuseppe Accogli, Gerardo Cazzato, Giorgio De Nunzio, Brigida Maiorano, Emanuele Naglieri, Andrea Lupo, Elsa Vitale, Vera Loizzi, Raffaella Massafra","doi":"10.3389/frai.2024.1388188","DOIUrl":"10.3389/frai.2024.1388188","url":null,"abstract":"<p><strong>Objectives: </strong>Endometrial carcinosarcoma is a rare, aggressive high-grade endometrial cancer, accounting for about 5% of all uterine cancers and 15% of deaths from uterine cancers. The treatment can be complex, and the prognosis is poor. Its increasing incidence underscores the urgent requirement for personalized approaches in managing such challenging diseases.</p><p><strong>Method: </strong>In this work, we designed an explainable machine learning approach to predict recurrence-free survival in patients affected by endometrial carcinosarcoma. For this purpose, we exploited the predictive power of clinical and histopathological data, as well as chemotherapy and surgical information collected for a cohort of 80 patients monitored over time. Among these patients, 32.5% have experienced the appearance of a recurrence.</p><p><strong>Results: </strong>The designed model was able to well describe the observed sequence of events, providing a reliable ranking of the survival times based on the individual risk scores, and achieving a C-index equals to 70.00% (95% CI, 59.38-84.74).</p><p><strong>Conclusion: </strong>Accordingly, machine learning methods could support clinicians in discriminating between endometrial carcinosarcoma patients at low-risk or high-risk of recurrence, in a non-invasive and inexpensive way. To the best of our knowledge, this is the first study proposing a preliminary approach addressing this task.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1388188"},"PeriodicalIF":3.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11659245/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142877970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-04eCollection Date: 2024-01-01DOI: 10.3389/frai.2024.1506676
Khalil El Gharib, Bakr Jundi, David Furfaro, Raja-Elie E Abdulnour
Diagnostic errors pose a significant public health challenge, affecting nearly 800,000 Americans annually, with even higher rates globally. In the ICU, these errors are particularly prevalent, leading to substantial morbidity and mortality. The clinical reasoning process aims to reduce diagnostic uncertainty and establish a plausible differential diagnosis but is often hindered by cognitive load, patient complexity, and clinician burnout. These factors contribute to cognitive biases that compromise diagnostic accuracy. Emerging technologies like large language models (LLMs) offer potential solutions to enhance clinical reasoning and improve diagnostic precision. In this perspective article, we explore the roles of LLMs, such as GPT-4, in addressing diagnostic challenges in critical care settings through a case study of a critically ill patient managed with LLM assistance.
{"title":"AI-assisted human clinical reasoning in the ICU: beyond \"to err is human\".","authors":"Khalil El Gharib, Bakr Jundi, David Furfaro, Raja-Elie E Abdulnour","doi":"10.3389/frai.2024.1506676","DOIUrl":"10.3389/frai.2024.1506676","url":null,"abstract":"<p><p>Diagnostic errors pose a significant public health challenge, affecting nearly 800,000 Americans annually, with even higher rates globally. In the ICU, these errors are particularly prevalent, leading to substantial morbidity and mortality. The clinical reasoning process aims to reduce diagnostic uncertainty and establish a plausible differential diagnosis but is often hindered by cognitive load, patient complexity, and clinician burnout. These factors contribute to cognitive biases that compromise diagnostic accuracy. Emerging technologies like large language models (LLMs) offer potential solutions to enhance clinical reasoning and improve diagnostic precision. In this perspective article, we explore the roles of LLMs, such as GPT-4, in addressing diagnostic challenges in critical care settings through a case study of a critically ill patient managed with LLM assistance.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1506676"},"PeriodicalIF":3.0,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11659639/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142877968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}