Pub Date : 2025-04-01Epub Date: 2025-04-30DOI: 10.4258/hir.2025.31.2.166
Jean Seo, Sumin Park, Sungjoo Byun, Jinwook Choi, Jinho Choi, Hyopil Shin
Objectives: Developing large language models (LLMs) in biomedicine requires access to high-quality training and alignment tuning datasets. However, publicly available Korean medical preference datasets are scarce, hindering the advancement of Korean medical LLMs. This study constructs and evaluates the efficacy of the Korean Medical Preference Dataset (KoMeP), an alignment tuning dataset constructed with an automated pipeline, minimizing the high costs of human annotation.
Methods: KoMeP was generated using the DAHL score, an automated hallucination evaluation metric. Five LLMs (Dolly-v2-3B, MPT-7B, GPT-4o, Qwen-2-7B, Llama-3-8B) produced responses to 8,573 biomedical examination questions, from which 5,551 preference pairs were extracted. Each pair consisted of a "chosen" response and a "rejected" response, as determined by their DAHL scores. The dataset was evaluated when trained through two different alignment tuning methods, direct preference optimization (DPO) and odds ratio preference optimization (ORPO) respectively across five different models. The KorMedMCQA benchmark was employed to assess the effectiveness of alignment tuning.
Results: Models trained with DPO consistently improved KorMedMCQA performance; notably, Llama-3.1-8B showed a 43.96% increase. In contrast, ORPO training produced inconsistent results. Additionally, English-to-Korean transfer learning proved effective, particularly for English-centric models like Gemma-2, whereas Korean-to-English transfer learning achieved limited success. Instruction tuning with KoMeP yielded mixed outcomes, which suggests challenges in dataset formatting.
Conclusions: KoMeP is the first publicly available Korean medical preference dataset and significantly improves alignment tuning performance in LLMs. The DPO method outperforms ORPO in alignment tuning. Future work should focus on expanding KoMeP, developing a Korean-native dataset, and refining alignment tuning methods to produce safer and more reliable Korean medical LLMs.
{"title":"Advancing Korean Medical Large Language Models: Automated Pipeline for Korean Medical Preference Dataset Construction.","authors":"Jean Seo, Sumin Park, Sungjoo Byun, Jinwook Choi, Jinho Choi, Hyopil Shin","doi":"10.4258/hir.2025.31.2.166","DOIUrl":"10.4258/hir.2025.31.2.166","url":null,"abstract":"<p><strong>Objectives: </strong>Developing large language models (LLMs) in biomedicine requires access to high-quality training and alignment tuning datasets. However, publicly available Korean medical preference datasets are scarce, hindering the advancement of Korean medical LLMs. This study constructs and evaluates the efficacy of the Korean Medical Preference Dataset (KoMeP), an alignment tuning dataset constructed with an automated pipeline, minimizing the high costs of human annotation.</p><p><strong>Methods: </strong>KoMeP was generated using the DAHL score, an automated hallucination evaluation metric. Five LLMs (Dolly-v2-3B, MPT-7B, GPT-4o, Qwen-2-7B, Llama-3-8B) produced responses to 8,573 biomedical examination questions, from which 5,551 preference pairs were extracted. Each pair consisted of a \"chosen\" response and a \"rejected\" response, as determined by their DAHL scores. The dataset was evaluated when trained through two different alignment tuning methods, direct preference optimization (DPO) and odds ratio preference optimization (ORPO) respectively across five different models. The KorMedMCQA benchmark was employed to assess the effectiveness of alignment tuning.</p><p><strong>Results: </strong>Models trained with DPO consistently improved KorMedMCQA performance; notably, Llama-3.1-8B showed a 43.96% increase. In contrast, ORPO training produced inconsistent results. Additionally, English-to-Korean transfer learning proved effective, particularly for English-centric models like Gemma-2, whereas Korean-to-English transfer learning achieved limited success. Instruction tuning with KoMeP yielded mixed outcomes, which suggests challenges in dataset formatting.</p><p><strong>Conclusions: </strong>KoMeP is the first publicly available Korean medical preference dataset and significantly improves alignment tuning performance in LLMs. The DPO method outperforms ORPO in alignment tuning. Future work should focus on expanding KoMeP, developing a Korean-native dataset, and refining alignment tuning methods to produce safer and more reliable Korean medical LLMs.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"166-174"},"PeriodicalIF":2.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086433/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144093370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-04-01Epub Date: 2025-04-30DOI: 10.4258/hir.2025.31.2.125
Kingsley F Attai, Constance Amannah, Moses Ekpenyong, Daniel E Asuquo, Oryina K Akputu, Okure U Obot, Peterben C Ajuga, Jeremiah C Obi, Omosivie Maduka, Christie Akwaowo, Faith-Michael Uzoka
Objectives: This study proposes a mobile-based explainable artificial intelligence (XAI) platform designed for diagnosing febrile illnesses.
Methods: We integrated the interpretability offered by local interpretable model-agnostic explanations (LIME) and the explainability provided by generative pre-trained transformers (GPT) to bridge the gap in understanding and trust often created by machine learning models in critical healthcare decision-making. The developed system employed random forest for disease diagnosis, LIME for interpretation of the results, and GPT-3.5 for generating explanations in easy-to-understand language.
Results: Our model demonstrated robust performance in detecting malaria, achieving precision, recall, and F1-scores of 85%, 91%, and 88%, respectively. It performed moderately well in detecting urinary tract and respiratory tract infections, with precision, recall, and F1-scores of 80%, 65%, and 72%, and 77%, 68%, and 72%, respectively, maintaining an effective balance between sensitivity and specificity. However, the model exhibited limitations in detecting typhoid fever and human immunodeficiency virus/acquired immune deficiency syndrome, achieving lower precision, recall, and F1-scores of 69%, 53%, and 60%, and 75%, 39%, and 51%, respectively. These results indicate missed true-positive cases, necessitating further model fine-tuning. LIME and GPT-3.5 were integrated to enhance transparency and provide natural language explanations, thereby aiding decision-making and improving user comprehension of the diagnoses.
Conclusions: The LIME plots revealed key symptoms influencing the diagnoses, with bitter taste in the mouth and fever showing the highest negative influence on predictions, and GPT-3.5 provided natural language explanations that increased the reliability and trustworthiness of the system, promoting improved patient outcomes and reducing the healthcare burden.
{"title":"Developing an Explainable Artificial Intelligence System for the Mobile-Based Diagnosis of Febrile Diseases Using Random Forest, LIME, and GPT.","authors":"Kingsley F Attai, Constance Amannah, Moses Ekpenyong, Daniel E Asuquo, Oryina K Akputu, Okure U Obot, Peterben C Ajuga, Jeremiah C Obi, Omosivie Maduka, Christie Akwaowo, Faith-Michael Uzoka","doi":"10.4258/hir.2025.31.2.125","DOIUrl":"10.4258/hir.2025.31.2.125","url":null,"abstract":"<p><strong>Objectives: </strong>This study proposes a mobile-based explainable artificial intelligence (XAI) platform designed for diagnosing febrile illnesses.</p><p><strong>Methods: </strong>We integrated the interpretability offered by local interpretable model-agnostic explanations (LIME) and the explainability provided by generative pre-trained transformers (GPT) to bridge the gap in understanding and trust often created by machine learning models in critical healthcare decision-making. The developed system employed random forest for disease diagnosis, LIME for interpretation of the results, and GPT-3.5 for generating explanations in easy-to-understand language.</p><p><strong>Results: </strong>Our model demonstrated robust performance in detecting malaria, achieving precision, recall, and F1-scores of 85%, 91%, and 88%, respectively. It performed moderately well in detecting urinary tract and respiratory tract infections, with precision, recall, and F1-scores of 80%, 65%, and 72%, and 77%, 68%, and 72%, respectively, maintaining an effective balance between sensitivity and specificity. However, the model exhibited limitations in detecting typhoid fever and human immunodeficiency virus/acquired immune deficiency syndrome, achieving lower precision, recall, and F1-scores of 69%, 53%, and 60%, and 75%, 39%, and 51%, respectively. These results indicate missed true-positive cases, necessitating further model fine-tuning. LIME and GPT-3.5 were integrated to enhance transparency and provide natural language explanations, thereby aiding decision-making and improving user comprehension of the diagnoses.</p><p><strong>Conclusions: </strong>The LIME plots revealed key symptoms influencing the diagnoses, with bitter taste in the mouth and fever showing the highest negative influence on predictions, and GPT-3.5 provided natural language explanations that increased the reliability and trustworthiness of the system, promoting improved patient outcomes and reducing the healthcare burden.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"125-135"},"PeriodicalIF":2.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086442/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144093374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-04-01Epub Date: 2025-04-30DOI: 10.4258/hir.2025.31.2.136
Junseo Kim, Seok Jun Kim, Junseok Ahn, Suehyun Lee
Objectives: This research aimed to develop a retrieval-augmented generation (RAG) based large language model (LLM) system that offers personalized and reliable responses to a wide range of concerns raised by Korean adolescents. Our work focuses on building a culturally reflective dataset and on designing and validating the system's effectiveness by comparing the answer quality of RAG-based models with non-RAG models.
Methods: Data were collected from the NAVER Knowledge iN platform, concentrating on posts that featured adolescents' questions and corresponding expert responses during the period 2014-2024. The dataset comprises 3,874 cases, categorized by key negative emotions and the primary sources of worry. The data were processed to remove irrelevant or redundant content and then classified into general and detailed causes. The RAG-based model employed FAISS for similarity-based retrieval of the top three reference cases and used GPT-4o mini for response generation. The responses generated with and without RAG were evaluated using several metrics.
Results: RAG-based responses outperformed non-RAG responses across all evaluation metrics. Key findings indicate that RAG-based responses delivered more specific, empathetic, and actionable guidance, particularly when addressing complex emotional and situational concerns. The analysis revealed that family relationships, peer interactions, and academic stress are significant factors affecting adolescents' worries, with depression and stress frequently co-occurring.
Conclusions: This study demonstrates the potential of RAG-based LLMs to address the diverse and culture-specific worries of Korean adolescents. By integrating external knowledge and offering personalized support, the proposed system provides a scalable approach to enhancing mental health interventions for adolescents. Future research should concentrate on expanding the dataset and improving multiturn conversational capabilities to deliver even more comprehensive support.
{"title":"LLM-Based Response Generation for Korean Adolescents: A Study Using the NAVER Knowledge iN Q&A Dataset with RAG.","authors":"Junseo Kim, Seok Jun Kim, Junseok Ahn, Suehyun Lee","doi":"10.4258/hir.2025.31.2.136","DOIUrl":"10.4258/hir.2025.31.2.136","url":null,"abstract":"<p><strong>Objectives: </strong>This research aimed to develop a retrieval-augmented generation (RAG) based large language model (LLM) system that offers personalized and reliable responses to a wide range of concerns raised by Korean adolescents. Our work focuses on building a culturally reflective dataset and on designing and validating the system's effectiveness by comparing the answer quality of RAG-based models with non-RAG models.</p><p><strong>Methods: </strong>Data were collected from the NAVER Knowledge iN platform, concentrating on posts that featured adolescents' questions and corresponding expert responses during the period 2014-2024. The dataset comprises 3,874 cases, categorized by key negative emotions and the primary sources of worry. The data were processed to remove irrelevant or redundant content and then classified into general and detailed causes. The RAG-based model employed FAISS for similarity-based retrieval of the top three reference cases and used GPT-4o mini for response generation. The responses generated with and without RAG were evaluated using several metrics.</p><p><strong>Results: </strong>RAG-based responses outperformed non-RAG responses across all evaluation metrics. Key findings indicate that RAG-based responses delivered more specific, empathetic, and actionable guidance, particularly when addressing complex emotional and situational concerns. The analysis revealed that family relationships, peer interactions, and academic stress are significant factors affecting adolescents' worries, with depression and stress frequently co-occurring.</p><p><strong>Conclusions: </strong>This study demonstrates the potential of RAG-based LLMs to address the diverse and culture-specific worries of Korean adolescents. By integrating external knowledge and offering personalized support, the proposed system provides a scalable approach to enhancing mental health interventions for adolescents. Future research should concentrate on expanding the dataset and improving multiturn conversational capabilities to deliver even more comprehensive support.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"136-145"},"PeriodicalIF":2.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086440/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144093452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-01-31DOI: 10.4258/hir.2025.31.1.66
Keni Lee, Ramzi Argoubi, Halley Costantino
Objectives: To identify the right interventions for the right heart failure (HF) patients in the real-world setting using machine learning (ML) trained on individual-level clinical data linked with social determinants of health (SDOH) data.
Methods: In this retrospective cohort study, point-of-care claims data from Komodo Health and SDOH data from the National Health and Wellness Survey (NHWS), from January 2014-December 2020, were linked. Data mining was conducted using K-means clustering, an ML tool. Komodo Health data were used to access longitudinal data for the selected patient cohorts and crosssectional data from NHWS for additional patient information. The primary outcome was HF-related hospitalizations; secondary outcomes, all-cause hospitalization and all-cause mortality. Use of digital healthcare (DHC)/non-DHC interventions and related outcomes were also assessed.
Results: The study population included 353 HF patients (mean age, 63.5 years; 57.2% women). The use of non-DHC (75.9%-81.9%) and DHC (4.0%-9.1%) interventions increased from baseline to followup. Overall, 17.0% of patients had HF-related hospitalizations (DHC, 6.9%; non-DHC, 16.5%) and 45.0% had all-cause hospitalization (DHC, 75.0%; non-DHC, 50.9%). Two archetypes with distinct patient profiles were identified. Archetype 1 (vs. 2) characterised by older age, greater disease severity, more comorbidities, more medication use, took steps to prevent heart attack/problems, had better lifestyle, higher HF-related hospitalizations (18.3% vs. 16.3%) and lower all-cause hospitalizations (42.9% vs. 46.3%). The trends remained the same regardless of the intervention type.
Conclusions: Identification of patient archetypes with distinct profiles can be useful to understand underlying disease subtypes, identify specific interventions, predict clinical outcomes, and define the right intervention for the right patient.
{"title":"Data Mining to Identify the Right Interventions for the Right Patient for Heart Failure: A Real-World Study.","authors":"Keni Lee, Ramzi Argoubi, Halley Costantino","doi":"10.4258/hir.2025.31.1.66","DOIUrl":"10.4258/hir.2025.31.1.66","url":null,"abstract":"<p><strong>Objectives: </strong>To identify the right interventions for the right heart failure (HF) patients in the real-world setting using machine learning (ML) trained on individual-level clinical data linked with social determinants of health (SDOH) data.</p><p><strong>Methods: </strong>In this retrospective cohort study, point-of-care claims data from Komodo Health and SDOH data from the National Health and Wellness Survey (NHWS), from January 2014-December 2020, were linked. Data mining was conducted using K-means clustering, an ML tool. Komodo Health data were used to access longitudinal data for the selected patient cohorts and crosssectional data from NHWS for additional patient information. The primary outcome was HF-related hospitalizations; secondary outcomes, all-cause hospitalization and all-cause mortality. Use of digital healthcare (DHC)/non-DHC interventions and related outcomes were also assessed.</p><p><strong>Results: </strong>The study population included 353 HF patients (mean age, 63.5 years; 57.2% women). The use of non-DHC (75.9%-81.9%) and DHC (4.0%-9.1%) interventions increased from baseline to followup. Overall, 17.0% of patients had HF-related hospitalizations (DHC, 6.9%; non-DHC, 16.5%) and 45.0% had all-cause hospitalization (DHC, 75.0%; non-DHC, 50.9%). Two archetypes with distinct patient profiles were identified. Archetype 1 (vs. 2) characterised by older age, greater disease severity, more comorbidities, more medication use, took steps to prevent heart attack/problems, had better lifestyle, higher HF-related hospitalizations (18.3% vs. 16.3%) and lower all-cause hospitalizations (42.9% vs. 46.3%). The trends remained the same regardless of the intervention type.</p><p><strong>Conclusions: </strong>Identification of patient archetypes with distinct profiles can be useful to understand underlying disease subtypes, identify specific interventions, predict clinical outcomes, and define the right intervention for the right patient.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 1","pages":"66-87"},"PeriodicalIF":2.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11854618/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143457610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-01-31DOI: 10.4258/hir.2025.31.1.4
Eun-Gee Park, Min Jung Kim, Jinseo Kim, Kichul Shin, Borim Ryu
Objectives: We aimed to derive observational research evidence on treatment patterns through a scoping review of common data model (CDM)-based publications.
Methods: We searched the medical literature databases PubMed and EMBASE, as well as the Observational Health Data Sciences and Informatics (OHDSI) website, for papers published between January 1, 2010 and August 21, 2023 to identify research papers relevant to our topic.
Results: Eighteen articles satisfied the inclusion criteria for this scoping review. We summarized study characteristics such as phenotypes, patient numbers, data periods, countries, Observational Medical Outcomes Partnership (OMOP) CDM databases, and definitions of index date and target cohort. Type 2 diabetes mellitus emerged as the most frequently studied disease, covered in five articles, followed by hypertension and depression, each addressed in four articles. Biguanides, with metformin as the primary drug, were the most commonly prescribed first-line treatments for type 2 diabetes mellitus. Most studies utilized sunburst plots to visualize treatment patterns, whereas two studies used Sankey plots. Various software tools were employed for treatment pattern analysis, including JavaScript, the open-source ATLAS by OHDSI, R code, and the R package "TreatmentPatterns."
Conclusions: This study provides a comprehensive overview of research on treatment patterns using the CDM, highlighting the growing importance of OMOP CDM in enabling multinational observational network studies and advancing collaborative research in this field.
{"title":"Utility of Treatment Pattern Analysis Using a Common Data Model: A Scoping Review.","authors":"Eun-Gee Park, Min Jung Kim, Jinseo Kim, Kichul Shin, Borim Ryu","doi":"10.4258/hir.2025.31.1.4","DOIUrl":"10.4258/hir.2025.31.1.4","url":null,"abstract":"<p><strong>Objectives: </strong>We aimed to derive observational research evidence on treatment patterns through a scoping review of common data model (CDM)-based publications.</p><p><strong>Methods: </strong>We searched the medical literature databases PubMed and EMBASE, as well as the Observational Health Data Sciences and Informatics (OHDSI) website, for papers published between January 1, 2010 and August 21, 2023 to identify research papers relevant to our topic.</p><p><strong>Results: </strong>Eighteen articles satisfied the inclusion criteria for this scoping review. We summarized study characteristics such as phenotypes, patient numbers, data periods, countries, Observational Medical Outcomes Partnership (OMOP) CDM databases, and definitions of index date and target cohort. Type 2 diabetes mellitus emerged as the most frequently studied disease, covered in five articles, followed by hypertension and depression, each addressed in four articles. Biguanides, with metformin as the primary drug, were the most commonly prescribed first-line treatments for type 2 diabetes mellitus. Most studies utilized sunburst plots to visualize treatment patterns, whereas two studies used Sankey plots. Various software tools were employed for treatment pattern analysis, including JavaScript, the open-source ATLAS by OHDSI, R code, and the R package \"TreatmentPatterns.\"</p><p><strong>Conclusions: </strong>This study provides a comprehensive overview of research on treatment patterns using the CDM, highlighting the growing importance of OMOP CDM in enabling multinational observational network studies and advancing collaborative research in this field.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 1","pages":"4-15"},"PeriodicalIF":2.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11854637/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143457776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-01-31DOI: 10.4258/hir.2025.31.1.96
Nazlee Sharmin, Shahram Houshyar, Thomas R Stevenson, Ava K Chow
Objectives: A growing number of health professional institutions around the world are embracing innovative technologies to increase student engagement, primarily to improve clinical and simulated learning experiences. Didactic learning is an essential component of dental and medical curricula. However, limited research is available regarding the implementation of technology-infused teaching in classroom settings. We developed self-paced interactive learning content using the HTML5 Package (H5P) to promote student engagement in a didactic course within a dental hygiene program.
Methods: A total of 52 interactive artifacts were created and administered to students as supplementary learning material. A descriptive study was conducted to explore student perceptions and engagement with the H5P content, as well as to evaluate the impact of these artifacts on academic performance.
Results: Students performed significantly better on exam questions associated with interactive H5P content posted in the learning management system compared to other questions. Most students were highly engaged with the H5P content during the week leading up to each summative assessment. However, two of the three students with the highest course grades demonstrated consistent engagement with this content throughout the course.
Conclusions: Our results highlight the effectiveness of interactive content created using the H5P platform in fostering student engagement. The development of self-paced interactive materials may benefit various aspects of didactic teaching, including both synchronous and asynchronous online learning.
{"title":"Interactive Engagement with Self-Paced Learning Content in a Didactic Course.","authors":"Nazlee Sharmin, Shahram Houshyar, Thomas R Stevenson, Ava K Chow","doi":"10.4258/hir.2025.31.1.96","DOIUrl":"10.4258/hir.2025.31.1.96","url":null,"abstract":"<p><strong>Objectives: </strong>A growing number of health professional institutions around the world are embracing innovative technologies to increase student engagement, primarily to improve clinical and simulated learning experiences. Didactic learning is an essential component of dental and medical curricula. However, limited research is available regarding the implementation of technology-infused teaching in classroom settings. We developed self-paced interactive learning content using the HTML5 Package (H5P) to promote student engagement in a didactic course within a dental hygiene program.</p><p><strong>Methods: </strong>A total of 52 interactive artifacts were created and administered to students as supplementary learning material. A descriptive study was conducted to explore student perceptions and engagement with the H5P content, as well as to evaluate the impact of these artifacts on academic performance.</p><p><strong>Results: </strong>Students performed significantly better on exam questions associated with interactive H5P content posted in the learning management system compared to other questions. Most students were highly engaged with the H5P content during the week leading up to each summative assessment. However, two of the three students with the highest course grades demonstrated consistent engagement with this content throughout the course.</p><p><strong>Conclusions: </strong>Our results highlight the effectiveness of interactive content created using the H5P platform in fostering student engagement. The development of self-paced interactive materials may benefit various aspects of didactic teaching, including both synchronous and asynchronous online learning.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 1","pages":"96-106"},"PeriodicalIF":2.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11854625/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143457641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-01-31DOI: 10.4258/hir.2025.31.1.1
Jisan Lee, Taehoon Ko, Kwangmo Yang, Younghee Lee
{"title":"Review of the 2024 Fall Conference of the Korean Society of Medical Informatics-AI's Role in Shaping Modern Healthcare.","authors":"Jisan Lee, Taehoon Ko, Kwangmo Yang, Younghee Lee","doi":"10.4258/hir.2025.31.1.1","DOIUrl":"10.4258/hir.2025.31.1.1","url":null,"abstract":"","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 1","pages":"1-3"},"PeriodicalIF":2.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11854613/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143457695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-01-31DOI: 10.4258/hir.2025.31.1.37
R Bavatharani, V Supriya, Julius Xavier Scott, Suresh Sankaranarayanan
Objectives: Conventional face-to-face nutrition counseling has played a crucial role in promoting healthy habits. However, the emergence of digital health technologies has introduced mobile app-based nutrition counseling as an effective alternative. This research aims to develop and evaluate the usability and effectiveness of the FIT4PEDON mobile nutrition counseling application in promoting healthy dietary behaviors and lifestyle modifications among childhood cancer survivors (CCS).
Methods: This study employed a mixed-methods approach, incorporating both survey and qualitative and quantitative analyses. A total of 33 health care professional experts participated. The reliability of the questionnaire was assessed using the Kuder-Richardson method, and its content validity was confirmed through expert evaluation. Usability testing was conducted with a validated questionnaire.
Results: The development process resulted in two applications: an Android mobile application and an admin web application. The findings indicated that a significant proportion of experts endorsed the app for dietary management. Statistical analysis showed significant differences between "yes" and "no" responses. However, no significant differences were found when comparing responses across different sex or age groups.
Conclusions: The FIT4PEDON application shows promise in supporting CCS to adopt healthier lifestyles. Nevertheless, the study underscores the necessity for further research, particularly focusing on specific age groups of experts with relevant experience, to achieve more conclusive results. Leveraging technology through mobile apps has the potential to improve the quality of survivorship care and foster sustained engagement in long-term care for pediatric cancer survivors.
{"title":"FIT4PEDON: Mobile Nutrition Counseling Application Effectiveness and Usability for Childhood Cancer Survivors.","authors":"R Bavatharani, V Supriya, Julius Xavier Scott, Suresh Sankaranarayanan","doi":"10.4258/hir.2025.31.1.37","DOIUrl":"10.4258/hir.2025.31.1.37","url":null,"abstract":"<p><strong>Objectives: </strong>Conventional face-to-face nutrition counseling has played a crucial role in promoting healthy habits. However, the emergence of digital health technologies has introduced mobile app-based nutrition counseling as an effective alternative. This research aims to develop and evaluate the usability and effectiveness of the FIT4PEDON mobile nutrition counseling application in promoting healthy dietary behaviors and lifestyle modifications among childhood cancer survivors (CCS).</p><p><strong>Methods: </strong>This study employed a mixed-methods approach, incorporating both survey and qualitative and quantitative analyses. A total of 33 health care professional experts participated. The reliability of the questionnaire was assessed using the Kuder-Richardson method, and its content validity was confirmed through expert evaluation. Usability testing was conducted with a validated questionnaire.</p><p><strong>Results: </strong>The development process resulted in two applications: an Android mobile application and an admin web application. The findings indicated that a significant proportion of experts endorsed the app for dietary management. Statistical analysis showed significant differences between \"yes\" and \"no\" responses. However, no significant differences were found when comparing responses across different sex or age groups.</p><p><strong>Conclusions: </strong>The FIT4PEDON application shows promise in supporting CCS to adopt healthier lifestyles. Nevertheless, the study underscores the necessity for further research, particularly focusing on specific age groups of experts with relevant experience, to achieve more conclusive results. Leveraging technology through mobile apps has the potential to improve the quality of survivorship care and foster sustained engagement in long-term care for pediatric cancer survivors.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 1","pages":"37-47"},"PeriodicalIF":2.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11854622/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143457703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-01-31DOI: 10.4258/hir.2025.31.1.16
Lailil Muflikhah, Tirana Noor Fatyanosa, Nashi Widodo, Rizal Setya Perdana, Solimun, Hana Ratnawati
Objectives: Hypertension, commonly known as high blood pressure, is a prevalent and serious condition affecting a significant portion of the adult population globally. It is a chronic medical issue that, if left unaddressed, can lead to severe health complications, including kidney problems, heart disease, and stroke. This study aims to develop a feature selection model using the XGBoost algorithm to identify specific single nucleotide polymorphisms (SNPs) as biomarkers for detecting hypertension risk.
Methods: We propose using the high dimensionality of genetic variations (i.e., SNPs) to build a classifier model for prediction. In this study, SNPs were used as markers for hypertension in patients. We utilized the OpenSNP dataset, which includes 19,697 SNPs from 2,052 samples. Extreme gradient boosting (XGBoost) is an ensemble machine learning method employed here for feature selection, which incrementally adjusts weights in a series of steps.
Results: The experimental results identified 292 SNPs that exhibited high performance, with an F1-score of 98.55%, precision of 98.73%, recall of 98.38%, and overall accuracy of 98%. This study provides compelling evidence that the XGBoost feature selection method outperforms other representative feature selection methods, such as genetic algorithms, analysis of variance, chi-square, and principal component analysis, in predicting hypertension risk, demonstrating its effectiveness.
Conclusions: We developed a model for predicting hypertension using the SNPs dataset. The high dimensionality of SNP data was effectively managed to identify significant features as biomarkers using the XGBoost feature selection method. The results indicate high performance in predicting the risk of hypertension.
{"title":"Feature Selection for Hypertension Risk Prediction Using XGBoost on Single Nucleotide Polymorphism Data.","authors":"Lailil Muflikhah, Tirana Noor Fatyanosa, Nashi Widodo, Rizal Setya Perdana, Solimun, Hana Ratnawati","doi":"10.4258/hir.2025.31.1.16","DOIUrl":"10.4258/hir.2025.31.1.16","url":null,"abstract":"<p><strong>Objectives: </strong>Hypertension, commonly known as high blood pressure, is a prevalent and serious condition affecting a significant portion of the adult population globally. It is a chronic medical issue that, if left unaddressed, can lead to severe health complications, including kidney problems, heart disease, and stroke. This study aims to develop a feature selection model using the XGBoost algorithm to identify specific single nucleotide polymorphisms (SNPs) as biomarkers for detecting hypertension risk.</p><p><strong>Methods: </strong>We propose using the high dimensionality of genetic variations (i.e., SNPs) to build a classifier model for prediction. In this study, SNPs were used as markers for hypertension in patients. We utilized the OpenSNP dataset, which includes 19,697 SNPs from 2,052 samples. Extreme gradient boosting (XGBoost) is an ensemble machine learning method employed here for feature selection, which incrementally adjusts weights in a series of steps.</p><p><strong>Results: </strong>The experimental results identified 292 SNPs that exhibited high performance, with an F1-score of 98.55%, precision of 98.73%, recall of 98.38%, and overall accuracy of 98%. This study provides compelling evidence that the XGBoost feature selection method outperforms other representative feature selection methods, such as genetic algorithms, analysis of variance, chi-square, and principal component analysis, in predicting hypertension risk, demonstrating its effectiveness.</p><p><strong>Conclusions: </strong>We developed a model for predicting hypertension using the SNPs dataset. The high dimensionality of SNP data was effectively managed to identify significant features as biomarkers using the XGBoost feature selection method. The results indicate high performance in predicting the risk of hypertension.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 1","pages":"16-22"},"PeriodicalIF":2.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11854617/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143457532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01Epub Date: 2025-01-31DOI: 10.4258/hir.2025.31.1.23
Hyun A Shin, Hyeonji Kang, Mona Choi
Objectives: Emergency department (ED) overcrowding significantly impacts healthcare efficiency, safety, and resource management. Predictive models that utilize triage information can streamline the admission process. This review evaluates existing hospital admission prediction models that have been developed or validated using triage data for adult ED patients.
Methods: A systematic search of PubMed, Embase, CINAHL, Web of Science, and the Cochrane Library was conducted. Studies were selected if they developed or validated predictive models for hospital admission using triage data from adult ED patients. Data extraction adhered to the CHARMS (Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies), and the risk of bias was evaluated using PROBAST (Prediction model Risk of Bias Assessment Tool).
Results: Twenty studies met the inclusion criteria, employing logistic regression and machine learning techniques. Logistic regression was noted for its traditional use and clinical interpretability, whereas machine learning provided enhanced flexibility and potential for better predictive accuracy. Common predictors included patient demographics, triage category, vital signs, and mode of arrival. The area under the curve values for model performance ranged from 0.80 to 0.89, demonstrating strong discriminatory ability. However, external validation was limited, and there was variability in outcome definitions and model generalizability.
Conclusions: Predictive models based on triage data show promise in supporting ED operations by facilitating early predictions of hospital admissions, which could help decrease boarding times and enhance patient flow. Further research is necessary to validate these models in various settings to confirm their applicability and reliability.
目的:急诊科(ED)过度拥挤严重影响医疗效率、安全性和资源管理。利用分诊信息的预测模型可以简化入院过程。本综述评估了现有的住院预测模型,这些模型是利用成人急诊科患者的分诊数据开发或验证的。方法:系统检索PubMed、Embase、CINAHL、Web of Science、Cochrane Library。如果研究开发或验证了使用成人急诊科患者分诊数据的住院预测模型,则选择研究。数据提取遵循CHARMS(预测模型研究系统评价关键评价和数据提取清单),并使用PROBAST(预测模型偏倚风险评估工具)评估偏倚风险。结果:20项研究符合纳入标准,采用逻辑回归和机器学习技术。逻辑回归以其传统用途和临床可解释性而闻名,而机器学习提供了增强的灵活性和更好的预测准确性的潜力。常见的预测因素包括患者人口统计、分诊类别、生命体征和到达方式。模型性能曲线下面积在0.80 ~ 0.89之间,具有较强的判别能力。然而,外部验证是有限的,并且在结果定义和模型推广方面存在可变性。结论:基于分诊数据的预测模型通过促进住院情况的早期预测,有望支持急诊科手术,有助于减少住院时间并提高患者流量。需要进一步的研究来验证这些模型在不同环境下的适用性和可靠性。
{"title":"Triage Data-Driven Prediction Models for Hospital Admission of Emergency Department Patients: A Systematic Review.","authors":"Hyun A Shin, Hyeonji Kang, Mona Choi","doi":"10.4258/hir.2025.31.1.23","DOIUrl":"10.4258/hir.2025.31.1.23","url":null,"abstract":"<p><strong>Objectives: </strong>Emergency department (ED) overcrowding significantly impacts healthcare efficiency, safety, and resource management. Predictive models that utilize triage information can streamline the admission process. This review evaluates existing hospital admission prediction models that have been developed or validated using triage data for adult ED patients.</p><p><strong>Methods: </strong>A systematic search of PubMed, Embase, CINAHL, Web of Science, and the Cochrane Library was conducted. Studies were selected if they developed or validated predictive models for hospital admission using triage data from adult ED patients. Data extraction adhered to the CHARMS (Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies), and the risk of bias was evaluated using PROBAST (Prediction model Risk of Bias Assessment Tool).</p><p><strong>Results: </strong>Twenty studies met the inclusion criteria, employing logistic regression and machine learning techniques. Logistic regression was noted for its traditional use and clinical interpretability, whereas machine learning provided enhanced flexibility and potential for better predictive accuracy. Common predictors included patient demographics, triage category, vital signs, and mode of arrival. The area under the curve values for model performance ranged from 0.80 to 0.89, demonstrating strong discriminatory ability. However, external validation was limited, and there was variability in outcome definitions and model generalizability.</p><p><strong>Conclusions: </strong>Predictive models based on triage data show promise in supporting ED operations by facilitating early predictions of hospital admissions, which could help decrease boarding times and enhance patient flow. Further research is necessary to validate these models in various settings to confirm their applicability and reliability.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 1","pages":"23-36"},"PeriodicalIF":2.3,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11854635/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143457773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}