Critical View of Safety (CVS) is the accepted strategy to avoid bile duct injury during Laparoscopic Cholecystectomy (LC). In this study, we sought to investigate the accuracy and performance of a trained Artificial Intelligent (AI) model in validation of the CVS achievement during elective LC in a real time operating room setting.
A deep learning neural network which was previously trained on annotated segments of 700 LC videos to identify the CVS criteria, was integrated into the operating room laparoscopic video system, for continuous monitoring and real-time validation of CVS achievement during elective LC procedures. The system's feedback and surgeon's report were recorded and compared, as well as the overall rate of CVS achievement.
Of 40 consecutive LC, CVS was reported by the surgeons in 34 (85 %). In all the 6 cases where CVS was not achieved due to severe inflammation or anatomy distortion, the AI model agreed with surgeon's report and did not identify CVS. Out of the 34 cases where CVS was achieved, the AI model identified 33. Thus, the AI model detected the CVS achievement with a specificity of 100 % [95%-CI 98.1 %, 100 %] and sensitivity of 97 % [95%-CI 96.1 %, 98.2 %].
A trained AI model can identify CVS during elective LC with very high accuracy in a real time OR setting. Additionally, its use may result in high rates of CVS achievement, thereby improving LC procedure's safety and outcome.
Reinforcement learning (RL) has helped improve decision-making in several domains but can be challenging to apply; this is the case for rehabilitation of people with a spinal cord injury (SCI). Among other factors, applying RL in this domain is difficult because there are many possible treatments (i.e., large action space) and few detailed records of longitudinal treatments and outcomes (i.e., limited training data). Applying Fitted Q Iteration in this domain with linear models and the most natural state and action representation results in problems with convergence and overfitting. However, isolating treatments from one another can mitigate the convergence issue, and treatments for SCIs have meaningful groupings that can be used to combat overfitting. We propose two approaches to grouping treatments so that an RL agent can learn effectively from limited data. One relies on domain knowledge of SCI rehabilitation and the other learns similarities among treatments using an embedding technique. After re-interpreting the data using these treatment grouping approaches in conjunction with our process that isolates the treatment groups, we use Fitted Q Iteration to train an agent that learns to select better treatments. Through a simulation study designed to reflect the properties of SCI rehabilitation, we find that agents trained after using either grouping method can help improve the treatment decisions of individual physiotherapists, but the approach based on domain knowledge offers better performance. Our findings provide a proof of concept that applying RL has the potential to help improve the treatment of those with an SCI and indicates that continued efforts to gather data and apply RL to this domain are worthwhile.
Breast cancer is the most common cancer worldwide. The standard imaging modality for breast cancer screening is X-ray mammography, which suffers from low sensitivities in women with dense breasts and can potentially cause cancers despite a low radiation dosage. Diffuse Optical Tomography (DOT) is a noninvasive imaging technique that can potentially be employed to improve breast cancer early detection. However, conventional model-based algorithms for reconstructing DOT images usually produce low-quality images with limited resolution and low reconstruction accuracy. We propose to integrate deep neural networks (DNNs) with the conventional DOT reconstruction methods. This hybrid framework significantly enhances image quality. The DNNs have been trained and tested with sample data derived from clinically relevant breast models. The sample dataset contains blood vessel structures from breast structures and artificially created vessels using the Lindenmayer-system algorithm. By comparing the hybrid reconstruction with the ground truth image, we demonstrated a multi scale - structural similarity index measure (MS-SSIM) score of 0.80–0.90. Whereas using conventional reconstruction, MS-SSIM provided a much inferior score of 0.36–0.59. In terms of DOT image quality, both qualitative and quantitative assessments of the reconstructed images signify that the hybrid approach is superior to conventional methods. This improvement suggests that DOT can potentially become a viable alternative to breast cancer screening, providing a step towards the next-generation device for optical mammography.
Recent advances in Artificial Intelligence (AI) offer promising opportunities for its use in pediatric healthcare. This is especially true for early identification of developmental problems where timely intervention is essential, but developmental assessments are resource-intensive. AI carries potential as a valuable tool in the early detection of such developmental issues. In this systematic review, we aim to synthesize and evaluate the current literature on AI-usage in monitoring child development, including possible clinical outcomes, and acceptability of such technologies by different stakeholders.
The systematic review is based on a literature search comprising the databases PubMed, Cochrane Library, Scopus, Web of Science, Science Direct, PsycInfo, ACM and Google Scholar (time interval 1996–2022). All articles addressing AI-usage in monitoring child development or describing respective clinical outcomes and opinions were included.
Out of 2814 identified articles, finally 71 were included. 70 reported on AI usage and one study dealt with users’ acceptance of AI. No article reported on potential clinical outcomes of AI applications. Articles showed a peak from 2020 to 2022. The majority of studies were from the US, China and India (n = 45) and mostly used pre-existing datasets such as electronic health records or speech and video recordings. The most used AI methods were support vector machines and deep learning.
A few well-proven AI applications in developmental monitoring exist. However, the majority has not been evaluated in clinical practice. The subdomains of cognitive, social and language development are particularly well-represented. Another focus is on early detection of autism. Potential clinical outcomes of AI usage and user's acceptance have rarely been considered yet. While the increase of publications in recent years suggests an increasing interest in AI implementation in child development monitoring, future research should focus on clinical practice application and stakeholder's needs.
Advances in machine and deep learning techniques provide a novel approach in understanding complex patterns within large datasets, leading to an implementation of personalized medicine approaches to support clinical decision making. Results from recent clinical trials (TENT-A1 and TENT-A2 studies; clinicaltrials.gov: NCT02669069 and NCT03530306) support that a novel bimodal neuromodulation approach could be a breakthrough treatment for patients with tinnitus, which adversely affects 10–15 % of the population. Given the heterogeneity of symptoms, it is important to identify whether treatment has an optimal effect on specific subgroups of tinnitus patients. The current study is a first look at the feasibility of using deep learning modelling on patient reported data to predict treatment outcomes in individuals with tinnitus, and highlights what features are most beneficial for clinical decision making.
Vital sign assessment is an examination that indicates changes in health. Direct contact during vital signs assessment can increase the risk of disease transmission. This research aimed to develop a contactless vital sign monitoring prototype that includes heart rate, respiratory rate, blood pressure, and oxygen saturation using a digital camera based on remote photoplethysmography with an adaptive region of interest. The adaptive region-of-interest method uses face detection and skin segmentation to generate red-green-blue signals, taking only the skin pixels of the patients while also minimising the effect of motion artefacts. The hybrid processing method combines several vital sign extraction methods to filter external irrelevant factors and produce heart rate, respiratory rate, blood pressure, and blood oxygen saturation values. In addition, the prototype was tested on 50 participants using standard vital sign assessment tools for comparison. The technical specification test of the prototype concluded that the optimal distance of this prototype was up to 2 m with a processing time of 2 s for every 1-s video. The vital signs results were presented using Bland-Altman, which showed that although the Bland-Altman plots revealed a substantial variance in the limits of agreement (±15–20 mmHg for blood pressure, ±15–17 bpm for heart rate, ±4–6 bpm for respiratory rate, and ±1–3 % for blood oxygen saturation), the mean differences for all vital signs were small (±0.7–5 mmHg for blood pressure, ±0.4–0.6 bpm for heart rate, ±0.5–0.7 bpm for respiratory rate, ±0.4–0.6 for blood oxygen saturation) and most data points were within the limits. While further clinical studies are needed to assess its reliability in monitoring specific medical conditions, the prototype has shown an acceptable agreement in assessing vital signs compared to the conventional methods, making it feasible for further development into a medical device.
The paper aims to address the problem of massive unlabeled patients in electronic health records (EHR) who potentially have undiagnosed diabetic retinopathy (DR). It is desired to estimate the actual DR prevalence in EHR with 96 % missing labels.
The Cerner Health Facts data are used in the study, with 3749 labeled DR patients and 97,876 unlabeled diabetic patients. This extensive dataset spans the demographics of the United States over the past two decades. We implemented state-of-art positive-unlabeled learning methods, including ensemble-based support vector machine, ensemble-based random forest, and Bayesian finite mixture modeling.
The estimated DR prevalence in the population represented by Cerner EHR is approximately 25 % and the classification techniques generally achieve an AUC of around 87 %. As a by-product, a predictive inference on the risk of DR based on a patient's personalized medical information is derived.
Missing labels is a common issue for EHR data quality. Ignoring these missing labels can lead to biased results in the analyses of EHR data. The problem is especially severe in the context of DR. It is thus important to use machine learning or statistical tools to identify the unlabeled patients. The tool in this paper helps both data analysts and clinicians in their practices.