首页 > 最新文献

Proceedings. IEEE International Conference on Bioinformatics and Biomedicine最新文献

英文 中文
Determining and Validating Population Differences in Magnetic Resonance Angiography Using Sparse Representation. 利用稀疏表示确定和验证磁共振血管造影的人群差异。
Pub Date : 2022-12-01 DOI: 10.1109/bibm55620.2022.9994989
Steve Mendoza, Fabien Scalzo, Aichi Chien

Goal: Identifying population differences can serve as an insightful tool for diagnostic radiology. To do so, a reliable preprocessing framework and data representation are vital.

Methods: We build a machine learning model to visualize gender differences in the circle of Willis (CoW), an integral part of the brain's vasculature. We start with a dataset of 570 individuals and process them for analysis using 389 for the final analysis.

Results: We find statistical differences between male and female patients in one image plane and visualize where they are. We can see differences between the right and left-hand sides of the brain confirmed using Support Vector Machines (SVM).

Conclusion: This process can be applied to detect population variations in the vasculature automatically.

Significance: It can guide debugging and inferring complex machine learning algorithms such as SVM and deep learning models.

目的:确定人群差异可以作为诊断放射学的一个有见地的工具。为此,可靠的预处理框架和数据表示是至关重要的。方法:我们建立了一个机器学习模型来可视化威利斯圈(CoW)的性别差异,威利斯圈是大脑脉管系统的组成部分。我们从570个人的数据集开始,使用389个人进行最终分析。结果:我们发现男性和女性患者在一个图像平面上的统计差异,并可视化他们的位置。通过支持向量机(SVM),我们可以看到左右脑的差异。结论:该方法可用于血管种群变异的自动检测。意义:对SVM、深度学习模型等复杂机器学习算法的调试和推理具有指导意义。
{"title":"Determining and Validating Population Differences in Magnetic Resonance Angiography Using Sparse Representation.","authors":"Steve Mendoza,&nbsp;Fabien Scalzo,&nbsp;Aichi Chien","doi":"10.1109/bibm55620.2022.9994989","DOIUrl":"https://doi.org/10.1109/bibm55620.2022.9994989","url":null,"abstract":"<p><strong>Goal: </strong>Identifying population differences can serve as an insightful tool for diagnostic radiology. To do so, a reliable preprocessing framework and data representation are vital.</p><p><strong>Methods: </strong>We build a machine learning model to visualize gender differences in the circle of Willis (CoW), an integral part of the brain's vasculature. We start with a dataset of 570 individuals and process them for analysis using 389 for the final analysis.</p><p><strong>Results: </strong>We find statistical differences between male and female patients in one image plane and visualize where they are. We can see differences between the right and left-hand sides of the brain confirmed using Support Vector Machines (SVM).</p><p><strong>Conclusion: </strong>This process can be applied to detect population variations in the vasculature automatically.</p><p><strong>Significance: </strong>It can guide debugging and inferring complex machine learning algorithms such as SVM and deep learning models.</p>","PeriodicalId":74563,"journal":{"name":"Proceedings. IEEE International Conference on Bioinformatics and Biomedicine","volume":"2022 ","pages":"3101-3108"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10170968/pdf/nihms-1889670.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9460588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Twitter Data Analysis to Understand the Perceptions, Awareness, and Barriers to the Wide Use of Pre-Exposure Prophylaxis in the United States. 使用Twitter数据分析了解美国广泛使用暴露前预防的认知、意识和障碍。
Pub Date : 2022-12-01 DOI: 10.1109/bibm55620.2022.9995568
Arslan Erdengasileng, Shubo Tian, Sara S Green, Sylvie Naar, Zhe He

User-generated social media posts such as tweets can provide insights about the public's perception, cognitive, and behavioral responses to health-related issues. Pre-Exposure Prophylaxis (PrEP) is one of the most effective ways to reduce the risk of HIV infection. However, its utilization is low in the US, especially among populations disproportionately affected by HIV such as the age group of under 24 years old. It is therefore important to understand the barriers to the wider use of PrEP in the US using social media posts. In this study, we collected tweets from Twitter about PrEP in the past 4 years to identify such barriers by first identifying tweets about personal discussions, and then performing textual analysis using word analysis, UMLS semantic type analysis, and topic modeling. We found that the public often discussed advocacy, risks/benefits, access, pricing, insurance coverage, legislation, stigma, health education, and prevention of HIV. This result is consistent with the literature and can help identify strategies for promoting the use of PrEP, especially among young adults.

用户生成的社交媒体帖子,如推文,可以提供公众对健康相关问题的感知、认知和行为反应的见解。暴露前预防(PrEP)是降低艾滋病毒感染风险的最有效方法之一。然而,在美国,它的使用率很低,尤其是在受艾滋病毒影响不成比例的人群中,如24岁以下年龄组。因此,通过社交媒体帖子了解在美国广泛使用PrEP的障碍是很重要的。在本研究中,我们收集了过去4年Twitter上关于PrEP的推文,通过首先识别关于个人讨论的推文,然后使用单词分析、UMLS语义类型分析和主题建模进行文本分析,来识别这些障碍。我们发现公众经常讨论宣传、风险/收益、获取、定价、保险范围、立法、污名、健康教育和艾滋病毒预防。这一结果与文献一致,可以帮助确定促进PrEP使用的策略,特别是在年轻人中。
{"title":"Using Twitter Data Analysis to Understand the Perceptions, Awareness, and Barriers to the Wide Use of Pre-Exposure Prophylaxis in the United States.","authors":"Arslan Erdengasileng,&nbsp;Shubo Tian,&nbsp;Sara S Green,&nbsp;Sylvie Naar,&nbsp;Zhe He","doi":"10.1109/bibm55620.2022.9995568","DOIUrl":"https://doi.org/10.1109/bibm55620.2022.9995568","url":null,"abstract":"<p><p>User-generated social media posts such as tweets can provide insights about the public's perception, cognitive, and behavioral responses to health-related issues. Pre-Exposure Prophylaxis (PrEP) is one of the most effective ways to reduce the risk of HIV infection. However, its utilization is low in the US, especially among populations disproportionately affected by HIV such as the age group of under 24 years old. It is therefore important to understand the barriers to the wider use of PrEP in the US using social media posts. In this study, we collected tweets from Twitter about PrEP in the past 4 years to identify such barriers by first identifying tweets about personal discussions, and then performing textual analysis using word analysis, UMLS semantic type analysis, and topic modeling. We found that the public often discussed advocacy, risks/benefits, access, pricing, insurance coverage, legislation, stigma, health education, and prevention of HIV. This result is consistent with the literature and can help identify strategies for promoting the use of PrEP, especially among young adults.</p>","PeriodicalId":74563,"journal":{"name":"Proceedings. IEEE International Conference on Bioinformatics and Biomedicine","volume":"2022 ","pages":"3000-3007"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9937556/pdf/nihms-1871903.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9549806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deep Learning Based MS2 Feature Detection for Data-Independent Shotgun Proteomics. 基于深度学习的 MS2 特征检测,用于与数据无关的射枪蛋白质组学。
Pub Date : 2022-12-01 Epub Date: 2023-01-02 DOI: 10.1109/bibm55620.2022.9995258
Jonathan He, Olivia Liu, Xuan Guo

Accuracy of peptide identification in LC-MS analysis is crucial for information regarding the aspects of proteins that aid in biomarker discovery and the profiling of complex proteomes. The detection of peptide fragment ions in tandem mass spectrometry is still challenging given that current tools were not created or tested for the low-abundance, low-peak fragments of peptides found in MS2 data. Feature detection, a crucial pre-processing step in the LC-MS analysis pipeline that quantifies peptides by their mass-to-charge ratio, retention time, and intensity, is particularly challenging due to the overlapping nature of peptides and weak signals that are often indistinguishable from noises, thus creating a reliance on rigid mathematical structures and heuristics. In this study, we developed a deep-learning-based model with an innovative sliding window process that enables high-resolution processing of quantitative MS/MS data to conduct MS2 feature detection. Experimental results show that our model can produce more accurate values and identifications than existing feature detection tools, as well as a high rate of true positive features quantified. Therefore, we believe that our model illustrates the advantages of deep learning techniques applied towards computational proteomics.

液相色谱-质谱分析中肽段鉴定的准确性对于蛋白质方面的信息至关重要,有助于生物标记物的发现和复杂蛋白质组的分析。在串联质谱中检测肽片段离子仍然是一项挑战,因为目前的工具并不是针对 MS2 数据中发现的低丰度、低峰值肽片段而开发或测试的。特征检测是液相色谱-质谱分析流水线中一个关键的预处理步骤,它通过肽段的质量电荷比、保留时间和强度对肽段进行量化,但由于肽段的重叠性以及弱信号往往无法与噪声区分开来,因此对僵化的数学结构和启发式方法产生了依赖,这一点尤其具有挑战性。在本研究中,我们开发了一种基于深度学习的模型,该模型具有创新的滑动窗口过程,可对定量 MS/MS 数据进行高分辨率处理,从而进行 MS2 特征检测。实验结果表明,与现有的特征检测工具相比,我们的模型能得出更准确的数值和识别结果,而且量化特征的真阳性率也很高。因此,我们认为我们的模型体现了深度学习技术在计算蛋白质组学方面的优势。
{"title":"Deep Learning Based MS2 Feature Detection for Data-Independent Shotgun Proteomics.","authors":"Jonathan He, Olivia Liu, Xuan Guo","doi":"10.1109/bibm55620.2022.9995258","DOIUrl":"10.1109/bibm55620.2022.9995258","url":null,"abstract":"<p><p>Accuracy of peptide identification in LC-MS analysis is crucial for information regarding the aspects of proteins that aid in biomarker discovery and the profiling of complex proteomes. The detection of peptide fragment ions in tandem mass spectrometry is still challenging given that current tools were not created or tested for the low-abundance, low-peak fragments of peptides found in MS2 data. Feature detection, a crucial pre-processing step in the LC-MS analysis pipeline that quantifies peptides by their mass-to-charge ratio, retention time, and intensity, is particularly challenging due to the overlapping nature of peptides and weak signals that are often indistinguishable from noises, thus creating a reliance on rigid mathematical structures and heuristics. In this study, we developed a deep-learning-based model with an innovative sliding window process that enables high-resolution processing of quantitative MS/MS data to conduct MS2 feature detection. Experimental results show that our model can produce more accurate values and identifications than existing feature detection tools, as well as a high rate of true positive features quantified. Therefore, we believe that our model illustrates the advantages of deep learning techniques applied towards computational proteomics.</p>","PeriodicalId":74563,"journal":{"name":"Proceedings. IEEE International Conference on Bioinformatics and Biomedicine","volume":"2022 ","pages":"2342-2348"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10457098/pdf/nihms-1874655.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10107810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FineFDR: Fine-grained Taxonomy-specific False Discovery Rates Control in Metaproteomics. 细粒度分类特异性错误发现率控制在元蛋白质组学。
Pub Date : 2022-12-01 DOI: 10.1109/bibm55620.2022.9995401
Shengze Wang, Shichao Feng, Chongle Pan, Xuan Guo

Microbial community proteomics, also termed metaproteomics, investigates all proteins expressed by a microbiota. Tandem mass spectrometry (MS/MS) is the typical method for identifying proteins in metaproteomics, which involves searching the mass spectra against a protein sequence database. A major post-analysis step is controlling the false discovery rate (FDR), i.e., the ratio of false positives to the total number of annotations. The current popular target-decoy FDR estimation method treats all the peptides and proteins equally and overlooks that they could have varied probabilities of being identified. In this study, we report FineFDR, a framework for FDR assessment at fine-grained levels with taxonomy information considered. FineFDR groups the identified peptide-spectrum matches, peptides, and proteins from different taxonomic units and estimates the FDR in each group separately. Empirical experiments on the simulated and real-world data sets demonstrate that our FineFDR achieved higher precision and more peptide and protein identifications when compared to the state-of-the-art methods, such as Comet, Percolator, TIDD, and Tailor. FineFDR is freely available under the GNU GPL license at https://github.com/Biocomputing-Research-Group/FDR.

微生物群落蛋白质组学,也称为元蛋白质组学,研究微生物群表达的所有蛋白质。串联质谱法(MS/MS)是元蛋白质组学中鉴定蛋白质的典型方法,它涉及到根据蛋白质序列数据库搜索质谱。分析后的一个主要步骤是控制错误发现率(FDR),即误报率与注释总数的比率。目前流行的目标-诱饵FDR估计方法对所有肽和蛋白质都一视同仁,忽略了它们可能具有不同的被识别概率。在这项研究中,我们报告了FineFDR,一个细粒度级别的FDR评估框架,考虑了分类信息。FineFDR将鉴定出的肽谱匹配、多肽和来自不同分类单位的蛋白质进行分组,并分别估计每组的FDR。在模拟和真实数据集上的经验实验表明,与Comet、Percolator、TIDD和Tailor等最先进的方法相比,我们的FineFDR实现了更高的精度和更多的肽和蛋白质鉴定。FineFDR在GNU GPL许可下可在https://github.com/Biocomputing-Research-Group/FDR免费获得。
{"title":"FineFDR: Fine-grained Taxonomy-specific False Discovery Rates Control in Metaproteomics.","authors":"Shengze Wang,&nbsp;Shichao Feng,&nbsp;Chongle Pan,&nbsp;Xuan Guo","doi":"10.1109/bibm55620.2022.9995401","DOIUrl":"https://doi.org/10.1109/bibm55620.2022.9995401","url":null,"abstract":"<p><p>Microbial community proteomics, also termed metaproteomics, investigates all proteins expressed by a microbiota. Tandem mass spectrometry (MS/MS) is the typical method for identifying proteins in metaproteomics, which involves searching the mass spectra against a protein sequence database. A major post-analysis step is controlling the false discovery rate (FDR), i.e., the ratio of false positives to the total number of annotations. The current popular target-decoy FDR estimation method treats all the peptides and proteins equally and overlooks that they could have varied probabilities of being identified. In this study, we report FineFDR, a framework for FDR assessment at fine-grained levels with taxonomy information considered. FineFDR groups the identified peptide-spectrum matches, peptides, and proteins from different taxonomic units and estimates the FDR in each group separately. Empirical experiments on the simulated and real-world data sets demonstrate that our FineFDR achieved higher precision and more peptide and protein identifications when compared to the state-of-the-art methods, such as Comet, Percolator, TIDD, and Tailor. FineFDR is freely available under the GNU GPL license at https://github.com/Biocomputing-Research-Group/FDR.</p>","PeriodicalId":74563,"journal":{"name":"Proceedings. IEEE International Conference on Bioinformatics and Biomedicine","volume":"2022 ","pages":"287-292"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9998077/pdf/nihms-1868490.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9112293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AutoMed: Automated Medical Risk Predictive Modeling on Electronic Health Records. AutoMed:电子健康记录上的自动医疗风险预测建模。
Pub Date : 2022-12-01 DOI: 10.1109/bibm55620.2022.9995209
Suhan Cui, Jiaqi Wang, Xinning Gui, Ting Wang, Fenglong Ma

Electronic health records (EHR) have been widely applied to various tasks in the medical domain such as risk predictive modeling, which aims to predict further health conditions by analyzing patients' historical EHR. Existing work mainly focuses on modeling the sequential and temporal characteristics of EHR data with advanced deep learning techniques. However, the network architectures of these models are all manually designed based on experts' prior knowledge, which largely impedes non-experts from exploring this task. To address this issue, in this paper, we propose a novel automated risk prediction model named AutoMed to automatically search the optimal model architecture for modeling the complex EHR data and improving the performance of the risk prediction task. In particular, we follow the idea of neural architecture search to design a search space that contains three separate searchable modules. Two of them are used for analyzing sequential and temporal features of EHR data, respectively. The third is to automatically fuse both features together. Besides these three modules, AutoMed contains an embedding module and a prediction module. All the three searchable modules are jointly optimized in the search stage to derive the optimal model architecture. In such a way, the model design can be automatically achieved with few human interventions. Experimental results on three real-world datasets show that AutoMed outperforms state-of-the-art baselines in terms of PR-AUC, F1, and Cohen's Kappa. Moreover, the ablation study shows that AutoMed can obtain reasonable model architectures and offer useful insights to the future risk prediction model design.

电子健康记录(EHR)已被广泛应用于医疗领域的各种任务,如风险预测建模,其目的是通过分析患者的历史EHR来预测进一步的健康状况。现有工作主要侧重于利用先进的深度学习技术对电子病历数据的顺序和时间特征进行建模。然而,这些模型的网络架构都是基于专家的先验知识手动设计的,这在很大程度上阻碍了非专业人员对这一任务的探索。为解决这一问题,我们在本文中提出了一种名为 AutoMed 的新型自动风险预测模型,以自动搜索最佳模型架构,对复杂的电子病历数据建模,提高风险预测任务的性能。具体而言,我们遵循神经架构搜索的理念,设计了一个包含三个独立可搜索模块的搜索空间。其中两个模块分别用于分析电子病历数据的顺序特征和时间特征。第三个模块用于将这两种特征自动融合在一起。除这三个模块外,AutoMed 还包含一个嵌入模块和一个预测模块。在搜索阶段,所有三个可搜索模块将共同优化,以得出最佳模型架构。这样,只需少量人工干预,就能自动完成模型设计。在三个真实世界数据集上的实验结果表明,AutoMed 在 PR-AUC、F1 和 Cohen's Kappa 方面都优于最先进的基线。此外,消融研究表明,AutoMed 可以获得合理的模型架构,并为未来的风险预测模型设计提供有益的启示。
{"title":"AutoMed: Automated Medical Risk Predictive Modeling on Electronic Health Records.","authors":"Suhan Cui, Jiaqi Wang, Xinning Gui, Ting Wang, Fenglong Ma","doi":"10.1109/bibm55620.2022.9995209","DOIUrl":"10.1109/bibm55620.2022.9995209","url":null,"abstract":"<p><p>Electronic health records (EHR) have been widely applied to various tasks in the medical domain such as risk predictive modeling, which aims to predict further health conditions by analyzing patients' historical EHR. Existing work mainly focuses on modeling the sequential and temporal characteristics of EHR data with advanced deep learning techniques. However, the network architectures of these models are all manually designed based on experts' prior knowledge, which largely impedes non-experts from exploring this task. To address this issue, in this paper, we propose a novel automated risk prediction model named AutoMed to automatically search the optimal model architecture for modeling the complex EHR data and improving the performance of the risk prediction task. In particular, we follow the idea of neural architecture search to design a search space that contains three separate searchable modules. Two of them are used for analyzing sequential and temporal features of EHR data, respectively. The third is to automatically fuse both features together. Besides these three modules, AutoMed contains an embedding module and a prediction module. All the three searchable modules are jointly optimized in the search stage to derive the optimal model architecture. In such a way, the model design can be automatically achieved with few human interventions. Experimental results on three real-world datasets show that AutoMed outperforms state-of-the-art baselines in terms of PR-AUC, F1, and Cohen's Kappa. Moreover, the ablation study shows that AutoMed can obtain reasonable model architectures and offer useful insights to the future risk prediction model design.</p>","PeriodicalId":74563,"journal":{"name":"Proceedings. IEEE International Conference on Bioinformatics and Biomedicine","volume":"2022 ","pages":"948-953"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10102833/pdf/nihms-1889654.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9379505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KIT-LSTM: Knowledge-guided Time-aware LSTM for Continuous Clinical Risk Prediction. KIT-LSTM:用于临床风险持续预测的知识导向时间感知LSTM。
Pub Date : 2022-12-01 DOI: 10.1109/bibm55620.2022.9994931
Lucas Jing Liu, Victor Ortiz-Soriano, Javier A Neyra, Jin Chen

Rapid accumulation of temporal Electronic Health Record (EHR) data and recent advances in deep learning have shown high potential in precisely and timely predicting patients' risks using AI. However, most existing risk prediction approaches ignore the complex asynchronous and irregular problems in real-world EHR data. This paper proposes a novel approach called Knowledge-guIded Time-aware LSTM (KIT-LSTM) for continuous mortality predictions using EHR. KIT-LSTM extends LSTM with two time-aware gates and a knowledge-aware gate to better model EHR and interprets results. Experiments on real-world data for patients with acute kidney injury with dialysis (AKI-D) demonstrate that KIT-LSTM performs better than the state-of-the-art methods for predicting patients' risk trajectories and model interpretation. KIT-LSTM can better support timely decision-making for clinicians.

时间电子健康记录(EHR)数据的快速积累和深度学习的最新进展表明,利用人工智能精确、及时地预测患者的风险具有很大的潜力。然而,现有的风险预测方法大多忽略了现实电子病历数据中复杂的异步和不规则问题。本文提出了一种新的方法,称为知识引导的时间感知LSTM (KIT-LSTM),用于使用电子病历进行连续死亡率预测。KIT-LSTM对LSTM进行了扩展,增加了两个时间感知门和一个知识感知门,以更好地建模EHR并解释结果。对急性肾损伤透析患者(AKI-D)的真实数据进行的实验表明,KIT-LSTM在预测患者风险轨迹和模型解释方面比最先进的方法表现更好。KIT-LSTM可以更好地支持临床医生的及时决策。
{"title":"KIT-LSTM: Knowledge-guided Time-aware LSTM for Continuous Clinical Risk Prediction.","authors":"Lucas Jing Liu, Victor Ortiz-Soriano, Javier A Neyra, Jin Chen","doi":"10.1109/bibm55620.2022.9994931","DOIUrl":"10.1109/bibm55620.2022.9994931","url":null,"abstract":"<p><p>Rapid accumulation of temporal Electronic Health Record (EHR) data and recent advances in deep learning have shown high potential in precisely and timely predicting patients' risks using AI. However, most existing risk prediction approaches ignore the complex asynchronous and irregular problems in real-world EHR data. This paper proposes a novel approach called Knowledge-guIded Time-aware LSTM (KIT-LSTM) for continuous mortality predictions using EHR. KIT-LSTM extends LSTM with two time-aware gates and a knowledge-aware gate to better model EHR and interprets results. Experiments on real-world data for patients with acute kidney injury with dialysis (AKI-D) demonstrate that KIT-LSTM performs better than the state-of-the-art methods for predicting patients' risk trajectories and model interpretation. KIT-LSTM can better support timely decision-making for clinicians.</p>","PeriodicalId":74563,"journal":{"name":"Proceedings. IEEE International Conference on Bioinformatics and Biomedicine","volume":"2022 ","pages":"1086-1091"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10151119/pdf/nihms-1893292.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9465727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effective Subject Representation based on Multi-omics Disease Networks using Graph Embedding. 基于图嵌入的多组学疾病网络的有效主题表示。
Pub Date : 2022-12-01 DOI: 10.1109/bibm55620.2022.9995707
Sundous Hussein, Thao Vu, Leslie Lange, Russell P Bowler, Katerina J Kechris, Farnoush Banaei-Kashani

The study of complex behavior of biological systems has become increasingly dependent on evolutionary network modeling. In particular, multi-omics networks capture interactions between biomolecules such as proteins and metabolites, providing a basis for predicting relationships between such biomolecules and various phenotypic traits of complex diseases. In this paper, we introduce an integrative framework that given a multi-omics network representing a cohort of subjects, learns expressive representations for network nodes, and combines the learned nodes representations with the biological profiles of individual subjects for enriched representation of the subjects. With extensive empirical evaluation using real-world multi-omics networks, we show that our proposed framework significantly outperforms existing and baseline methods in terms of subject representation accuracy, particularly when the multi-omics network representing the cohort is sparse and structured and therefore, more informative.

生物系统复杂行为的研究越来越依赖于进化网络模型。特别是,多组学网络捕获生物分子(如蛋白质和代谢物)之间的相互作用,为预测这些生物分子与复杂疾病的各种表型性状之间的关系提供了基础。在本文中,我们引入了一个集成框架,该框架给定一个代表一组受试者的多组学网络,学习网络节点的表达表示,并将学习到的节点表示与个体受试者的生物学概况相结合,以丰富受试者的表示。通过使用真实世界的多组学网络进行广泛的实证评估,我们表明,我们提出的框架在受试者表示准确性方面显着优于现有和基线方法,特别是当代表队列的多组学网络稀疏且结构化时,因此信息更丰富。
{"title":"Effective Subject Representation based on Multi-omics Disease Networks using Graph Embedding.","authors":"Sundous Hussein, Thao Vu, Leslie Lange, Russell P Bowler, Katerina J Kechris, Farnoush Banaei-Kashani","doi":"10.1109/bibm55620.2022.9995707","DOIUrl":"10.1109/bibm55620.2022.9995707","url":null,"abstract":"<p><p>The study of complex behavior of biological systems has become increasingly dependent on evolutionary network modeling. In particular, multi-omics networks capture interactions between biomolecules such as proteins and metabolites, providing a basis for predicting relationships between such biomolecules and various phenotypic traits of complex diseases. In this paper, we introduce an integrative framework that given a multi-omics network representing a cohort of subjects, learns expressive representations for network nodes, and combines the learned nodes representations with the biological profiles of individual subjects for enriched representation of the subjects. With extensive empirical evaluation using real-world multi-omics networks, we show that our proposed framework significantly outperforms existing and baseline methods in terms of subject representation accuracy, particularly when the multi-omics network representing the cohort is sparse and structured and therefore, more informative.</p>","PeriodicalId":74563,"journal":{"name":"Proceedings. IEEE International Conference on Bioinformatics and Biomedicine","volume":"2022 ","pages":"1911-1918"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9916186/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9274804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep ensemble learning over the microbial phylogenetic tree (DeepEn-Phy). 微生物系统发育树的深度集合学习(DeepEn-Phy)。
Pub Date : 2021-12-01 DOI: 10.1109/bibm52615.2021.9669654
Wodan Ling, Youran Qi, Xing Hua, Michael C Wu

Successful prediction of clinical outcomes facilitates tailored diagnosis and treatment. The microbiome has been shown to be an important biomarker to predict host clinical outcomes. Further, the incorporation of microbial phylogeny, the evolutionary relationship among microbes, has been demonstrated to improve prediction accuracy. We propose a phylogeny-driven deep neural network (PhyNN) and develop an ensemble method, DeepEn-Phy, for host clinical outcome prediction. The method is designed to optimally extract features from phylogeny, thereby take full advantage of the information in phylogeny while harnessing the core principles of phylogeny (in contrast to taxonomy). We apply DeepEn-Phy to a real large microbiome data set to predict both categorical and continuous clinical outcomes. DeepEn-Phy demonstrates superior prediction performance to existing machine learning and deep learning approaches. Overall, DeepEn-Phy provides a new strategy for designing deep neural network architectures within the context of phylogeny-constrained microbiome data.

成功预测临床结果有助于进行有针对性的诊断和治疗。微生物组已被证明是预测宿主临床结果的重要生物标志物。此外,纳入微生物系统发育(微生物之间的进化关系)已被证明可提高预测的准确性。我们提出了一种系统发育驱动的深度神经网络(PhyNN),并开发了一种用于宿主临床结果预测的集合方法 DeepEn-Phy。该方法旨在从系统发育中优化提取特征,从而充分利用系统发育中的信息,同时利用系统发育的核心原理(与分类学相反)。我们将 DeepEn-Phy 应用于一个真实的大型微生物组数据集,以预测分类和连续的临床结果。与现有的机器学习和深度学习方法相比,DeepEn-Phy 的预测性能更胜一筹。总之,DeepEn-Phy 为在系统发育受限的微生物组数据背景下设计深度神经网络架构提供了一种新策略。
{"title":"Deep ensemble learning over the microbial phylogenetic tree (DeepEn-Phy).","authors":"Wodan Ling, Youran Qi, Xing Hua, Michael C Wu","doi":"10.1109/bibm52615.2021.9669654","DOIUrl":"10.1109/bibm52615.2021.9669654","url":null,"abstract":"<p><p>Successful prediction of clinical outcomes facilitates tailored diagnosis and treatment. The microbiome has been shown to be an important biomarker to predict host clinical outcomes. Further, the incorporation of microbial phylogeny, the evolutionary relationship among microbes, has been demonstrated to improve prediction accuracy. We propose a phylogeny-driven deep neural network (PhyNN) and develop an ensemble method, DeepEn-Phy, for host clinical outcome prediction. The method is designed to optimally extract features from phylogeny, thereby take full advantage of the information in phylogeny while harnessing the core principles of phylogeny (in contrast to taxonomy). We apply DeepEn-Phy to a real large microbiome data set to predict both categorical and continuous clinical outcomes. DeepEn-Phy demonstrates superior prediction performance to existing machine learning and deep learning approaches. Overall, DeepEn-Phy provides a new strategy for designing deep neural network architectures within the context of phylogeny-constrained microbiome data.</p>","PeriodicalId":74563,"journal":{"name":"Proceedings. IEEE International Conference on Bioinformatics and Biomedicine","volume":"2021 ","pages":"470-477"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9875567/pdf/nihms-1860461.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10686576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Therapeutic Claims in Cannabidiol (CBD) Marketing Messages on Twitter. 推特上大麻二酚(CBD)营销信息的治疗声明。
Pub Date : 2021-12-01 Epub Date: 2022-01-14 DOI: 10.1109/bibm52615.2021.9669404
Mohammad Soleymanpour, Sofia Saderholm, Ramakanth Kavuluru

Although the U.S. FDA has only approved exactly one cannabidiol (CBD) drug product (specifically to treat seizures), CBD products are proliferating rapidly through different modes of usage including food products, cosmetics, vaping pods, and supplements (typically, oils). Despite the FDA clearly warning consumers about unproven health claims made by manufacturers selling CBD products over the counter, the CBD market share was nearly 3 billion USD in 2020 and is expected to top 55 billion USD in 2028. In this context, it is important to assess the presence of health claims being made on social media, especially claims that are part of marketing messages. To this end, we collected over two million English tweets discussing CBD themes. We created a hand-labeled dataset and built machine learned classifiers to identify marketing tweets from regular tweets that may be generated by consumers. The best classifier achieved 85% precision, 83% recall, and 84% F-score. Our analyses showed that pain, anxiety disorders, sleep disorders, and stress are the four main therapeutic claims made constituting 31.67%, 27.11%, 13.77%, and 10.37% of all medical claims made on Twitter, respectively. Also, more than 93% of advertised CBD products are edibles or oil/tinctures. Our effort is the first to demonstrate the feasibility of surveillance of marketing claims for CBD products. We believe this could pave way for more explorations into this indispensable task in the current landscape of social media driven health (mis)information and communication.

尽管美国食品和药物管理局只批准了一种大麻二酚(CBD)药物产品(专门用于治疗癫痫发作),但CBD产品正在通过不同的使用方式迅速增加,包括食品、化妆品、电子烟和补充剂(通常是油)。尽管FDA明确警告消费者,非处方销售CBD产品的制造商所做的未经证实的健康声明,但CBD的市场份额在2020年接近30亿美元,预计到2028年将超过550亿美元。在这种情况下,重要的是评估社交媒体上健康声明的存在,特别是作为营销信息一部分的声明。为此,我们收集了超过200万条讨论CBD主题的英文推文。我们创建了一个手工标记的数据集,并构建了机器学习分类器,从可能由消费者生成的常规推文中识别营销推文。最好的分类器达到85%的准确率,83%的召回率和84%的f分。我们的分析显示,疼痛、焦虑症、睡眠障碍和压力是四种主要的治疗声明,分别占Twitter上所有医疗声明的31.67%、27.11%、13.77%和10.37%。此外,超过93%的广告CBD产品是可食用的或油/酊剂。我们的努力首次证明了对CBD产品的营销声明进行监管的可行性。我们相信,在当前社交媒体驱动的健康(mis)信息和沟通环境中,这可以为更多探索这一不可或缺的任务铺平道路。
{"title":"Therapeutic Claims in Cannabidiol (CBD) Marketing Messages on Twitter.","authors":"Mohammad Soleymanpour,&nbsp;Sofia Saderholm,&nbsp;Ramakanth Kavuluru","doi":"10.1109/bibm52615.2021.9669404","DOIUrl":"https://doi.org/10.1109/bibm52615.2021.9669404","url":null,"abstract":"<p><p>Although the U.S. FDA has only approved exactly one cannabidiol (CBD) drug product (specifically to treat seizures), CBD products are proliferating rapidly through different modes of usage including food products, cosmetics, vaping pods, and supplements (typically, oils). Despite the FDA clearly warning consumers about unproven health claims made by manufacturers selling CBD products over the counter, the CBD market share was nearly 3 billion USD in 2020 and is expected to top 55 billion USD in 2028. In this context, it is important to assess the presence of health claims being made on social media, especially claims that are part of marketing messages. To this end, we collected over two million English tweets discussing CBD themes. We created a hand-labeled dataset and built machine learned classifiers to identify marketing tweets from regular tweets that may be generated by consumers. The best classifier achieved 85% precision, 83% recall, and 84% F-score. Our analyses showed that <i>pain</i>, <i>anxiety disorders</i>, <i>sleep disorders</i>, and <i>stress</i> are the four main therapeutic claims made constituting 31.67%, 27.11%, 13.77%, and 10.37% of all medical claims made on Twitter, respectively. Also, more than 93% of advertised CBD products are <i>edibles</i> or <i>oil/tinctures</i>. Our effort is the first to demonstrate the feasibility of surveillance of marketing claims for CBD products. We believe this could pave way for more explorations into this indispensable task in the current landscape of social media driven health (mis)information and communication.</p>","PeriodicalId":74563,"journal":{"name":"Proceedings. IEEE International Conference on Bioinformatics and Biomedicine","volume":" ","pages":"3083-3088"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8794048/pdf/nihms-1772253.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39734700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Interpretable temporal graph neural network for prognostic prediction of Alzheimer's disease using longitudinal neuroimaging data. 利用纵向神经影像数据对阿尔茨海默病预后预测的可解释时间图神经网络。
Pub Date : 2021-12-01 DOI: 10.1109/bibm52615.2021.9669504
Mansu Kim, Jaesik Kim, Jeffrey Qu, Heng Huang, Qi Long, Kyung-Ah Sohn, Dokyoon Kim, Li Shen

Alzheimer's disease (AD) is a progressive neurodegenerative brain disorder characterized by memory loss and cognitive decline. Early detection and accurate prognosis of AD is an important research topic, and numerous machine learning methods have been proposed to solve this problem. However, traditional machine learning models are facing challenges in effectively integrating longitudinal neuroimaging data and biologically meaningful structure and knowledge to build accurate and interpretable prognostic predictors. To bridge this gap, we propose an interpretable graph neural network (GNN) model for AD prognostic prediction based on longitudinal neuroimaging data while embracing the valuable knowledge of structural brain connectivity. In our empirical study, we demonstrate that 1) the proposed model outperforms several competing models (i.e., DNN, SVM) in terms of prognostic prediction accuracy, and 2) our model can capture neuroanatomical contribution to the prognostic predictor and yield biologically meaningful interpretation to facilitate better mechanistic understanding of the Alzheimer's disease. Source code is available at https://github.com/JaesikKim/temporal-GNN.

阿尔茨海默病(AD)是一种以记忆力减退和认知能力下降为特征的进行性神经退行性脑部疾病。阿尔茨海默病的早期检测和准确预后是一个重要的研究课题,人们提出了许多机器学习方法来解决这一问题。然而,传统的机器学习模型在有效整合纵向神经影像数据和具有生物学意义的结构和知识以建立准确且可解释的预后预测指标方面面临挑战。为了弥补这一差距,我们提出了一种可解释的图神经网络(GNN)模型,该模型基于纵向神经影像数据,同时包含了宝贵的大脑连接结构知识,用于艾滋病预后预测。在实证研究中,我们证明:1)所提出的模型在预后预测准确性方面优于几种竞争模型(如 DNN、SVM);2)我们的模型可以捕捉神经解剖学对预后预测的贡献,并产生具有生物学意义的解释,从而促进对阿尔茨海默病机理的更好理解。源代码见 https://github.com/JaesikKim/temporal-GNN。
{"title":"Interpretable temporal graph neural network for prognostic prediction of Alzheimer's disease using longitudinal neuroimaging data.","authors":"Mansu Kim, Jaesik Kim, Jeffrey Qu, Heng Huang, Qi Long, Kyung-Ah Sohn, Dokyoon Kim, Li Shen","doi":"10.1109/bibm52615.2021.9669504","DOIUrl":"10.1109/bibm52615.2021.9669504","url":null,"abstract":"<p><p>Alzheimer's disease (AD) is a progressive neurodegenerative brain disorder characterized by memory loss and cognitive decline. Early detection and accurate prognosis of AD is an important research topic, and numerous machine learning methods have been proposed to solve this problem. However, traditional machine learning models are facing challenges in effectively integrating longitudinal neuroimaging data and biologically meaningful structure and knowledge to build accurate and interpretable prognostic predictors. To bridge this gap, we propose an interpretable graph neural network (GNN) model for AD prognostic prediction based on longitudinal neuroimaging data while embracing the valuable knowledge of structural brain connectivity. In our empirical study, we demonstrate that 1) the proposed model outperforms several competing models (i.e., DNN, SVM) in terms of prognostic prediction accuracy, and 2) our model can capture neuroanatomical contribution to the prognostic predictor and yield biologically meaningful interpretation to facilitate better mechanistic understanding of the Alzheimer's disease. Source code is available at https://github.com/JaesikKim/temporal-GNN.</p>","PeriodicalId":74563,"journal":{"name":"Proceedings. IEEE International Conference on Bioinformatics and Biomedicine","volume":"2021 ","pages":"1381-1384"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8922159/pdf/nihms-1784457.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10617208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings. IEEE International Conference on Bioinformatics and Biomedicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1