首页 > 最新文献

2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)最新文献

英文 中文
Automatic analysis of neonatal video data to evaluate resuscitation prformance 新生儿视频数据自动分析评估复苏表现
Yue Guo, Johan Wrammert, Kavita Singh, A. Kc, K. Bradford, Ashok K. Krishnamurthy
Approximately 3% of births require neonatal resuscitation, which has a direct impact on the immediate survival of these infants. This report proposes an automatic video analysis method for neonatal resuscitation performance evaluation, which helps improve the quality of this procedure. More specifically, we design a deep learning based action model which incorporates motion and spatial information in order to classify neonatal resuscitation actions in videos. First, we use a Convolutional Neural Network to select regions containing infants and only keep those that are motion salient. Second, we extract deep spatial-temporal features to train a linear SVM classifier. Finally, we propose a pair-wise model to ensure consistent classification in consecutive frames. We evaluate the proposed method on a dataset consisting of 17 videos and compare the result against the state-of-the-art method for action classification in videos. To our best knowledge, this work is the first to attempt automatic evaluation of neonatal resuscitation videos and identifies several issues that require further work.
大约3%的新生儿需要新生儿复苏,这直接影响到这些婴儿的即时生存。本文提出了一种用于新生儿复苏性能评价的自动视频分析方法,有助于提高新生儿复苏质量。更具体地说,我们设计了一个基于深度学习的动作模型,该模型结合了运动和空间信息,以便对视频中的新生儿复苏动作进行分类。首先,我们使用卷积神经网络选择包含婴儿的区域,并只保留那些运动显著的区域。其次,提取深度时空特征,训练线性支持向量机分类器。最后,我们提出了一个成对模型,以确保连续帧的一致性分类。我们在包含17个视频的数据集上评估了所提出的方法,并将结果与视频中最先进的动作分类方法进行了比较。据我们所知,这项工作是第一次尝试对新生儿复苏视频进行自动评估,并确定了需要进一步工作的几个问题。
{"title":"Automatic analysis of neonatal video data to evaluate resuscitation prformance","authors":"Yue Guo, Johan Wrammert, Kavita Singh, A. Kc, K. Bradford, Ashok K. Krishnamurthy","doi":"10.1109/ICCABS.2016.7802775","DOIUrl":"https://doi.org/10.1109/ICCABS.2016.7802775","url":null,"abstract":"Approximately 3% of births require neonatal resuscitation, which has a direct impact on the immediate survival of these infants. This report proposes an automatic video analysis method for neonatal resuscitation performance evaluation, which helps improve the quality of this procedure. More specifically, we design a deep learning based action model which incorporates motion and spatial information in order to classify neonatal resuscitation actions in videos. First, we use a Convolutional Neural Network to select regions containing infants and only keep those that are motion salient. Second, we extract deep spatial-temporal features to train a linear SVM classifier. Finally, we propose a pair-wise model to ensure consistent classification in consecutive frames. We evaluate the proposed method on a dataset consisting of 17 videos and compare the result against the state-of-the-art method for action classification in videos. To our best knowledge, this work is the first to attempt automatic evaluation of neonatal resuscitation videos and identifies several issues that require further work.","PeriodicalId":306466,"journal":{"name":"2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131628862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Assessing how multiple mutations affect protein stability using rigid cluster size distributions 使用刚性簇大小分布评估多重突变如何影响蛋白质稳定性
E. Andersson, Rebecca Hsieh, Howard Szeto, R. Farhoodi, Nurit Haspel, F. Jagodzinski
Predicting how amino acid substitutions affect the stability of a protein has relevance to drug design and may help elucidate the mechanisms of disease-causing protein variants. Unfortunately, wet-lab experiments are time intensive, and to the best of our knowledge there are no efficient computational techniques to asses the effect of multiple mutations. In this work we present a new approach for inferring the effects of single and multiple mutations on a protein's structure. Our rMutant algorithm generates in silico mutants with single or multiple amino acid substitutions. We use a graph-theoretic rigidity analysis approach to compute the distributions of rigid cluster sizes of the wild type and mutant structures which we then analyze to infer the effect of the amino acid substitutions. We successfully predict the effects of multiple mutations for which our previous methods were unsuccessful. We validate the predictions of our computational approach against experimental ΔΔG data. To demonstrate the utility of using rigid cluster size distributions to infer the effects of mutations, we also present a Random Forest Machine Learning approach that relies on rigidity data to predict which residues are critical to the stability of a protein. We predict the destabilizing effects of a single or multiple mutations with over 86% accuracy.
预测氨基酸取代如何影响蛋白质的稳定性与药物设计相关,并可能有助于阐明致病蛋白质变异的机制。不幸的是,湿实验室实验非常耗时,而且据我们所知,目前还没有有效的计算技术来评估多重突变的影响。在这项工作中,我们提出了一种新的方法来推断单个和多个突变对蛋白质结构的影响。我们的突变体算法生成具有单个或多个氨基酸取代的硅突变体。我们使用图理论的刚性分析方法来计算野生型和突变结构的刚性簇大小的分布,然后我们分析来推断氨基酸取代的影响。我们成功地预测了多种突变的影响,而我们以前的方法是不成功的。我们根据实验ΔΔG数据验证了我们的计算方法的预测。为了证明使用刚性簇大小分布来推断突变影响的效用,我们还提出了一种随机森林机器学习方法,该方法依赖于刚性数据来预测哪些残基对蛋白质的稳定性至关重要。我们预测单个或多个突变的不稳定效应的准确率超过86%。
{"title":"Assessing how multiple mutations affect protein stability using rigid cluster size distributions","authors":"E. Andersson, Rebecca Hsieh, Howard Szeto, R. Farhoodi, Nurit Haspel, F. Jagodzinski","doi":"10.1109/ICCABS.2016.7802777","DOIUrl":"https://doi.org/10.1109/ICCABS.2016.7802777","url":null,"abstract":"Predicting how amino acid substitutions affect the stability of a protein has relevance to drug design and may help elucidate the mechanisms of disease-causing protein variants. Unfortunately, wet-lab experiments are time intensive, and to the best of our knowledge there are no efficient computational techniques to asses the effect of multiple mutations. In this work we present a new approach for inferring the effects of single and multiple mutations on a protein's structure. Our rMutant algorithm generates in silico mutants with single or multiple amino acid substitutions. We use a graph-theoretic rigidity analysis approach to compute the distributions of rigid cluster sizes of the wild type and mutant structures which we then analyze to infer the effect of the amino acid substitutions. We successfully predict the effects of multiple mutations for which our previous methods were unsuccessful. We validate the predictions of our computational approach against experimental ΔΔG data. To demonstrate the utility of using rigid cluster size distributions to infer the effects of mutations, we also present a Random Forest Machine Learning approach that relies on rigidity data to predict which residues are critical to the stability of a protein. We predict the destabilizing effects of a single or multiple mutations with over 86% accuracy.","PeriodicalId":306466,"journal":{"name":"2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131744072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
JULiP: An efficient model for accurate intron selection from multiple RNA-seq samples JULiP:从多个RNA-seq样本中精确选择内含子的有效模型
Guangyu Yang, L. Florea
Accurate alternative splicing detection and transcript reconstruction are essential to characterize gene regulation and function and to understand development and disease. However, current methods for extracting splicing variation from RNA-seq data only analyze signals from a single sample, which limits transcript reconstruction and fails to detect a complete set of alternative splicing events. We developed a novel feature selection method, JULiP, that analyzes information across multiple samples to identify alternative splicing variation in the form of splice junctions (introns). It formulates the selection problem as a regularized program, utilizing the latent information from multiple RNA-seq samples to construct an accurate and comprehensive intron set. JULiP is highly accurate, and could detect thousands more introns in any one sample, >30% more than the most sensitive single-sample method, and 10% more introns than in the cumulative set of samples, at higher or comparable precision (>98%). Tested assemblers included Cufflinks, CLASS2, StringTie and FlipFlop, and the multi-sample assembler ISP. JULiP is multi-threaded and parallelized, taking only one minute to analyze up to 100 data sets on a multi-computer cluster, and can easily scale up to allow analyses of hundreds and thousands of RNA-seq samples.
准确的选择性剪接检测和转录本重建对于表征基因调控和功能以及了解发育和疾病至关重要。然而,目前从RNA-seq数据中提取剪接变化的方法仅分析来自单个样本的信号,这限制了转录物的重建,并且无法检测到一整套可选剪接事件。我们开发了一种新的特征选择方法,JULiP,分析多个样本的信息,以确定剪接连接(内含子)形式的可选剪接变化。它将选择问题表述为一个正则化程序,利用来自多个RNA-seq样本的潜在信息构建一个准确而全面的内含子集。JULiP非常准确,在任何一个样品中都可以检测到数千个内含子,比最灵敏的单样品方法多30%,比累积样品多10%,精度更高或相当(98%)。测试的汇编程序包括袖扣,CLASS2, StringTie和FlipFlop,以及多样本汇编程序ISP。JULiP是多线程和并行的,只需要一分钟就可以在多台计算机集群上分析多达100个数据集,并且可以轻松扩展以允许分析数百和数千个RNA-seq样本。
{"title":"JULiP: An efficient model for accurate intron selection from multiple RNA-seq samples","authors":"Guangyu Yang, L. Florea","doi":"10.1109/ICCABS.2016.7802790","DOIUrl":"https://doi.org/10.1109/ICCABS.2016.7802790","url":null,"abstract":"Accurate alternative splicing detection and transcript reconstruction are essential to characterize gene regulation and function and to understand development and disease. However, current methods for extracting splicing variation from RNA-seq data only analyze signals from a single sample, which limits transcript reconstruction and fails to detect a complete set of alternative splicing events. We developed a novel feature selection method, JULiP, that analyzes information across multiple samples to identify alternative splicing variation in the form of splice junctions (introns). It formulates the selection problem as a regularized program, utilizing the latent information from multiple RNA-seq samples to construct an accurate and comprehensive intron set. JULiP is highly accurate, and could detect thousands more introns in any one sample, >30% more than the most sensitive single-sample method, and 10% more introns than in the cumulative set of samples, at higher or comparable precision (>98%). Tested assemblers included Cufflinks, CLASS2, StringTie and FlipFlop, and the multi-sample assembler ISP. JULiP is multi-threaded and parallelized, taking only one minute to analyze up to 100 data sets on a multi-computer cluster, and can easily scale up to allow analyses of hundreds and thousands of RNA-seq samples.","PeriodicalId":306466,"journal":{"name":"2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126198990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Identifying hotspots in five year survival electronic health records of older adults 识别老年人5年生存电子健康记录中的热点
Ankit Agrawal, J. S. Mathias, D. Baker, A. Choudhary
Understanding the prognosis of older adults is a big challenge in healthcare research, especially since very little is known about how different comorbidities interact and influence the prognosis. Recently, a electronic healthcare records dataset of 24 patient attributes from Northwestern Memorial Hospital was used to develop predictive models for five year survival outcome. In this study we analyze the same data for discovering hotspots with respect to five year survival using association rule mining techniques. The goal here is to identify characteristics of patient segments where the five year survival fraction is significantly lower/higher than the survival fraction across the entire dataset. A two-stage post-processing procedure was used to identify non-redundant rules. The resulting rules conform with existing biomedical knowledge and provide interesting insights into prognosis of older adults. Incorporating such information into clinical decision making could advance person-centered healthcare by encouraging optimal use of healthcare services to those patients most likely to benefit.
了解老年人的预后在医疗保健研究中是一个很大的挑战,特别是因为我们对不同的合并症如何相互作用和影响预后知之甚少。最近,来自西北纪念医院的24个患者属性的电子医疗记录数据集被用于开发五年生存结果的预测模型。在本研究中,我们使用关联规则挖掘技术分析相同的数据,以发现有关五年生存的热点。这里的目标是确定五年生存率明显低于/高于整个数据集生存率的患者部分的特征。采用两阶段后处理程序识别非冗余规则。结果与现有的生物医学知识一致,并为老年人的预后提供了有趣的见解。将这些信息纳入临床决策可以通过鼓励那些最有可能受益的患者最佳地使用医疗保健服务来推进以人为本的医疗保健。
{"title":"Identifying hotspots in five year survival electronic health records of older adults","authors":"Ankit Agrawal, J. S. Mathias, D. Baker, A. Choudhary","doi":"10.1109/ICCABS.2016.7802773","DOIUrl":"https://doi.org/10.1109/ICCABS.2016.7802773","url":null,"abstract":"Understanding the prognosis of older adults is a big challenge in healthcare research, especially since very little is known about how different comorbidities interact and influence the prognosis. Recently, a electronic healthcare records dataset of 24 patient attributes from Northwestern Memorial Hospital was used to develop predictive models for five year survival outcome. In this study we analyze the same data for discovering hotspots with respect to five year survival using association rule mining techniques. The goal here is to identify characteristics of patient segments where the five year survival fraction is significantly lower/higher than the survival fraction across the entire dataset. A two-stage post-processing procedure was used to identify non-redundant rules. The resulting rules conform with existing biomedical knowledge and provide interesting insights into prognosis of older adults. Incorporating such information into clinical decision making could advance person-centered healthcare by encouraging optimal use of healthcare services to those patients most likely to benefit.","PeriodicalId":306466,"journal":{"name":"2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124925662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
HRVCam: A software for real-time feedback of heart rate and HRV HRVCam:一个实时反馈心率和HRV的软件
P. H. Souza, J. O. Ferreira, T. M. D. A. Barbosa, A. Rocha
The heart rate (HR) and its variability, known as Heart Rate Variability (HRV), are indispensable measurements for cardiorespiratory monitoring, recognition and quantification of emotions, detection of abnormalities, and heart disease control. In general, the acquisition systems for HR and HRV require a contact area for sensor's installation and positioning, creating restrictions and/or obstructions on user's movements. This paper proposes a noninvasive and noncontact technique for HR and HRV acquisition using a camera. The purposed technique consists in the automatic detection of the user's face and utilization of an Independent Component Analysis (ICA) algorithm to separate the necessary signals to determine the HR and HRV. The experiments have shown more than 95% of similarity between the results of the proposed software (HRVCam) in comparison to the results of the photoplethysmography sensor (PPG).
心率(HR)及其变异性,称为心率变异性(HRV),是心肺监测、情绪识别和量化、异常检测和心脏病控制不可或缺的测量方法。一般来说,HR和HRV的采集系统需要一个接触区域来安装和定位传感器,这对用户的移动产生了限制和/或障碍。本文提出了一种无创、非接触的相机HR和HRV采集技术。目的技术包括对用户面部的自动检测,并利用独立分量分析(ICA)算法分离必要的信号以确定HR和HRV。实验表明,与光电容积脉搏波传感器(PPG)的结果相比,所提出的软件(HRVCam)的结果相似性超过95%。
{"title":"HRVCam: A software for real-time feedback of heart rate and HRV","authors":"P. H. Souza, J. O. Ferreira, T. M. D. A. Barbosa, A. Rocha","doi":"10.1109/ICCABS.2016.7802767","DOIUrl":"https://doi.org/10.1109/ICCABS.2016.7802767","url":null,"abstract":"The heart rate (HR) and its variability, known as Heart Rate Variability (HRV), are indispensable measurements for cardiorespiratory monitoring, recognition and quantification of emotions, detection of abnormalities, and heart disease control. In general, the acquisition systems for HR and HRV require a contact area for sensor's installation and positioning, creating restrictions and/or obstructions on user's movements. This paper proposes a noninvasive and noncontact technique for HR and HRV acquisition using a camera. The purposed technique consists in the automatic detection of the user's face and utilization of an Independent Component Analysis (ICA) algorithm to separate the necessary signals to determine the HR and HRV. The experiments have shown more than 95% of similarity between the results of the proposed software (HRVCam) in comparison to the results of the photoplethysmography sensor (PPG).","PeriodicalId":306466,"journal":{"name":"2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124958823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
NanoBLASTer: Fast alignment and characterization of Oxford Nanopore single molecule sequencing reads NanoBLASTer:牛津纳米孔单分子测序reads的快速定位和表征
M. R. Amin, S. Skiena, M. Schatz
The quality of the Oxford Nanopores long DNA sequence reads has been, to date, lower than other technologies, causing great interest to develop new algorithms that can make use of the data. So far, alignment methods including LAST, BLAST, BWA-MEM and GraphMap have been used to analyze these sequences. However, each of these tools has significant challenges to use with these data: LAST and BLAST require considerable processing time for high sensitivity, BWA-MEM has the smallest average alignment length, and GraphMap aligns many random strings with moderate accuracy. To address these challenges we developed a new read aligner called NanoBLASTer specifically designed for long nanopore reads. In experiments resequencing the well-studied S. cerevisiae (yeast) and Escherichia coli (E. coli) genomes, we show that our algorithm produces longer alignments with higher overall sensitivity than LAST, BLAST and BWA-MEM. We also show that the runtime of NanoBLASTer is faster than GraphMap, BLAST and BWA-MEM.
迄今为止,牛津纳米孔长DNA序列读取的质量一直低于其他技术,这引起了人们对开发可以利用这些数据的新算法的极大兴趣。目前已使用LAST、BLAST、BWA-MEM和GraphMap等比对方法对这些序列进行分析。然而,这些工具在处理这些数据时都面临着重大挑战:LAST和BLAST需要相当长的处理时间才能获得高灵敏度,BWA-MEM具有最小的平均对齐长度,而GraphMap以中等精度对齐许多随机字符串。为了解决这些挑战,我们开发了一种名为NanoBLASTer的新型读取校准器,专门用于长纳米孔读取。在对酿酒酵母和大肠杆菌基因组进行重测序的实验中,我们发现我们的算法比LAST、BLAST和BWA-MEM产生更长的序列,总体灵敏度更高。我们还表明,NanoBLASTer的运行速度比GraphMap、BLAST和BWA-MEM快。
{"title":"NanoBLASTer: Fast alignment and characterization of Oxford Nanopore single molecule sequencing reads","authors":"M. R. Amin, S. Skiena, M. Schatz","doi":"10.1109/ICCABS.2016.7802776","DOIUrl":"https://doi.org/10.1109/ICCABS.2016.7802776","url":null,"abstract":"The quality of the Oxford Nanopores long DNA sequence reads has been, to date, lower than other technologies, causing great interest to develop new algorithms that can make use of the data. So far, alignment methods including LAST, BLAST, BWA-MEM and GraphMap have been used to analyze these sequences. However, each of these tools has significant challenges to use with these data: LAST and BLAST require considerable processing time for high sensitivity, BWA-MEM has the smallest average alignment length, and GraphMap aligns many random strings with moderate accuracy. To address these challenges we developed a new read aligner called NanoBLASTer specifically designed for long nanopore reads. In experiments resequencing the well-studied S. cerevisiae (yeast) and Escherichia coli (E. coli) genomes, we show that our algorithm produces longer alignments with higher overall sensitivity than LAST, BLAST and BWA-MEM. We also show that the runtime of NanoBLASTer is faster than GraphMap, BLAST and BWA-MEM.","PeriodicalId":306466,"journal":{"name":"2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123139483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
An accurate and customizable text classification algorithm: Two applications in healthcare 准确和可定制的文本分类算法:医疗保健中的两个应用
Mohammed D. Aldhoayan, Leming Zhou
Text classification is an important step in many data analysis procedures. The demand on text classification algorithm is booming due to the increase of the amount of digital data, especially in the healthcare field. A customizable and accurate algorithm is expected to produce positive impact on the efficiency of many data analysis procedures. In this work, we proposed a novel algorithm for accurately classifying data entries in huge text files into several pre-determined categories. We built the algorithm with multiple rules according to text similarity, frequency, and weight. For different classification tasks, the algorithm can be conveniently adjusted to process the corresponding data sets. Data sets related to healthcare cost analysis (hospital discharge summary) and medical classification systems (ICD-9) are used to evaluate the algorithm. When the algorithm is used on the ICD-9 data, the overall accuracy of the algorithm was 100%. After the algorithm was used on 7480 healthcare cost entries, the results were then compared with the ones processed manually by a physician, and the accuracy was between 86%–91.6%, and the difference is from different classification of ambiguous entries, which is hard to determine the correct category even when it is done manually because those entries were documented improperly. This new classification algorithm is 3 to 5 times faster than the manual process on the same data set. Therefore, this customizable and accurate text classification algorithm is effective in saving time compared to the manual classification methods.
文本分类是许多数据分析过程中的一个重要步骤。随着数字数据量的不断增加,特别是在医疗保健领域,对文本分类算法的需求日益增长。一种可定制且精确的算法有望对许多数据分析程序的效率产生积极影响。在这项工作中,我们提出了一种新的算法来准确地将大量文本文件中的数据条目分类为几个预先确定的类别。我们根据文本相似度、频率和权重构建了多个规则的算法。对于不同的分类任务,可以方便地调整算法来处理相应的数据集。使用与医疗成本分析(出院摘要)和医疗分类系统(ICD-9)相关的数据集来评估该算法。将该算法应用于ICD-9数据时,该算法的总体准确率为100%。将该算法应用于7480个医疗成本条目后,将结果与医生人工处理的结果进行比较,准确率在86%-91.6%之间,差异来自于对歧义条目的不同分类,由于这些条目的记录不正确,即使人工进行分类也难以确定正确的类别。这种新的分类算法在相同的数据集上比人工处理快3到5倍。因此,与人工分类方法相比,这种可定制且准确的文本分类算法有效地节省了时间。
{"title":"An accurate and customizable text classification algorithm: Two applications in healthcare","authors":"Mohammed D. Aldhoayan, Leming Zhou","doi":"10.1109/ICCABS.2016.7802778","DOIUrl":"https://doi.org/10.1109/ICCABS.2016.7802778","url":null,"abstract":"Text classification is an important step in many data analysis procedures. The demand on text classification algorithm is booming due to the increase of the amount of digital data, especially in the healthcare field. A customizable and accurate algorithm is expected to produce positive impact on the efficiency of many data analysis procedures. In this work, we proposed a novel algorithm for accurately classifying data entries in huge text files into several pre-determined categories. We built the algorithm with multiple rules according to text similarity, frequency, and weight. For different classification tasks, the algorithm can be conveniently adjusted to process the corresponding data sets. Data sets related to healthcare cost analysis (hospital discharge summary) and medical classification systems (ICD-9) are used to evaluate the algorithm. When the algorithm is used on the ICD-9 data, the overall accuracy of the algorithm was 100%. After the algorithm was used on 7480 healthcare cost entries, the results were then compared with the ones processed manually by a physician, and the accuracy was between 86%–91.6%, and the difference is from different classification of ambiguous entries, which is hard to determine the correct category even when it is done manually because those entries were documented improperly. This new classification algorithm is 3 to 5 times faster than the manual process on the same data set. Therefore, this customizable and accurate text classification algorithm is effective in saving time compared to the manual classification methods.","PeriodicalId":306466,"journal":{"name":"2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132498048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Curvelet-based texture classification of critical Gleason patterns of prostate histological images 基于曲线的前列腺组织学图像关键Gleason模式纹理分类
Wen-Chyi Lin, Ching-Chung Li, J. Epstein, R. Veltri
This paper presents our new result of a study on machine-aided classification of four critical Gleason patterns with curvelet-based texture descriptors extracted from prostatic histological section images. The reliable recognition of these patterns between Gleason score 6 and Gleason score 8 is of crucial importance that will affect the appropriate treatment and patient's quality of life. Higher-order statistical moments of fine scale curvelet coefficients are selected as discriminative features. A two-level classifier consisting of two Gaussian kernel support vector machines, each incorporated with a pertinent voting mechanism by multiple windowed patches in an image for final decision making, has been developed. A set of Tissue MicroArray (TMA) images of four prominent Gleason scores (GS) 3 + 3, 3 + 4, 4 + 3 and 4 + 4 has been studied in machine learning and testing. The testing result has achieved an average accuracy of 93.75% for 4 classes, an outstanding performance when compared with other published works.
本文介绍了我们利用从前列腺组织学切片图像中提取的基于曲线的纹理描述符对四种关键Gleason模式进行机器辅助分类的新结果。对Gleason评分6分和Gleason评分8分之间的这些模式的可靠识别至关重要,这将影响到适当的治疗和患者的生活质量。选取细尺度曲线系数的高阶统计矩作为判别特征。提出了一种由两个高斯核支持向量机组成的两级分类器,每个高斯核支持向量机通过图像中的多个窗口补丁与相关投票机制相结合,以进行最终决策。在机器学习和测试中研究了四组突出的Gleason评分(GS) 3 + 3,3 + 4,4 + 3和4 + 4的组织微阵列(TMA)图像。测试结果表明,4个类别的平均准确率达到了93.75%,与其他已发表的作品相比表现优异。
{"title":"Curvelet-based texture classification of critical Gleason patterns of prostate histological images","authors":"Wen-Chyi Lin, Ching-Chung Li, J. Epstein, R. Veltri","doi":"10.1109/ICCABS.2016.7802768","DOIUrl":"https://doi.org/10.1109/ICCABS.2016.7802768","url":null,"abstract":"This paper presents our new result of a study on machine-aided classification of four critical Gleason patterns with curvelet-based texture descriptors extracted from prostatic histological section images. The reliable recognition of these patterns between Gleason score 6 and Gleason score 8 is of crucial importance that will affect the appropriate treatment and patient's quality of life. Higher-order statistical moments of fine scale curvelet coefficients are selected as discriminative features. A two-level classifier consisting of two Gaussian kernel support vector machines, each incorporated with a pertinent voting mechanism by multiple windowed patches in an image for final decision making, has been developed. A set of Tissue MicroArray (TMA) images of four prominent Gleason scores (GS) 3 + 3, 3 + 4, 4 + 3 and 4 + 4 has been studied in machine learning and testing. The testing result has achieved an average accuracy of 93.75% for 4 classes, an outstanding performance when compared with other published works.","PeriodicalId":306466,"journal":{"name":"2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114062993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A deep learning-based segmentation method for brain tumor in MR images 基于深度学习的MR图像脑肿瘤分割方法
Zhe Xiao, Ruohan Huang, Yi Ding, Tian Lan, Rongfen Dong, Zhiguang Qin, Xinjie Zhang, Wei Wang
Accurate tumor segmentation is an essential and crucial step for computer-aided brain tumor diagnosis and surgical planning. Subjective segmentations are widely adopted in clinical diagnosis and treating, but they are neither accurate nor reliable. An automatical and objective system for brain tumor segmentation is strongly expected. But they are still facing some challenges such as lower segmentation accuracy, demanding a priori knowledge or requiring the human intervention. In this paper, a novel and new coarse-to-fine method is proposed to segment the brain tumor. This hierarchical framework consists of preprocessing, deep learning network based classification and post-processing. The preprocessing is used to extract image patches for each MR image and obtains the gray level sequences of image patches as the input of the deep learning network. The deep learning network based classification is implemented by a stacked auto-encoder network to extract the high level abstract feature from the input, and utilizes the extracted feature to classify image patches. After mapping the classification result to a binary image, the post-processing is implemented by a morphological filter to get the final segmentation result. In order to evaluate the proposed method, the experiment was applied to segment the brain tumor for the real patient dataset. The final performance shows that the proposed brain tumor segmentation method is more accurate and efficient.
准确的肿瘤分割是计算机辅助脑肿瘤诊断和手术计划的关键步骤。主观分割在临床诊断和治疗中被广泛采用,但它既不准确也不可靠。一个自动、客观的脑肿瘤分割系统是人们热切期待的。但它们仍然面临着分割精度较低、需要先验知识或需要人工干预等挑战。本文提出了一种新的从粗到精的脑肿瘤分割方法。该框架由预处理、基于深度学习网络的分类和后处理三部分组成。预处理对每张MR图像提取图像patch,得到图像patch的灰度序列作为深度学习网络的输入。基于深度学习网络的分类是通过堆叠自编码器网络从输入中提取高级抽象特征,并利用提取的特征对图像patch进行分类。将分类结果映射到二值图像后,通过形态学滤波进行后处理,得到最终的分割结果。为了验证所提出的方法,将该实验应用于真实患者数据集的脑肿瘤分割。最后的性能表明,所提出的脑肿瘤分割方法更加准确和高效。
{"title":"A deep learning-based segmentation method for brain tumor in MR images","authors":"Zhe Xiao, Ruohan Huang, Yi Ding, Tian Lan, Rongfen Dong, Zhiguang Qin, Xinjie Zhang, Wei Wang","doi":"10.1109/ICCABS.2016.7802771","DOIUrl":"https://doi.org/10.1109/ICCABS.2016.7802771","url":null,"abstract":"Accurate tumor segmentation is an essential and crucial step for computer-aided brain tumor diagnosis and surgical planning. Subjective segmentations are widely adopted in clinical diagnosis and treating, but they are neither accurate nor reliable. An automatical and objective system for brain tumor segmentation is strongly expected. But they are still facing some challenges such as lower segmentation accuracy, demanding a priori knowledge or requiring the human intervention. In this paper, a novel and new coarse-to-fine method is proposed to segment the brain tumor. This hierarchical framework consists of preprocessing, deep learning network based classification and post-processing. The preprocessing is used to extract image patches for each MR image and obtains the gray level sequences of image patches as the input of the deep learning network. The deep learning network based classification is implemented by a stacked auto-encoder network to extract the high level abstract feature from the input, and utilizes the extracted feature to classify image patches. After mapping the classification result to a binary image, the post-processing is implemented by a morphological filter to get the final segmentation result. In order to evaluate the proposed method, the experiment was applied to segment the brain tumor for the real patient dataset. The final performance shows that the proposed brain tumor segmentation method is more accurate and efficient.","PeriodicalId":306466,"journal":{"name":"2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125854459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 60
期刊
2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1