首页 > 最新文献

2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)最新文献

英文 中文
Predicting disease-microbe association by random walking on the heterogeneous network 异质网络随机行走预测疾病-微生物关联
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822619
Xianjun Shen, Yao Chen, Xingpeng Jiang, Xiaohua Hu, Tingting He, Jincai Yang
The microbiota living in the human body plays a very important role in our health and disease, so the identification of microbes associated with diseases will contribute to improving medical care and to better understanding of microbe functions, interactions. However, the known associations between the diseases and microbes are very less. We proposed a new method for prioritization of candidate microbes to predict disease-microbe relationships that based on the random walking on the heterogeneous network. Here, we first constructed a heterogeneous network by connecting the disease network and microbe network using the disease-microbe relationship information, then extended the random walk to the heterogeneous network, finally we used leave-one-out cross-validation to evaluate the method and ranked the candidate disease-causing microbes. We used the algorithm to disclose some potential association between disease and microbe that cannot be found by microbe network or disease network alone. Furthermore, we studied three representative diseases, Type 2 diabetes, Asthma and Psoriasis, and presented the potential microbes associated with these diseases, respectively. We confirmed that the discovery of the associations will be a good clinical solution for disease mechanism understanding, diagnosis and therapy.
生活在人体内的微生物群在我们的健康和疾病中起着非常重要的作用,因此识别与疾病相关的微生物将有助于改善医疗保健,更好地了解微生物的功能和相互作用。然而,已知的疾病和微生物之间的联系非常少。我们提出了一种基于异构网络随机行走的候选微生物优先级预测疾病-微生物关系的新方法。本文首先利用疾病-微生物关系信息将疾病网络和微生物网络连接起来,构建了一个异构网络,然后将随机漫步扩展到异构网络中,最后使用留一交叉验证对方法进行评价,并对候选致病微生物进行排序。我们使用该算法揭示了微生物网络或疾病网络无法单独发现的疾病与微生物之间的一些潜在关联。此外,我们研究了3种代表性疾病,2型糖尿病、哮喘和牛皮癣,并分别提出了与这些疾病相关的潜在微生物。我们证实,这些关联的发现将为了解疾病机制、诊断和治疗提供良好的临床解决方案。
{"title":"Predicting disease-microbe association by random walking on the heterogeneous network","authors":"Xianjun Shen, Yao Chen, Xingpeng Jiang, Xiaohua Hu, Tingting He, Jincai Yang","doi":"10.1109/BIBM.2016.7822619","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822619","url":null,"abstract":"The microbiota living in the human body plays a very important role in our health and disease, so the identification of microbes associated with diseases will contribute to improving medical care and to better understanding of microbe functions, interactions. However, the known associations between the diseases and microbes are very less. We proposed a new method for prioritization of candidate microbes to predict disease-microbe relationships that based on the random walking on the heterogeneous network. Here, we first constructed a heterogeneous network by connecting the disease network and microbe network using the disease-microbe relationship information, then extended the random walk to the heterogeneous network, finally we used leave-one-out cross-validation to evaluate the method and ranked the candidate disease-causing microbes. We used the algorithm to disclose some potential association between disease and microbe that cannot be found by microbe network or disease network alone. Furthermore, we studied three representative diseases, Type 2 diabetes, Asthma and Psoriasis, and presented the potential microbes associated with these diseases, respectively. We confirmed that the discovery of the associations will be a good clinical solution for disease mechanism understanding, diagnosis and therapy.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124340208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Interpretable models to predict Breast Cancer 预测乳腺癌的可解释模型
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822745
Pedro Ferreira, I. Dutra, R. Salvini, E. Burnside
Several works in the literature use propositional (“black box”) approaches to generate prediction models. In this work we employ the Inductive Logic Programming technique, whose prediction model is based on first order rules, to the domain of breast cancer. These rules have the advantage of being interpretable and convenient to be used as a common language between the computer scientists and the medical experts. We also explore the relevance of some of variables usually collected to predict breast cancer. We compare our results with a propositional classifier that was considered best for the same dataset studied in this paper.
文献中的一些作品使用命题(“黑箱”)方法来生成预测模型。在这项工作中,我们采用归纳逻辑编程技术,其预测模型是基于一阶规则,到乳腺癌领域。这些规则具有可解释性强、易于作为计算机科学家和医学专家之间的共同语言使用的优点。我们还探讨了通常用于预测乳腺癌的一些变量的相关性。我们将我们的结果与命题分类器进行比较,该分类器被认为是本文研究的同一数据集的最佳分类器。
{"title":"Interpretable models to predict Breast Cancer","authors":"Pedro Ferreira, I. Dutra, R. Salvini, E. Burnside","doi":"10.1109/BIBM.2016.7822745","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822745","url":null,"abstract":"Several works in the literature use propositional (“black box”) approaches to generate prediction models. In this work we employ the Inductive Logic Programming technique, whose prediction model is based on first order rules, to the domain of breast cancer. These rules have the advantage of being interpretable and convenient to be used as a common language between the computer scientists and the medical experts. We also explore the relevance of some of variables usually collected to predict breast cancer. We compare our results with a propositional classifier that was considered best for the same dataset studied in this paper.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"198 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124372613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Concod: Accurate consensus-based approach of calling deletions from high-throughput sequencing data Concod:精确的基于共识的方法,从高通量测序数据中调用删除
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822495
Xiaodong Zhang, Chong Chu, Yao Zhang, Y. Wu, Jingyang Gao
Accurate calling of structural variations such as deletions with short sequence reads from high-throughput sequencing is an important but challenging problem in the field of genome analysis. There are many existing methods for calling deletions. At present, not a single method clearly outperforms all other methods in precision and sensitivity. A popular strategy used by several authors is combining different signatures left by deletions in order to achieve more accurate deletion calling. However, most existing methods using the combining approach are heuristic and the called deletions by these tools still contain many wrongly called deletions. In this paper, we present Concod, a machine learning based framework for calling deletions with consensus, which is able to more accurately detect and distinguish true deletions from falsely called ones. First, Concod collects candidate deletions by merging the output of multiple existing deletion calling tools. Then, features of each candidate are extracted from aligned reads based on multiple detection theories. Finally, a machine learning model is trained with these features and used to classify the true and false candidates. We test our approach on different coverage of real data and compare with existing tools, including Pindel, SVseq2, BreakDancer, and DELLY. Results show that Concod improves both precision and sensitivity of deletion calling significantly.
从高通量测序中准确识别短序列缺失等结构变异是基因组分析领域的一个重要但具有挑战性的问题。有许多调用删除的现有方法。目前,没有一种方法在精度和灵敏度上明显优于所有其他方法。一些作者使用的一种流行策略是将删除留下的不同签名组合起来,以实现更准确的删除调用。然而,现有的组合方法大多是启发式的,这些工具的所谓删除仍然包含许多错误的删除。在本文中,我们提出了Concod,一个基于机器学习的共识删除调用框架,它能够更准确地检测和区分真正的删除和错误的删除调用。首先,Concod通过合并多个现有删除调用工具的输出来收集候选删除。然后,基于多种检测理论,从对齐的reads中提取每个候选的特征。最后,使用这些特征训练机器学习模型,并用于对真假候选对象进行分类。我们在真实数据的不同覆盖范围上测试了我们的方法,并与现有工具(包括Pindel, SVseq2, BreakDancer和DELLY)进行了比较。结果表明,Concod显著提高了缺失调用的精度和灵敏度。
{"title":"Concod: Accurate consensus-based approach of calling deletions from high-throughput sequencing data","authors":"Xiaodong Zhang, Chong Chu, Yao Zhang, Y. Wu, Jingyang Gao","doi":"10.1109/BIBM.2016.7822495","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822495","url":null,"abstract":"Accurate calling of structural variations such as deletions with short sequence reads from high-throughput sequencing is an important but challenging problem in the field of genome analysis. There are many existing methods for calling deletions. At present, not a single method clearly outperforms all other methods in precision and sensitivity. A popular strategy used by several authors is combining different signatures left by deletions in order to achieve more accurate deletion calling. However, most existing methods using the combining approach are heuristic and the called deletions by these tools still contain many wrongly called deletions. In this paper, we present Concod, a machine learning based framework for calling deletions with consensus, which is able to more accurately detect and distinguish true deletions from falsely called ones. First, Concod collects candidate deletions by merging the output of multiple existing deletion calling tools. Then, features of each candidate are extracted from aligned reads based on multiple detection theories. Finally, a machine learning model is trained with these features and used to classify the true and false candidates. We test our approach on different coverage of real data and compare with existing tools, including Pindel, SVseq2, BreakDancer, and DELLY. Results show that Concod improves both precision and sensitivity of deletion calling significantly.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124436582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Learning to diagnose cirrhosis via combined liver capsule and parenchyma ultrasound image features 学习通过肝包膜和肝实质联合超声图像特征诊断肝硬化
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822627
Shuohong Wang, Xiang Liu, Jingwen Zhao, J. Song, J. Zhang, Y. Chen
This paper proposes a novel cirrhosis diagnosis method using high-frequency ultrasound imaging that is able to not only diagnose cirrhosis, but also determine its stage. We propose combined features extracted from both liver capsule and parenchyma texture to avoid the bias caused by considering only one aspect. The liver capsule is localized using a multi-scale, multi-objective optimization method and indices are proposed to measure the smoothness and continuity of the capsule. The parenchyma texture is modeled with Gaussian mixture model (GMM), and the lesions in the parenchyma are detected by a scale-space defect detection algorithm. The degree of pathological changes of the liver is quantitatively evaluated by 7 features describing morphology of the capsule and lesions in the parenchyma. Then SVM classifiers are trained to classify the samples into different cirrhosis stages. Experiment results demonstrate the effectiveness of the proposed method, which outperforms other 4 state-of-the-art methods and the proposed method that solely uses capsule or parenchyma texture features.
本文提出了一种新的肝硬化诊断方法,利用高频超声成像不仅可以诊断肝硬化,而且可以确定其分期。我们提出结合肝包膜和实质纹理的特征提取,以避免只考虑一个方面造成的偏差。采用多尺度、多目标优化方法对肝包膜进行定位,并提出了衡量包膜平滑度和连续性的指标。采用高斯混合模型(GMM)对薄壁组织纹理进行建模,采用尺度空间缺陷检测算法对薄壁组织中的病变进行检测。肝脏的病理改变程度通过描述包膜形态和实质病变的7个特征来定量评估。然后训练SVM分类器将样本划分为不同的肝硬化阶段。实验结果证明了该方法的有效性,优于其他4种最先进的方法以及仅使用胶囊或薄壁纹理特征的方法。
{"title":"Learning to diagnose cirrhosis via combined liver capsule and parenchyma ultrasound image features","authors":"Shuohong Wang, Xiang Liu, Jingwen Zhao, J. Song, J. Zhang, Y. Chen","doi":"10.1109/BIBM.2016.7822627","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822627","url":null,"abstract":"This paper proposes a novel cirrhosis diagnosis method using high-frequency ultrasound imaging that is able to not only diagnose cirrhosis, but also determine its stage. We propose combined features extracted from both liver capsule and parenchyma texture to avoid the bias caused by considering only one aspect. The liver capsule is localized using a multi-scale, multi-objective optimization method and indices are proposed to measure the smoothness and continuity of the capsule. The parenchyma texture is modeled with Gaussian mixture model (GMM), and the lesions in the parenchyma are detected by a scale-space defect detection algorithm. The degree of pathological changes of the liver is quantitatively evaluated by 7 features describing morphology of the capsule and lesions in the parenchyma. Then SVM classifiers are trained to classify the samples into different cirrhosis stages. Experiment results demonstrate the effectiveness of the proposed method, which outperforms other 4 state-of-the-art methods and the proposed method that solely uses capsule or parenchyma texture features.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"268 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114565788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A deep tongue image features analysis model for medical application 一种医学应用深舌图像特征分析模型
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822815
Dan Meng, Guitao Cao, Y. Duan, Minghua Zhu, Liping Tu, Jia-tuo Xu, Dong-Guo Xu
With the improvement of people's living standards, there is no doubt that people are paying more and more attention to their health. However, shortage of medical resources is a critical global problem. As a result, an intelligent prognostics system has a great potential to play important roles in computer aided diagnosis. Numerous papers reported that tongue features have been closely related to a human's state. Among them, the majority of the existing tongue image analyses and classification methods are based on the low-level features, which may not provide a holistic view of the tongue. Inspired by a deep convolutional neural network (CNN), we propose a deep tongue image feature analysis system to extract unbiased features and reduce human labor for tongue diagnosis. With the unbalanced sample distribution, it is hard to form a balanced classification model based on feature representations obtained by existing low-level and high-level methods. Our proposed deep tongue image feature analysis model learns high-level features and provide more classification information during training time, which may result in higher accuracy when predicting testing samples. We tested the proposed system on a set of 267 gastritis patients, and a control group of 48 healthy volunteers (labeled according to Western medical practices). Test results show that the proposed deep tongue image feature analysis model can classify a given tongue image into healthy and diseased state with an average accuracy of 91.49%, which demonstrates the relationship between human body's state and its deep tongue image features.
随着人们生活水平的提高,毫无疑问,人们越来越重视自己的健康。然而,医疗资源短缺是一个严重的全球性问题。因此,智能预测系统在计算机辅助诊断中具有很大的潜力。许多论文报道,舌头的特征与人类的状态密切相关。其中,现有的舌头图像分析和分类方法大多是基于底层特征,可能无法提供舌头的整体视图。受深度卷积神经网络(CNN)的启发,我们提出了一种深度舌图像特征分析系统,以提取无偏特征,减少舌诊断的人工劳动。由于样本分布不平衡,现有的低级和高级方法得到的特征表示很难形成一个平衡的分类模型。我们提出的深舌图像特征分析模型在训练过程中学习了高级特征,提供了更多的分类信息,可以提高测试样本预测的准确率。我们在267名胃炎患者和48名健康志愿者(根据西方医学实践标记)的对照组中测试了所提出的系统。实验结果表明,所提出的深舌图像特征分析模型能够以91.49%的平均准确率将给定的舌图像分为健康状态和病变状态,证明了人体状态与深舌图像特征之间的关系。
{"title":"A deep tongue image features analysis model for medical application","authors":"Dan Meng, Guitao Cao, Y. Duan, Minghua Zhu, Liping Tu, Jia-tuo Xu, Dong-Guo Xu","doi":"10.1109/BIBM.2016.7822815","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822815","url":null,"abstract":"With the improvement of people's living standards, there is no doubt that people are paying more and more attention to their health. However, shortage of medical resources is a critical global problem. As a result, an intelligent prognostics system has a great potential to play important roles in computer aided diagnosis. Numerous papers reported that tongue features have been closely related to a human's state. Among them, the majority of the existing tongue image analyses and classification methods are based on the low-level features, which may not provide a holistic view of the tongue. Inspired by a deep convolutional neural network (CNN), we propose a deep tongue image feature analysis system to extract unbiased features and reduce human labor for tongue diagnosis. With the unbalanced sample distribution, it is hard to form a balanced classification model based on feature representations obtained by existing low-level and high-level methods. Our proposed deep tongue image feature analysis model learns high-level features and provide more classification information during training time, which may result in higher accuracy when predicting testing samples. We tested the proposed system on a set of 267 gastritis patients, and a control group of 48 healthy volunteers (labeled according to Western medical practices). Test results show that the proposed deep tongue image feature analysis model can classify a given tongue image into healthy and diseased state with an average accuracy of 91.49%, which demonstrates the relationship between human body's state and its deep tongue image features.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115068263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
HPTree: Reconstructing phylogenetic trees for ultra-large unaligned DNA sequences via NJ model and Hadoop HPTree:通过NJ模型和Hadoop重建超大未对齐DNA序列的系统发育树
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822492
Q. Zou, Shixiang Wan, Xiangxiang Zeng
Constructing phylogenetic tree for ultra-large sequences (eg. Files more than 1GB) is quite difficult, especially for the unaligned DNA sequences. It is meaningless and impracticable to do multiple sequence alignment for large diverse DNA sequences. We try to do clustering firstly for the mounts of DNA sequences, and divide them into several clusters. Then each cluster is aligned and phylogenetic analysed in parallel. Hadoop, which is the most popular parallel platform in cloud computing, is employed for this process. Our software tool HPTree can handle the >1GB DNA sequence file or more than 1,000,000 DNA sequences in few hours. Users could try HPTree in the cloud computing platform (eg. Amazon) or their own clusters for the big data phylogenetic tree reconstruction. No super machine or large memory is required. HPTree could benefit the users who focus on population evolution or long common genes (eg. 16s rRNA) evolution. The software tool along with its codes and datasets are accessible at http://lab.malab.cn/soft/HPtree/.
构建超大序列的系统发育树。文件超过1GB)是相当困难的,特别是对于未对齐的DNA序列。对大量不同的DNA序列进行多序列比对是没有意义和不可行的。我们首先尝试对DNA序列进行聚类,并将它们分成若干个聚类。然后对每个簇进行排列,并并行进行系统发育分析。Hadoop是云计算中最流行的并行平台,它被用于这个过程。我们的软件工具HPTree可以在几个小时内处理>1GB的DNA序列文件或超过1,000,000个DNA序列。用户可以在云计算平台(例如:Amazon)或自己的集群进行大数据系统发育树重建。不需要超级机器或大内存。HPTree可以使关注群体进化或长共同基因的用户受益。16s rRNA)进化。该软件工具及其代码和数据集可在http://lab.malab.cn/soft/HPtree/上访问。
{"title":"HPTree: Reconstructing phylogenetic trees for ultra-large unaligned DNA sequences via NJ model and Hadoop","authors":"Q. Zou, Shixiang Wan, Xiangxiang Zeng","doi":"10.1109/BIBM.2016.7822492","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822492","url":null,"abstract":"Constructing phylogenetic tree for ultra-large sequences (eg. Files more than 1GB) is quite difficult, especially for the unaligned DNA sequences. It is meaningless and impracticable to do multiple sequence alignment for large diverse DNA sequences. We try to do clustering firstly for the mounts of DNA sequences, and divide them into several clusters. Then each cluster is aligned and phylogenetic analysed in parallel. Hadoop, which is the most popular parallel platform in cloud computing, is employed for this process. Our software tool HPTree can handle the >1GB DNA sequence file or more than 1,000,000 DNA sequences in few hours. Users could try HPTree in the cloud computing platform (eg. Amazon) or their own clusters for the big data phylogenetic tree reconstruction. No super machine or large memory is required. HPTree could benefit the users who focus on population evolution or long common genes (eg. 16s rRNA) evolution. The software tool along with its codes and datasets are accessible at http://lab.malab.cn/soft/HPtree/.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"318 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123503179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Dependency-based convolutional neural network for drug-drug interaction extraction 基于依赖的卷积神经网络药物-药物相互作用提取
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822671
Shengyu Liu, Kai Chen, Qingcai Chen, Buzhou Tang
Drug-drug interactions (DDIs) are crucial for healthcare. Besides DDIs reported in medical knowledge bases such as DrugBank, a large number of latest DDI findings are also reported in unstructured biomedical literature. Extracting DDIs from unstructured biomedical literature is a worthy addition to the existing knowledge bases. Currently, convolutional neural network (CNN) is a state-of-the-art method for DDI extraction. One limitation of CNN is that it neglects long distance dependencies between words in candidate DDI instances, which may be helpful for DDI extraction. In order to incorporate the long distance dependencies between words in candidate DDI instances, in this work, we propose a dependency-based convolutional neural network (DCNN) for DDI extraction. Experiments conducted on the DDIExtraction 2013 corpus show that DCNN using a public state-of-the-art dependency parser achieves an F-score of 70.19%, outperforming CNN by 0.44%. By analyzing errors of DCNN, we find that errors from dependency parsers are propagated into DCNN and affect the performance of DCNN. To reduce error propagation, we design a simple rule to combine CNN with DCNN, that is, using DCNN to extract DDIs in short sentences and CNN to extract DDIs in long distances as most dependency parsers work well for short sentences but bad for long sentences. Finally, our system that combines CNN and DCNN achieves an F-score of 70.81%, outperforming CNN by 1.06% and DNN by 0.62% on the DDIExtraction 2013 corpus.
药物-药物相互作用(ddi)对医疗保健至关重要。除了在DrugBank等医学知识库中报道DDI外,非结构化生物医学文献中也报道了大量最新的DDI发现。从非结构化生物医学文献中提取ddi是对现有知识库的一个有价值的补充。卷积神经网络(CNN)是目前最先进的DDI提取方法。CNN的一个限制是它忽略了候选DDI实例中单词之间的长距离依赖关系,这可能有助于DDI提取。为了结合候选DDI实例中词之间的长距离依赖关系,本文提出了一种基于依赖关系的卷积神经网络(DCNN)用于DDI提取。在DDIExtraction 2013语料库上进行的实验表明,使用最先进的公共依赖解析器的DCNN达到了70.19%的f分,比CNN高出0.44%。通过对DCNN的误差分析,我们发现依赖解析器的误差会传播到DCNN中,影响DCNN的性能。为了减少错误传播,我们设计了一个简单的规则将CNN和DCNN结合起来,即使用DCNN提取短句子中的ddi,使用CNN提取长距离中的ddi,因为大多数依赖解析器对短句子效果很好,但对长句子效果不好。最后,我们的系统结合了CNN和DCNN,在DDIExtraction 2013语料库上取得了70.81%的f分,比CNN高1.06%,比DNN高0.62%。
{"title":"Dependency-based convolutional neural network for drug-drug interaction extraction","authors":"Shengyu Liu, Kai Chen, Qingcai Chen, Buzhou Tang","doi":"10.1109/BIBM.2016.7822671","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822671","url":null,"abstract":"Drug-drug interactions (DDIs) are crucial for healthcare. Besides DDIs reported in medical knowledge bases such as DrugBank, a large number of latest DDI findings are also reported in unstructured biomedical literature. Extracting DDIs from unstructured biomedical literature is a worthy addition to the existing knowledge bases. Currently, convolutional neural network (CNN) is a state-of-the-art method for DDI extraction. One limitation of CNN is that it neglects long distance dependencies between words in candidate DDI instances, which may be helpful for DDI extraction. In order to incorporate the long distance dependencies between words in candidate DDI instances, in this work, we propose a dependency-based convolutional neural network (DCNN) for DDI extraction. Experiments conducted on the DDIExtraction 2013 corpus show that DCNN using a public state-of-the-art dependency parser achieves an F-score of 70.19%, outperforming CNN by 0.44%. By analyzing errors of DCNN, we find that errors from dependency parsers are propagated into DCNN and affect the performance of DCNN. To reduce error propagation, we design a simple rule to combine CNN with DCNN, that is, using DCNN to extract DDIs in short sentences and CNN to extract DDIs in long distances as most dependency parsers work well for short sentences but bad for long sentences. Finally, our system that combines CNN and DCNN achieves an F-score of 70.81%, outperforming CNN by 1.06% and DNN by 0.62% on the DDIExtraction 2013 corpus.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122104250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
DMcompress: Dynamic Markov models for bacterial genome compression 细菌基因组压缩的动态马尔可夫模型
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822621
Rongjie Wang, Mingxiang Teng, Yang Bai, Tianyi Zang, Yadong Wang
Genome data increasing exponentially since the last decade, compressing genome with Markov models has been proposed as an effective statistical method. However, existing methods set a static order-k Markov models to compress various genomes. Employing static order-k Markov model could result in a sub-optimal orders on some genomes. In this paper, we propose a compression method that relies on a pre-analysis of the data before compression, with the aim of estimating Markov models order k, yielding improvements over static Markov models. Experimental results on the latest complete bacterial genome data show that our method could effectively compress genome with a better performance than the state-of-the-art method. The codes of DMcompress are available at https://rongjiewang.github.io/DMcompress
近十年来,基因组数据呈指数级增长,利用马尔可夫模型压缩基因组已被提出作为一种有效的统计方法。然而,现有的方法设置一个静态的k阶马尔可夫模型来压缩不同的基因组。采用静态k阶马尔可夫模型可能导致某些基因组的次优序。在本文中,我们提出了一种压缩方法,该方法依赖于压缩前对数据的预分析,目的是估计马尔可夫模型的k阶,从而优于静态马尔可夫模型。在最新的细菌全基因组数据上的实验结果表明,我们的方法可以有效地压缩基因组,并且具有比现有方法更好的性能。DMcompress的代码可在https://rongjiewang.github.io/DMcompress上获得
{"title":"DMcompress: Dynamic Markov models for bacterial genome compression","authors":"Rongjie Wang, Mingxiang Teng, Yang Bai, Tianyi Zang, Yadong Wang","doi":"10.1109/BIBM.2016.7822621","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822621","url":null,"abstract":"Genome data increasing exponentially since the last decade, compressing genome with Markov models has been proposed as an effective statistical method. However, existing methods set a static order-k Markov models to compress various genomes. Employing static order-k Markov model could result in a sub-optimal orders on some genomes. In this paper, we propose a compression method that relies on a pre-analysis of the data before compression, with the aim of estimating Markov models order k, yielding improvements over static Markov models. Experimental results on the latest complete bacterial genome data show that our method could effectively compress genome with a better performance than the state-of-the-art method. The codes of DMcompress are available at https://rongjiewang.github.io/DMcompress","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"11 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128539204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Clinical text mining for efficient extraction of drug-allergy reactions 有效提取药物过敏反应的临床文本挖掘
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822651
Arantza Casillas, Koldo Gojenola, Alicia Pérez, M. Oronoz
This work focuses on the extraction of allergic drug reactions in electronic health records. The goal is to annotate a sub-class of cause-effect events, those in which drugs are causing allergies. Little work has carried out in this field, seldom for Spanish clinical text mining, which is, indeed, the aim of this work. We present two approaches: a rule-based method and another one based on machine learning. Both approaches incorporate semantic knowledge derived from FreeLing-Med, a software explicitly developed to parse texts in the medical domain. Having recognised the medical entities for a given record, the challenge stands on triggering the underlying allergies. To this end, the knowledge is expressed as a set of semantic, syntactic and structural features. Our best system, based on machine learning, obtained a precision of 0.90 with a recall of 0.87, outperforming a rule-based approach.
本研究的重点是电子病历中药物过敏反应的提取。目标是注释因果事件的一个子类,即那些药物引起过敏的事件。在这一领域开展的工作很少,很少用于西班牙临床文本挖掘,这确实是本工作的目的。我们提出了两种方法:基于规则的方法和另一种基于机器学习的方法。这两种方法都结合了源自FreeLing-Med的语义知识,FreeLing-Med是一种明确开发用于解析医学领域文本的软件。在确认了给定记录的医疗实体之后,挑战在于触发潜在的过敏。为此,知识被表达为一组语义、句法和结构特征。我们最好的系统,基于机器学习,获得了0.90的精度和0.87的召回率,优于基于规则的方法。
{"title":"Clinical text mining for efficient extraction of drug-allergy reactions","authors":"Arantza Casillas, Koldo Gojenola, Alicia Pérez, M. Oronoz","doi":"10.1109/BIBM.2016.7822651","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822651","url":null,"abstract":"This work focuses on the extraction of allergic drug reactions in electronic health records. The goal is to annotate a sub-class of cause-effect events, those in which drugs are causing allergies. Little work has carried out in this field, seldom for Spanish clinical text mining, which is, indeed, the aim of this work. We present two approaches: a rule-based method and another one based on machine learning. Both approaches incorporate semantic knowledge derived from FreeLing-Med, a software explicitly developed to parse texts in the medical domain. Having recognised the medical entities for a given record, the challenge stands on triggering the underlying allergies. To this end, the knowledge is expressed as a set of semantic, syntactic and structural features. Our best system, based on machine learning, obtained a precision of 0.90 with a recall of 0.87, outperforming a rule-based approach.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130075288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Fluorescence and bright-field 3D image fusion based on sinogram unification for optical projection tomography 基于正弦图统一的光学投影断层成像荧光与亮场三维图像融合
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822552
Xiaoqin Tang, M. V. Hoff, J. Hoogenboom, Yuanhao Guo, Fuyu Cai, G. Lamers, F. Verbeek
In order to preserve sufficient fluorescence intensity and improve the quality of fluorescence images in optical projection tomography (OPT) imaging, a feasible acquisition solution is to temporally formalize the fluorescence and bright-field imaging procedure as two consecutive phases. To be specific, fluorescence images are acquired first, in a full axial-view revolution, followed by the bright-field images. Due to the mechanical drift, this approach, however, may suffer from a deviation of center of rotation (COR) for the two imaging phases, resulting in irregular 3D image fusion, with which gene or protein activity may be located inaccurately. In this paper, we address this problem and consider it into a framework based on sinogram unification so as to precisely fuse 3D images from different channels for CORs between channels that are not coincident or if COR is not in the center of sinogram. The former case corresponds to the COR deviation above; while the latter one correlates with COR alignment, without which artefacts will be introduced in the reconstructed results. After sinogram unification, inverse radon transform can be implemented on each channel to reconstruct the 3D image. The fusion results are acquired by mapping the 3D images from different channels into a common space. Experimental results indicate that the proposed framework gains excellent performance in 3D image fusion from different channels. For the COR alignment, a new automated method based on interest point detection and included in sinogram unification, is presented. It outperforms traditional COR alignment approaches in combination of effectiveness and computational complexity.
为了在光学投影层析成像(OPT)中保持足够的荧光强度并提高荧光图像的质量,一种可行的采集方案是将荧光和明场成像过程暂时形式化为两个连续的阶段。具体地说,荧光图像是首先获得的,在一个完整的轴向视图的革命,其次是亮场图像。然而,由于机械漂移,这种方法可能存在两个成像阶段的旋转中心(COR)偏差,导致不规则的3D图像融合,从而可能不准确地定位基因或蛋白质的活性。在本文中,我们解决了这一问题,并将其考虑到一个基于sinogram统一的框架中,以便在不重合通道之间或当COR不在sinogram中心时,对不同通道的3D图像进行精确的CORs融合。前一种情况对应于上述COR偏差;而后者与COR对齐相关,如果没有COR对齐,重构结果中将引入伪影。在正弦图统一后,对每个通道进行逆radon变换,重建三维图像。通过将不同通道的三维图像映射到一个公共空间,获得融合结果。实验结果表明,该框架在不同通道的三维图像融合中取得了较好的效果。提出了一种基于兴趣点检测并包含正弦图统一的自动对齐方法。它在有效性和计算复杂度方面优于传统的COR对齐方法。
{"title":"Fluorescence and bright-field 3D image fusion based on sinogram unification for optical projection tomography","authors":"Xiaoqin Tang, M. V. Hoff, J. Hoogenboom, Yuanhao Guo, Fuyu Cai, G. Lamers, F. Verbeek","doi":"10.1109/BIBM.2016.7822552","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822552","url":null,"abstract":"In order to preserve sufficient fluorescence intensity and improve the quality of fluorescence images in optical projection tomography (OPT) imaging, a feasible acquisition solution is to temporally formalize the fluorescence and bright-field imaging procedure as two consecutive phases. To be specific, fluorescence images are acquired first, in a full axial-view revolution, followed by the bright-field images. Due to the mechanical drift, this approach, however, may suffer from a deviation of center of rotation (COR) for the two imaging phases, resulting in irregular 3D image fusion, with which gene or protein activity may be located inaccurately. In this paper, we address this problem and consider it into a framework based on sinogram unification so as to precisely fuse 3D images from different channels for CORs between channels that are not coincident or if COR is not in the center of sinogram. The former case corresponds to the COR deviation above; while the latter one correlates with COR alignment, without which artefacts will be introduced in the reconstructed results. After sinogram unification, inverse radon transform can be implemented on each channel to reconstruct the 3D image. The fusion results are acquired by mapping the 3D images from different channels into a common space. Experimental results indicate that the proposed framework gains excellent performance in 3D image fusion from different channels. For the COR alignment, a new automated method based on interest point detection and included in sinogram unification, is presented. It outperforms traditional COR alignment approaches in combination of effectiveness and computational complexity.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130925631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1