首页 > 最新文献

2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)最新文献

英文 中文
A disease module detection algorithm for lung adenocarcinoma tumor network with significance of connections and network controllability methodology 基于连接意义和网络可控性的肺腺癌肿瘤网络疾病模块检测算法
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822797
Guimin Qin, Yi-Bo Hou, Bao-Guo Yu, Xi-Yang Liu
The protein phosphorylation modifications are important to protein activities and functions. It has been widely recognized that dysfunctional phosphorylation modifications are related to cancer. Specifically, some single amino acid variations could disrupt existing phosphorylation kinase-substrate relationships and create novel kinase-substrate relationships. Besides, numerous network-based methods have been proposed to identify meaningful disease modules, which are locally dense subnetworks. In this work, we proposed a new network clustering method to uncover disease modules, which are correlated with the specific disease, based on significance of connections instead of local density. Specially, we build a weighted tumor network of lung adenocarcinoma with kinase-substrate relationships, tissue-specific gene regulatory network, pairwise gene expression data and mutation data. With appropriate parameters decided by a machine learning method, our method identified 9 disease modules. We found that these disease modules could effectively discriminate tumor samples from normal samples. Some significantly important genes in these modules have been identified as target genes of drugs recently. Our results provide insights into the disease mechanism underlying, and help identify more target genes of drugs in the era of precision medicine.
蛋白质磷酸化修饰对蛋白质的活性和功能具有重要意义。人们普遍认为功能失调的磷酸化修饰与癌症有关。具体来说,一些单一氨基酸的变化可能会破坏现有的磷酸化激酶-底物关系,并产生新的激酶-底物关系。此外,已经提出了许多基于网络的方法来识别有意义的疾病模块,这些模块是局部密集的子网络。在这项工作中,我们提出了一种新的网络聚类方法来发现与特定疾病相关的疾病模块,该方法基于连接的显著性而不是局部密度。特别地,我们构建了一个包含激酶-底物关系、组织特异性基因调控网络、成对基因表达数据和突变数据的肺腺癌加权肿瘤网络。通过机器学习方法确定适当的参数,我们的方法识别了9个疾病模块。我们发现这些疾病模块可以有效地区分肿瘤样本和正常样本。近年来,这些模块中一些重要的基因已被确定为药物的靶基因。我们的研究结果有助于深入了解潜在的疾病机制,并有助于在精准医疗时代发现更多的药物靶基因。
{"title":"A disease module detection algorithm for lung adenocarcinoma tumor network with significance of connections and network controllability methodology","authors":"Guimin Qin, Yi-Bo Hou, Bao-Guo Yu, Xi-Yang Liu","doi":"10.1109/BIBM.2016.7822797","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822797","url":null,"abstract":"The protein phosphorylation modifications are important to protein activities and functions. It has been widely recognized that dysfunctional phosphorylation modifications are related to cancer. Specifically, some single amino acid variations could disrupt existing phosphorylation kinase-substrate relationships and create novel kinase-substrate relationships. Besides, numerous network-based methods have been proposed to identify meaningful disease modules, which are locally dense subnetworks. In this work, we proposed a new network clustering method to uncover disease modules, which are correlated with the specific disease, based on significance of connections instead of local density. Specially, we build a weighted tumor network of lung adenocarcinoma with kinase-substrate relationships, tissue-specific gene regulatory network, pairwise gene expression data and mutation data. With appropriate parameters decided by a machine learning method, our method identified 9 disease modules. We found that these disease modules could effectively discriminate tumor samples from normal samples. Some significantly important genes in these modules have been identified as target genes of drugs recently. Our results provide insights into the disease mechanism underlying, and help identify more target genes of drugs in the era of precision medicine.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125086921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effects of Bo's abdominal acupuncture on post-stroke fatigue: A pilot study 博氏腹针对脑卒中后疲劳的影响:一项初步研究
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822717
Zhen Huang, Jie Zhan, Ruihuan Pan, Youhua Guo, Mingfeng He, Hongxia Chen, Lechang Zhan
Background: Post-stroke fatigue (PSF) is a frequently reported complication of stroke. The current drugs play a limited effect on PSF. Bo's abdominal acupuncture (BAA) has been used for decades to treat stroke in China, however, few studies have used the western clinical evaluation approach to verify the efficacy of BAA. Objective: This study aimed to investigate the safety and effectiveness of BAA on PSF. Methods: Seventy stroke patients with fatigue were randomly allocated into the BAA group (n=35) or the control group (n=35). Patients in the control group received conventional rehabilitation treatment, while patients in the BAA group were given 30 additional minutes of BAA treatment each day. The level of patients' fatigue was evaluated by Fatigue Severity Scale (FSS) and the energy domain of the Stroke Specific Quality of Life (SS-QOL-E). Besides, the activity of daily living of patients was assessed by Barthel Index (BI). All adverse events were clearly written during the whole trial. Results: 70 patients with PSF accomplished this study. The mean age of patients was 60.7 years and 47 (67%) were males. At baseline, no significant difference can be observed between two groups in FSS, SS-QOL-E, and BI. After 2-week treatment, both groups signified an increase on SS-QOL-E and BI scores, a decrease on FSS scores; and the SS-QOL-E scores of BAA group increased more than that of the control group (p< 0.05), but the changes of FSS and BI scores between two groups had no significant difference after treatment (p> 0.05). Serious adverse events were not reported. Conclusion: This study suggested the integrative program of BAA and conventional rehabilitation treatment maybe more effective in promoting the recovery of PSF. Further explorations on the treatment of PSF are needed.
背景:卒中后疲劳(PSF)是卒中后常见的并发症。目前的药物对PSF的作用有限。博腹针(BAA)治疗脑卒中在中国已有几十年的历史,但很少有研究采用西方临床评价方法来验证BAA的疗效。目的:探讨BAA治疗PSF的安全性和有效性。方法:70例脑卒中疲乏患者随机分为BAA组(n=35)和对照组(n=35)。对照组患者接受常规康复治疗,BAA组患者每天增加BAA治疗30分钟。采用疲劳严重程度量表(FSS)和脑卒中特定生活质量能量域量表(SS-QOL-E)评价患者的疲劳水平。采用Barthel指数(Barthel Index, BI)评价患者的日常生活活动能力。在整个试验过程中,所有不良事件都被清楚地记录下来。结果:70例PSF患者完成了本研究。患者平均年龄60.7岁,男性47例(67%)。在基线时,两组间FSS、SS-QOL-E和BI无显著差异。治疗2周后,两组患者SS-QOL-E和BI评分均升高,FSS评分降低;BAA组SS-QOL-E评分高于对照组(p< 0.05),治疗后两组FSS、BI评分变化无显著性差异(p> 0.05)。未见严重不良事件的报道。结论:本研究提示BAA与常规康复治疗相结合可能对促进PSF的康复更有效。PSF的治疗方法有待进一步探索。
{"title":"Effects of Bo's abdominal acupuncture on post-stroke fatigue: A pilot study","authors":"Zhen Huang, Jie Zhan, Ruihuan Pan, Youhua Guo, Mingfeng He, Hongxia Chen, Lechang Zhan","doi":"10.1109/BIBM.2016.7822717","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822717","url":null,"abstract":"Background: Post-stroke fatigue (PSF) is a frequently reported complication of stroke. The current drugs play a limited effect on PSF. Bo's abdominal acupuncture (BAA) has been used for decades to treat stroke in China, however, few studies have used the western clinical evaluation approach to verify the efficacy of BAA. Objective: This study aimed to investigate the safety and effectiveness of BAA on PSF. Methods: Seventy stroke patients with fatigue were randomly allocated into the BAA group (n=35) or the control group (n=35). Patients in the control group received conventional rehabilitation treatment, while patients in the BAA group were given 30 additional minutes of BAA treatment each day. The level of patients' fatigue was evaluated by Fatigue Severity Scale (FSS) and the energy domain of the Stroke Specific Quality of Life (SS-QOL-E). Besides, the activity of daily living of patients was assessed by Barthel Index (BI). All adverse events were clearly written during the whole trial. Results: 70 patients with PSF accomplished this study. The mean age of patients was 60.7 years and 47 (67%) were males. At baseline, no significant difference can be observed between two groups in FSS, SS-QOL-E, and BI. After 2-week treatment, both groups signified an increase on SS-QOL-E and BI scores, a decrease on FSS scores; and the SS-QOL-E scores of BAA group increased more than that of the control group (p< 0.05), but the changes of FSS and BI scores between two groups had no significant difference after treatment (p> 0.05). Serious adverse events were not reported. Conclusion: This study suggested the integrative program of BAA and conventional rehabilitation treatment maybe more effective in promoting the recovery of PSF. Further explorations on the treatment of PSF are needed.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122421908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Reconstructing gene regulatory network based on candidate auto selection method 基于候选基因自动选择方法的基因调控网络重构
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822524
L. Xing, Maozu Guo, Xiaoyan Liu, Chunyu Wang, Lei Wang, Yin Zhang
The reconstruction of gene regulatory network (GRN) is a great challenge in systems biology and bioinformatics, and methods based on Bayesian network (BN) draw most of attention because of its inherent probability characteristics. As NP-hard problems, most of the BN methods often adopt the heuristic search, but they are time-consuming for biological networks with a large number of nodes. To solve this problem, this paper presents a Candidate Auto Selection algorithm (CAS) based on mutual information and breakpoint detection to limit the search space in order to accelerate the learning process. The proposed algorithm automatically restricts the neighbors of each node to a small set of candidates before structure learning. Then based on CAS algorithm, we propose a globally optimal greedy search method (CAS+G), which focuses on finding the high-scoring network structure, and a local learning method (CAS+L), which focuses on faster learning the structure with small loss of quality. Results show that the proposed CAS algorithm can effectively identify the neighbor nodes of each node. In the experiments, the CAS+G method outperforms the state-of-the-art method on simulation data for inferring GRNs, and the CAS+L method is significantly faster than the state-of-the-art method with little loss of accuracy. Hence, the CAS based algorithms are more suitable for GRN inference.
基因调控网络(GRN)的重构是系统生物学和生物信息学领域的一个重大挑战,而基于贝叶斯网络(BN)的方法因其固有的概率特性而备受关注。作为NP-hard问题,大多数BN方法通常采用启发式搜索,但对于具有大量节点的生物网络,这种方法耗时较长。为了解决这一问题,本文提出了一种基于互信息和断点检测的候选自动选择算法(CAS),以限制搜索空间,从而加快学习过程。该算法在进行结构学习之前,自动将每个节点的邻居限制在一个小的候选集合中。然后在CAS算法的基础上,提出了全局最优贪心搜索法(CAS+G)和局部学习法(CAS+L),前者侧重于寻找高分网络结构,后者侧重于以较小的质量损失更快地学习结构。结果表明,所提出的CAS算法可以有效地识别每个节点的邻居节点。在实验中,CAS+G方法在模拟数据上优于最先进的方法来推断grn, CAS+L方法明显快于最先进的方法,而且精度损失很小。因此,基于CAS的算法更适合于GRN推理。
{"title":"Reconstructing gene regulatory network based on candidate auto selection method","authors":"L. Xing, Maozu Guo, Xiaoyan Liu, Chunyu Wang, Lei Wang, Yin Zhang","doi":"10.1109/BIBM.2016.7822524","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822524","url":null,"abstract":"The reconstruction of gene regulatory network (GRN) is a great challenge in systems biology and bioinformatics, and methods based on Bayesian network (BN) draw most of attention because of its inherent probability characteristics. As NP-hard problems, most of the BN methods often adopt the heuristic search, but they are time-consuming for biological networks with a large number of nodes. To solve this problem, this paper presents a Candidate Auto Selection algorithm (CAS) based on mutual information and breakpoint detection to limit the search space in order to accelerate the learning process. The proposed algorithm automatically restricts the neighbors of each node to a small set of candidates before structure learning. Then based on CAS algorithm, we propose a globally optimal greedy search method (CAS+G), which focuses on finding the high-scoring network structure, and a local learning method (CAS+L), which focuses on faster learning the structure with small loss of quality. Results show that the proposed CAS algorithm can effectively identify the neighbor nodes of each node. In the experiments, the CAS+G method outperforms the state-of-the-art method on simulation data for inferring GRNs, and the CAS+L method is significantly faster than the state-of-the-art method with little loss of accuracy. Hence, the CAS based algorithms are more suitable for GRN inference.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122729916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Identifying patterns of associated-conditions through topic models of Electronic Medical Records 通过电子医疗记录的主题模型识别相关条件的模式
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822561
Moumita Bhattacharya, C. Jurkovitz, H. Shatkay
Multiple adverse health conditions co-occurring in a patient are typically associated with poor prognosis and increased office or hospital visits. Developing methods to identify patterns of co-occurring conditions can assist in diagnosis. Thus, identifying patterns of association among co-occurring conditions is of growing interest. In this paper, we report preliminary results from a data-driven study, in which we apply a machine learning method, namely, topic modeling, to Electronic Medical Records (EMRs), aiming to identify patterns of associated conditions. Specifically, we use the well-established Latent Dirichlet Allocation (LDA), a method based on the idea that documents can be modeled as a mixture of latent topics, where each topic is a distribution over words. In our study, we adapt the LDA model to identify latent topics in patients' EMRs. We evaluate the performance of our method both qualitatively and quantitatively, and show that the obtained topics indeed align well with distinct medical phenomena characterized by co-occurring conditions.
患者同时出现多种不良健康状况通常与预后不良和办公室或医院就诊次数增加有关。开发方法来识别共同发生的病症的模式可以帮助诊断。因此,确定共同发生的条件之间的关联模式是越来越感兴趣的。在本文中,我们报告了一项数据驱动研究的初步结果,其中我们将机器学习方法,即主题建模应用于电子病历(emr),旨在识别相关条件的模式。具体来说,我们使用了公认的潜在狄利克雷分配(Latent Dirichlet Allocation, LDA),这是一种基于这样一种思想的方法,即文档可以建模为潜在主题的混合物,其中每个主题是单词的分布。在我们的研究中,我们采用LDA模型来识别患者电子病历中的潜在话题。我们定性和定量地评估了我们的方法的性能,并表明所获得的主题确实与以共同发生的条件为特征的不同医学现象很好地一致。
{"title":"Identifying patterns of associated-conditions through topic models of Electronic Medical Records","authors":"Moumita Bhattacharya, C. Jurkovitz, H. Shatkay","doi":"10.1109/BIBM.2016.7822561","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822561","url":null,"abstract":"Multiple adverse health conditions co-occurring in a patient are typically associated with poor prognosis and increased office or hospital visits. Developing methods to identify patterns of co-occurring conditions can assist in diagnosis. Thus, identifying patterns of association among co-occurring conditions is of growing interest. In this paper, we report preliminary results from a data-driven study, in which we apply a machine learning method, namely, topic modeling, to Electronic Medical Records (EMRs), aiming to identify patterns of associated conditions. Specifically, we use the well-established Latent Dirichlet Allocation (LDA), a method based on the idea that documents can be modeled as a mixture of latent topics, where each topic is a distribution over words. In our study, we adapt the LDA model to identify latent topics in patients' EMRs. We evaluate the performance of our method both qualitatively and quantitatively, and show that the obtained topics indeed align well with distinct medical phenomena characterized by co-occurring conditions.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127669516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
A shape model for contour extraction of Drosophila embryos 果蝇胚胎轮廓提取的形状模型
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822544
Qi Li, Y. Gong
Drosophila embryonic images provide valuable spatial and temporal information of gene expression. Extraction of the contour of a targeting embryo in an embryonic image is a fundamental step of a computational system for the study of gene-gene interaction on Drosophila. In this paper, we propose a shape model for contour extraction of Drosophila embryos. The shape model is built on connected components of edge pixels. It approximates a connected component of edge pixels by a polygon that can be either convex or concave. The main contribution of the proposed shape model is its ability of segmenting embryos touching each other. Moreover, the proposed shape model is adaptable to a wide range of applications on contour extraction.
果蝇胚胎图像为基因表达提供了宝贵的时空信息。胚胎图像中目标胚胎轮廓的提取是果蝇基因-基因相互作用计算系统研究的基本步骤。本文提出了一种用于果蝇胚胎轮廓提取的形状模型。形状模型建立在边缘像素的连接组件上。它通过一个可以是凸的或凹的多边形来近似边缘像素的连接分量。所提出的形状模型的主要贡献在于它能够分割彼此接触的胚胎。此外,所提出的形状模型适用于广泛的轮廓提取应用。
{"title":"A shape model for contour extraction of Drosophila embryos","authors":"Qi Li, Y. Gong","doi":"10.1109/BIBM.2016.7822544","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822544","url":null,"abstract":"Drosophila embryonic images provide valuable spatial and temporal information of gene expression. Extraction of the contour of a targeting embryo in an embryonic image is a fundamental step of a computational system for the study of gene-gene interaction on Drosophila. In this paper, we propose a shape model for contour extraction of Drosophila embryos. The shape model is built on connected components of edge pixels. It approximates a connected component of edge pixels by a polygon that can be either convex or concave. The main contribution of the proposed shape model is its ability of segmenting embryos touching each other. Moreover, the proposed shape model is adaptable to a wide range of applications on contour extraction.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127674586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Practical approach for disease similarity calculation based on disease phenotype, etiology, and locational clues in disease names 基于疾病表型,病因学和疾病名称中的位置线索的疾病相似度计算的实用方法
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822659
Mai Omura, N. Sonehara, T. Okumura
Disease similarity is a useful measure to improve clinical decision support systems wherein it allows continuous presentation of similar diseases. In a previous study, we demonstrated that etiological and symptomatic information of diseases provide a reasonable approximation for the similarity of diseases. This study extends the previously proposed approach by incorporating the locational information of diseases, which may improve the performance against the baseline achieved only by the etiological and symptomatic features.
疾病相似性是改善临床决策支持系统的有用措施,其中它允许连续呈现类似疾病。在之前的研究中,我们证明了疾病的病因学和症状信息为疾病的相似性提供了合理的近似。本研究通过纳入疾病的位置信息来扩展先前提出的方法,这可能会提高仅通过病因和症状特征实现的基线的性能。
{"title":"Practical approach for disease similarity calculation based on disease phenotype, etiology, and locational clues in disease names","authors":"Mai Omura, N. Sonehara, T. Okumura","doi":"10.1109/BIBM.2016.7822659","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822659","url":null,"abstract":"Disease similarity is a useful measure to improve clinical decision support systems wherein it allows continuous presentation of similar diseases. In a previous study, we demonstrated that etiological and symptomatic information of diseases provide a reasonable approximation for the similarity of diseases. This study extends the previously proposed approach by incorporating the locational information of diseases, which may improve the performance against the baseline achieved only by the etiological and symptomatic features.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127682761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Prediction of the efficacy of Wuji Pills by machine learning methods 用机器学习方法预测五忌丸的疗效
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822720
Haiqing Li, Guozheng Li, William Yang, Ying Chen, Xiaoxin Zhu, Mary Yang
Efficacy prediction is an inseparable part of TCM. We firstly analyze the correlation between indicators and efficacy, and max blood-drug concentration(Cmax) is chosen as the target to reflect the efficacy of drugs. Then we apply linear regression(LR), support vector regression(SVR) as well as artificial neural networks(ANNs) to predict the efficacy of Wuji pills. The results of the leave-one-out method show that SVR performs better than other methods for label Cmax, and appears to be a good method for this task. In order to find the relationship between each component of Wuji Pills, several visualization methods are adopted to deal with this problem. The web server of prediction is available at http://data.jindengtai.cn/#/case/drug for public usage.
疗效预测是中医不可分割的一部分。我们首先分析指标与疗效的相关性,选择最大血药浓度(max blood drug concentration, Cmax)作为反映药物疗效的指标。然后应用线性回归(LR)、支持向量回归(SVR)和人工神经网络(ann)对五极丸的疗效进行预测。留一方法的结果表明,对于标签Cmax, SVR的性能优于其他方法,是一种很好的方法。为了找到无忌丸各成分之间的关系,采用了几种可视化方法来处理这一问题。预测网络服务器可在http://data.jindengtai.cn/#/case/drug上公开使用。
{"title":"Prediction of the efficacy of Wuji Pills by machine learning methods","authors":"Haiqing Li, Guozheng Li, William Yang, Ying Chen, Xiaoxin Zhu, Mary Yang","doi":"10.1109/BIBM.2016.7822720","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822720","url":null,"abstract":"Efficacy prediction is an inseparable part of TCM. We firstly analyze the correlation between indicators and efficacy, and max blood-drug concentration(Cmax) is chosen as the target to reflect the efficacy of drugs. Then we apply linear regression(LR), support vector regression(SVR) as well as artificial neural networks(ANNs) to predict the efficacy of Wuji pills. The results of the leave-one-out method show that SVR performs better than other methods for label Cmax, and appears to be a good method for this task. In order to find the relationship between each component of Wuji Pills, several visualization methods are adopted to deal with this problem. The web server of prediction is available at http://data.jindengtai.cn/#/case/drug for public usage.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"240 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132565257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A novel method to identify pre-microRNA in various species knowledge base 一种在不同物种知识库中鉴定前microrna的新方法
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822771
Tianyi Zhao, Ningyi Zhang, Jun Ren, Peigang Xu, Zhiyan Liu, Liang Cheng, Yang Hu
More than 1/3 of human genes are regulated by microRNAs. The identification of microRNA (miRNA) is the precondition of discovering the regulatory mechanism of miRNA and developing the cure for genetic diseases. The traditional identification method is biological experiment, but it has the defects of long period, high cost, and missing the miRNAs that only exist in a specific period or low expression level. Therefore, to overcome these defects, machine learning method is applied to identify miRNAs. In this study, for identifying real and pseudo miRNAs and classifying different species, we extracted 98 dimensional features based on the primary and secondary structure, then we proposed the BP-Adaboost method to figure out the overfitting phenomenon of BP neural network by constructing multiple BP neural network classifiers and distributed weights to these classifiers. The novel method we proposed raised the accuracy and the stability. In this study, we verified the effectiveness and superiority over other methods by experiments.
超过1/3的人类基因受microrna调控。microRNA (miRNA)的鉴定是发现miRNA的调控机制和开展遗传病治疗的前提。传统的鉴定方法是生物实验,但存在周期长、成本高、缺失只存在于特定时期或低表达水平的mirna等缺陷。因此,为了克服这些缺陷,我们采用机器学习方法来识别mirna。在本研究中,为了识别真实和伪mirna并对不同物种进行分类,我们基于一级和二级结构提取了98个维度的特征,然后我们提出了BP- adaboost方法,通过构建多个BP神经网络分类器并对这些分类器分配权重来找出BP神经网络的过拟合现象。提出的新方法提高了检测的精度和稳定性。在本研究中,我们通过实验验证了该方法的有效性和优越性。
{"title":"A novel method to identify pre-microRNA in various species knowledge base","authors":"Tianyi Zhao, Ningyi Zhang, Jun Ren, Peigang Xu, Zhiyan Liu, Liang Cheng, Yang Hu","doi":"10.1109/BIBM.2016.7822771","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822771","url":null,"abstract":"More than 1/3 of human genes are regulated by microRNAs. The identification of microRNA (miRNA) is the precondition of discovering the regulatory mechanism of miRNA and developing the cure for genetic diseases. The traditional identification method is biological experiment, but it has the defects of long period, high cost, and missing the miRNAs that only exist in a specific period or low expression level. Therefore, to overcome these defects, machine learning method is applied to identify miRNAs. In this study, for identifying real and pseudo miRNAs and classifying different species, we extracted 98 dimensional features based on the primary and secondary structure, then we proposed the BP-Adaboost method to figure out the overfitting phenomenon of BP neural network by constructing multiple BP neural network classifiers and distributed weights to these classifiers. The novel method we proposed raised the accuracy and the stability. In this study, we verified the effectiveness and superiority over other methods by experiments.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133417593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Estimating isoform abundance by Particle Swarm Optimization 基于粒子群算法的异构体丰度估计
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822512
Jin Zhao, Haodi Feng
Gene controls biological character by various proteins that are formed by isoforms. Through alternative splicing, gene can express multiple isoforms. The next generation of high-throughput RNA sequencing has provided facilitation for quantifying isoform expression level. Extensive efforts have been made in stimulating isoform abundance from RNA-Seq data, but the accuracy still needs to be improved. In this article, we propose a statistical method combined with Particle Swarm Optimization to estimate isoform abundance from RNA-Seq data. After a series of statistical analysis and experiments, we decided on the forms and values of coefficients in Particle Swarm Optimization model. We analyzed the performance of our approach on both simulated and real datasets. Experiment results showed that comparing to Cufflinks our approach makes acceptable improvement on accuracy and is more sensitive to condition changes in most cases.
基因通过异构体形成的各种蛋白质来控制生物学特性。通过选择性剪接,基因可以表达多种同种异构体。下一代高通量RNA测序为定量分析异构体表达水平提供了便利。从RNA-Seq数据中激发异构体丰度已经做了大量的工作,但准确性仍有待提高。在本文中,我们提出了一种结合粒子群优化的统计方法来估计RNA-Seq数据的异构体丰度。经过一系列的统计分析和实验,我们确定了粒子群优化模型中系数的形式和取值。我们分析了我们的方法在模拟和真实数据集上的性能。实验结果表明,与袖扣相比,我们的方法在精度上有了可接受的提高,并且在大多数情况下对条件变化更敏感。
{"title":"Estimating isoform abundance by Particle Swarm Optimization","authors":"Jin Zhao, Haodi Feng","doi":"10.1109/BIBM.2016.7822512","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822512","url":null,"abstract":"Gene controls biological character by various proteins that are formed by isoforms. Through alternative splicing, gene can express multiple isoforms. The next generation of high-throughput RNA sequencing has provided facilitation for quantifying isoform expression level. Extensive efforts have been made in stimulating isoform abundance from RNA-Seq data, but the accuracy still needs to be improved. In this article, we propose a statistical method combined with Particle Swarm Optimization to estimate isoform abundance from RNA-Seq data. After a series of statistical analysis and experiments, we decided on the forms and values of coefficients in Particle Swarm Optimization model. We analyzed the performance of our approach on both simulated and real datasets. Experiment results showed that comparing to Cufflinks our approach makes acceptable improvement on accuracy and is more sensitive to condition changes in most cases.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133669141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Compression-based distance methods as an alternative to statistical methods for constructing phylogenetic trees 基于压缩的距离方法作为构建系统发育树的统计方法的替代方法
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822676
Mohamed El-Dirany, Forrest Wang, J. Furst, J. Rogers, D. Raicu
Distance based methods for constructing phylogenetic trees have long been considered inconsistent and inferior to the more dominant statistical methods. However, use of compression methods specific to DNA could prove valuable in improving the effectiveness of distance based methods. To demonstrate the validity of distance-based methods when utilizing current DNA compression algorithms, such as MFCompress, we have applied such a method to datasets of closely related species of fish from the suborder Labroidei and to strains of Ebola. In both cases, we have managed to produce trees that are either very similar or identical to published trees produced using statistically based methods. This suggests that distance based methods can perform comparably to statistically based methods without requiring as much pre-processing of original DNA sequences or system resources. Additionally, the results also stress the importance of using accurate methods of calculating species distance due to the way that one specific DNA compression algorithm, MFCompress, consistently and convincingly managed to outperform other popular, general use compression algorithms.
基于距离的构建系统发育树的方法长期以来被认为不一致,而且不如更占优势的统计方法。然而,使用特定于DNA的压缩方法在提高基于距离的方法的有效性方面可能证明是有价值的。为了证明基于距离的方法在利用当前DNA压缩算法(如MFCompress)时的有效性,我们将这种方法应用于Labroidei亚目密切相关的鱼类物种和埃博拉病毒菌株的数据集。在这两种情况下,我们都成功地生成了与使用基于统计的方法生成的已发表的树非常相似或相同的树。这表明基于距离的方法可以与基于统计的方法相媲美,而不需要对原始DNA序列或系统资源进行过多的预处理。此外,研究结果还强调了使用精确方法计算物种距离的重要性,因为一种特定的DNA压缩算法MFCompress一贯且令人信服地优于其他流行的通用压缩算法。
{"title":"Compression-based distance methods as an alternative to statistical methods for constructing phylogenetic trees","authors":"Mohamed El-Dirany, Forrest Wang, J. Furst, J. Rogers, D. Raicu","doi":"10.1109/BIBM.2016.7822676","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822676","url":null,"abstract":"Distance based methods for constructing phylogenetic trees have long been considered inconsistent and inferior to the more dominant statistical methods. However, use of compression methods specific to DNA could prove valuable in improving the effectiveness of distance based methods. To demonstrate the validity of distance-based methods when utilizing current DNA compression algorithms, such as MFCompress, we have applied such a method to datasets of closely related species of fish from the suborder Labroidei and to strains of Ebola. In both cases, we have managed to produce trees that are either very similar or identical to published trees produced using statistically based methods. This suggests that distance based methods can perform comparably to statistically based methods without requiring as much pre-processing of original DNA sequences or system resources. Additionally, the results also stress the importance of using accurate methods of calculating species distance due to the way that one specific DNA compression algorithm, MFCompress, consistently and convincingly managed to outperform other popular, general use compression algorithms.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"207 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133719883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1