首页 > 最新文献

2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)最新文献

英文 中文
Regularization of sequence data for machine learning 用于机器学习的序列数据正则化
Bryan Bai, S. C. Kremer
We examine the problem of classifying biological sequences, and in particular the challenge of generalizing results to novel input data. We observe that the high-dimensionality of sequence data representations results in an extremely sparsely populated input space. This motivates a need for regularization (a form of inductive bias), in order to achieve generalization. We discuss regularization in the context of regular neural networks, deep belief networks and support vector machines, and provide experimental results for these architectures. Our results support the importance of using an effective regularization method and identify which methods work well on a real-world dataset.
我们研究了分类生物序列的问题,特别是将结果推广到新输入数据的挑战。我们观察到,序列数据表示的高维导致了一个极其稀疏的输入空间。这激发了对正则化(归纳偏差的一种形式)的需求,以实现泛化。我们在规则神经网络、深度信念网络和支持向量机的背景下讨论了正则化,并提供了这些架构的实验结果。我们的结果支持使用有效的正则化方法的重要性,并确定哪些方法在真实数据集上工作得很好。
{"title":"Regularization of sequence data for machine learning","authors":"Bryan Bai, S. C. Kremer","doi":"10.1109/BIBMW.2011.6112350","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112350","url":null,"abstract":"We examine the problem of classifying biological sequences, and in particular the challenge of generalizing results to novel input data. We observe that the high-dimensionality of sequence data representations results in an extremely sparsely populated input space. This motivates a need for regularization (a form of inductive bias), in order to achieve generalization. We discuss regularization in the context of regular neural networks, deep belief networks and support vector machines, and provide experimental results for these architectures. Our results support the importance of using an effective regularization method and identify which methods work well on a real-world dataset.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"13 1","pages":"19-25"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87631016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Active Protein Interaction Network and Its Application on Protein Complex Detection 活性蛋白相互作用网络及其在蛋白复合物检测中的应用
Jianxin Wang, Xiaoqing Peng, Min Li, Yong Luo, Yi Pan
In recent years, more and more attentions are focused on modelling and analyzing dynamic network. Some researchers attempted to extract dynamic network by combining the dynamic information from gene expression data or sub cellular localization data with protein network. However, the dynamics of proteins' presence does not guarantee the dynamics of interactions, since the presence of a protein does not indicate the protein's activity. The activity of a protein is closely connected with its function. Thus only the dynamics of proteins activity ensure the dynamics of interaction. The gene expression of a cellular process or cycle carries more information than only the dynamics of proteins' presence. We assume that a protein is active when its expression values are near its maximum expression value, since the expression quantity will decrease after it has performed its function that leads a feedback for controlling the expression quantity. In this paper, we proposed a method to identify active time points for each protein in a cellular process or cycle by using a 3-sigma principle to compute an active threshold for each gene according to the characteristics of its expression curve. Combined the activity information and protein interaction network, we can construct an active protein interaction network (APPI). To demonstrate the efficiency of APPI network model, we applied it on complex detection. Compared with single threshold time series networks, APPI network achieves a better performance on protein complex prediction.
近年来,动态网络的建模和分析越来越受到人们的关注。一些研究者试图将基因表达数据或亚细胞定位数据中的动态信息与蛋白质网络相结合来提取动态网络。然而,蛋白质存在的动态并不能保证相互作用的动态,因为蛋白质的存在并不表明蛋白质的活性。蛋白质的活性与其功能密切相关。因此,只有蛋白质活性的动态才能保证相互作用的动态。细胞过程或周期的基因表达携带的信息比仅仅是蛋白质存在的动态更多。我们假设一个蛋白在其表达值接近其最大表达值时是有活性的,因为它在完成其功能后,表达量会减少,导致控制表达量的反馈。在本文中,我们提出了一种识别细胞过程或周期中每个蛋白质的活性时间点的方法,该方法使用3-sigma原理根据每个基因的表达曲线特征计算每个基因的活性阈值。将活性信息与蛋白质相互作用网络相结合,构建活性蛋白质相互作用网络(APPI)。为了验证APPI网络模型的有效性,我们将其应用于复杂检测。与单阈值时间序列网络相比,APPI网络在蛋白质复合体预测上取得了更好的性能。
{"title":"Active Protein Interaction Network and Its Application on Protein Complex Detection","authors":"Jianxin Wang, Xiaoqing Peng, Min Li, Yong Luo, Yi Pan","doi":"10.1109/BIBM.2011.45","DOIUrl":"https://doi.org/10.1109/BIBM.2011.45","url":null,"abstract":"In recent years, more and more attentions are focused on modelling and analyzing dynamic network. Some researchers attempted to extract dynamic network by combining the dynamic information from gene expression data or sub cellular localization data with protein network. However, the dynamics of proteins' presence does not guarantee the dynamics of interactions, since the presence of a protein does not indicate the protein's activity. The activity of a protein is closely connected with its function. Thus only the dynamics of proteins activity ensure the dynamics of interaction. The gene expression of a cellular process or cycle carries more information than only the dynamics of proteins' presence. We assume that a protein is active when its expression values are near its maximum expression value, since the expression quantity will decrease after it has performed its function that leads a feedback for controlling the expression quantity. In this paper, we proposed a method to identify active time points for each protein in a cellular process or cycle by using a 3-sigma principle to compute an active threshold for each gene according to the characteristics of its expression curve. Combined the activity information and protein interaction network, we can construct an active protein interaction network (APPI). To demonstrate the efficiency of APPI network model, we applied it on complex detection. Compared with single threshold time series networks, APPI network achieves a better performance on protein complex prediction.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"12 1","pages":"37-42"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90168797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Protein docking with information on evolutionary conserved interfaces 蛋白质与进化保守界面信息的对接
I. Hashmi, Bahar Akbal-Delibas, Nurit Haspel, Amarda Shehu
Structural modeling of molecular assemblies lies at the heart of understanding molecular interactions and biological function. We present a method for docking protein molecules and elucidating native-like structures of protein dimers. Our method is based on geometric hashing to ensure the feasibility of searching the combined conformational space of dimeric structures. The search space is narrowed by focusing the sought rigid-body transformations around surface areas with evolutionary-conserved amino-acids. Recent analysis of protein assemblies reveals that many functional interfaces are significantly conserved throughout evolution. We test our method on a broad list of sixteen diverse protein dimers and compare the structures found to have lowest lRMSD to the known native dimeric structures to those reported by other groups. Our results show that focusing the search around evolutionary-conserved interfaces results in lower lRMSDs.
分子组装的结构建模是理解分子相互作用和生物功能的核心。我们提出了一种对接蛋白质分子和阐明蛋白质二聚体的天然样结构的方法。该方法基于几何哈希,保证了二聚体结构组合构象空间搜索的可行性。通过将寻找的刚体转换集中在具有进化保守氨基酸的表面区域周围,可以缩小搜索空间。最近对蛋白质组装的分析表明,许多功能界面在整个进化过程中都是显着保守的。我们在16种不同蛋白质二聚体的广泛列表上测试了我们的方法,并将发现的具有最低lRMSD的结构与已知的天然二聚体结构与其他小组报道的结构进行了比较。我们的研究结果表明,围绕进化保守的界面进行搜索可以降低lrmsd。
{"title":"Protein docking with information on evolutionary conserved interfaces","authors":"I. Hashmi, Bahar Akbal-Delibas, Nurit Haspel, Amarda Shehu","doi":"10.1109/BIBMW.2011.6112399","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112399","url":null,"abstract":"Structural modeling of molecular assemblies lies at the heart of understanding molecular interactions and biological function. We present a method for docking protein molecules and elucidating native-like structures of protein dimers. Our method is based on geometric hashing to ensure the feasibility of searching the combined conformational space of dimeric structures. The search space is narrowed by focusing the sought rigid-body transformations around surface areas with evolutionary-conserved amino-acids. Recent analysis of protein assemblies reveals that many functional interfaces are significantly conserved throughout evolution. We test our method on a broad list of sixteen diverse protein dimers and compare the structures found to have lowest lRMSD to the known native dimeric structures to those reported by other groups. Our results show that focusing the search around evolutionary-conserved interfaces results in lower lRMSDs.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"10 1","pages":"358-365"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73634890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Probabilistic Signal Network Models from Multiple Replicates of Sparse Time-Course Data 稀疏时程数据多重重复的概率信号网络模型
Kristopher L. Patton, D. J. John, J. Norris
has sparse data with the number of time points being less than the number of proteins. Usually, each replicate is modeled separately, however, here all the information in each of the replicates is used to make a composite inference about the signal network. The composite inference comes from combining well structured Bayesian probabilistic modeling with a multi-faceted Markov Chain Monte Carlo algorithm. Based on simulations which investigate many different types of network interactions and experimental variabilities, the composite examination uncovers many important relationships within the network.
具有时间点数量小于蛋白质数量的稀疏数据。通常,每个复制都是单独建模的,但是在这里,每个复制中的所有信息都用于对信号网络进行复合推断。复合推理是将结构良好的贝叶斯概率模型与多面马尔可夫链蒙特卡罗算法相结合的结果。基于模拟研究了许多不同类型的网络相互作用和实验变量,复合检验揭示了网络中许多重要的关系。
{"title":"Probabilistic Signal Network Models from Multiple Replicates of Sparse Time-Course Data","authors":"Kristopher L. Patton, D. J. John, J. Norris","doi":"10.1109/BIBM.2011.78","DOIUrl":"https://doi.org/10.1109/BIBM.2011.78","url":null,"abstract":"has sparse data with the number of time points being less than the number of proteins. Usually, each replicate is modeled separately, however, here all the information in each of the replicates is used to make a composite inference about the signal network. The composite inference comes from combining well structured Bayesian probabilistic modeling with a multi-faceted Markov Chain Monte Carlo algorithm. Based on simulations which investigate many different types of network interactions and experimental variabilities, the composite examination uncovers many important relationships within the network.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"37 1","pages":"450-455"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74039993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Identifying medicine bottles by incorporating RFID and video analysis 通过结合RFID和视频分析来识别药瓶
Faiz M. Hasanuzzaman, Yingli Tian, Qingshan Liu
In this paper, we present a new framework of identifying medicine bottles using a combination of a video camera and Radio Frequency Identification (RFID) sensors for applications of monitoring the elderly's activities of daily living (ADLs) at home. RFID tags are attached to medicine bottles and first detected by RFID readers from the antenna. However, the RFID detection can only detect RFID tags within a certain range of the antenna. Once a medicine bottle is moved out of the range of the RFID antenna, a camera will be activated to continue detecting and tracking the medicine bottle for further action analysis based on moving object detection and color model of the medicine bottle. The experimental results demonstrate 100% detection accuracy for identifying medicine bottles.
在本文中,我们提出了一种使用摄像机和射频识别(RFID)传感器的组合来识别药瓶的新框架,用于监测老年人的日常生活活动(adl)。RFID标签贴在药瓶上,首先由天线上的RFID读取器检测到。但是,RFID检测只能检测到天线一定范围内的RFID标签。一旦药瓶移动到RFID天线的范围之外,就会激活摄像头,继续检测和跟踪药瓶,并根据药瓶的运动物体检测和颜色模型进行进一步的动作分析。实验结果表明,该方法对药瓶的检测准确率为100%。
{"title":"Identifying medicine bottles by incorporating RFID and video analysis","authors":"Faiz M. Hasanuzzaman, Yingli Tian, Qingshan Liu","doi":"10.1109/BIBMW.2011.6112424","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112424","url":null,"abstract":"In this paper, we present a new framework of identifying medicine bottles using a combination of a video camera and Radio Frequency Identification (RFID) sensors for applications of monitoring the elderly's activities of daily living (ADLs) at home. RFID tags are attached to medicine bottles and first detected by RFID readers from the antenna. However, the RFID detection can only detect RFID tags within a certain range of the antenna. Once a medicine bottle is moved out of the range of the RFID antenna, a camera will be activated to continue detecting and tracking the medicine bottle for further action analysis based on moving object detection and color model of the medicine bottle. The experimental results demonstrate 100% detection accuracy for identifying medicine bottles.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"11 1","pages":"528-529"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73242657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Populating Local Minima in the Protein Conformational Space 填充蛋白质构象空间的局部极小值
Brian S. Olson, Amarda Shehu
Protein Modeling conceptualizes the protein energy landscape as a funnel with the native structure at the low-energy minimum. Current protein structure prediction algorithms seek the global minimum by searching for low-energy conformations in the hope that some of these reside in local minima near the native structure. The search techniques employed, however, fail to explicitly model these local minima. This work proposes a memetic algorithm which combines methods from evolutionary computation with cutting-edge structure prediction protocols. The Protein Local Optima Walk (PLOW) algorithm proposed here explores the space of local minima by explicitly projecting each move in the conformation space to a nearby local minimum. This allows PLOW to jump over local energy barriers and more effectively sample near-native conformations. Analysis across a broad range of proteins shows that PLOW outperforms an MMC-based method and compares favorably against other published abini to structure prediction algorithms.
蛋白质建模将蛋白质能量景观概念化为具有低能量最小值的天然结构的漏斗。目前的蛋白质结构预测算法通过寻找低能构象来寻求全局最小值,并希望其中一些位于本地结构附近的局部最小值。然而,所采用的搜索技术不能明确地模拟这些局部最小值。本文提出了一种模因算法,该算法将进化计算方法与前沿结构预测协议相结合。本文提出的蛋白质局部最优行走(PLOW)算法通过显式地将构象空间中的每次移动映射到附近的局部最小值来探索局部最小值空间。这使得PLOW能够跳过局部能量障碍,更有效地采样接近原生的构象。对多种蛋白质的分析表明,PLOW优于基于mmc的方法,并且与其他已发表的abini - to - structure预测算法相比具有优势。
{"title":"Populating Local Minima in the Protein Conformational Space","authors":"Brian S. Olson, Amarda Shehu","doi":"10.1109/BIBM.2011.22","DOIUrl":"https://doi.org/10.1109/BIBM.2011.22","url":null,"abstract":"Protein Modeling conceptualizes the protein energy landscape as a funnel with the native structure at the low-energy minimum. Current protein structure prediction algorithms seek the global minimum by searching for low-energy conformations in the hope that some of these reside in local minima near the native structure. The search techniques employed, however, fail to explicitly model these local minima. This work proposes a memetic algorithm which combines methods from evolutionary computation with cutting-edge structure prediction protocols. The Protein Local Optima Walk (PLOW) algorithm proposed here explores the space of local minima by explicitly projecting each move in the conformation space to a nearby local minimum. This allows PLOW to jump over local energy barriers and more effectively sample near-native conformations. Analysis across a broad range of proteins shows that PLOW outperforms an MMC-based method and compares favorably against other published abini to structure prediction algorithms.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"93 1","pages":"114-117"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77205213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Literature based Bayesian analysis of gene expression data 基于文献的基因表达数据贝叶斯分析
Lijing Xu, R. Homayouni, E. George
Recent research has focused on incorporating biological function and pathway information into the analysis of gene expression data, partly as a means of compensating for insufficient experimental replications, low signal to noise, lack of reproducibility and/or multiple testing confounds. A Bayesian approach seems to be ideal for incorporating functional information into gene expression data analysis. In this study, we tested the feasibility of using literature derived gene relationships in a Bayesian model to analyze gene expression data. Prior distributions were constructed based on gene associations derived from the biomedical literature using Latent Semantic Indexing (LSI). The LSI model was built using more than 1 million Medline abstracts corresponding to 22,000 human and mouse genes. A key advantage of LSI is that both explicit and implicit gene relationships can be derived from the literature. Gene neighborhoods were determined using latent Gaussian Markov random fields and logistic transformation of the latent variables. We tested the procedure on a microarray dataset for interferon-stimulated genes in mouse embryonic fibroblasts. By integrating functional information from literature, Bayesian approach identified relevant genes that previously did not meet the 0.05 significance level. In comparison to a standard mixture model, spatial mixture model has more power for identifying direct and indirect interferon regulated genes. The spatial model enhanced the ranks of some genes which are known to be affected by interferon treatment, such as Nmi (NMI N-myc and STAT interactor) and ifi35 (interferon-induced protein 35). It also identified some genes that previously were ignored because of the marginal p-values, such as dpysl2, map2k1, msn, Psck5, and Il6st. Interestingly, these genes appear to be indirectly related to interferon treatment. In summary, we show that our procedure increases statistical power and produces more biologically meaningful gene lists. These results suggest that Bayesian methods which incorporate functional information from the literature may improve analysis of gene expression data.
最近的研究集中在将生物学功能和途径信息纳入基因表达数据的分析中,部分作为补偿实验重复不足、低信号噪声、缺乏可重复性和/或多重测试混淆的手段。贝叶斯方法似乎是将功能信息纳入基因表达数据分析的理想方法。在本研究中,我们测试了在贝叶斯模型中使用文献导出的基因关系来分析基因表达数据的可行性。使用潜在语义索引(LSI)构建基于生物医学文献中基因关联的先验分布。LSI模型是使用超过100万份Medline摘要建立的,这些摘要对应22,000个人类和小鼠基因。LSI的一个关键优势是显性和隐性基因关系都可以从文献中得到。利用隐高斯马尔科夫随机场和隐变量的逻辑变换确定基因邻域。我们在小鼠胚胎成纤维细胞中干扰素刺激基因的微阵列数据集上测试了该程序。通过整合文献中的功能信息,贝叶斯方法识别出之前未达到0.05显著性水平的相关基因。与标准混合模型相比,空间混合模型在识别干扰素直接和间接调控基因方面具有更强的能力。空间模型提高了一些已知受干扰素治疗影响的基因的等级,如Nmi (Nmi N-myc和STAT相互作用因子)和ifi35(干扰素诱导蛋白35)。它还发现了一些以前由于边际p值而被忽略的基因,如dpysl2、map2k1、msn、Psck5和Il6st。有趣的是,这些基因似乎与干扰素治疗间接相关。总之,我们表明我们的程序提高了统计能力,并产生了更有生物学意义的基因列表。这些结果表明,贝叶斯方法结合了文献中的功能信息,可以改善基因表达数据的分析。
{"title":"Literature based Bayesian analysis of gene expression data","authors":"Lijing Xu, R. Homayouni, E. George","doi":"10.1109/BIBMW.2011.6112549","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112549","url":null,"abstract":"Recent research has focused on incorporating biological function and pathway information into the analysis of gene expression data, partly as a means of compensating for insufficient experimental replications, low signal to noise, lack of reproducibility and/or multiple testing confounds. A Bayesian approach seems to be ideal for incorporating functional information into gene expression data analysis. In this study, we tested the feasibility of using literature derived gene relationships in a Bayesian model to analyze gene expression data. Prior distributions were constructed based on gene associations derived from the biomedical literature using Latent Semantic Indexing (LSI). The LSI model was built using more than 1 million Medline abstracts corresponding to 22,000 human and mouse genes. A key advantage of LSI is that both explicit and implicit gene relationships can be derived from the literature. Gene neighborhoods were determined using latent Gaussian Markov random fields and logistic transformation of the latent variables. We tested the procedure on a microarray dataset for interferon-stimulated genes in mouse embryonic fibroblasts. By integrating functional information from literature, Bayesian approach identified relevant genes that previously did not meet the 0.05 significance level. In comparison to a standard mixture model, spatial mixture model has more power for identifying direct and indirect interferon regulated genes. The spatial model enhanced the ranks of some genes which are known to be affected by interferon treatment, such as Nmi (NMI N-myc and STAT interactor) and ifi35 (interferon-induced protein 35). It also identified some genes that previously were ignored because of the marginal p-values, such as dpysl2, map2k1, msn, Psck5, and Il6st. Interestingly, these genes appear to be indirectly related to interferon treatment. In summary, we show that our procedure increases statistical power and produces more biologically meaningful gene lists. These results suggest that Bayesian methods which incorporate functional information from the literature may improve analysis of gene expression data.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"14 1","pages":"1032-1032"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74476114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A chi-square test for detecting multiple joint genetic variants in genome-wide association studies 在全基因组关联研究中检测多个联合遗传变异的卡方检验
Iksoo Huh, Sohee Oh, T. Park
As a result of genotyping technologies, genome-wide association studies (GWAS) have been widely used to identify genetic variants associated with common complex traits. While most GWAS have focused on associations with single genetic variants, the investigation of multiple joint genetic variants is essential for understanding genetic architecture of complex traits because common complex traits are associated with multiple genetic variants. However, it is not easy to conduct the multiple joint genetic variants analysis and to identify high order interactions using a number of genetic variants in GWAS. In this study, we propose a stepwise method based on the Chi-square test in order to identify causal joint multiple genetic variants in GWAS. Through simulation studies, we examine the properties of the stepwise method and then apply the proposed method to a GWA data for detecting joint multiple genetic variants for age-related macular degeneration.
由于基因分型技术的发展,全基因组关联研究(GWAS)已被广泛用于鉴定与常见复杂性状相关的遗传变异。虽然大多数GWAS都集中在与单一遗传变异的关联上,但由于常见的复杂性状与多个遗传变异相关,因此对多个联合遗传变异的研究对于理解复杂性状的遗传结构至关重要。然而,在GWAS中进行多联合遗传变异分析和利用多个遗传变异识别高阶相互作用并不容易。在本研究中,我们提出了一种基于卡方检验的逐步方法,以确定GWAS的因果联合多遗传变异。通过仿真研究,我们检验了逐步方法的特性,然后将所提出的方法应用于GWA数据,用于检测关节多遗传变异的年龄相关性黄斑变性。
{"title":"A chi-square test for detecting multiple joint genetic variants in genome-wide association studies","authors":"Iksoo Huh, Sohee Oh, T. Park","doi":"10.1109/BIBMW.2011.6112457","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112457","url":null,"abstract":"As a result of genotyping technologies, genome-wide association studies (GWAS) have been widely used to identify genetic variants associated with common complex traits. While most GWAS have focused on associations with single genetic variants, the investigation of multiple joint genetic variants is essential for understanding genetic architecture of complex traits because common complex traits are associated with multiple genetic variants. However, it is not easy to conduct the multiple joint genetic variants analysis and to identify high order interactions using a number of genetic variants in GWAS. In this study, we propose a stepwise method based on the Chi-square test in order to identify causal joint multiple genetic variants in GWAS. Through simulation studies, we examine the properties of the stepwise method and then apply the proposed method to a GWA data for detecting joint multiple genetic variants for age-related macular degeneration.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"9 1","pages":"708-713"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76189876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Module detection for bacteria based on spectral clustering of protein-protein functional association networks 基于蛋白质-蛋白质功能关联网络光谱聚类的细菌模块检测
Hongwei Wu, Yaming Lin, Fun Choi Chan, R. Alba-Flores
Network analysis-based module detection has significant implications in many fields. In cellular/ molecular biology, module detection based on analyses of metabolic/regulatory networks will not only help us understand more about the function and evolution of cellular machinery of an organism, but will also provide tractable contextual information for potential drug targets and facilitate improvements in drug designs. We here present our preliminary study on the module detection for bacteria based on the spectral clustering of the protein-protein functional association networks. We first examined how the parameter of the spectral clustering algorithm (i.e., the number of clusters) affects our module detection results, and demonstrated that when the number of clusters was set too small or too large the resulting module collection deteriorate in terms of gene coverage and intra-module association. We then compared our predicted modules against the randomly generated modules, and demonstrated that our modules (i) have a higher ratio of the intra-module to inter-module gene-gene functional association scores and (ii) can better capture the modularization information inherent in the experimentally verified modules. Finally we compared the module collections of seven bacterial organisms, and observed that modules related to membrane transport and cell motility are among those that are conserved among multiple organisms. Because it is desirable from both scientific and technical points of view to study functional modules at various resolution levels, we believe that the spectral clustering algorithm, with the flexibility rendered by different parameter settings, provides an appropriate solution in terms of capturing the modularization properties of networks and computational affordability.
基于网络分析的模块检测在许多领域具有重要意义。在细胞/分子生物学中,基于代谢/调控网络分析的模块检测不仅有助于我们更多地了解生物体细胞机制的功能和进化,而且还将为潜在的药物靶点提供可处理的上下文信息,并促进药物设计的改进。本文提出了基于蛋白质-蛋白质功能关联网络光谱聚类的细菌模块检测方法的初步研究。我们首先研究了谱聚类算法的参数(即聚类数量)如何影响我们的模块检测结果,并证明当聚类数量设置得太小或太大时,所得到的模块收集在基因覆盖和模块内关联方面会恶化。然后,我们将我们的预测模块与随机生成的模块进行了比较,并证明我们的模块(i)具有更高的模块内与模块间基因-基因功能关联评分比例,并且(ii)可以更好地捕获实验验证模块中固有的模块化信息。最后,我们比较了7种细菌的模块集合,并观察到与膜运输和细胞运动相关的模块在多种生物中是保守的。由于从科学和技术的角度来看,研究不同分辨率的功能模块是可取的,我们认为频谱聚类算法具有不同参数设置所带来的灵活性,在捕获网络的模块化特性和计算负担能力方面提供了合适的解决方案。
{"title":"Module detection for bacteria based on spectral clustering of protein-protein functional association networks","authors":"Hongwei Wu, Yaming Lin, Fun Choi Chan, R. Alba-Flores","doi":"10.1109/BIBMW.2011.6112415","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112415","url":null,"abstract":"Network analysis-based module detection has significant implications in many fields. In cellular/ molecular biology, module detection based on analyses of metabolic/regulatory networks will not only help us understand more about the function and evolution of cellular machinery of an organism, but will also provide tractable contextual information for potential drug targets and facilitate improvements in drug designs. We here present our preliminary study on the module detection for bacteria based on the spectral clustering of the protein-protein functional association networks. We first examined how the parameter of the spectral clustering algorithm (i.e., the number of clusters) affects our module detection results, and demonstrated that when the number of clusters was set too small or too large the resulting module collection deteriorate in terms of gene coverage and intra-module association. We then compared our predicted modules against the randomly generated modules, and demonstrated that our modules (i) have a higher ratio of the intra-module to inter-module gene-gene functional association scores and (ii) can better capture the modularization information inherent in the experimentally verified modules. Finally we compared the module collections of seven bacterial organisms, and observed that modules related to membrane transport and cell motility are among those that are conserved among multiple organisms. Because it is desirable from both scientific and technical points of view to study functional modules at various resolution levels, we believe that the spectral clustering algorithm, with the flexibility rendered by different parameter settings, provides an appropriate solution in terms of capturing the modularization properties of networks and computational affordability.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"12 1","pages":"465-472"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76261021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A hybrid approach for hemorrhage segmentation in pelvic CT scans 骨盆CT扫描出血分割的混合方法
Pavani Davuluri, Jie Wu, Ashwin Belle, Charles Cockrell, Yang Tang, Kevin Ward, K. Najarian, R. H. Hargraves
Hemorrhage is the leading cause of death in patients with severe pelvic fractures within the first 24 hours after the injury. Hence, it is vital for physicians to quickly identify hemorrhage and assess bleeding severity. However, it is rather time consuming for physicians to evaluate all the CT images. Therefore, an automated hemorrhage segmentation system is needed to assist physicians. This paper proposes a hybrid approach for hemorrhage segmentation from pelvic CT scans. This approach utilizes region growing technique with integration of contrast information from the previous and subsequent slices. The results show that the method is able to segment hemorrhage well with acceptable results. Hemorrhage volume is also determined. A statistical t-test is conducted to determine if the calculated hemorrhage volume using the proposed method is significantly different from the manually detected volume.
出血是严重骨盆骨折患者在受伤后24小时内死亡的主要原因。因此,对医生来说,快速识别出血和评估出血严重程度是至关重要的。然而,医生对所有CT图像进行评估是相当耗时的。因此,需要一个自动出血分割系统来辅助医生。本文提出了一种骨盆CT扫描出血分割的混合方法。该方法利用区域增长技术,整合了前后切片的对比度信息。结果表明,该方法能较好地分割出血,效果良好。同时测定出血量。采用统计t检验来确定采用所提出的方法计算的出血量是否与人工检测的出血量有显著差异。
{"title":"A hybrid approach for hemorrhage segmentation in pelvic CT scans","authors":"Pavani Davuluri, Jie Wu, Ashwin Belle, Charles Cockrell, Yang Tang, Kevin Ward, K. Najarian, R. H. Hargraves","doi":"10.1109/BIBMW.2011.6112428","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112428","url":null,"abstract":"Hemorrhage is the leading cause of death in patients with severe pelvic fractures within the first 24 hours after the injury. Hence, it is vital for physicians to quickly identify hemorrhage and assess bleeding severity. However, it is rather time consuming for physicians to evaluate all the CT images. Therefore, an automated hemorrhage segmentation system is needed to assist physicians. This paper proposes a hybrid approach for hemorrhage segmentation from pelvic CT scans. This approach utilizes region growing technique with integration of contrast information from the previous and subsequent slices. The results show that the method is able to segment hemorrhage well with acceptable results. Hemorrhage volume is also determined. A statistical t-test is conducted to determine if the calculated hemorrhage volume using the proposed method is significantly different from the manually detected volume.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"195 1","pages":"548-554"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76924767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1