从文本中提取基因-疾病关系以支持生物标志物的发现

Proceedings of the 2017 International Conference on Digital Health Pub Date : 2017-07-02 DOI:10.1145/3079452.3079472

Paul Thompson, S. Ananiadou

{"title":"从文本中提取基因-疾病关系以支持生物标志物的发现","authors":"Paul Thompson, S. Ananiadou","doi":"10.1145/3079452.3079472","DOIUrl":null,"url":null,"abstract":"The biomedical literature constitutes a rich source of evidence to support the discovery of biomarkers. However, locating evidence in huge volumes of text can be difficult, as typical keyword queries cannot account for the meaning and structure of text. Text mining (TM) methods carry out automated semantic analysis of documents, to facilitate structured searching that can more precisely match users' information needs. We describe our TM approach to the detection of sentence-level associations between genes and diseases, as a first step towards developing a sophisticated search system targeted at locating biomarker evidence in the literature. We vary the sophistication of our detection methodology according to sentence complexity, using either co-occurring mentions of genes and diseases, or linguistic patterns obtained using evidence from approximately 1 million biomedical abstracts. We demonstrate that this method can detect associations more successfully than applying a single technique, with an accuracy that compares highly favourably to related efforts. We also show that the identified relations can complement those detected using alternative approaches.","PeriodicalId":245682,"journal":{"name":"Proceedings of the 2017 International Conference on Digital Health","volume":"16 9","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Extracting Gene-Disease Relations from Text to Support Biomarker Discovery\",\"authors\":\"Paul Thompson, S. Ananiadou\",\"doi\":\"10.1145/3079452.3079472\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The biomedical literature constitutes a rich source of evidence to support the discovery of biomarkers. However, locating evidence in huge volumes of text can be difficult, as typical keyword queries cannot account for the meaning and structure of text. Text mining (TM) methods carry out automated semantic analysis of documents, to facilitate structured searching that can more precisely match users' information needs. We describe our TM approach to the detection of sentence-level associations between genes and diseases, as a first step towards developing a sophisticated search system targeted at locating biomarker evidence in the literature. We vary the sophistication of our detection methodology according to sentence complexity, using either co-occurring mentions of genes and diseases, or linguistic patterns obtained using evidence from approximately 1 million biomedical abstracts. We demonstrate that this method can detect associations more successfully than applying a single technique, with an accuracy that compares highly favourably to related efforts. We also show that the identified relations can complement those detected using alternative approaches.\",\"PeriodicalId\":245682,\"journal\":{\"name\":\"Proceedings of the 2017 International Conference on Digital Health\",\"volume\":\"16 9\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2017 International Conference on Digital Health\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3079452.3079472\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 International Conference on Digital Health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3079452.3079472","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

生物医学文献为支持生物标志物的发现提供了丰富的证据。然而，在大量文本中定位证据可能很困难，因为典型的关键字查询无法解释文本的含义和结构。文本挖掘(TM)方法对文档进行自动语义分析，便于结构化搜索，更精确地匹配用户的信息需求。我们描述了我们的TM方法来检测基因和疾病之间的句子级关联，作为开发一个复杂的搜索系统的第一步，目标是在文献中定位生物标志物证据。我们根据句子的复杂程度改变了检测方法的复杂程度，使用基因和疾病的共同出现，或使用从大约100万份生物医学摘要中获得的证据获得的语言模式。我们证明，这种方法可以比应用单一技术更成功地检测关联，其准确性与相关工作相比非常有利。我们还表明，识别的关系可以补充使用替代方法检测到的关系。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Extracting Gene-Disease Relations from Text to Support Biomarker Discovery

The biomedical literature constitutes a rich source of evidence to support the discovery of biomarkers. However, locating evidence in huge volumes of text can be difficult, as typical keyword queries cannot account for the meaning and structure of text. Text mining (TM) methods carry out automated semantic analysis of documents, to facilitate structured searching that can more precisely match users' information needs. We describe our TM approach to the detection of sentence-level associations between genes and diseases, as a first step towards developing a sophisticated search system targeted at locating biomarker evidence in the literature. We vary the sophistication of our detection methodology according to sentence complexity, using either co-occurring mentions of genes and diseases, or linguistic patterns obtained using evidence from approximately 1 million biomedical abstracts. We demonstrate that this method can detect associations more successfully than applying a single technique, with an accuracy that compares highly favourably to related efforts. We also show that the identified relations can complement those detected using alternative approaches.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2017 International Conference on Digital Health

自引率

0.00%

发文量

期刊最新文献

Extracting Gene-Disease Relations from Text to Support Biomarker Discovery Towards Health (Aware) Recommender Systems A Regularization Approach for Identifying Cumulative Lagged Effects in Smart Health Applications FitBit Garden: A Mobile Game Designed to Increase Physical Activity in Children Health Misinformation in Search and Social Media