Chenyang Song , Zheni Zeng , Changyao Tian , Kuai Li , Yuan Yao , Suncong Zheng , Zhiyuan Liu , Maosong Sun
{"title":"关系感知深度神经网络让从海量文献中获取生物医学知识更高效","authors":"Chenyang Song , Zheni Zeng , Changyao Tian , Kuai Li , Yuan Yao , Suncong Zheng , Zhiyuan Liu , Maosong Sun","doi":"10.1016/j.aiopen.2024.08.002","DOIUrl":null,"url":null,"abstract":"<div><p>Biomedical knowledge is typically organized in a relational scheme, such as chemical-disease relation, gene-disease relation, and gene-pathway relation. Biomedical scientists heavily rely on search engines to acquire up-to-date relational knowledge from massive biomedical articles. The navigation efficiency of the retrieval process, however, is significantly restricted by keyword matching techniques unaware of the biomedical relations of these keywords in articles. To bridge the gap between existing retrieval techniques and practical access demands for relational knowledge, we present a novel framework, <strong>Bio</strong>medical <strong>R</strong>elation-<strong>A</strong>ware <strong>D</strong>ocument <strong>R</strong>anking (BioRADR), capable of retrieving articles expressing specific relations with respect to the queried entity pair. Based on a deep neural network, BioRADR can be trained from large-scale data automatically annotated via distant supervision, and empirical evaluation reveals that it outperforms the strongest baseline by over 8 points in NDCG@1. We implement an online system (<span><span>http://bioradr.ai.thunlp.org/</span><svg><path></path></svg></span>) based on BioRADR, enabling more efficient relation-oriented retrieval of biomedical articles.</p></div>","PeriodicalId":100068,"journal":{"name":"AI Open","volume":"5 ","pages":"Pages 104-114"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666651024000123/pdfft?md5=0371d6da4f7cdd9c7adbcc0dac99a13d&pid=1-s2.0-S2666651024000123-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Relation-aware deep neural network enables more efficient biomedical knowledge acquisition from massive literature\",\"authors\":\"Chenyang Song , Zheni Zeng , Changyao Tian , Kuai Li , Yuan Yao , Suncong Zheng , Zhiyuan Liu , Maosong Sun\",\"doi\":\"10.1016/j.aiopen.2024.08.002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Biomedical knowledge is typically organized in a relational scheme, such as chemical-disease relation, gene-disease relation, and gene-pathway relation. Biomedical scientists heavily rely on search engines to acquire up-to-date relational knowledge from massive biomedical articles. The navigation efficiency of the retrieval process, however, is significantly restricted by keyword matching techniques unaware of the biomedical relations of these keywords in articles. To bridge the gap between existing retrieval techniques and practical access demands for relational knowledge, we present a novel framework, <strong>Bio</strong>medical <strong>R</strong>elation-<strong>A</strong>ware <strong>D</strong>ocument <strong>R</strong>anking (BioRADR), capable of retrieving articles expressing specific relations with respect to the queried entity pair. Based on a deep neural network, BioRADR can be trained from large-scale data automatically annotated via distant supervision, and empirical evaluation reveals that it outperforms the strongest baseline by over 8 points in NDCG@1. We implement an online system (<span><span>http://bioradr.ai.thunlp.org/</span><svg><path></path></svg></span>) based on BioRADR, enabling more efficient relation-oriented retrieval of biomedical articles.</p></div>\",\"PeriodicalId\":100068,\"journal\":{\"name\":\"AI Open\",\"volume\":\"5 \",\"pages\":\"Pages 104-114\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666651024000123/pdfft?md5=0371d6da4f7cdd9c7adbcc0dac99a13d&pid=1-s2.0-S2666651024000123-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AI Open\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666651024000123\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AI Open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666651024000123","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Relation-aware deep neural network enables more efficient biomedical knowledge acquisition from massive literature
Biomedical knowledge is typically organized in a relational scheme, such as chemical-disease relation, gene-disease relation, and gene-pathway relation. Biomedical scientists heavily rely on search engines to acquire up-to-date relational knowledge from massive biomedical articles. The navigation efficiency of the retrieval process, however, is significantly restricted by keyword matching techniques unaware of the biomedical relations of these keywords in articles. To bridge the gap between existing retrieval techniques and practical access demands for relational knowledge, we present a novel framework, Biomedical Relation-Aware Document Ranking (BioRADR), capable of retrieving articles expressing specific relations with respect to the queried entity pair. Based on a deep neural network, BioRADR can be trained from large-scale data automatically annotated via distant supervision, and empirical evaluation reveals that it outperforms the strongest baseline by over 8 points in NDCG@1. We implement an online system (http://bioradr.ai.thunlp.org/) based on BioRADR, enabling more efficient relation-oriented retrieval of biomedical articles.