C. Li, H. Gurulingappa, P. Karmalkar, J. Raab, Aastha Vij, Gerard Megaro, C. Henke
{"title":"Automate Clinical Evidence Synthesis by Linking Trials to Publications with Text Analytics","authors":"C. Li, H. Gurulingappa, P. Karmalkar, J. Raab, Aastha Vij, Gerard Megaro, C. Henke","doi":"10.1145/3459104.3459168","DOIUrl":null,"url":null,"abstract":"Randomized clinical trials are the core data source for meta- analyses and relative efficacy analyses. As of today, there are missing links between large portions of trials registered in clinical trial registries and their reported outcomes published in scientific journals. This missing citation information makes it difficult to identify all relevant publications for evidence synthesis and decision support. Therefore, we propose a novel natural language processing-based system to establish links between clinical trials and their published outcomes in literature. Different approaches leveraging information retrieval and machine learning with embedding features were systematically developed and evaluated. Results show that shallow machine learning approach with embeddings provide promising results indicating the value it can add to circumvent the limitations of manual search and analyses.","PeriodicalId":142284,"journal":{"name":"2021 International Symposium on Electrical, Electronics and Information Engineering","volume":"103 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Symposium on Electrical, Electronics and Information Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3459104.3459168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Randomized clinical trials are the core data source for meta- analyses and relative efficacy analyses. As of today, there are missing links between large portions of trials registered in clinical trial registries and their reported outcomes published in scientific journals. This missing citation information makes it difficult to identify all relevant publications for evidence synthesis and decision support. Therefore, we propose a novel natural language processing-based system to establish links between clinical trials and their published outcomes in literature. Different approaches leveraging information retrieval and machine learning with embedding features were systematically developed and evaluated. Results show that shallow machine learning approach with embeddings provide promising results indicating the value it can add to circumvent the limitations of manual search and analyses.