Geervani Koneti, Shyam Sundar Das, J. Bahl, P. Ranjan, N. Ramamurthi
{"title":"Discovering the Knowledge in Unstructured Early Drug Development Data Using NLP and Advanced Analytics","authors":"Geervani Koneti, Shyam Sundar Das, J. Bahl, P. Ranjan, N. Ramamurthi","doi":"10.1109/BIBM55620.2022.9995435","DOIUrl":null,"url":null,"abstract":"Discovering the knowledge in unstructured archived data is an area of active interest in academia and industry, as it offers opportunities to learn, confirm and eventually, address R&D productivity challenges. Our interest in this area prompted us to investigate an Natural Language Processing based approach to extract unstructured Pharmacokinetics (PK) and Pharmacodynamics (PD) data from PK-PD study reports, and perform analytics using in-house developed compartmental and non-compartmental analytics engines. For this purpose, we have developed a dictionary of two thousand twenty-one (2321) PK-PD keywords based on published study reports. Details of our approach and its applications in discovering the knowledge in the unstructured archived early drug development reports, is the subject of this paper.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"8 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM55620.2022.9995435","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Discovering the knowledge in unstructured archived data is an area of active interest in academia and industry, as it offers opportunities to learn, confirm and eventually, address R&D productivity challenges. Our interest in this area prompted us to investigate an Natural Language Processing based approach to extract unstructured Pharmacokinetics (PK) and Pharmacodynamics (PD) data from PK-PD study reports, and perform analytics using in-house developed compartmental and non-compartmental analytics engines. For this purpose, we have developed a dictionary of two thousand twenty-one (2321) PK-PD keywords based on published study reports. Details of our approach and its applications in discovering the knowledge in the unstructured archived early drug development reports, is the subject of this paper.