{"title":"BERTopic Modelling with P53 in Ovarian Cancer","authors":"R. O. Oveh, M. Adewunmi, G. Aziken","doi":"10.1109/ITED56637.2022.10051483","DOIUrl":null,"url":null,"abstract":"Ovarian cancer is the cancerous growth that begins in the ovaries. It has been identified as the most common cause of cancer related death around the world. It is known for its complexity and low survival rate due to late diagnosis and ineffective early detection mechanism. The mutation of p53 tumour suppressor gene is prevalent in High Grade Serious Ovarian Cancer (HGSOC). In this paper BERTopic Topic modelling an unsupervised machine learning technique was used to extract the keywords p53 and ovarian cancer from PubMed database using the Entrez Global Query Cross-Database Search System. The resulting data was then processed using the regex approach and the Natural Language Tool Kit (NLTK). The result showed useful insight in p53 ovarian cancer topic areas.","PeriodicalId":246041,"journal":{"name":"2022 5th Information Technology for Education and Development (ITED)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th Information Technology for Education and Development (ITED)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITED56637.2022.10051483","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Ovarian cancer is the cancerous growth that begins in the ovaries. It has been identified as the most common cause of cancer related death around the world. It is known for its complexity and low survival rate due to late diagnosis and ineffective early detection mechanism. The mutation of p53 tumour suppressor gene is prevalent in High Grade Serious Ovarian Cancer (HGSOC). In this paper BERTopic Topic modelling an unsupervised machine learning technique was used to extract the keywords p53 and ovarian cancer from PubMed database using the Entrez Global Query Cross-Database Search System. The resulting data was then processed using the regex approach and the Natural Language Tool Kit (NLTK). The result showed useful insight in p53 ovarian cancer topic areas.