利用自然语言处理方法开发大肠癌粪便微生物诊断标记集。

IF 2.3 4区 医学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY International Journal of Biological Markers Pub Date : 2024-03-01 Epub Date: 2023-12-21 DOI:10.1177/03936155231210881
Houcong Liu, Changpu Song, Jidong Wang, Zhufang Chen, Xiaohong Zhang, Hekai Zhou, Linhong Yao, Dan Chen, Wenhao Gu, Rui-Kun Huang, Bing-Kun Huang, Bo-Wei Han, Jihui Du
{"title":"利用自然语言处理方法开发大肠癌粪便微生物诊断标记集。","authors":"Houcong Liu, Changpu Song, Jidong Wang, Zhufang Chen, Xiaohong Zhang, Hekai Zhou, Linhong Yao, Dan Chen, Wenhao Gu, Rui-Kun Huang, Bing-Kun Huang, Bo-Wei Han, Jihui Du","doi":"10.1177/03936155231210881","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Cancer screening and early detection greatly increase the chances of successful treatment. However, most cancer types lack effective early screening biomarkers. In recent years, natural language processing (NLP)-based text-mining methods have proven effective in searching the scientific literature and identifying promising associations between potential biomarkers and disease, but unfortunately few are widely used.</p><p><strong>Methods: </strong>In this study, we used an NLP-enabled text-mining system, MarkerGenie, to identify potential stool bacterial markers for early detection and screening of colorectal cancer. After filtering markers based on text-mining results, we validated bacterial markers using multiplex digital droplet polymerase chain reaction (ddPCR). Classifiers were built based on ddPCR results, and sensitivity, specificity, and area under the curve (AUC) were used to evaluate the performance.</p><p><strong>Results: </strong>A total of 7 of the 14 bacterial markers showed significantly increased abundance in the stools of colorectal cancer patients. A five-bacteria classifier for colorectal cancer diagnosis was built, and achieved an AUC of 0.852, with a sensitivity of 0.692 and specificity of 0.935. When combined with the fecal immunochemical test (FIT), our classifier achieved an AUC of 0.959 and increased the sensitivity of FIT (0.929 vs. 0.872) at a specificity of 0.900.</p><p><strong>Conclusions: </strong>Our study provides a valuable case example of the use of NLP-based marker mining for biomarker identification.</p>","PeriodicalId":50334,"journal":{"name":"International Journal of Biological Markers","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development of fecal microbial diagnostic marker sets of colorectal cancer using natural language processing method.\",\"authors\":\"Houcong Liu, Changpu Song, Jidong Wang, Zhufang Chen, Xiaohong Zhang, Hekai Zhou, Linhong Yao, Dan Chen, Wenhao Gu, Rui-Kun Huang, Bing-Kun Huang, Bo-Wei Han, Jihui Du\",\"doi\":\"10.1177/03936155231210881\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Cancer screening and early detection greatly increase the chances of successful treatment. However, most cancer types lack effective early screening biomarkers. In recent years, natural language processing (NLP)-based text-mining methods have proven effective in searching the scientific literature and identifying promising associations between potential biomarkers and disease, but unfortunately few are widely used.</p><p><strong>Methods: </strong>In this study, we used an NLP-enabled text-mining system, MarkerGenie, to identify potential stool bacterial markers for early detection and screening of colorectal cancer. After filtering markers based on text-mining results, we validated bacterial markers using multiplex digital droplet polymerase chain reaction (ddPCR). Classifiers were built based on ddPCR results, and sensitivity, specificity, and area under the curve (AUC) were used to evaluate the performance.</p><p><strong>Results: </strong>A total of 7 of the 14 bacterial markers showed significantly increased abundance in the stools of colorectal cancer patients. A five-bacteria classifier for colorectal cancer diagnosis was built, and achieved an AUC of 0.852, with a sensitivity of 0.692 and specificity of 0.935. When combined with the fecal immunochemical test (FIT), our classifier achieved an AUC of 0.959 and increased the sensitivity of FIT (0.929 vs. 0.872) at a specificity of 0.900.</p><p><strong>Conclusions: </strong>Our study provides a valuable case example of the use of NLP-based marker mining for biomarker identification.</p>\",\"PeriodicalId\":50334,\"journal\":{\"name\":\"International Journal of Biological Markers\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Biological Markers\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1177/03936155231210881\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/12/21 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"BIOTECHNOLOGY & APPLIED MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Biological Markers","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/03936155231210881","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/12/21 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:癌症筛查和早期检测可大大提高成功治疗的几率。然而,大多数癌症类型都缺乏有效的早期筛查生物标志物。近年来,基于自然语言处理(NLP)的文本挖掘方法已被证明能有效地搜索科学文献并识别潜在生物标志物与疾病之间的关联,但遗憾的是,这些方法很少得到广泛应用:在这项研究中,我们使用了一个支持 NLP 的文本挖掘系统 MarkerGenie 来识别潜在的粪便细菌标记物,以用于结直肠癌的早期检测和筛查。根据文本挖掘结果筛选标记物后,我们使用多重数字液滴聚合酶链反应(ddPCR)验证了细菌标记物。我们根据 ddPCR 结果建立了分类器,并使用灵敏度、特异性和曲线下面积(AUC)来评估其性能:结果:在 14 种细菌标记物中,共有 7 种在结直肠癌患者粪便中的含量明显增加。建立的用于诊断结直肠癌的五种细菌分类器的AUC为0.852,灵敏度为0.692,特异度为0.935。当与粪便免疫化学检验(FIT)相结合时,我们的分类器的AUC达到了0.959,提高了FIT的灵敏度(0.929对0.872),特异性为0.900:我们的研究为基于 NLP 的生物标记物挖掘在生物标记物鉴定中的应用提供了一个有价值的案例。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Development of fecal microbial diagnostic marker sets of colorectal cancer using natural language processing method.

Background: Cancer screening and early detection greatly increase the chances of successful treatment. However, most cancer types lack effective early screening biomarkers. In recent years, natural language processing (NLP)-based text-mining methods have proven effective in searching the scientific literature and identifying promising associations between potential biomarkers and disease, but unfortunately few are widely used.

Methods: In this study, we used an NLP-enabled text-mining system, MarkerGenie, to identify potential stool bacterial markers for early detection and screening of colorectal cancer. After filtering markers based on text-mining results, we validated bacterial markers using multiplex digital droplet polymerase chain reaction (ddPCR). Classifiers were built based on ddPCR results, and sensitivity, specificity, and area under the curve (AUC) were used to evaluate the performance.

Results: A total of 7 of the 14 bacterial markers showed significantly increased abundance in the stools of colorectal cancer patients. A five-bacteria classifier for colorectal cancer diagnosis was built, and achieved an AUC of 0.852, with a sensitivity of 0.692 and specificity of 0.935. When combined with the fecal immunochemical test (FIT), our classifier achieved an AUC of 0.959 and increased the sensitivity of FIT (0.929 vs. 0.872) at a specificity of 0.900.

Conclusions: Our study provides a valuable case example of the use of NLP-based marker mining for biomarker identification.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Biological Markers
International Journal of Biological Markers 医学-生物工程与应用微生物
CiteScore
4.10
自引率
0.00%
发文量
43
期刊介绍: IJBM is an international, online only, peer-reviewed Journal, which publishes original research and critical reviews primarily focused on cancer biomarkers. IJBM targets advanced topics regarding the application of biomarkers in oncology and is dedicated to solid tumors in adult subjects. The clinical scenarios of interests are screening and early diagnosis of cancer, prognostic assessment, prediction of the response to and monitoring of treatment.
期刊最新文献
Circulating exosomal miRNA-451 as an effective diagnostic biomarker and prognostic indicator for multiple myeloma. Inflammatory markers related to survival in breast cancer patients: Peru. Alteration of lncRNA RHPN1-AS1 predicts clinical prognosis and regulates the progression of bladder cancer via modulating miR-485-5p. Serum LINC00339 is a promising biomarker for prognosis prediction of nasopharyngeal carcinoma. Value of the HOTAIR expression assay in predicting therapy target in hepatocellular carcinoma: A meta-analysis and bioinformatics analysis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1