Empowering standardization of cancer vaccines through ontology: enhanced modeling and data analysis.

IF 1.6 3区 工程技术 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Journal of Biomedical Semantics Pub Date : 2024-06-19 DOI:10.1186/s13326-024-00312-3
Jie Zheng, Xingxian Li, Anna Maria Masci, Hayleigh Kahn, Anthony Huffman, Eliyas Asfaw, Yuanyi Pan, Jinjing Guo, Virginia He, Justin Song, Andrey I Seleznev, Asiyah Yu Lin, Yongqun He
{"title":"Empowering standardization of cancer vaccines through ontology: enhanced modeling and data analysis.","authors":"Jie Zheng, Xingxian Li, Anna Maria Masci, Hayleigh Kahn, Anthony Huffman, Eliyas Asfaw, Yuanyi Pan, Jinjing Guo, Virginia He, Justin Song, Andrey I Seleznev, Asiyah Yu Lin, Yongqun He","doi":"10.1186/s13326-024-00312-3","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The exploration of cancer vaccines has yielded a multitude of studies, resulting in a diverse collection of information. The heterogeneity of cancer vaccine data significantly impedes effective integration and analysis. While CanVaxKB serves as a pioneering database for over 670 manually annotated cancer vaccines, it is important to distinguish that a database, on its own, does not offer the structured relationships and standardized definitions found in an ontology. Recognizing this, we expanded the Vaccine Ontology (VO) to include those cancer vaccines present in CanVaxKB that were not initially covered, enhancing VO's capacity to systematically define and interrelate cancer vaccines.</p><p><strong>Results: </strong>An ontology design pattern (ODP) was first developed and applied to semantically represent various cancer vaccines, capturing their associated entities and relations. By applying the ODP, we generated a cancer vaccine template in a tabular format and converted it into the RDF/OWL format for generation of cancer vaccine terms in the VO. '12MP vaccine' was used as an example of cancer vaccines to demonstrate the application of the ODP. VO also reuses reference ontology terms to represent entities such as cancer diseases and vaccine hosts. Description Logic (DL) and SPARQL query scripts were developed and used to query for cancer vaccines based on different vaccine's features and to demonstrate the versatility of the VO representation. Additionally, ontological modeling was applied to illustrate cancer vaccine related concepts and studies for in-depth cancer vaccine analysis. A cancer vaccine-specific VO view, referred to as \"CVO,\" was generated, and it contains 928 classes including 704 cancer vaccines. The CVO OWL file is publicly available on: http://purl.obolibrary.org/obo/vo/cvo.owl , for sharing and applications.</p><p><strong>Conclusion: </strong>To facilitate the standardization, integration, and analysis of cancer vaccine data, we expanded the Vaccine Ontology (VO) to systematically model and represent cancer vaccines. We also developed a pipeline to automate the inclusion of cancer vaccines and associated terms in the VO. This not only enriches the data's standardization and integration, but also leverages ontological modeling to deepen the analysis of cancer vaccine information, maximizing benefits for researchers and clinicians.</p><p><strong>Availability: </strong>The VO-cancer GitHub website is: https://github.com/vaccineontology/VO/tree/master/CVO .</p>","PeriodicalId":15055,"journal":{"name":"Journal of Biomedical Semantics","volume":"15 1","pages":"12"},"PeriodicalIF":1.6000,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11186274/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biomedical Semantics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1186/s13326-024-00312-3","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The exploration of cancer vaccines has yielded a multitude of studies, resulting in a diverse collection of information. The heterogeneity of cancer vaccine data significantly impedes effective integration and analysis. While CanVaxKB serves as a pioneering database for over 670 manually annotated cancer vaccines, it is important to distinguish that a database, on its own, does not offer the structured relationships and standardized definitions found in an ontology. Recognizing this, we expanded the Vaccine Ontology (VO) to include those cancer vaccines present in CanVaxKB that were not initially covered, enhancing VO's capacity to systematically define and interrelate cancer vaccines.

Results: An ontology design pattern (ODP) was first developed and applied to semantically represent various cancer vaccines, capturing their associated entities and relations. By applying the ODP, we generated a cancer vaccine template in a tabular format and converted it into the RDF/OWL format for generation of cancer vaccine terms in the VO. '12MP vaccine' was used as an example of cancer vaccines to demonstrate the application of the ODP. VO also reuses reference ontology terms to represent entities such as cancer diseases and vaccine hosts. Description Logic (DL) and SPARQL query scripts were developed and used to query for cancer vaccines based on different vaccine's features and to demonstrate the versatility of the VO representation. Additionally, ontological modeling was applied to illustrate cancer vaccine related concepts and studies for in-depth cancer vaccine analysis. A cancer vaccine-specific VO view, referred to as "CVO," was generated, and it contains 928 classes including 704 cancer vaccines. The CVO OWL file is publicly available on: http://purl.obolibrary.org/obo/vo/cvo.owl , for sharing and applications.

Conclusion: To facilitate the standardization, integration, and analysis of cancer vaccine data, we expanded the Vaccine Ontology (VO) to systematically model and represent cancer vaccines. We also developed a pipeline to automate the inclusion of cancer vaccines and associated terms in the VO. This not only enriches the data's standardization and integration, but also leverages ontological modeling to deepen the analysis of cancer vaccine information, maximizing benefits for researchers and clinicians.

Availability: The VO-cancer GitHub website is: https://github.com/vaccineontology/VO/tree/master/CVO .

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过本体论促进癌症疫苗标准化:增强建模和数据分析。
背景:对癌症疫苗的探索产生了大量的研究,从而收集了各种各样的信息。癌症疫苗数据的异质性严重阻碍了有效的整合与分析。尽管 CanVaxKB 是一个包含 670 多种人工注释癌症疫苗的开创性数据库,但必须区分的是,数据库本身并不能提供本体论中的结构化关系和标准化定义。认识到这一点后,我们对疫苗本体(VO)进行了扩展,纳入了 CanVaxKB 最初未涵盖的癌症疫苗,从而增强了 VO 系统定义和相互关联癌症疫苗的能力:我们首先开发了一种本体设计模式(ODP),并将其用于从语义上表示各种癌症疫苗,捕捉其相关实体和关系。通过应用 ODP,我们以表格格式生成了癌症疫苗模板,并将其转换为 RDF/OWL 格式,以便在 VO 中生成癌症疫苗术语。我们以 "12MP 疫苗 "为例演示了 ODP 的应用。VO 还重复使用参考本体术语来表示癌症疾病和疫苗宿主等实体。我们开发了描述逻辑(DL)和 SPARQL 查询脚本,用于根据不同疫苗的特征查询癌症疫苗,以展示 VO 表示法的多功能性。此外,还应用本体论建模来说明癌症疫苗的相关概念和研究,以便对癌症疫苗进行深入分析。生成的癌症疫苗专用 VO 视图被称为 "CVO",包含 928 个类,其中包括 704 种癌症疫苗。CVO OWL 文件可在 http://purl.obolibrary.org/obo/vo/cvo.owl 上公开获取,以供共享和应用:为了促进癌症疫苗数据的标准化、集成和分析,我们扩展了疫苗本体(VO),以系统地建模和表示癌症疫苗。我们还开发了一个管道,可自动将癌症疫苗和相关术语纳入 VO。这不仅丰富了数据的标准化和集成,还利用本体论建模加深了对癌症疫苗信息的分析,为研究人员和临床医生带来了最大益处:VO-cancer GitHub 网站:https://github.com/vaccineontology/VO/tree/master/CVO 。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Biomedical Semantics
Journal of Biomedical Semantics MATHEMATICAL & COMPUTATIONAL BIOLOGY-
CiteScore
4.20
自引率
5.30%
发文量
28
审稿时长
30 weeks
期刊介绍: Journal of Biomedical Semantics addresses issues of semantic enrichment and semantic processing in the biomedical domain. The scope of the journal covers two main areas: Infrastructure for biomedical semantics: focusing on semantic resources and repositories, meta-data management and resource description, knowledge representation and semantic frameworks, the Biomedical Semantic Web, and semantic interoperability. Semantic mining, annotation, and analysis: focusing on approaches and applications of semantic resources; and tools for investigation, reasoning, prediction, and discoveries in biomedicine.
期刊最新文献
Dynamic Retrieval Augmented Generation of Ontologies using Artificial Intelligence (DRAGON-AI). MeSH2Matrix: combining MeSH keywords and machine learning for biomedical relation classification based on PubMed. Annotation of epilepsy clinic letters for natural language processing An extensible and unifying approach to retrospective clinical data modeling: the BrainTeaser Ontology. Concretizing plan specifications as realizables within the OBO foundry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1