LitCovid-AGAC: cellular and molecular level annotation data set based on COVID-19.

Q2 Agricultural and Biological Sciences Genomics and Informatics Pub Date : 2021-09-01 Epub Date: 2021-09-30 DOI:10.5808/gi.21013
Sizhuo Ouyang, Yuxing Wang, Kaiyin Zhou, Jingbo Xia
{"title":"LitCovid-AGAC: cellular and molecular level annotation data set based on COVID-19.","authors":"Sizhuo Ouyang, Yuxing Wang, Kaiyin Zhou, Jingbo Xia","doi":"10.5808/gi.21013","DOIUrl":null,"url":null,"abstract":"<p><p>Currently, coronavirus disease 2019 (COVID-19) literature has been increasing dramatically, and the increased text amount make it possible to perform large scale text mining and knowledge discovery. Therefore, curation of these texts becomes a crucial issue for Bio-medical Natural Language Processing (BioNLP) community, so as to retrieve the important information about the mechanism of COVID-19. PubAnnotation is an aligned annotation system which provides an efficient platform for biological curators to upload their annotations or merge other external annotations. Inspired by the integration among multiple useful COVID-19 annotations, we merged three annotations resources to LitCovid data set, and constructed a cross-annotated corpus, LitCovid-AGAC. This corpus consists of 12 labels including Mutation, Species, Gene, Disease from PubTator, GO, CHEBI from OGER, Var, MPA, CPA, NegReg, PosReg, Reg from AGAC, upon 50,018 COVID-19 abstracts in LitCovid. Contain sufficient abundant information being possible to unveil the hidden knowledge in the pathological mechanism of COVID-19.</p>","PeriodicalId":36591,"journal":{"name":"Genomics and Informatics","volume":"19 3","pages":"e23"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8510875/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5808/gi.21013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/9/30 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
引用次数: 0

Abstract

Currently, coronavirus disease 2019 (COVID-19) literature has been increasing dramatically, and the increased text amount make it possible to perform large scale text mining and knowledge discovery. Therefore, curation of these texts becomes a crucial issue for Bio-medical Natural Language Processing (BioNLP) community, so as to retrieve the important information about the mechanism of COVID-19. PubAnnotation is an aligned annotation system which provides an efficient platform for biological curators to upload their annotations or merge other external annotations. Inspired by the integration among multiple useful COVID-19 annotations, we merged three annotations resources to LitCovid data set, and constructed a cross-annotated corpus, LitCovid-AGAC. This corpus consists of 12 labels including Mutation, Species, Gene, Disease from PubTator, GO, CHEBI from OGER, Var, MPA, CPA, NegReg, PosReg, Reg from AGAC, upon 50,018 COVID-19 abstracts in LitCovid. Contain sufficient abundant information being possible to unveil the hidden knowledge in the pathological mechanism of COVID-19.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
LitCovid-AGAC:基于 COVID-19 的细胞和分子水平注释数据集。
目前,冠状病毒病 2019(COVID-19)的文献急剧增加,文本量的增加使得进行大规模文本挖掘和知识发现成为可能。因此,如何对这些文本进行整理,以获取有关 COVID-19 机制的重要信息,成为生物医学自然语言处理(BioNLP)领域的一个关键问题。PubAnnotation 是一个对齐注释系统,它为生物馆员上传注释或合并其他外部注释提供了一个高效的平台。受整合多种有用 COVID-19 注释的启发,我们将三种注释资源合并到 LitCovid 数据集,并构建了交叉注释语料库 LitCovid-AGAC。该语料库包括 PubTator 中的 Mutation、Species、Gene、Disease,OGER 中的 GO、CHEBI,AGAC 中的 Var、MPA、CPA、NegReg、PosReg、Reg 等 12 个标签,以及 LitCovid 中的 50,018 篇 COVID-19 摘要。其中包含足够丰富的信息,有可能揭示 COVID-19 病理机制中隐藏的知识。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Genomics and Informatics
Genomics and Informatics Agricultural and Biological Sciences-Ecology, Evolution, Behavior and Systematics
CiteScore
1.90
自引率
0.00%
发文量
0
审稿时长
12 weeks
期刊最新文献
Gut metagenomic analysis of gastric cancer patients reveals Akkermansia, Gammaproteobacteria, and Veillonella microbiota as potential non-invasive biomarkers COVID-19 progression towards ARDS: a genome wide study reveals host factors underlying critical COVID-19. Bioinformatic analyses reveal the prognostic significance and potential role of ankyrin 3 (ANK3) in kidney renal clear cell carcinoma. Comparison of digital PCR platforms using the molecular marker. Single-cell RNA sequencing identifies distinct transcriptomic signatures between PMA/ionomycin- and αCD3/αCD28-activated primary human T cells.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1