Automatic extraction of gene and protein synonyms from MEDLINE and journal articles.

Proceedings. AMIA Symposium Pub Date : 2002-01-01
Hong Yu, Vasileios Hatzivassiloglou, Carol Friedman, Andrey Rzhetsky, W John Wilbur
{"title":"Automatic extraction of gene and protein synonyms from MEDLINE and journal articles.","authors":"Hong Yu,&nbsp;Vasileios Hatzivassiloglou,&nbsp;Carol Friedman,&nbsp;Andrey Rzhetsky,&nbsp;W John Wilbur","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>Genes and proteins are often associated with multiple names, and more names are added as new functional or structural information is discovered. Because authors often alternate between these synonyms, information retrieval and extraction benefits from identifying these synonymous names. We have developed a method to extract automatically synonymous gene and protein names from MEDLINE and journal articles. We first identified patterns authors use to list synonymous gene and protein names. We developed SGPE (for synonym extraction of gene and protein names), a software program that recognizes the patterns and extracts from MEDLINE abstracts and full-text journal articles candidate synonymous terms. SGPE then applies a sequence of filters that automatically screen out those terms that are not gene and protein names. We evaluated our method to have an overall precision of 71% on both MEDLINE and journal articles, and 90% precision on the more suitable full-text articles alone</p>","PeriodicalId":79712,"journal":{"name":"Proceedings. AMIA Symposium","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2002-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2244511/pdf/procamiasymp00001-0960.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. AMIA Symposium","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Genes and proteins are often associated with multiple names, and more names are added as new functional or structural information is discovered. Because authors often alternate between these synonyms, information retrieval and extraction benefits from identifying these synonymous names. We have developed a method to extract automatically synonymous gene and protein names from MEDLINE and journal articles. We first identified patterns authors use to list synonymous gene and protein names. We developed SGPE (for synonym extraction of gene and protein names), a software program that recognizes the patterns and extracts from MEDLINE abstracts and full-text journal articles candidate synonymous terms. SGPE then applies a sequence of filters that automatically screen out those terms that are not gene and protein names. We evaluated our method to have an overall precision of 71% on both MEDLINE and journal articles, and 90% precision on the more suitable full-text articles alone

分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从MEDLINE和期刊文章中自动提取基因和蛋白质同义词。
基因和蛋白质通常与多个名称相关联,随着新的功能或结构信息的发现,更多的名称被添加。由于作者经常在这些同义词之间交替使用,因此信息检索和提取可以从识别这些同义词中获益。我们开发了一种从MEDLINE和期刊文章中自动提取同义基因和蛋白质名称的方法。我们首先确定了作者用来列出同义基因和蛋白质名称的模式。我们开发了SGPE(用于基因和蛋白质名称的同义词提取),这是一个从MEDLINE摘要和全文期刊文章中识别模式和提取候选同义词的软件程序。然后,SGPE应用一系列过滤器,自动筛选出那些不是基因和蛋白质名称的术语。我们评估我们的方法在MEDLINE和期刊文章上的总体精度为71%,在更合适的全文文章上的精度为90%
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Electronic Patient Record Medical informatics as a market for IS/IT Perceived Information Needs and Communication Difficulties of Inpatient Physicians and Nurses Disambiguation Data: Extracting Information from Anonymized Sources The Operating Room Charge Nurse: Coordinator and Communicator
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1