From sipping on a straw to drinking from a fire hose: data integration in a public genome database

J. Richardson, J. Kadin, J. Blake, C. Bult, J. Eppig, M. Ringwald
{"title":"From sipping on a straw to drinking from a fire hose: data integration in a public genome database","authors":"J. Richardson, J. Kadin, J. Blake, C. Bult, J. Eppig, M. Ringwald","doi":"10.1109/ICDE.2004.1320050","DOIUrl":null,"url":null,"abstract":"Biology is a vast domain. The Mouse Genome Informatics (MGI) system, which focuses on the biology of the laboratory mouse, covers only a small, carefully chosen slice. Nevertheless, we deal with data of immense variety, deep complexity, and exponentially growing volume. Our role as an integration nexus is to add value by combining data sets of diverse types and origins, eliminating redundancy and resolving conflicts. We briefly describe some of the issues we face and approaches we have adopted to the integration problem.","PeriodicalId":358862,"journal":{"name":"Proceedings. 20th International Conference on Data Engineering","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. 20th International Conference on Data Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDE.2004.1320050","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Biology is a vast domain. The Mouse Genome Informatics (MGI) system, which focuses on the biology of the laboratory mouse, covers only a small, carefully chosen slice. Nevertheless, we deal with data of immense variety, deep complexity, and exponentially growing volume. Our role as an integration nexus is to add value by combining data sets of diverse types and origins, eliminating redundancy and resolving conflicts. We briefly describe some of the issues we face and approaches we have adopted to the integration problem.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从啜吸管到从消防水管里喝水:公共基因组数据库中的数据整合
生物学是一个广阔的领域。小鼠基因组信息学(MGI)系统专注于实验室小鼠的生物学,只涵盖了很小的、精心挑选的部分。尽管如此,我们处理的数据种类繁多,非常复杂,而且数量呈指数级增长。我们作为集成纽带的角色是通过组合不同类型和来源的数据集来增加价值,消除冗余并解决冲突。我们简要描述了我们面临的一些问题以及我们采用的解决集成问题的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
ContextMetrics/sup /spl trade//: semantic and syntactic interoperability in cross-border trading systems EShopMonitor: a Web content monitoring tool A probabilistic approach to metasearching with adaptive probing Simple, robust and highly concurrent b-trees with node deletion Substructure clustering on sequential 3d object datasets
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1