{"title":"Improving Provenance Data in Natural History Collection Databases","authors":"G. Rosenberg, M. Khoo","doi":"10.4003/006.036.0203","DOIUrl":null,"url":null,"abstract":"Abstract: A growing use of natural history collections is documenting changes in species distributions, morphology, phenology and genetics during historical times. Until the 20th century, however, collecting dates were not routinely recorded in collections, making it difficult to determine the time course of biotic changes. Various forms of proxy data can constrain when a particular sample might have been collected. The date of cataloguing puts an upper limit on potential collecting dates, as do dates of death of agents such as collectors, donors, and other previous owners, and dates of birth and activity of collectors give a lower limit. Archival material such as field notes and acquisition records can also provide constraints. Information in such sources should be captured in standardized biographical databases to allow automated bounds on collecting dates to be applied via fields in collection databases. Mollusks are among the best sampled metazoans, so they can serve to test the effectiveness of using biographical data to constrain collecting dates. A random sample of records in the ANSP malacology database that lack date of collection shows that when an agent is known, date of death information improves on date of cataloguing as a constraint on collecting date for 41% of records. Overall, including records that lacked agent information, 38% had improvement. If further historical information such as dates of travel, residence, employment and other affiliations were included in biographical databases, additional improvement on these bounds could be obtained. Collection databases need appropriate data structures for provenance information to track the chain of ownership of specimens more rigorously, and to allow cleaner interface with biographical databases. A survey of other large mollusk collections in the United States suggests that a similar level of improvement could be obtained more generally, affecting millions of specimens. If this result could be extended to other disciplines, a substantially increased proportion of specimens in natural history collections would be accessible for studies of biotic change. Interoperability with genealogical databases could accelerate addition of provenance data to natural history databases.","PeriodicalId":7779,"journal":{"name":"American Malacological Bulletin","volume":"36 1","pages":"215 - 231"},"PeriodicalIF":0.4000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Malacological Bulletin","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.4003/006.036.0203","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MARINE & FRESHWATER BIOLOGY","Score":null,"Total":0}
引用次数: 7
Abstract
Abstract: A growing use of natural history collections is documenting changes in species distributions, morphology, phenology and genetics during historical times. Until the 20th century, however, collecting dates were not routinely recorded in collections, making it difficult to determine the time course of biotic changes. Various forms of proxy data can constrain when a particular sample might have been collected. The date of cataloguing puts an upper limit on potential collecting dates, as do dates of death of agents such as collectors, donors, and other previous owners, and dates of birth and activity of collectors give a lower limit. Archival material such as field notes and acquisition records can also provide constraints. Information in such sources should be captured in standardized biographical databases to allow automated bounds on collecting dates to be applied via fields in collection databases. Mollusks are among the best sampled metazoans, so they can serve to test the effectiveness of using biographical data to constrain collecting dates. A random sample of records in the ANSP malacology database that lack date of collection shows that when an agent is known, date of death information improves on date of cataloguing as a constraint on collecting date for 41% of records. Overall, including records that lacked agent information, 38% had improvement. If further historical information such as dates of travel, residence, employment and other affiliations were included in biographical databases, additional improvement on these bounds could be obtained. Collection databases need appropriate data structures for provenance information to track the chain of ownership of specimens more rigorously, and to allow cleaner interface with biographical databases. A survey of other large mollusk collections in the United States suggests that a similar level of improvement could be obtained more generally, affecting millions of specimens. If this result could be extended to other disciplines, a substantially increased proportion of specimens in natural history collections would be accessible for studies of biotic change. Interoperability with genealogical databases could accelerate addition of provenance data to natural history databases.
期刊介绍:
The American Malacological Bulletin serves as an outlet for reporting notable contributions in malacological research. Manuscripts concerning any aspect of original, unpublished research,important short reports, and detailed reviews dealing with molluscs will be considered for publication. Recent issues have included AMS symposia, independent papers, research notes,and book reviews. All published research articles in this journal have undergone rigorous peer review, based on initial editor screening and anonymous reviewing by independent expertreferees. AMS symposium papers have undergone peer review by symposium organizer, symposium participants, and independent referees.