XLUM:用于交换和长期保存发光数据的开放数据格式

IF 2.7 Q2 GEOCHEMISTRY & GEOPHYSICS Geochronology Pub Date : 2023-06-06 DOI:10.5194/gchron-5-271-2023
S. Kreutzer, Steve Grehl, Michael Höhne, Oliver Simmank, K. Dornich, Grzegorz Adamiec, Christoph Burow, H. Roberts, G. Duller
{"title":"XLUM:用于交换和长期保存发光数据的开放数据格式","authors":"S. Kreutzer, Steve Grehl, Michael Höhne, Oliver Simmank, K. Dornich, Grzegorz Adamiec, Christoph Burow, H. Roberts, G. Duller","doi":"10.5194/gchron-5-271-2023","DOIUrl":null,"url":null,"abstract":"Abstract. The concept of open data has become the modern science meme, and major funding bodies and publishers support open data. On a daily basis, however, the\nopen data mandate frequently encounters technical obstacles, such as a lack of a suitable data format for data sharing and long-term data\npreservation. Such issues are often community-specific and best addressed through community-tailored solutions. In Quaternary sciences, luminescence\ndating is widely used for constraining the timing of event-based processes (e.g. sediment transport). Every luminescence dating study produces a\nvast body of primary data that usually remains inaccessible and incompatible with future studies or adjacent scientific disciplines. To facilitate\ndata exchange and long-term data preservation (in short, open data) in luminescence dating studies, we propose a new XML-based structured data\nformat called XLUM. The format applies a hierarchical data storage concept consisting of a root node (node 0), a sample (node 1), a sequence\n(node 2), a record (node 3), and a curve (node 4). The curve level holds information on the technical component (e.g. photomultiplier,\nthermocouple). A finite number of curves represent a record (e.g. an optically stimulated luminescence curve). Records are part of a sequence\nmeasured for a particular sample. This design concept allows the user to retain information on a technical component level from the measurement\nprocess. The additional storage of related metadata fosters future data mining projects on large datasets. The XML-based format is less\nmemory-efficient than binary formats; however, its focus is data exchange, preservation, and hence XLUM long-term format stability by\ndesign. XLUM is inherently stable to future updates and backwards-compatible. We support XLUM through a new R package xlum,\nfacilitating the conversion of different formats into the new XLUM format. XLUM is licensed under the MIT licence and hence available\nfor free to be used in open- and closed-source commercial and non-commercial software and research projects.\n","PeriodicalId":12723,"journal":{"name":"Geochronology","volume":null,"pages":null},"PeriodicalIF":2.7000,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"XLUM: an open data format for exchange and long-term preservation of luminescence data\",\"authors\":\"S. Kreutzer, Steve Grehl, Michael Höhne, Oliver Simmank, K. Dornich, Grzegorz Adamiec, Christoph Burow, H. Roberts, G. Duller\",\"doi\":\"10.5194/gchron-5-271-2023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract. The concept of open data has become the modern science meme, and major funding bodies and publishers support open data. On a daily basis, however, the\\nopen data mandate frequently encounters technical obstacles, such as a lack of a suitable data format for data sharing and long-term data\\npreservation. Such issues are often community-specific and best addressed through community-tailored solutions. In Quaternary sciences, luminescence\\ndating is widely used for constraining the timing of event-based processes (e.g. sediment transport). Every luminescence dating study produces a\\nvast body of primary data that usually remains inaccessible and incompatible with future studies or adjacent scientific disciplines. To facilitate\\ndata exchange and long-term data preservation (in short, open data) in luminescence dating studies, we propose a new XML-based structured data\\nformat called XLUM. The format applies a hierarchical data storage concept consisting of a root node (node 0), a sample (node 1), a sequence\\n(node 2), a record (node 3), and a curve (node 4). The curve level holds information on the technical component (e.g. photomultiplier,\\nthermocouple). A finite number of curves represent a record (e.g. an optically stimulated luminescence curve). Records are part of a sequence\\nmeasured for a particular sample. This design concept allows the user to retain information on a technical component level from the measurement\\nprocess. The additional storage of related metadata fosters future data mining projects on large datasets. The XML-based format is less\\nmemory-efficient than binary formats; however, its focus is data exchange, preservation, and hence XLUM long-term format stability by\\ndesign. XLUM is inherently stable to future updates and backwards-compatible. We support XLUM through a new R package xlum,\\nfacilitating the conversion of different formats into the new XLUM format. XLUM is licensed under the MIT licence and hence available\\nfor free to be used in open- and closed-source commercial and non-commercial software and research projects.\\n\",\"PeriodicalId\":12723,\"journal\":{\"name\":\"Geochronology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2023-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geochronology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5194/gchron-5-271-2023\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GEOCHEMISTRY & GEOPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geochronology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5194/gchron-5-271-2023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0

摘要

摘要开放数据的概念已经成为现代科学的模因,主要的资助机构和出版商支持开放数据。然而,在日常工作中,开放数据授权经常遇到技术障碍,例如缺乏用于数据共享和长期数据保留的合适数据格式。这些问题往往是社区特有的,最好通过针对社区的解决方案来解决。在第四纪科学中,发光被广泛用于限制基于事件的过程的时间(如沉积物输运)。每一项发光测年研究都会产生大量的原始数据,这些数据通常是无法获取的,而且与未来的研究或邻近的科学学科不相容。为了促进发光定年研究中的数据交换和长期数据保存(简而言之,开放数据),我们提出了一种新的基于xml的结构化数据格式XLUM。该格式应用分层数据存储概念,由根节点(节点0)、样本(节点1)、序列(节点2)、记录(节点3)和曲线(节点4)组成。曲线级别保存技术组件(例如光电倍增管、热电偶)的信息。有限数量的曲线代表一个记录(例如,光激发发光曲线)。记录是为特定样本测量的序列的一部分。这种设计理念允许用户从测量过程中保留技术组件级别的信息。相关元数据的额外存储促进了未来在大型数据集上的数据挖掘项目。基于xml的格式比二进制格式内存效率低;但是,它的重点是数据交换、保存以及XLUM的长期格式稳定性。XLUM对未来的更新具有固有的稳定性和向后兼容性。我们通过一个新的R包XLUM来支持XLUM,方便将不同的格式转换为新的XLUM格式。XLUM在MIT许可下获得许可,因此可以免费用于开放和封闭源代码的商业和非商业软件和研究项目。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
XLUM: an open data format for exchange and long-term preservation of luminescence data
Abstract. The concept of open data has become the modern science meme, and major funding bodies and publishers support open data. On a daily basis, however, the open data mandate frequently encounters technical obstacles, such as a lack of a suitable data format for data sharing and long-term data preservation. Such issues are often community-specific and best addressed through community-tailored solutions. In Quaternary sciences, luminescence dating is widely used for constraining the timing of event-based processes (e.g. sediment transport). Every luminescence dating study produces a vast body of primary data that usually remains inaccessible and incompatible with future studies or adjacent scientific disciplines. To facilitate data exchange and long-term data preservation (in short, open data) in luminescence dating studies, we propose a new XML-based structured data format called XLUM. The format applies a hierarchical data storage concept consisting of a root node (node 0), a sample (node 1), a sequence (node 2), a record (node 3), and a curve (node 4). The curve level holds information on the technical component (e.g. photomultiplier, thermocouple). A finite number of curves represent a record (e.g. an optically stimulated luminescence curve). Records are part of a sequence measured for a particular sample. This design concept allows the user to retain information on a technical component level from the measurement process. The additional storage of related metadata fosters future data mining projects on large datasets. The XML-based format is less memory-efficient than binary formats; however, its focus is data exchange, preservation, and hence XLUM long-term format stability by design. XLUM is inherently stable to future updates and backwards-compatible. We support XLUM through a new R package xlum, facilitating the conversion of different formats into the new XLUM format. XLUM is licensed under the MIT licence and hence available for free to be used in open- and closed-source commercial and non-commercial software and research projects.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Geochronology
Geochronology Earth and Planetary Sciences-Paleontology
CiteScore
6.60
自引率
0.00%
发文量
35
审稿时长
19 weeks
期刊最新文献
The daughter–parent plot: a tool for analyzing thermochronological data New age constraints reveal moraine stabilization thousands of years after deposition during the last deglaciation of western New York, USA Errorchrons and anchored isochrons in IsoplotR Cosmogenic 3He chronology of postglacial lava flows at Mt Ruapehu, Aotearoa / New Zealand Effect of chemical abrasion of zircon on SIMS U–Pb, δ18O, trace element, and LA-ICPMS trace element and Lu–Hf isotopic analyses
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1