Trait-based methodologies are gaining traction in the field of ecology, providing deeper insights into ecosystem structure and functions. To this end, trait databases tailored to specific taxonomic groups have become foundational. In Taiwan, the collaborative efforts of avian researchers and dedicated citizen scientists have led to the compilation of a vast array of data. This includes web-sourced images from social media, spatial distribution records from eBird, and morphological metrics from banded birds and specimens. Enriched by peer-reviewed literature, we have meticulously assembled a comprehensive trait dataset encompassing 454 bird species across 73 families. This dataset covers a wide range of traits, including foraging ecology, morphological characteristics, territorial behaviors, breeding attributes, and the roles of bird species in ecosystem regulation. As an invaluable resource, this dataset lays the foundation for in-depth exploration of functional diversity, trait-based community ecology, ecosystem function, and critical insights needed to shape conservation strategies.
{"title":"Combining citizen science data and literature to build a traits dataset of Taiwan's birds.","authors":"Shu-Wei Fu, Meng-Chieh Feng, Po-Wei Chi, Tzung-Su Ding","doi":"10.1038/s41597-024-03928-3","DOIUrl":"https://doi.org/10.1038/s41597-024-03928-3","url":null,"abstract":"<p><p>Trait-based methodologies are gaining traction in the field of ecology, providing deeper insights into ecosystem structure and functions. To this end, trait databases tailored to specific taxonomic groups have become foundational. In Taiwan, the collaborative efforts of avian researchers and dedicated citizen scientists have led to the compilation of a vast array of data. This includes web-sourced images from social media, spatial distribution records from eBird, and morphological metrics from banded birds and specimens. Enriched by peer-reviewed literature, we have meticulously assembled a comprehensive trait dataset encompassing 454 bird species across 73 families. This dataset covers a wide range of traits, including foraging ecology, morphological characteristics, territorial behaviors, breeding attributes, and the roles of bird species in ecosystem regulation. As an invaluable resource, this dataset lays the foundation for in-depth exploration of functional diversity, trait-based community ecology, ecosystem function, and critical insights needed to shape conservation strategies.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-03DOI: 10.1038/s41597-024-03910-z
Connor R Barker, Eloise A Marais, Jonathan C McDowell
Satellite megaconstellation (SMC) missions are spurring rapid growth in rocket launches and anthropogenic re-entries. These events inject pollutants and carbon dioxide (CO2) in all atmospheric layers, affecting climate and stratospheric ozone. Quantification of these and other environmental impacts requires reliable inventories of emissions. We present a global, hourly, 3D, multi-year inventory of air pollutant emissions and CO2 from rocket launches and object re-entries spanning the inception and growth of SMCs (2020-2022). We use multiple reliable sources to compile information needed to build the inventory and conduct rigorous and innovative cross-checks and validations against launch livestreams and past studies. Our inventory accounts for rocket plume afterburning effects, applies object-specific ablation profiles to re-entering objects, and quantifies unablated mass of objects returning to Earth. We also identify all launches and objects associated with SMC missions, accounting for 37-41% of emissions of black carbon particles, carbon monoxide, and CO2 by 2022. The data are provided in formats for ease-of-use in atmospheric chemistry and climate models to inform regulation and space sustainability policies.
{"title":"Global 3D rocket launch and re-entry air pollutant and CO<sub>2</sub> emissions at the onset of the megaconstellation era.","authors":"Connor R Barker, Eloise A Marais, Jonathan C McDowell","doi":"10.1038/s41597-024-03910-z","DOIUrl":"https://doi.org/10.1038/s41597-024-03910-z","url":null,"abstract":"<p><p>Satellite megaconstellation (SMC) missions are spurring rapid growth in rocket launches and anthropogenic re-entries. These events inject pollutants and carbon dioxide (CO<sub>2</sub>) in all atmospheric layers, affecting climate and stratospheric ozone. Quantification of these and other environmental impacts requires reliable inventories of emissions. We present a global, hourly, 3D, multi-year inventory of air pollutant emissions and CO<sub>2</sub> from rocket launches and object re-entries spanning the inception and growth of SMCs (2020-2022). We use multiple reliable sources to compile information needed to build the inventory and conduct rigorous and innovative cross-checks and validations against launch livestreams and past studies. Our inventory accounts for rocket plume afterburning effects, applies object-specific ablation profiles to re-entering objects, and quantifies unablated mass of objects returning to Earth. We also identify all launches and objects associated with SMC missions, accounting for 37-41% of emissions of black carbon particles, carbon monoxide, and CO<sub>2</sub> by 2022. The data are provided in formats for ease-of-use in atmospheric chemistry and climate models to inform regulation and space sustainability policies.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-03DOI: 10.1038/s41597-024-03901-0
Hao Ma, Fawu Wang
The extreme meteorological events caused by climate change have increasingly caused serious clustered landslides. A systematic and timely inventory of triggered landslides is a prerequisite for quantifying and evaluating the impact of extreme meteorological events. In addition, a landslide inventory can provide basic data for any subsequent analysis of the event or be used innovatively in landslide risk analysis. The Rainfall-induced Landslides in Beijing (RLBJ) inventory presented here contains data on 15,383 rainfall-induced shallow landslides triggered by a single extreme precipitation event in July 2023 in an area of ~3,250 km2 in western mountainous areas of Beijing, China. High-resolution satellite images before and after this rainstorm event were used to visually analyze the landslides. All landslides were reported as vectorized points and polygon features and classified according to their motion forms. This inventory is now freely available for the benefit of international geohazard researchers.
{"title":"Inventory of shallow landslides triggered by extreme precipitation in July 2023 in Beijing, China.","authors":"Hao Ma, Fawu Wang","doi":"10.1038/s41597-024-03901-0","DOIUrl":"https://doi.org/10.1038/s41597-024-03901-0","url":null,"abstract":"<p><p>The extreme meteorological events caused by climate change have increasingly caused serious clustered landslides. A systematic and timely inventory of triggered landslides is a prerequisite for quantifying and evaluating the impact of extreme meteorological events. In addition, a landslide inventory can provide basic data for any subsequent analysis of the event or be used innovatively in landslide risk analysis. The Rainfall-induced Landslides in Beijing (RLBJ) inventory presented here contains data on 15,383 rainfall-induced shallow landslides triggered by a single extreme precipitation event in July 2023 in an area of ~3,250 km<sup>2</sup> in western mountainous areas of Beijing, China. High-resolution satellite images before and after this rainstorm event were used to visually analyze the landslides. All landslides were reported as vectorized points and polygon features and classified according to their motion forms. This inventory is now freely available for the benefit of international geohazard researchers.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The Cape hare (Lepus capensis) is among the most widely distributed hare species globally, inhabiting extensive regions across Africa, the Middle East, and Central Asia. However, evolutionary and genetic research on L. capensis was seriously impeded by the absence of a reference genome. Here, we assembled and constructed a chromosome-level genome of L. capensis (with scaffolds anchored to 25 chromosomes and a total assembled length of 2.9 Gb, achieving a contig N50 length of 124.44 Mb) using PacBio HiFi sequencing and Hi-C assembly technology. Evaluation using BUSCO indicated the genome assembly to be 98.2% complete. The de novo prediction revealed that repetitive sequences constitute 46.13% of the entire genome, and long interspersed nuclear elements (LINEs) constituted the largest portion. We annotated a total of 13, 868 protein-coding genes using transcriptomes from two tissues (muscle and skin). This high-quality reference genome serves as a valuable genomic resource for advancing genetic studies in this species.
{"title":"A chromosome-level genome assembly of Cape hare (Lepus capensis).","authors":"Xianggui Dong, Yu Liu, Yuan Chen, Xinxin Ping, Zhanjun Ren, Yuanyuan Zhang","doi":"10.1038/s41597-024-03953-2","DOIUrl":"https://doi.org/10.1038/s41597-024-03953-2","url":null,"abstract":"<p><p>The Cape hare (Lepus capensis) is among the most widely distributed hare species globally, inhabiting extensive regions across Africa, the Middle East, and Central Asia. However, evolutionary and genetic research on L. capensis was seriously impeded by the absence of a reference genome. Here, we assembled and constructed a chromosome-level genome of L. capensis (with scaffolds anchored to 25 chromosomes and a total assembled length of 2.9 Gb, achieving a contig N50 length of 124.44 Mb) using PacBio HiFi sequencing and Hi-C assembly technology. Evaluation using BUSCO indicated the genome assembly to be 98.2% complete. The de novo prediction revealed that repetitive sequences constitute 46.13% of the entire genome, and long interspersed nuclear elements (LINEs) constituted the largest portion. We annotated a total of 13, 868 protein-coding genes using transcriptomes from two tissues (muscle and skin). This high-quality reference genome serves as a valuable genomic resource for advancing genetic studies in this species.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372771","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-03DOI: 10.1038/s41597-024-03913-w
Jian Sun, Zhiming Gao, David Grant, Kashif Nawaz, Pengtao Wang, Cheng-Min Yang, Philip Boudreaux, Stephen Kowalski, Shean Huff
The Hewlett Packard Enterprise-Cray EX Frontier is the world's first and fastest exascale supercomputer, hosted at the Oak Ridge Leadership Computing Facility in Tennessee, United States. Frontier is a significant electricity consumer, drawing 8-30 MW; this massive energy demand produces significant waste heat, requiring extensive cooling measures. Although harnessing this waste heat for campus heating is a sustainability goal at Oak Ridge National Laboratory (ORNL), the 30 °C-38 °C waste heat temperature poses compatibility issues with standard HVAC systems. Heat pump systems, prevalent in residential settings and some industries, can efficiently upgrade low-quality heat to usable energy for buildings. Thus, heat pump technology powered by renewable electricity offers an efficient, cost-effective solution for substantial waste heat recovery. However, a major challenge is the absence of benchmark data on high-performance computing (HPC) heat generation and waste heat profiles. This paper reports power demand and waste heat measurements from an ORNL HPC data centre, aiming to guide future research on optimizing waste heat recovery in large-scale data centres, especially those of HPC calibre.
{"title":"Energy dataset of Frontier supercomputer for waste heat recovery.","authors":"Jian Sun, Zhiming Gao, David Grant, Kashif Nawaz, Pengtao Wang, Cheng-Min Yang, Philip Boudreaux, Stephen Kowalski, Shean Huff","doi":"10.1038/s41597-024-03913-w","DOIUrl":"https://doi.org/10.1038/s41597-024-03913-w","url":null,"abstract":"<p><p>The Hewlett Packard Enterprise-Cray EX Frontier is the world's first and fastest exascale supercomputer, hosted at the Oak Ridge Leadership Computing Facility in Tennessee, United States. Frontier is a significant electricity consumer, drawing 8-30 MW; this massive energy demand produces significant waste heat, requiring extensive cooling measures. Although harnessing this waste heat for campus heating is a sustainability goal at Oak Ridge National Laboratory (ORNL), the 30 °C-38 °C waste heat temperature poses compatibility issues with standard HVAC systems. Heat pump systems, prevalent in residential settings and some industries, can efficiently upgrade low-quality heat to usable energy for buildings. Thus, heat pump technology powered by renewable electricity offers an efficient, cost-effective solution for substantial waste heat recovery. However, a major challenge is the absence of benchmark data on high-performance computing (HPC) heat generation and waste heat profiles. This paper reports power demand and waste heat measurements from an ORNL HPC data centre, aiming to guide future research on optimizing waste heat recovery in large-scale data centres, especially those of HPC calibre.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-03DOI: 10.1038/s41597-024-03686-2
Mikhail Sofiev, Julia Palamarchuk, Rostislav Kouznetsov, Tamuna Abramidze, Beverley Adams-Groom, Célia M Antunes, Arturo H Ariño, Maximilian Bastl, Jordina Belmonte, Uwe E Berger, Maira Bonini, Nicolas Bruffaerts, Jeroen Buters, Paloma Cariñanos, Sevcan Celenk, Valentina Ceriotti, Athanasios Charalampopoulos, Yolanda Clewlow, Bernard Clot, Aslog Dahl, Athanasios Damialis, Concepción De Linares, Letty A De Weger, Lukas Dirr, Agneta Ekebom, Yalda Fatahi, María Fernández González, Delia Fernández González, Santiago Fernández-Rodríguez, Carmen Galán, Björn Gedda, Regula Gehrig, Carmi Geller Bernstein, Nestor Gonzalez Roldan, Lukasz Grewling, Lenka Hajkova, Risto Hänninen, François Hentges, Juha Jantunen, Evgeny Kadantsev, Idalia Kasprzyk, Mathilde Kloster, Katarzyna Kluska, Mieke Koenders, Janka Lafférsová, Poliana Mihaela Leru, Agnieszka Lipiec, Maria Louna-Korteniemi, Donát Magyar, Barbara Majkowska-Wojciechowska, Mika Mäkelä, Mirjana Mitrovic, Dorota Myszkowska, Gilles Oliver, Pia Östensson, Rosa Pérez-Badia, Krystyna Piotrowska-Weryszko, Marje Prank, Ewa Maria Przedpelska-Wasowicz, Sanna Pätsi, F Javier Rodríguyez Rajo, Hallvard Ramfjord, Joanna Rapiejko, Victoria Rodinkova, Jesús Rojo, Luis Ruiz-Valenzuela, Ondrej Rybnicek, Annika Saarto, Ingrida Sauliene, Andreja Kofol Seliger, Elena Severova, Valentina Shalaboda, Branko Sikoparija, Pilvi Siljamo, Joana Soares, Olga Sozinova, Anders Stangel, Barbara Stjepanović, Erik Teinemaa, Svyatoslav Tyuryakov, M Mar Trigo, Andreas Uppstu, Mart Vill, Julius Vira, Nicolas Visez, Tiina Vitikainen, Despoina Vokou, Elżbieta Weryszko-Chmielewska, Ari Karppinen
The dataset presents a 43 year-long reanalysis of pollen seasons for three major allergenic genera of trees in Europe: alder (Alnus), birch (Betula), and olive (Olea). Driven by the meteorological reanalysis ERA5, the atmospheric composition model SILAM predicted the flowering period and calculated the Europe-wide dispersion pattern of pollen for the years 1980-2022. The model applied an extended 4-dimensional variational data assimilation of in-situ observations of aerobiological networks in 34 European countries to reproduce the inter-annual variability and trends of pollen production and distribution. The control variable of the assimilation procedure was the total pollen release during each flowering season, implemented as an annual correction factor to the mean pollen production. The dataset was designed as an input to studies on climate-induced and anthropogenically driven changes in the European vegetation, biodiversity monitoring, bioaerosol modelling and assessment, as well as, in combination with intra-seasonal observations, for health-related applications.
{"title":"European pollen reanalysis, 1980-2022, for alder, birch, and olive.","authors":"Mikhail Sofiev, Julia Palamarchuk, Rostislav Kouznetsov, Tamuna Abramidze, Beverley Adams-Groom, Célia M Antunes, Arturo H Ariño, Maximilian Bastl, Jordina Belmonte, Uwe E Berger, Maira Bonini, Nicolas Bruffaerts, Jeroen Buters, Paloma Cariñanos, Sevcan Celenk, Valentina Ceriotti, Athanasios Charalampopoulos, Yolanda Clewlow, Bernard Clot, Aslog Dahl, Athanasios Damialis, Concepción De Linares, Letty A De Weger, Lukas Dirr, Agneta Ekebom, Yalda Fatahi, María Fernández González, Delia Fernández González, Santiago Fernández-Rodríguez, Carmen Galán, Björn Gedda, Regula Gehrig, Carmi Geller Bernstein, Nestor Gonzalez Roldan, Lukasz Grewling, Lenka Hajkova, Risto Hänninen, François Hentges, Juha Jantunen, Evgeny Kadantsev, Idalia Kasprzyk, Mathilde Kloster, Katarzyna Kluska, Mieke Koenders, Janka Lafférsová, Poliana Mihaela Leru, Agnieszka Lipiec, Maria Louna-Korteniemi, Donát Magyar, Barbara Majkowska-Wojciechowska, Mika Mäkelä, Mirjana Mitrovic, Dorota Myszkowska, Gilles Oliver, Pia Östensson, Rosa Pérez-Badia, Krystyna Piotrowska-Weryszko, Marje Prank, Ewa Maria Przedpelska-Wasowicz, Sanna Pätsi, F Javier Rodríguyez Rajo, Hallvard Ramfjord, Joanna Rapiejko, Victoria Rodinkova, Jesús Rojo, Luis Ruiz-Valenzuela, Ondrej Rybnicek, Annika Saarto, Ingrida Sauliene, Andreja Kofol Seliger, Elena Severova, Valentina Shalaboda, Branko Sikoparija, Pilvi Siljamo, Joana Soares, Olga Sozinova, Anders Stangel, Barbara Stjepanović, Erik Teinemaa, Svyatoslav Tyuryakov, M Mar Trigo, Andreas Uppstu, Mart Vill, Julius Vira, Nicolas Visez, Tiina Vitikainen, Despoina Vokou, Elżbieta Weryszko-Chmielewska, Ari Karppinen","doi":"10.1038/s41597-024-03686-2","DOIUrl":"https://doi.org/10.1038/s41597-024-03686-2","url":null,"abstract":"<p><p>The dataset presents a 43 year-long reanalysis of pollen seasons for three major allergenic genera of trees in Europe: alder (Alnus), birch (Betula), and olive (Olea). Driven by the meteorological reanalysis ERA5, the atmospheric composition model SILAM predicted the flowering period and calculated the Europe-wide dispersion pattern of pollen for the years 1980-2022. The model applied an extended 4-dimensional variational data assimilation of in-situ observations of aerobiological networks in 34 European countries to reproduce the inter-annual variability and trends of pollen production and distribution. The control variable of the assimilation procedure was the total pollen release during each flowering season, implemented as an annual correction factor to the mean pollen production. The dataset was designed as an input to studies on climate-induced and anthropogenically driven changes in the European vegetation, biodiversity monitoring, bioaerosol modelling and assessment, as well as, in combination with intra-seasonal observations, for health-related applications.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-03DOI: 10.1038/s41597-024-03917-6
Zeev Yampolsky, Yair Stolero, Nitsan Pri-Hadash, Dan Solodar, Shira Massas, Itai Savin, Itzik Klein
An inertial navigation system (INS) utilizes three orthogonal accelerometers and gyroscopes to determine platform position, velocity, and orientation. There are countless applications for INS, including robotics, autonomous platforms, and the internet of things. Recent research explores the integration of data-driven methods with INS, highlighting significant innovations, improving accuracy and efficiency. Despite the growing interest in this field and the availability of INS datasets, no datasets are available for gyro-free INS (GFINS) and multiple inertial measurement unit (MIMU) architectures. To fill this gap and to stimulate further research in this field, we designed and recorded GFINS and MIMU datasets using 54 inertial sensors grouped in nine inertial measurement units. These sensors can be used to define and evaluate different types of MIMU and GFINS architectures. The inertial sensors were arranged in three different sensor configurations and mounted on a mobile robot, a passenger car and a turntable. In total, the dataset contains 45 hours of inertial data and corresponding ground truth trajectories. The data is freely accessible through our figshare repository.
惯性导航系统(INS)利用三个正交加速度计和陀螺仪来确定平台的位置、速度和方向。惯性导航系统的应用数不胜数,包括机器人、自主平台和物联网。最近的研究探索了数据驱动方法与 INS 的整合,突出了重大创新,提高了精度和效率。尽管人们对这一领域的兴趣与日俱增,也有了 INS 数据集,但却没有无陀螺 INS(GFINS)和多惯性测量单元(MIMU)架构的数据集。为了填补这一空白并促进该领域的进一步研究,我们设计并记录了无陀螺 INS 和多惯性测量单元数据集,这些数据集使用了 54 个惯性传感器,分为九个惯性测量单元。这些传感器可用于定义和评估不同类型的 MIMU 和 GFINS 架构。惯性传感器采用三种不同的传感器配置,分别安装在移动机器人、客车和转盘上。数据集总共包含 45 小时的惯性数据和相应的地面实况轨迹。这些数据可通过我们的 figshare 存储库免费访问。
{"title":"Multiple and Gyro-Free Inertial Datasets.","authors":"Zeev Yampolsky, Yair Stolero, Nitsan Pri-Hadash, Dan Solodar, Shira Massas, Itai Savin, Itzik Klein","doi":"10.1038/s41597-024-03917-6","DOIUrl":"https://doi.org/10.1038/s41597-024-03917-6","url":null,"abstract":"<p><p>An inertial navigation system (INS) utilizes three orthogonal accelerometers and gyroscopes to determine platform position, velocity, and orientation. There are countless applications for INS, including robotics, autonomous platforms, and the internet of things. Recent research explores the integration of data-driven methods with INS, highlighting significant innovations, improving accuracy and efficiency. Despite the growing interest in this field and the availability of INS datasets, no datasets are available for gyro-free INS (GFINS) and multiple inertial measurement unit (MIMU) architectures. To fill this gap and to stimulate further research in this field, we designed and recorded GFINS and MIMU datasets using 54 inertial sensors grouped in nine inertial measurement units. These sensors can be used to define and evaluate different types of MIMU and GFINS architectures. The inertial sensors were arranged in three different sensor configurations and mounted on a mobile robot, a passenger car and a turntable. In total, the dataset contains 45 hours of inertial data and corresponding ground truth trajectories. The data is freely accessible through our figshare repository.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-03DOI: 10.1038/s41597-024-03924-7
Yazhen Ma, Shixiong Ding, Yingxiong Qiu
The deciduous American sweetgum (Liquidambar styraciflua, Altingiaceae) is a popular ornamental and economically valuable tree renowned for its sweet-smelling bark resin, abundant volatile substances, and spectacular fall leaf color. However, the absence of a reference genome hinders thorough investigations into the mechanisms underlying phenotypic variation, secondary metabolite synthesis and adaptation, both in this species and other Liquidambar members. In this study, we sequenced and constructed a chromosome-level assembly of the L. styraciflua genome, covering 662.48 Mb with a scaffold N50 of 39.54 Mb, by integrating PacBio, Illumina and chromosome conformation capture data. We identified 58.83% of the genome sequences as repetitive elements and 25,713 protein-coding genes, 97.28% of which were functionally annotated. The genome sequencing reads, assembly and annotation data have been deposited in publicly available repositories. This high-quality genome assembly provides valuable resources for further evolutionary and functional genomic studies in American sweetgum and other Liquidambar species.
{"title":"Chromosome-level genome assembly of American sweetgum (Liquidambar styraciflua, Altingiaceae).","authors":"Yazhen Ma, Shixiong Ding, Yingxiong Qiu","doi":"10.1038/s41597-024-03924-7","DOIUrl":"https://doi.org/10.1038/s41597-024-03924-7","url":null,"abstract":"<p><p>The deciduous American sweetgum (Liquidambar styraciflua, Altingiaceae) is a popular ornamental and economically valuable tree renowned for its sweet-smelling bark resin, abundant volatile substances, and spectacular fall leaf color. However, the absence of a reference genome hinders thorough investigations into the mechanisms underlying phenotypic variation, secondary metabolite synthesis and adaptation, both in this species and other Liquidambar members. In this study, we sequenced and constructed a chromosome-level assembly of the L. styraciflua genome, covering 662.48 Mb with a scaffold N50 of 39.54 Mb, by integrating PacBio, Illumina and chromosome conformation capture data. We identified 58.83% of the genome sequences as repetitive elements and 25,713 protein-coding genes, 97.28% of which were functionally annotated. The genome sequencing reads, assembly and annotation data have been deposited in publicly available repositories. This high-quality genome assembly provides valuable resources for further evolutionary and functional genomic studies in American sweetgum and other Liquidambar species.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372772","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Electroencephalography (EEG)-based open-access datasets are available for emotion recognition studies, where external auditory/visual stimuli are used to artificially evoke pre-defined emotions. In this study, we provide a novel EEG dataset containing the emotional information induced during a realistic human-computer interaction (HCI) using a voice user interface system that mimics natural human-to-human communication. To validate our dataset via neurophysiological investigation and binary emotion classification, we applied a series of signal processing and machine learning methods to the EEG data. The maximum classification accuracy ranged from 43.3% to 90.8% over 38 subjects and classification features could be interpreted neurophysiologically. Our EEG data could be used to develop a reliable HCI system because they were acquired in a natural HCI environment. In addition, auxiliary physiological data measured simultaneously with the EEG data also showed plausible results, i.e., electrocardiogram, photoplethysmogram, galvanic skin response, and facial images, which could be utilized for automatic emotion discrimination independently from, as well as together with the EEG data via the fusion of multi-modal physiological datasets.
{"title":"EEG Dataset for the Recognition of Different Emotions Induced in Voice-User Interaction.","authors":"Ga-Young Choi, Jong-Gyu Shin, Ji-Yoon Lee, Jun-Seok Lee, In-Seok Heo, Ha-Yeong Yoon, Wansu Lim, Jin-Woo Jeong, Sang-Ho Kim, Han-Jeong Hwang","doi":"10.1038/s41597-024-03887-9","DOIUrl":"https://doi.org/10.1038/s41597-024-03887-9","url":null,"abstract":"<p><p>Electroencephalography (EEG)-based open-access datasets are available for emotion recognition studies, where external auditory/visual stimuli are used to artificially evoke pre-defined emotions. In this study, we provide a novel EEG dataset containing the emotional information induced during a realistic human-computer interaction (HCI) using a voice user interface system that mimics natural human-to-human communication. To validate our dataset via neurophysiological investigation and binary emotion classification, we applied a series of signal processing and machine learning methods to the EEG data. The maximum classification accuracy ranged from 43.3% to 90.8% over 38 subjects and classification features could be interpreted neurophysiologically. Our EEG data could be used to develop a reliable HCI system because they were acquired in a natural HCI environment. In addition, auxiliary physiological data measured simultaneously with the EEG data also showed plausible results, i.e., electrocardiogram, photoplethysmogram, galvanic skin response, and facial images, which could be utilized for automatic emotion discrimination independently from, as well as together with the EEG data via the fusion of multi-modal physiological datasets.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142372774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-02DOI: 10.1038/s41597-024-03892-y
Carlos Roberto de Souza Filho, Rebecca D P M Scafutto
Remote detection and mapping of surface materials using optical sensors relies predominantly on analyzing multispectral and hyperspectral imagery employing classification algorithms. The classification process involves comparing the spectra of individual pixels within the image to spectra from reference databases, commonly referred to as spectral libraries. Here, we introduce a comprehensive compilation of spectral libraries specifically tailored for petroleum hydrocarbons (PHC), meticulously crafted under controlled laboratory conditions. This compilation includes reference spectral libraries for various PHC forms, including crude oils, mineral substrate-PHC mixtures (comprising crude oils and fuels), oil-film on water, and oil-water emulsions. Data collection was conducted within the visible, near, and shortwave IR (VNIR-SWIR - 0.35-2.5 µm) spectra and thermal IR (TIR - 3-15 µm) range. The openly accessible spectral libraries presented herein support the scientific community and industry in characterizing field samples or spectral data from onshore and offshore sites. Furthermore, these libraries are instrumental in developing and applying classification algorithms designed for processing spectral images captured by cameras coupled to multiple platforms (e.g., tripods, drones, airborne, orbital satellites).
{"title":"A Comprehensive Compilation of Spectral Libraries for Petroleum Hydrocarbons (PHC) Encompassing VNIR-SWIR-TIR Ranges.","authors":"Carlos Roberto de Souza Filho, Rebecca D P M Scafutto","doi":"10.1038/s41597-024-03892-y","DOIUrl":"10.1038/s41597-024-03892-y","url":null,"abstract":"<p><p>Remote detection and mapping of surface materials using optical sensors relies predominantly on analyzing multispectral and hyperspectral imagery employing classification algorithms. The classification process involves comparing the spectra of individual pixels within the image to spectra from reference databases, commonly referred to as spectral libraries. Here, we introduce a comprehensive compilation of spectral libraries specifically tailored for petroleum hydrocarbons (PHC), meticulously crafted under controlled laboratory conditions. This compilation includes reference spectral libraries for various PHC forms, including crude oils, mineral substrate-PHC mixtures (comprising crude oils and fuels), oil-film on water, and oil-water emulsions. Data collection was conducted within the visible, near, and shortwave IR (VNIR-SWIR - 0.35-2.5 µm) spectra and thermal IR (TIR - 3-15 µm) range. The openly accessible spectral libraries presented herein support the scientific community and industry in characterizing field samples or spectral data from onshore and offshore sites. Furthermore, these libraries are instrumental in developing and applying classification algorithms designed for processing spectral images captured by cameras coupled to multiple platforms (e.g., tripods, drones, airborne, orbital satellites).</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142366443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}