首页 > 最新文献

Scientific Data最新文献

英文 中文
Large-scale modeling for housing condition prediction using machine learning algorithms. 使用机器学习算法进行住房状况预测的大规模建模。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-03-11 DOI: 10.1038/s41597-026-07012-w
Kyusik Kim, Tisha Holmes, Emily Powell, Christopher K Uejio

While housing price prediction is well-studied, the prediction of large-scale housing conditions remains underexplored due to data limitations. This paper addresses this gap by developing a machine-learning model to predict housing conditions across the United States. We integrated property-level data from the Warren Group with neighborhood characteristics from the U.S. Census Bureau's American Community Survey and trained three gradient-boosting algorithms: CatBoost, LightGBM, and XGBoost. Despite XGBoost's slightly higher balanced accuracy, CatBoost was selected as the best model due to its superior resistance to overfitting. The final model's predictions were aggregated to census tracts, ZIP code tabulation areas, and a 36.13 km2 resolution hexagonal grid for national-scale spatial analysis. The resulting comprehensive dataset can serve as a valuable resource for researchers and practitioners to analyze the geography of housing quality with applications in urban planning, disaster management, community resilience, public health, and more.

虽然房价预测已经得到了很好的研究,但由于数据的限制,对大规模住房状况的预测仍未得到充分的探索。本文通过开发一种机器学习模型来预测美国各地的住房状况,从而解决了这一差距。我们将Warren Group的房产级别数据与美国人口普查局美国社区调查的社区特征相结合,并训练了三种梯度增强算法:CatBoost、LightGBM和XGBoost。尽管XGBoost的平衡精度略高,但CatBoost被选为最佳模型,因为它具有优越的抗过拟合能力。最终模型的预测汇总到人口普查区,邮政编码制表区域,以及用于国家尺度空间分析的36.13平方公里分辨率六角形网格。由此产生的综合数据集可以作为研究人员和实践者分析住房质量地理的宝贵资源,并应用于城市规划、灾害管理、社区恢复力、公共卫生等领域。
{"title":"Large-scale modeling for housing condition prediction using machine learning algorithms.","authors":"Kyusik Kim, Tisha Holmes, Emily Powell, Christopher K Uejio","doi":"10.1038/s41597-026-07012-w","DOIUrl":"https://doi.org/10.1038/s41597-026-07012-w","url":null,"abstract":"<p><p>While housing price prediction is well-studied, the prediction of large-scale housing conditions remains underexplored due to data limitations. This paper addresses this gap by developing a machine-learning model to predict housing conditions across the United States. We integrated property-level data from the Warren Group with neighborhood characteristics from the U.S. Census Bureau's American Community Survey and trained three gradient-boosting algorithms: CatBoost, LightGBM, and XGBoost. Despite XGBoost's slightly higher balanced accuracy, CatBoost was selected as the best model due to its superior resistance to overfitting. The final model's predictions were aggregated to census tracts, ZIP code tabulation areas, and a 36.13 km<sup>2</sup> resolution hexagonal grid for national-scale spatial analysis. The resulting comprehensive dataset can serve as a valuable resource for researchers and practitioners to analyze the geography of housing quality with applications in urban planning, disaster management, community resilience, public health, and more.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147434982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
City boundaries for global urban water scarcity assessment. 全球城市水资源短缺评价的城市边界。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-03-11 DOI: 10.1038/s41597-026-06933-w
Kiyoharu Kajiyama, Naota Hanasaki, Shinjiro Kanae

Rapid and continuous urbanization has increased water demand in cities worldwide. Global assessments of urban water scarcity should be conducted using gridded hydrological information, but are hampered by the lack of water-resource-based city boundary information. This study introduces HydroUrbanMap (HUM), a global gridded dataset of city boundaries for 1,604 cities at 5 arcmin resolution, incorporating hydrological attributes. HUM consists of two key components: delineation of city boundaries that include the population served by urban water services (supply and drainage), and estimation of accessible surface water sources within and outside these boundaries. The estimated city populations closely match census-based populations, with a correlation coefficient of 0.997. HUM incorporates hydrological inlets and outlets of main rivers in each city by overlaying the city boundaries with the river network. Combining our HUM dataset with outputs from global hydrological models supports city-specific water resource assessments worldwide, filling gaps in cities where data on urban water services are unavailable or incomplete.

快速和持续的城市化增加了世界各地城市的用水需求。全球城市水资源短缺评估应使用网格水文信息,但由于缺乏基于水资源的城市边界信息而受到阻碍。本研究引入了hydroourbanmap (HUM),这是一个包含水文属性的全球网格化城市边界数据集,包含1604个城市的5角分辨率。HUM由两个关键部分组成:划定城市边界,包括城市供水服务(供应和排水)所服务的人口,以及估计这些边界内外可获得的地表水来源。估计的城市人口与普查基础人口接近,相关系数为0.997。HUM通过在城市边界上覆盖河网,将每个城市主要河流的水文入口和出口整合在一起。将我们的HUM数据集与全球水文模型的输出相结合,支持全球城市特定水资源评估,填补了城市供水服务数据不可用或不完整的城市的空白。
{"title":"City boundaries for global urban water scarcity assessment.","authors":"Kiyoharu Kajiyama, Naota Hanasaki, Shinjiro Kanae","doi":"10.1038/s41597-026-06933-w","DOIUrl":"https://doi.org/10.1038/s41597-026-06933-w","url":null,"abstract":"<p><p>Rapid and continuous urbanization has increased water demand in cities worldwide. Global assessments of urban water scarcity should be conducted using gridded hydrological information, but are hampered by the lack of water-resource-based city boundary information. This study introduces HydroUrbanMap (HUM), a global gridded dataset of city boundaries for 1,604 cities at 5 arcmin resolution, incorporating hydrological attributes. HUM consists of two key components: delineation of city boundaries that include the population served by urban water services (supply and drainage), and estimation of accessible surface water sources within and outside these boundaries. The estimated city populations closely match census-based populations, with a correlation coefficient of 0.997. HUM incorporates hydrological inlets and outlets of main rivers in each city by overlaying the city boundaries with the river network. Combining our HUM dataset with outputs from global hydrological models supports city-specific water resource assessments worldwide, filling gaps in cities where data on urban water services are unavailable or incomplete.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147434995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A spatiotemporal dataset of farmland rent aligned with farming seasons across China 2021-2025. 2021-2025年中国农作季节地租时空数据集
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-03-11 DOI: 10.1038/s41597-026-07040-6
Qi Xing, Shuiyi Zhu, Daolin Zhu, Jian Wang

Accurate farmland market data plays a crucial role in decision-making for policymakers, investors and operators. We present a nationwide, parcel-level dataset of farmland rent in China, derived from eight survey waves conducted between 2021 and 2025 across 27 provinces. The dataset comprises 7,237 rigorously validated samples, covering 191 cities and 422 counties in major agricultural production regions. Each record contains georeferenced parcel attributes and detailed transaction information, including parcel size, rental price, contracting parties, contractual terms. Compared with existing sources, this dataset offers broader coverage, finer spatial resolution, and improved temporal continuity, thereby addressing long-standing limitations of farmland transfer data in China. No other publicly available dataset provides comparable scope, reliability, and detail. Its release will enhance transparency in farmland markets and enable robust analyses of spatial and temporal dynamics, cross-regional comparisons, and the effects of institutional arrangements on land transactions. The dataset provides a valuable empirical foundation for advancing research in agricultural economics, rural development, and land use policy.

准确的耕地市场数据对政策制定者、投资者和经营者的决策具有至关重要的作用。我们展示了中国全国范围内的农地租金数据集,该数据集来自于2021年至2025年间在27个省份进行的8次调查。该数据集包括7,237个经过严格验证的样本,涵盖了主要农业生产区的191个城市和422个县。每个记录包含地理参考的包裹属性和详细的交易信息,包括包裹大小、租金价格、签约方、合同条款。与现有数据源相比,该数据集覆盖范围更广,空间分辨率更高,时间连续性更好,从而解决了中国耕地流转数据长期存在的局限性。没有其他公开可用的数据集提供可比较的范围、可靠性和细节。该报告的发布将提高农田市场的透明度,使人们能够对时空动态、跨区域比较以及制度安排对土地交易的影响进行强有力的分析。该数据集为推进农业经济学、农村发展和土地利用政策研究提供了宝贵的经验基础。
{"title":"A spatiotemporal dataset of farmland rent aligned with farming seasons across China 2021-2025.","authors":"Qi Xing, Shuiyi Zhu, Daolin Zhu, Jian Wang","doi":"10.1038/s41597-026-07040-6","DOIUrl":"https://doi.org/10.1038/s41597-026-07040-6","url":null,"abstract":"<p><p>Accurate farmland market data plays a crucial role in decision-making for policymakers, investors and operators. We present a nationwide, parcel-level dataset of farmland rent in China, derived from eight survey waves conducted between 2021 and 2025 across 27 provinces. The dataset comprises 7,237 rigorously validated samples, covering 191 cities and 422 counties in major agricultural production regions. Each record contains georeferenced parcel attributes and detailed transaction information, including parcel size, rental price, contracting parties, contractual terms. Compared with existing sources, this dataset offers broader coverage, finer spatial resolution, and improved temporal continuity, thereby addressing long-standing limitations of farmland transfer data in China. No other publicly available dataset provides comparable scope, reliability, and detail. Its release will enhance transparency in farmland markets and enable robust analyses of spatial and temporal dynamics, cross-regional comparisons, and the effects of institutional arrangements on land transactions. The dataset provides a valuable empirical foundation for advancing research in agricultural economics, rural development, and land use policy.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147434918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An ambient acoustic ice-fracturing dataset taken in shallow freshwater. 浅层淡水环境声压裂数据集。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-03-11 DOI: 10.1038/s41597-026-06712-7
John Case, Andrew Barnard, Daniel Brown

This paper describes an acoustic dataset collected on a frozen shallow freshwater lake between February and March of 2024. This collection took place over one full week on Portage Lake in the Upper Peninsula of Michigan, USA. The first sub-dataset consists of ambient ice and environmental noises collected by an array of hydrophones, microphones and geophones placed below, above and on the ice respectively. The second sub-dataset consists of instrumented force hammer impacts at a series of locations on the the ice with the corresponding response at each acoustic sensor. All acoustic data were recorded at a sample rate fs = 51, 200 Hz. Corresponding local weather data is also provided. This dataset offer a rich look into the physics of ice cover, response to weather phenomena, and shallow water and ice cover waveguide acoustic propagation. The datasets consist of time series data from all sensors, array dimensions, array placement and hardware descriptions.

本文描述了2024年2月至3月在冰冻浅淡水湖上采集的声学数据集。这次收集在美国密歇根州上半岛的波蒂奇湖上进行了整整一周。第一个子数据集包括环境冰和环境噪声,这些噪声分别由一组水听器、麦克风和检波器收集,这些检波器分别放置在冰面的下方、上方和上方。第二个子数据集包括在冰面上一系列位置测量的力锤撞击,以及每个声学传感器的相应响应。所有的声学数据记录在采样率fs = 51,200 Hz。并提供相应的本地天气资料。该数据集提供了丰富的冰盖物理,对天气现象的响应,以及浅水和冰盖波导声传播。数据集包括来自所有传感器的时间序列数据、阵列尺寸、阵列位置和硬件描述。
{"title":"An ambient acoustic ice-fracturing dataset taken in shallow freshwater.","authors":"John Case, Andrew Barnard, Daniel Brown","doi":"10.1038/s41597-026-06712-7","DOIUrl":"https://doi.org/10.1038/s41597-026-06712-7","url":null,"abstract":"<p><p>This paper describes an acoustic dataset collected on a frozen shallow freshwater lake between February and March of 2024. This collection took place over one full week on Portage Lake in the Upper Peninsula of Michigan, USA. The first sub-dataset consists of ambient ice and environmental noises collected by an array of hydrophones, microphones and geophones placed below, above and on the ice respectively. The second sub-dataset consists of instrumented force hammer impacts at a series of locations on the the ice with the corresponding response at each acoustic sensor. All acoustic data were recorded at a sample rate f<sub>s</sub> = 51, 200 Hz. Corresponding local weather data is also provided. This dataset offer a rich look into the physics of ice cover, response to weather phenomena, and shallow water and ice cover waveguide acoustic propagation. The datasets consist of time series data from all sensors, array dimensions, array placement and hardware descriptions.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147435005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hand biomechanics dataset of kinematics, kinetics, electromyography, and imaging in healthy adults. 健康成人手部运动学、动力学、肌电图和成像的生物力学数据集。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-03-11 DOI: 10.1038/s41597-026-06939-4
Maximillian T Diaz, Alexis R Benoit, Kalyn M Kearney, Troy F Kelly, Erica M Lindbeck, Isaly Tappan, William S Bowers, Lavanya Durai, Justin B Nunag, Michael B Officer, Joel B Harley, Jennifer A Nichols

Developing musculoskeletal hand models requires a variety of experimental biomechanics data. However, collecting robust biomechanics hand data is a time intensive process leading to a lack of widely available datasets. To address this issue the biomechanics hand modeling database (BHaM) was made as a collection of experimental data to aid the development, testing, and validation of musculoskeletal models and simulations. BHaM includes two datasets: (1) a population dataset (n = 726 adults) describing hand strength (pinch and grip), self-reported hand function (Michigan Hand Questionnaire), and anthropometric measurements (from photographs), and (2) a biomechanics dataset (n = 30 adults) describing kinematics (marker-based motion capture), kinetics (isometric and isokinetic data), and electromyography (surface and fine wire) during 19 tasks across the elbow, wrist, and hand. A subset of the biomechanics dataset (n = 15 adults) also includes magnetic resonance imaging of the shoulder through wrist. Participants for both datasets were recruited to represent a diverse population of healthy adults, ranging from 18 to 91 years.

开发肌肉骨骼手模型需要多种实验生物力学数据。然而,收集健壮的生物力学手部数据是一个耗时的过程,导致缺乏广泛可用的数据集。为了解决这一问题,生物力学手建模数据库(BHaM)作为实验数据的集合来帮助开发、测试和验证肌肉骨骼模型和模拟。BHaM包括两个数据集:(1)人口数据集(n = 726名成年人),描述手的力量(握力和握力)、自我报告的手功能(密歇根手问卷)和人体测量(来自照片);(2)生物力学数据集(n = 30名成年人),描述在肘部、手腕和手的19项任务中的运动学(基于标记的运动捕捉)、动力学(等距和等距数据)和肌电图(表面和细线)。生物力学数据集的一个子集(n = 15名成年人)还包括肩部通过手腕的磁共振成像。这两个数据集的参与者被招募来代表不同的健康成年人群体,年龄从18岁到91岁。
{"title":"A hand biomechanics dataset of kinematics, kinetics, electromyography, and imaging in healthy adults.","authors":"Maximillian T Diaz, Alexis R Benoit, Kalyn M Kearney, Troy F Kelly, Erica M Lindbeck, Isaly Tappan, William S Bowers, Lavanya Durai, Justin B Nunag, Michael B Officer, Joel B Harley, Jennifer A Nichols","doi":"10.1038/s41597-026-06939-4","DOIUrl":"10.1038/s41597-026-06939-4","url":null,"abstract":"<p><p>Developing musculoskeletal hand models requires a variety of experimental biomechanics data. However, collecting robust biomechanics hand data is a time intensive process leading to a lack of widely available datasets. To address this issue the biomechanics hand modeling database (BHaM) was made as a collection of experimental data to aid the development, testing, and validation of musculoskeletal models and simulations. BHaM includes two datasets: (1) a population dataset (n = 726 adults) describing hand strength (pinch and grip), self-reported hand function (Michigan Hand Questionnaire), and anthropometric measurements (from photographs), and (2) a biomechanics dataset (n = 30 adults) describing kinematics (marker-based motion capture), kinetics (isometric and isokinetic data), and electromyography (surface and fine wire) during 19 tasks across the elbow, wrist, and hand. A subset of the biomechanics dataset (n = 15 adults) also includes magnetic resonance imaging of the shoulder through wrist. Participants for both datasets were recruited to represent a diverse population of healthy adults, ranging from 18 to 91 years.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147435286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Occurrence and environmental data for aquatic plants of Minnesota from 1999-2018. 1999-2018年明尼苏达州水生植物发生与环境数据
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-03-11 DOI: 10.1038/s41597-026-07027-3
Michael R Verhoeven, William L Bartodziej, Matthew S Berg, Simba Blood, Rachael Crabb, Eric Fieldseth, James A Johnson, Jimmy Marty, Steve McComas, Raymond M Newman, Meg Rattei, Jill B Sweet, Justin Townsend, Brian Vlach, Justin Valenty, Jerry P Spetzman, Susanna W Witkowski, Andrea Prichard, Wesley J Glisson, Daniel J Larkin

The aquatic flora of Minnesota's freshwater lakes have been extensively surveyed for purposes of resource assessment, research, and ecosystem management. Despite widespread use of a common method for vegetation sampling ("point-intercept surveys"), these records have existed to-date in disparate locations without unification. Here we present a first-of-its-kind dataset of point-level occurrences, relative abundances, and associated environmental data for macrophytes (freshwater plants) across Minnesota. The data encompass 3,194 surveys of 1,520 lakes and ponds performed over a 19-year timespan. A total of 367,382 points were sampled, across which 231 taxa were recorded. Macrophyte occurrence data and depth, as well as point-level relative-plant-abundance measures for a subset of surveys, were collated, cleaned, and joined to geospatial data and Secchi-depth measurements of water clarity, enabling light availability, a primary control on aquatic plant growth, to be estimated. The data are well-suited for ecological analyses across multiple spatial scales and can be used to address both fundamental and applied ecological questions.

为了资源评估、研究和生态系统管理的目的,对明尼苏达州淡水湖的水生植物群进行了广泛的调查。尽管广泛使用了一种常见的植被采样方法(“点截调查”),但迄今为止,这些记录存在于不同的地点,没有统一。在这里,我们提出了首个类似的数据集,包括明尼苏达州大型植物(淡水植物)的点水平发生率、相对丰度和相关环境数据。这些数据包括在19年的时间跨度内对1520个湖泊和池塘进行的3194次调查。共采集点367382个,记录类群231个。大型植物的发生数据和深度,以及调查子集的点水平相对植物丰度测量,被整理、清理,并与地理空间数据和水清晰度的Secchi-depth测量相结合,使光可用性(水生植物生长的主要控制因素)得以估计。这些数据非常适合用于跨空间尺度的生态分析,可用于解决基础和应用生态问题。
{"title":"Occurrence and environmental data for aquatic plants of Minnesota from 1999-2018.","authors":"Michael R Verhoeven, William L Bartodziej, Matthew S Berg, Simba Blood, Rachael Crabb, Eric Fieldseth, James A Johnson, Jimmy Marty, Steve McComas, Raymond M Newman, Meg Rattei, Jill B Sweet, Justin Townsend, Brian Vlach, Justin Valenty, Jerry P Spetzman, Susanna W Witkowski, Andrea Prichard, Wesley J Glisson, Daniel J Larkin","doi":"10.1038/s41597-026-07027-3","DOIUrl":"https://doi.org/10.1038/s41597-026-07027-3","url":null,"abstract":"<p><p>The aquatic flora of Minnesota's freshwater lakes have been extensively surveyed for purposes of resource assessment, research, and ecosystem management. Despite widespread use of a common method for vegetation sampling (\"point-intercept surveys\"), these records have existed to-date in disparate locations without unification. Here we present a first-of-its-kind dataset of point-level occurrences, relative abundances, and associated environmental data for macrophytes (freshwater plants) across Minnesota. The data encompass 3,194 surveys of 1,520 lakes and ponds performed over a 19-year timespan. A total of 367,382 points were sampled, across which 231 taxa were recorded. Macrophyte occurrence data and depth, as well as point-level relative-plant-abundance measures for a subset of surveys, were collated, cleaned, and joined to geospatial data and Secchi-depth measurements of water clarity, enabling light availability, a primary control on aquatic plant growth, to be estimated. The data are well-suited for ecological analyses across multiple spatial scales and can be used to address both fundamental and applied ecological questions.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147434956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chromosomal-level genome assembly of Ichthyurus bourgeoisi Gestro using PacBio HiFi and Hi-C sequencing. PacBio - HiFi和Hi-C测序技术对小鳞鱼染色体水平基因组的组装。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-03-11 DOI: 10.1038/s41597-026-07039-z
Yuxia Yang, Yiyang Zhen, Zheng Yang, Jiliang Wang, Jianxin Hua, Haoyu Liu

Ichthyurus bourgeoisi Gestro is a representative species of the tribe Ichthyurini within the beetle family Cantharidae. This tribe is particularly noteworthy because of its brachelytrous characteristics. However, the lack of high-quality genomic resources hinders our understanding of the evolution and ecological adaptions associated with this beetle group. In this study, we present a chromosome-level genome assembly for I. bourgeoisi constructed using a combination of PacBio HiFi and Hi-C sequencing data. The genome spans 664.72 Mb, with a scaffold N50 of 98.12 Mb, and is organized into seven pseudo-chromosomes, including a chromosome X validated through analyses of genome collinearity and sequencing depth. Repeat sequences account for 65.35% of the genome, and 13,386 protein-coding genes are identified. The high-quality genome assembly and annotation has been corroborated by multiple metrics, including genome size, reads mapping rate, and BUSCO completeness (98.6%). This comprehensive genomic resource provides a foundation for elucidating the ecological adaption of I. bourgeoisi and advancing our understanding of morphological evolution in Ichthyurini within Cantharidae.

小鳞鱼(Ichthyurus bourgeoisi Gestro)是斑蝥科鱼鳞鱼族的代表种。这个部落特别值得注意,因为它的短苞特征。然而,缺乏高质量的基因组资源阻碍了我们对这种甲虫群的进化和生态适应的理解。在这项研究中,我们提出了利用PacBio HiFi和Hi-C测序数据组合构建的I. bourgeoisi染色体水平基因组组装。该基因组全长664.72 Mb,骨架N50为98.12 Mb,由7条伪染色体组成,其中一条X染色体通过基因组共线性分析和测序深度验证。重复序列占基因组的65.35%,鉴定出13386个蛋白质编码基因。高质量的基因组组装和注释得到了多个指标的证实,包括基因组大小、reads定位率和BUSCO完整性(98.6%)。这一全面的基因组资源为阐明小鳞鱼的生态适应性和深入了解斑蝥科鱼尾鱼的形态进化提供了基础。
{"title":"Chromosomal-level genome assembly of Ichthyurus bourgeoisi Gestro using PacBio HiFi and Hi-C sequencing.","authors":"Yuxia Yang, Yiyang Zhen, Zheng Yang, Jiliang Wang, Jianxin Hua, Haoyu Liu","doi":"10.1038/s41597-026-07039-z","DOIUrl":"https://doi.org/10.1038/s41597-026-07039-z","url":null,"abstract":"<p><p>Ichthyurus bourgeoisi Gestro is a representative species of the tribe Ichthyurini within the beetle family Cantharidae. This tribe is particularly noteworthy because of its brachelytrous characteristics. However, the lack of high-quality genomic resources hinders our understanding of the evolution and ecological adaptions associated with this beetle group. In this study, we present a chromosome-level genome assembly for I. bourgeoisi constructed using a combination of PacBio HiFi and Hi-C sequencing data. The genome spans 664.72 Mb, with a scaffold N50 of 98.12 Mb, and is organized into seven pseudo-chromosomes, including a chromosome X validated through analyses of genome collinearity and sequencing depth. Repeat sequences account for 65.35% of the genome, and 13,386 protein-coding genes are identified. The high-quality genome assembly and annotation has been corroborated by multiple metrics, including genome size, reads mapping rate, and BUSCO completeness (98.6%). This comprehensive genomic resource provides a foundation for elucidating the ecological adaption of I. bourgeoisi and advancing our understanding of morphological evolution in Ichthyurini within Cantharidae.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147434962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A high-resolution spatiotemporal wildfire propagation dataset for the Mediterranean and Europe. 地中海和欧洲野火传播的高分辨率时空数据集。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-03-11 DOI: 10.1038/s41597-026-06965-2
Simon Müller, Anja Hofmann-Böllinghaus, Zhimin Chen, Kristin Vogel, Philipp Benner

Wildfires are becoming more frequent and severe under the influence of climate change, posing increasing risks to ecosystems, human health, and infrastructure. Accurate spatiotemporal data on wildfire propagation is essential for advancing fire behavior modeling, improving management strategies, and mitigating future impacts. However, existing datasets with both high spatial and temporal resolution are rare, costly, and time-consuming to produce. To address this gap, we present FireSpread_MedEU, a dataset comprising 320 consecutive burned area maps from 103 wildfire events across the Mediterranean and Europe between 2017 and 2023. Burned areas were derived from high-resolution Planet optical satellite imagery (~3 m spatial, mostly daily temporal resolution) using a semi-automated workflow, followed by manual refinement to ensure highest accuracy. Each dataset entry is enriched with detailed metadata and a subjective quality assessment. With its high level of spatiotemporal precision, FireSpread_MedEU provides essential data for the development and validation of machine learning models or wildfire simulation models. It opens new research opportunities in wildfire behavior analysis, risk assessment, and predictive modeling.

在气候变化的影响下,野火变得越来越频繁和严重,对生态系统、人类健康和基础设施构成越来越大的风险。准确的野火传播时空数据对于推进火灾行为建模、改进管理策略和减轻未来影响至关重要。然而,现有的具有高时空分辨率的数据集很少,成本高且耗时长。为了解决这一差距,我们提出了FireSpread_MedEU数据集,该数据集包括2017年至2023年间地中海和欧洲103起野火事件的320个连续烧伤区域地图。燃烧区域是从高分辨率的行星光学卫星图像(约3米的空间,大部分是每日时间分辨率)中提取的,使用半自动化的工作流程,然后进行手动细化以确保最高的精度。每个数据集条目都包含详细的元数据和主观质量评估。fireread_medeu具有高水平的时空精度,为开发和验证机器学习模型或野火模拟模型提供了必要的数据。它为野火行为分析、风险评估和预测建模开辟了新的研究机会。
{"title":"A high-resolution spatiotemporal wildfire propagation dataset for the Mediterranean and Europe.","authors":"Simon Müller, Anja Hofmann-Böllinghaus, Zhimin Chen, Kristin Vogel, Philipp Benner","doi":"10.1038/s41597-026-06965-2","DOIUrl":"10.1038/s41597-026-06965-2","url":null,"abstract":"<p><p>Wildfires are becoming more frequent and severe under the influence of climate change, posing increasing risks to ecosystems, human health, and infrastructure. Accurate spatiotemporal data on wildfire propagation is essential for advancing fire behavior modeling, improving management strategies, and mitigating future impacts. However, existing datasets with both high spatial and temporal resolution are rare, costly, and time-consuming to produce. To address this gap, we present FireSpread_MedEU, a dataset comprising 320 consecutive burned area maps from 103 wildfire events across the Mediterranean and Europe between 2017 and 2023. Burned areas were derived from high-resolution Planet optical satellite imagery (~3 m spatial, mostly daily temporal resolution) using a semi-automated workflow, followed by manual refinement to ensure highest accuracy. Each dataset entry is enriched with detailed metadata and a subjective quality assessment. With its high level of spatiotemporal precision, FireSpread_MedEU provides essential data for the development and validation of machine learning models or wildfire simulation models. It opens new research opportunities in wildfire behavior analysis, risk assessment, and predictive modeling.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12992773/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147435260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A map of high-altitude wetlands in the world's major mountain regions. 世界主要山区的高海拔湿地地图。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-03-11 DOI: 10.1038/s41597-026-07020-w
Rike Becker, Jan Kropáček, Anthony C Ross, Tom Gribbin, Fabian Drenkhan, Lilia Hernandez Sotelo, Marc Martinez Mendoza, Bethan Davies, Jeremy Ely, Wouter Buytaert

We present a first global high-resolution map (30 m x 30 m) of high-altitudinal wetlands in the world's major mountain regions, i.e. the Andes, Rocky Mountains, Alps and High Mountain Asia. To map these wetlands, we employed a supervised classification approach using a random forest machine learning model and a selected set of predictors including vegetation, topographic, and surface moisture features. The predictors were derived from freely available radar and optical satellite imagery (Sentinel-1 and Sentinel-2), SRTM elevation data, and the global ecoregion map RESOLVE. We identify a total area of >30,500 km2 of high-mountain wetlands. With this map we aim to enhance the understanding of wetland distribution in remote and often inaccessible mountain regions and enable a more reliable understanding of their role in the ecosystem functioning and water cycles of high mountain areas.

我们提出了第一个全球高分辨率地图(30米× 30米)的高海拔湿地在世界主要山区,即安第斯山脉,落基山脉,阿尔卑斯山和亚洲高山。为了绘制这些湿地的地图,我们采用了一种监督分类方法,使用随机森林机器学习模型和一组选定的预测因子,包括植被、地形和地表湿度特征。预测因子来源于可免费获得的雷达和光学卫星图像(Sentinel-1和Sentinel-2)、SRTM高程数据和全球生态区域图RESOLVE。我们确定了总面积为30,500 km2的高山湿地。通过这张地图,我们的目标是加强对偏远和经常难以到达的山区湿地分布的了解,并使它们在高山区生态系统功能和水循环中的作用得到更可靠的理解。
{"title":"A map of high-altitude wetlands in the world's major mountain regions.","authors":"Rike Becker, Jan Kropáček, Anthony C Ross, Tom Gribbin, Fabian Drenkhan, Lilia Hernandez Sotelo, Marc Martinez Mendoza, Bethan Davies, Jeremy Ely, Wouter Buytaert","doi":"10.1038/s41597-026-07020-w","DOIUrl":"https://doi.org/10.1038/s41597-026-07020-w","url":null,"abstract":"<p><p>We present a first global high-resolution map (30 m x 30 m) of high-altitudinal wetlands in the world's major mountain regions, i.e. the Andes, Rocky Mountains, Alps and High Mountain Asia. To map these wetlands, we employed a supervised classification approach using a random forest machine learning model and a selected set of predictors including vegetation, topographic, and surface moisture features. The predictors were derived from freely available radar and optical satellite imagery (Sentinel-1 and Sentinel-2), SRTM elevation data, and the global ecoregion map RESOLVE. We identify a total area of >30,500 km<sup>2</sup> of high-mountain wetlands. With this map we aim to enhance the understanding of wetland distribution in remote and often inaccessible mountain regions and enable a more reliable understanding of their role in the ecosystem functioning and water cycles of high mountain areas.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147434927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bounding the costs of electric vehicle managed charging-supply curves for scenarios from 2025 to 2050. 限制电动汽车的成本管理2025年至2050年情景的充电供应曲线。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-03-11 DOI: 10.1038/s41597-026-07008-6
Reiko Matsuda-Dunn, Elaine Hale, Ellie Estreich, Luke Lavin, Gabriel Konar-Steenberg

As electric vehicle (EV) adoption increases, the resulting EV battery charging will increase demand on the electric power grid. Through EV managed charging (EVMC) programs, charging can be shifted in time to support electric grid reliability and reduce electricity costs. EVMC can offer an alternative to additional supply-side generation, but the costs of EVMC implementation must be understood to evaluate the cost-benefits of EVMC. This paper presents bottom-up, forward-looking (from 2025 through 2050) estimates of the incremental costs associated with different EVMC dispatch mechanisms available to electric utilities. The costs of enabling EVMC for a range of customer participation levels are presented in the form of supply curves, which provide per-EV costs for a targeted level of participation. The largest drivers of cost variation are assumptions about future charging flexibility paradigms described in four scenarios. These supply curves can be used to quantify the expected costs of EVMC programs and enable comparison with supply-side or other demand flexibility alternatives.

随着电动汽车的普及,由此产生的电动汽车电池充电将增加对电网的需求。通过电动汽车管理充电(EVMC)计划,充电可以及时转移,以支持电网可靠性并降低电力成本。EVMC可以为额外的供方发电提供一种替代方案,但必须了解EVMC实施的成本,才能评估EVMC的成本效益。本文提出了自下而上的、前瞻性的(从2025年到2050年)对电力公司可用的不同EVMC调度机制相关的增量成本的估计。在一定的客户参与水平下实现EVMC的成本以供给曲线的形式呈现,该曲线提供了目标参与水平下的每辆电动汽车成本。成本变化的最大驱动因素是对四种情景中描述的未来收费灵活性范式的假设。这些供给曲线可以用来量化EVMC项目的预期成本,并与供应侧或其他需求灵活性替代方案进行比较。
{"title":"Bounding the costs of electric vehicle managed charging-supply curves for scenarios from 2025 to 2050.","authors":"Reiko Matsuda-Dunn, Elaine Hale, Ellie Estreich, Luke Lavin, Gabriel Konar-Steenberg","doi":"10.1038/s41597-026-07008-6","DOIUrl":"https://doi.org/10.1038/s41597-026-07008-6","url":null,"abstract":"<p><p>As electric vehicle (EV) adoption increases, the resulting EV battery charging will increase demand on the electric power grid. Through EV managed charging (EVMC) programs, charging can be shifted in time to support electric grid reliability and reduce electricity costs. EVMC can offer an alternative to additional supply-side generation, but the costs of EVMC implementation must be understood to evaluate the cost-benefits of EVMC. This paper presents bottom-up, forward-looking (from 2025 through 2050) estimates of the incremental costs associated with different EVMC dispatch mechanisms available to electric utilities. The costs of enabling EVMC for a range of customer participation levels are presented in the form of supply curves, which provide per-EV costs for a targeted level of participation. The largest drivers of cost variation are assumptions about future charging flexibility paradigms described in four scenarios. These supply curves can be used to quantify the expected costs of EVMC programs and enable comparison with supply-side or other demand flexibility alternatives.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147434951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Scientific Data
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1