首页 > 最新文献

Scientific Data最新文献

英文 中文
Comprehensive compilation and quality assessment of street-level urban air temperature measurements across European networks. 全面汇编和质量评估整个欧洲网络的街道水平的城市空气温度测量。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-02-14 DOI: 10.1038/s41597-026-06804-4
Setareh Amini, Adrian Huerta, Jörg Franke, Yuri Brugnara, Steven Caluwaerts, Julien Anet, Stevan Savić, Moritz Gubler, Gert-Jan Steeneveld, Lee Chapman, Fred Meier, Vincent Dubreuil, Andreas Christen, Matthias Zeeman, Branislava Lalić, Sebastian Schlögl, Jukka Käyhkö, AmirMasoud Azadfar, Stefan Brönnimann

This study provides a comprehensive dataset (FAIRUrbTemp) that addresses the lack of high-resolution urban air temperature data across Europe. It compiles sub-hourly street-level air temperature data from 811 low-cost to commercial sensors across several European cities and offers data in a quality-controlled, standardized format in sub-hourly, hourly, and daily resolutions. In addition, detailed metadata, as an important source of information in urban studies, is provided at network, station, and measurement levels. This pan-European dataset is rigorously quality-controlled using a serially automatic method applicable to diverse city-scale air temperature data, which identifies systematic and minor inconsistencies to enhance reliability. Expert-based validation shows that the QC reliably identifies problematic measurements, while its performance varies across urban and climatic settings due to local environmental and instrumental effects. To ensure transparency, the results of the quality control are provided to the user together with the original value in the dataset. The validated FAIRUrbTemp is a valuable resource for urban climate studies, with direct applications in validating microclimate models, assessing heat-health risks, and informing climate-adaptive urban planning.

这项研究提供了一个全面的数据集(FAIRUrbTemp),解决了整个欧洲缺乏高分辨率城市气温数据的问题。它汇集了来自欧洲几个城市的811个低成本到商业传感器的亚小时街道气温数据,并以质量控制的标准格式提供亚小时、每小时和每日分辨率的数据。此外,详细元数据作为城市研究的重要信息来源,在网络、站点和测量层面提供。该泛欧洲数据集采用适用于不同城市尺度气温数据的连续自动方法进行严格的质量控制,该方法可识别系统和轻微的不一致,以提高可靠性。基于专家的验证表明,QC可靠地识别出有问题的测量,而由于当地环境和仪器的影响,其性能在城市和气候环境中有所不同。为了确保透明度,质量控制的结果与数据集中的原始值一起提供给用户。经过验证的FAIRUrbTemp是城市气候研究的宝贵资源,可直接应用于验证微气候模型,评估热健康风险,并为气候适应性城市规划提供信息。
{"title":"Comprehensive compilation and quality assessment of street-level urban air temperature measurements across European networks.","authors":"Setareh Amini, Adrian Huerta, Jörg Franke, Yuri Brugnara, Steven Caluwaerts, Julien Anet, Stevan Savić, Moritz Gubler, Gert-Jan Steeneveld, Lee Chapman, Fred Meier, Vincent Dubreuil, Andreas Christen, Matthias Zeeman, Branislava Lalić, Sebastian Schlögl, Jukka Käyhkö, AmirMasoud Azadfar, Stefan Brönnimann","doi":"10.1038/s41597-026-06804-4","DOIUrl":"https://doi.org/10.1038/s41597-026-06804-4","url":null,"abstract":"<p><p>This study provides a comprehensive dataset (FAIRUrbTemp) that addresses the lack of high-resolution urban air temperature data across Europe. It compiles sub-hourly street-level air temperature data from 811 low-cost to commercial sensors across several European cities and offers data in a quality-controlled, standardized format in sub-hourly, hourly, and daily resolutions. In addition, detailed metadata, as an important source of information in urban studies, is provided at network, station, and measurement levels. This pan-European dataset is rigorously quality-controlled using a serially automatic method applicable to diverse city-scale air temperature data, which identifies systematic and minor inconsistencies to enhance reliability. Expert-based validation shows that the QC reliably identifies problematic measurements, while its performance varies across urban and climatic settings due to local environmental and instrumental effects. To ensure transparency, the results of the quality control are provided to the user together with the original value in the dataset. The validated FAIRUrbTemp is a valuable resource for urban climate studies, with direct applications in validating microclimate models, assessing heat-health risks, and informing climate-adaptive urban planning.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146197943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Near telomere-to-telomere diploid genome assembly of Acrossocheilus wenchowensis. 温氏跨猴近端粒-端粒二倍体基因组组装。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-02-14 DOI: 10.1038/s41597-026-06752-z
Lingzhan Xue, Mingkun Luo, Haoyu Wang, Wenbin Zhu, Duhuang Chen, Gaoxiong Zeng, Mengxiang Liao, Ji Zhao, Bin Wu, Luohao Xu, Zaijie Dong

Acrossocheilus wenchowensis is a lukewarm-water fish found in southern Chinese mountain streams, valued for both ornamental and edible purposes. We assembled a near telomere-to-telomere (T2T) genome using HiFi, ONT, Hi-C and Illumina data. The assembly is approximately 870.69 Mb with a contig N50 of about 21.28 Mb. Among these, 14 chromosomes in Hap1 and 15 chromosomes in Hap2 have reached T2T levels. A total of 24,909 protein-coding genes were predicted in Hap1 and 24,496 in Hap2, with BUSCO scores of 97.4% and 97.6%, respectively. A conserved centromeric satellite sequence (262 bp) derived from an LTR transposon was identified. Comparative genomics showed that Acrossocheilus and Onychostoma diverged approximately 13.7 million years ago (Mya), while A. wenchowensis diverged from A. fasciatus about 5.25 Mya. Resequencing of four geographic populations of A. wenchowensis revealed distinct genetic structure in the LY group compared to the other populations based on SNP and InDel analysis. This genome provides a framework for diploid T2T studies in fish and supports further functional genomics research.

wenchowensis是一种发现于中国南方山间溪流中的温水鱼类,具有观赏和食用价值。我们使用HiFi, ONT, Hi-C和Illumina数据组装了近端粒到端粒(T2T)基因组。该组装体约为870.69 Mb, N50约为21.28 Mb。其中,Hap1中的14条染色体和Hap2中的15条染色体达到T2T水平。Hap1和Hap2共预测24,909个蛋白编码基因,BUSCO评分分别为97.4%和97.6%。从LTR转座子中鉴定出一个保守的着丝粒卫星序列(262 bp)。比较基因组学表明,跨颌猿人与甲口猿人大约在1370万年前(Mya)分化,而温氏猿人大约在5.25万年前与fasciatus猿人分化。基于SNP和InDel分析的温氏古猿4个地理居群重测序结果显示,与其他居群相比,LY组的遗传结构明显不同。该基因组为鱼类二倍体T2T研究提供了一个框架,并支持进一步的功能基因组学研究。
{"title":"Near telomere-to-telomere diploid genome assembly of Acrossocheilus wenchowensis.","authors":"Lingzhan Xue, Mingkun Luo, Haoyu Wang, Wenbin Zhu, Duhuang Chen, Gaoxiong Zeng, Mengxiang Liao, Ji Zhao, Bin Wu, Luohao Xu, Zaijie Dong","doi":"10.1038/s41597-026-06752-z","DOIUrl":"https://doi.org/10.1038/s41597-026-06752-z","url":null,"abstract":"<p><p>Acrossocheilus wenchowensis is a lukewarm-water fish found in southern Chinese mountain streams, valued for both ornamental and edible purposes. We assembled a near telomere-to-telomere (T2T) genome using HiFi, ONT, Hi-C and Illumina data. The assembly is approximately 870.69 Mb with a contig N50 of about 21.28 Mb. Among these, 14 chromosomes in Hap1 and 15 chromosomes in Hap2 have reached T2T levels. A total of 24,909 protein-coding genes were predicted in Hap1 and 24,496 in Hap2, with BUSCO scores of 97.4% and 97.6%, respectively. A conserved centromeric satellite sequence (262 bp) derived from an LTR transposon was identified. Comparative genomics showed that Acrossocheilus and Onychostoma diverged approximately 13.7 million years ago (Mya), while A. wenchowensis diverged from A. fasciatus about 5.25 Mya. Resequencing of four geographic populations of A. wenchowensis revealed distinct genetic structure in the LY group compared to the other populations based on SNP and InDel analysis. This genome provides a framework for diploid T2T studies in fish and supports further functional genomics research.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146195460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Defect Dataset for Electrode Coating Manufacturing. 电极涂层制造缺陷数据集。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-02-14 DOI: 10.1038/s41597-025-06419-1
Vignesh Sampath, Andrew S Lee, Samuel David Miller, Noah H Paulson, Yuepeng Zhang, Logan Ward

Electrode is a key component of many energy storage and energy conversion devices such as batteries and fuel cells. Defects in electrodes can significantly influence device performance and reliability and thus need to be monitored and eliminated during the electrode manufacturing process. Advancements in in-line metrology, computer vision, and machine learning have enabled the development of integrated hardware-software systems for automated defect detection and diagnostics. While several manufacturing domains have published defect datasets to support such efforts, publicly available datasets specific to electrode coating processes are not available. To fill this gap and support research on defect detection for automated coating processes, we present CoatingVision, a comprehensive dataset of slot-die coating images with labeled defect types. This dataset supports a diverse range of image recognition tasks, including defect segmentation, defect detection, and multi-label classification. It includes high-resolution images with associated labels for common defects such as surface cracks, delamination cracks, pinholes, and unclassified defects. To facilitate benchmarking and reproducible research, CoatingVision is packaged with an open-source codebase that enables comparative evaluation of AI models and hyperparameter configurations. The dataset has been meticulously curated to ensure high quality and consistency, providing researchers with reliable data for training and evaluating computer vision models. With over 2,200 image samples under various processing conditions, CoatingVision offers a robust foundation for developing automated defect detection systems. It promotes deeper insights into defect formation in coating manufacturing processes, which can be used to advance various coating-related applications including batteries and fuel cells.

电极是许多能量存储和能量转换装置(如电池和燃料电池)的关键部件。电极缺陷会严重影响器件的性能和可靠性,因此需要在电极制造过程中进行监测和消除。在线计量学、计算机视觉和机器学习的进步使得集成硬件软件系统的开发成为可能,用于自动缺陷检测和诊断。虽然一些制造领域已经发布了缺陷数据集来支持这种努力,但公开可用的特定于电极涂层工艺的数据集是不可用的。为了填补这一空白并支持自动化涂层过程缺陷检测的研究,我们提出了CoatingVision,这是一个综合的带有标记缺陷类型的槽模涂层图像数据集。该数据集支持多种图像识别任务,包括缺陷分割、缺陷检测和多标签分类。它包括高分辨率图像,并带有常见缺陷的相关标签,如表面裂纹、分层裂纹、针孔和未分类缺陷。为了便于基准测试和可重复的研究,CoatingVision打包了一个开源代码库,可以对人工智能模型和超参数配置进行比较评估。该数据集经过精心策划,以确保高质量和一致性,为研究人员提供训练和评估计算机视觉模型的可靠数据。CoatingVision拥有2200多个不同处理条件下的图像样本,为开发自动化缺陷检测系统提供了坚实的基础。它促进了对涂层制造过程中缺陷形成的更深入的了解,可用于推进各种涂层相关应用,包括电池和燃料电池。
{"title":"A Defect Dataset for Electrode Coating Manufacturing.","authors":"Vignesh Sampath, Andrew S Lee, Samuel David Miller, Noah H Paulson, Yuepeng Zhang, Logan Ward","doi":"10.1038/s41597-025-06419-1","DOIUrl":"https://doi.org/10.1038/s41597-025-06419-1","url":null,"abstract":"<p><p>Electrode is a key component of many energy storage and energy conversion devices such as batteries and fuel cells. Defects in electrodes can significantly influence device performance and reliability and thus need to be monitored and eliminated during the electrode manufacturing process. Advancements in in-line metrology, computer vision, and machine learning have enabled the development of integrated hardware-software systems for automated defect detection and diagnostics. While several manufacturing domains have published defect datasets to support such efforts, publicly available datasets specific to electrode coating processes are not available. To fill this gap and support research on defect detection for automated coating processes, we present CoatingVision, a comprehensive dataset of slot-die coating images with labeled defect types. This dataset supports a diverse range of image recognition tasks, including defect segmentation, defect detection, and multi-label classification. It includes high-resolution images with associated labels for common defects such as surface cracks, delamination cracks, pinholes, and unclassified defects. To facilitate benchmarking and reproducible research, CoatingVision is packaged with an open-source codebase that enables comparative evaluation of AI models and hyperparameter configurations. The dataset has been meticulously curated to ensure high quality and consistency, providing researchers with reliable data for training and evaluating computer vision models. With over 2,200 image samples under various processing conditions, CoatingVision offers a robust foundation for developing automated defect detection systems. It promotes deeper insights into defect formation in coating manufacturing processes, which can be used to advance various coating-related applications including batteries and fuel cells.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146197923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Global dataset on heat wave exposure due to the urban heat island effect. 基于城市热岛效应的热浪暴露全球数据集。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-02-14 DOI: 10.1038/s41597-026-06877-1
Wenbo Yu, Jun Yang, Yuyu Zhou, Xiangming Xiao

Continuing global warming and urbanization have increased the frequency and severity of extreme heat events in cities. Therefore, understanding how the urban heat island (UHI) effect influences cities is essential for developing effective mitigation and prevention strategies. A 1-km resolution dataset was constructed to assess heat-wave exposure attributable to UHIs in urban human settlements worldwide from 2003 to 2020. An adaptive urban-rural threshold method was employed to delineate the spatial extent of UHI impacts, and a spatiotemporally fitted MODIS surface temperature dataset was used to address missing data caused by cloud contamination. This dataset explicitly separates the contributions of background climate, local landscape characteristics, and urbanization to heat wave exposure, providing a scientific basis for identifying key UHI mitigation areas and developing heat wave risk early warning models that account for UHI effects. The proposed methodology and dataset support synergistic decision-making for integrating urban climate adaptation with sustainable development, and the technical framework can be extended to studies of UHIs and heat wave exposure in other regions worldwide.

持续的全球变暖和城市化增加了城市极端高温事件的频率和严重程度。因此,了解城市热岛效应如何影响城市对于制定有效的缓解和预防战略至关重要。构建了一个分辨率为1 km的数据集,评估了2003 - 2020年全球城市人类住区由UHIs引起的热浪暴露。采用城乡自适应阈值法描述城市热岛影响的空间范围,利用时空拟合的MODIS地表温度数据集解决云污染造成的数据缺失问题。该数据集明确分离了背景气候、当地景观特征和城市化对热浪暴露的贡献,为确定关键的热岛缓解区和开发考虑热岛效应的热浪风险预警模型提供了科学依据。所提出的方法和数据集支持协同决策,将城市气候适应与可持续发展相结合,技术框架可扩展到全球其他地区的UHIs和热浪暴露研究。
{"title":"Global dataset on heat wave exposure due to the urban heat island effect.","authors":"Wenbo Yu, Jun Yang, Yuyu Zhou, Xiangming Xiao","doi":"10.1038/s41597-026-06877-1","DOIUrl":"https://doi.org/10.1038/s41597-026-06877-1","url":null,"abstract":"<p><p>Continuing global warming and urbanization have increased the frequency and severity of extreme heat events in cities. Therefore, understanding how the urban heat island (UHI) effect influences cities is essential for developing effective mitigation and prevention strategies. A 1-km resolution dataset was constructed to assess heat-wave exposure attributable to UHIs in urban human settlements worldwide from 2003 to 2020. An adaptive urban-rural threshold method was employed to delineate the spatial extent of UHI impacts, and a spatiotemporally fitted MODIS surface temperature dataset was used to address missing data caused by cloud contamination. This dataset explicitly separates the contributions of background climate, local landscape characteristics, and urbanization to heat wave exposure, providing a scientific basis for identifying key UHI mitigation areas and developing heat wave risk early warning models that account for UHI effects. The proposed methodology and dataset support synergistic decision-making for integrating urban climate adaptation with sustainable development, and the technical framework can be extended to studies of UHIs and heat wave exposure in other regions worldwide.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146197991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assembling a chromosome-level genome for the Microtus fortis using PacBio HiFi and Hi-C technologies. 利用PacBio HiFi和Hi-C技术组装东方田鼠的染色体水平基因组。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-02-14 DOI: 10.1038/s41597-026-06813-3
Du Zhang, Qi Hu, Tianqiong He, Junkang Zhou, Yixin Wen, Qian Liu, Jing Zhang, Wenlin Zhi, Lingxuan Ouyang, Suisui Gao, Ruotong Guan, Zhijun Zhou

The reed vole (Microtus fortis) is an important rodent model for studying unique biological traits, such as its natural resistance to Schistosoma japonicum. To facilitate the genetic study of these phenotypes, we have produced the first high-quality, chromosome-level genome assembly for this species. The genome was assembled using PacBio HiFi long-read sequencing and scaffolded to the chromosome level with Hi-C data. The final 2.29 Gb assembly exhibits excellent continuity (contig N50 = 68.89 Mb; scaffold N50 = 91.23 Mb), with 97.7% of the sequence anchored into 26 pseudomolecules, consistent with the species' karyotype. Genome completeness was estimated at 96.3% via BUSCO analysis (glires_odb10). The annotation includes 23,678 protein-coding genes, with 97.5% assigned a putative function. This publicly available, high-quality genomic resource will be invaluable for future research, providing the necessary foundation to explore the genetic mechanisms behind the unique adaptations of M. fortis, including its innate immunity, digestive physiology, and disease models. The assembly will also serve as a key reference for comparative genomics, enriching our understanding of rodent evolution.

芦苇田鼠(Microtus fortis)是研究其对日本血吸虫天然抗性等独特生物学特性的重要啮齿类动物模型。为了促进这些表型的遗传研究,我们已经为该物种生产了第一个高质量的染色体水平基因组组装。使用PacBio HiFi长读测序对基因组进行组装,并使用Hi-C数据将其搭建到染色体水平。最终的2.29 Gb组装具有良好的连续性(contig N50 = 68.89 Mb; scaffold N50 = 91.23 Mb),其中97.7%的序列锚定在26个假分子中,与该物种的核型一致。通过BUSCO分析,基因组完整性估计为96.3% (glires_odb10)。该注释包括23,678个蛋白质编码基因,其中97.5%指定了假定的功能。这种公开的、高质量的基因组资源对未来的研究将是无价的,为探索m.f fortis独特适应性背后的遗传机制提供了必要的基础,包括其先天免疫、消化生理学和疾病模型。该组合也将作为比较基因组学的关键参考,丰富我们对啮齿动物进化的理解。
{"title":"Assembling a chromosome-level genome for the Microtus fortis using PacBio HiFi and Hi-C technologies.","authors":"Du Zhang, Qi Hu, Tianqiong He, Junkang Zhou, Yixin Wen, Qian Liu, Jing Zhang, Wenlin Zhi, Lingxuan Ouyang, Suisui Gao, Ruotong Guan, Zhijun Zhou","doi":"10.1038/s41597-026-06813-3","DOIUrl":"https://doi.org/10.1038/s41597-026-06813-3","url":null,"abstract":"<p><p>The reed vole (Microtus fortis) is an important rodent model for studying unique biological traits, such as its natural resistance to Schistosoma japonicum. To facilitate the genetic study of these phenotypes, we have produced the first high-quality, chromosome-level genome assembly for this species. The genome was assembled using PacBio HiFi long-read sequencing and scaffolded to the chromosome level with Hi-C data. The final 2.29 Gb assembly exhibits excellent continuity (contig N50 = 68.89 Mb; scaffold N50 = 91.23 Mb), with 97.7% of the sequence anchored into 26 pseudomolecules, consistent with the species' karyotype. Genome completeness was estimated at 96.3% via BUSCO analysis (glires_odb10). The annotation includes 23,678 protein-coding genes, with 97.5% assigned a putative function. This publicly available, high-quality genomic resource will be invaluable for future research, providing the necessary foundation to explore the genetic mechanisms behind the unique adaptations of M. fortis, including its innate immunity, digestive physiology, and disease models. The assembly will also serve as a key reference for comparative genomics, enriching our understanding of rodent evolution.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146195484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A dataset of paired blood mRNA and microRNA sequencing across acute septic shock and recovery. 急性感染性休克和恢复期间成对血液mRNA和microRNA测序数据集。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-02-14 DOI: 10.1038/s41597-026-06844-w
Krisztina Molnár, Katalin Maricza, Zsuzsanna Elek, Réka Kovács-Nagy, Ábel Fóthi, Zsófia Bánlaki, Eszter Losoncz, Bernadett Húri, János Kádas, Gergely Keszler, Zsolt Rónai

An adequate immune response is responsible for eradicating pathogens and reestablishing tissue homeostasis upon infection. However, in certain patients, immune processes become dysregulated, leading to sepsis which often results in life-threatening organ dysfunction, and the progression to septic shock is associated with mortality rates of up to 70-80%. The objective of the present data set is to facilitate the identification of transcriptomic signatures characteristic of septic shock and the stable, out of critical condition. A total of six patients were included in the study, with blood samples collected at two different stages - septic shock, at the time of discharge from the intensive care unit - of the disease. Following total RNA isolation, mRNA and microRNA levels were determined by NGS. The dataset's significance is based on the fact that two samples of the same patient were analyzed, ensuring that any observed alteration in transcript levels is related to the change in medical condition, and the analysis included both mRNAs and microRNAs enabling a comprehensive gene expression and regulatory study.

一个充分的免疫反应是负责根除病原体和重建组织稳态感染。然而,在某些患者中,免疫过程变得失调,导致败血症,这通常导致危及生命的器官功能障碍,并且进展为感染性休克与高达70-80%的死亡率相关。本数据集的目的是为了方便鉴定感染性休克和稳定,脱离危重状态的转录组特征。共有6名患者参与了这项研究,他们在两个不同阶段采集了血液样本——感染性休克,从重症监护病房出院时。总RNA分离后,用NGS法测定mRNA和microRNA水平。该数据集的意义在于分析了同一患者的两个样本,确保了任何观察到的转录物水平的改变都与医疗状况的变化有关,并且分析包括mrna和microrna,从而能够进行全面的基因表达和调控研究。
{"title":"A dataset of paired blood mRNA and microRNA sequencing across acute septic shock and recovery.","authors":"Krisztina Molnár, Katalin Maricza, Zsuzsanna Elek, Réka Kovács-Nagy, Ábel Fóthi, Zsófia Bánlaki, Eszter Losoncz, Bernadett Húri, János Kádas, Gergely Keszler, Zsolt Rónai","doi":"10.1038/s41597-026-06844-w","DOIUrl":"https://doi.org/10.1038/s41597-026-06844-w","url":null,"abstract":"<p><p>An adequate immune response is responsible for eradicating pathogens and reestablishing tissue homeostasis upon infection. However, in certain patients, immune processes become dysregulated, leading to sepsis which often results in life-threatening organ dysfunction, and the progression to septic shock is associated with mortality rates of up to 70-80%. The objective of the present data set is to facilitate the identification of transcriptomic signatures characteristic of septic shock and the stable, out of critical condition. A total of six patients were included in the study, with blood samples collected at two different stages - septic shock, at the time of discharge from the intensive care unit - of the disease. Following total RNA isolation, mRNA and microRNA levels were determined by NGS. The dataset's significance is based on the fact that two samples of the same patient were analyzed, ensuring that any observed alteration in transcript levels is related to the change in medical condition, and the analysis included both mRNAs and microRNAs enabling a comprehensive gene expression and regulatory study.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146197898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A chromosomal haplotype-resolved genome assembly of Cuphea hookeriana. 虎斑茶染色体单倍型分解基因组组装。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-02-13 DOI: 10.1038/s41597-026-06830-2
Cuihua Gu, Jie Wang, Guozhe Zhang, Dan Peng, Zhibin Li, Simei Ren, Zhiqiang Wu, Liyuan Yang

Cuphea hookeriana (Lythraceae) is an evergreen flowering shrub native to tropical and subtropical regions, serving as an ornamental plant in landscaping and as a raw material for industry and agriculture. Here, we assembled a haplotype-resolved chromosomal genome for C. hookeriana (2n = 3x = 24) based on HiFi and Hi-C sequencing datasets. The genome was assembled into 16 chromosomes with a total size of 498,761,099 bp, comprising 248,233,948 bp and 240,972,969 bp for haplotypes A and B, respectively. BUSCO evaluation indicated an assembly completeness of 97.7% C value. Approximately 30,000 genes were annotated for each haplotype, achieving a BUSCO complete score of 97%.

赤藓科赤藓属常绿开花灌木,原产于热带和亚热带地区,是园林绿化的观赏植物,也是工农业的原料。在这里,我们基于HiFi和Hi-C测序数据集组装了hookeriana (2n = 3x = 24)的单倍型染色体基因组。基因组组装成16条染色体,总长度为498,761,099 bp,单倍型a和B分别为248,233,948 bp和240,972,969 bp。BUSCO评价表明组装完整性为97.7% C值。每个单倍型大约有30,000个基因被注释,BUSCO完成率为97%。
{"title":"A chromosomal haplotype-resolved genome assembly of Cuphea hookeriana.","authors":"Cuihua Gu, Jie Wang, Guozhe Zhang, Dan Peng, Zhibin Li, Simei Ren, Zhiqiang Wu, Liyuan Yang","doi":"10.1038/s41597-026-06830-2","DOIUrl":"https://doi.org/10.1038/s41597-026-06830-2","url":null,"abstract":"<p><p>Cuphea hookeriana (Lythraceae) is an evergreen flowering shrub native to tropical and subtropical regions, serving as an ornamental plant in landscaping and as a raw material for industry and agriculture. Here, we assembled a haplotype-resolved chromosomal genome for C. hookeriana (2n = 3x = 24) based on HiFi and Hi-C sequencing datasets. The genome was assembled into 16 chromosomes with a total size of 498,761,099 bp, comprising 248,233,948 bp and 240,972,969 bp for haplotypes A and B, respectively. BUSCO evaluation indicated an assembly completeness of 97.7% C value. Approximately 30,000 genes were annotated for each haplotype, achieving a BUSCO complete score of 97%.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146195346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Citizens Survey 2022-23: a household-level dataset on Universal Health Coverage in India. 《2022-23年公民调查:印度全民健康覆盖的家庭数据集》。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-02-13 DOI: 10.1038/s41597-026-06775-6
Anuska Kalita, Siddhesh Zadey, Sudheer Kumar Shukla, Shubhangi Bhadada, Sumit Kane, Dolon Roy, Mukund Kumar Chandan, Jashanjot Singh Mangat, Preeyati Chopra, Sarika Chaturvedi, Sonia Bhalotra, S V Subramanian, Vikram Patel

The pursuit of Universal Health Coverage (UHC) in India is particularly challenging given the country's vast population and pronounced socioeconomic disparities. Although extensive research addresses specific healthcare areas, contemporary data on citizens' healthcare access, quality, and preferences to inform UHC design are lacking. To bridge this gap, the Lancet Commission on a Citizen-Centred Health System for India conducted a Citizens Survey from November 2022 to April 2023, interviewing respondents in person in 50,000 randomly selected households across 125 districts in 29 Indian states and Union Territories. The survey comprised 141 questions covering healthcare utilization, experiences, costs, satisfaction, delivery preferences, insurance coverage, willingness to pay, health information behaviors, technology use, aspirational health norms, and electoral attitudes towards health. The survey had a high participation rate (98%) and a low non-response rate (9.5%), 70% of households were rural, 56% of respondents were male, 79% were Hindu, and 39% identified as Scheduled Caste or Tribes. The data aim to inform citizen-centric reforms, advancing a UHC responsive to India's diverse population needs.

考虑到印度庞大的人口和明显的社会经济差距,实现全民健康覆盖(UHC)尤其具有挑战性。尽管广泛的研究针对特定的医疗保健领域,但缺乏关于公民医疗保健获取、质量和偏好的当代数据,以告知UHC设计。为了弥补这一差距,《柳叶刀》印度以公民为中心的卫生系统委员会在2022年11月至2023年4月期间进行了一项公民调查,在印度29个邦和联邦领土的125个地区随机抽取的5万个家庭中亲自采访了受访者。调查包括141个问题,包括医疗保健利用、经验、费用、满意度、提供偏好、保险范围、支付意愿、健康信息行为、技术使用、期望健康规范和选民对健康的态度。该调查的参与率很高(98%),无回应率很低(9.5%),70%的家庭是农村家庭,56%的受访者是男性,79%是印度教徒,39%被确定为预定种姓或部落。这些数据旨在为以公民为中心的改革提供信息,推动全民健康覆盖,满足印度多样化的人口需求。
{"title":"The Citizens Survey 2022-23: a household-level dataset on Universal Health Coverage in India.","authors":"Anuska Kalita, Siddhesh Zadey, Sudheer Kumar Shukla, Shubhangi Bhadada, Sumit Kane, Dolon Roy, Mukund Kumar Chandan, Jashanjot Singh Mangat, Preeyati Chopra, Sarika Chaturvedi, Sonia Bhalotra, S V Subramanian, Vikram Patel","doi":"10.1038/s41597-026-06775-6","DOIUrl":"https://doi.org/10.1038/s41597-026-06775-6","url":null,"abstract":"<p><p>The pursuit of Universal Health Coverage (UHC) in India is particularly challenging given the country's vast population and pronounced socioeconomic disparities. Although extensive research addresses specific healthcare areas, contemporary data on citizens' healthcare access, quality, and preferences to inform UHC design are lacking. To bridge this gap, the Lancet Commission on a Citizen-Centred Health System for India conducted a Citizens Survey from November 2022 to April 2023, interviewing respondents in person in 50,000 randomly selected households across 125 districts in 29 Indian states and Union Territories. The survey comprised 141 questions covering healthcare utilization, experiences, costs, satisfaction, delivery preferences, insurance coverage, willingness to pay, health information behaviors, technology use, aspirational health norms, and electoral attitudes towards health. The survey had a high participation rate (98%) and a low non-response rate (9.5%), 70% of households were rural, 56% of respondents were male, 79% were Hindu, and 39% identified as Scheduled Caste or Tribes. The data aim to inform citizen-centric reforms, advancing a UHC responsive to India's diverse population needs.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146195425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A dataset of tourist mobility networks across China derived from online travel blogs. 一个来自在线旅游博客的中国旅游交通网络数据集。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-02-13 DOI: 10.1038/s41597-026-06780-9
Yunhao Zheng, Jinhua Wang, Yi Zhang, Naixia Mou, Yu Liu

Nowadays, tourism practices face increasingly intensified flows of people, making it imperative to explore the tourism space through the lens of mobility. To examine nationwide tourist mobility, this study collected online travel blog data from Qunar.com, a leading travel services platform in China, to construct tourist mobility networks across China. In these networks, attractions are represented as nodes, while tourist movements between them, derived from blog data, are represented as weighted and directed edges. To capture different travel contexts, the study also develops mobility networks categorized by departure season and travel partners. All networks are released in a simple, accessible format to support future research.

如今,旅游实践面临着日益加剧的人口流动,从流动性的角度来探索旅游空间势在必行。为了考察全国范围内的旅游流动性,本研究收集了中国领先的旅游服务平台去哪儿网的在线旅游博客数据,构建了全国范围内的旅游流动性网络。在这些网络中,景点被表示为节点,而来自博客数据的游客在它们之间的移动被表示为加权和有向边。为了捕捉不同的旅行环境,该研究还开发了按出发季节和旅行伙伴分类的移动网络。所有网络都以一种简单、可访问的格式发布,以支持未来的研究。
{"title":"A dataset of tourist mobility networks across China derived from online travel blogs.","authors":"Yunhao Zheng, Jinhua Wang, Yi Zhang, Naixia Mou, Yu Liu","doi":"10.1038/s41597-026-06780-9","DOIUrl":"https://doi.org/10.1038/s41597-026-06780-9","url":null,"abstract":"<p><p>Nowadays, tourism practices face increasingly intensified flows of people, making it imperative to explore the tourism space through the lens of mobility. To examine nationwide tourist mobility, this study collected online travel blog data from Qunar.com, a leading travel services platform in China, to construct tourist mobility networks across China. In these networks, attractions are represented as nodes, while tourist movements between them, derived from blog data, are represented as weighted and directed edges. To capture different travel contexts, the study also develops mobility networks categorized by departure season and travel partners. All networks are released in a simple, accessible format to support future research.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146195339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A full-length mtDNA dataset for studying genetic variations across generations and complex family structures. 全长mtDNA数据集,用于研究跨代遗传变异和复杂的家庭结构。
IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-02-13 DOI: 10.1038/s41597-026-06824-0
Yanan Liu, Qi Yang, Yujia Xuan, Jinyuan Zhao, Anqi Chen, Suhua Zhang

Mitochondrial DNA (mtDNA) mutations are critical to disease research, evolutionary studies, and lineage tracing but are challenging to analyze due to interference from nuclear mitochondrial sequences (NUMTs). Current high-throughput sequencing techniques rely on multiple primers or probes to amplify short mtDNA fragments, followed by alignment to a reference genome. However, this approach fails to mitigate NUMTs interference, leading to ambiguous results. In this study, we presented a nanopore-based third-generation sequencing (TGS) method using a single primer pair to amplify full-length mtDNA, effectively circumventing NUMTs artifacts. Sequencing was carried out on the QITAN TECH QNome-3841hex platform, generating complete mtDNA coverage for 106 samples from eight distinct family pedigrees, including complex familial structures such as half-siblings and multi-generational households. The sequencing achieved 100% genome coverage with an average mapping rate of 99.96%, supporting comprehensive genome characterization. The resulting dataset offers valuable insights into mtDNA mutation detection, mitochondrial genetics, population genetics, ancestry tracing, and forensic identification, and may advance mtDNA sequencing technologies and intergenerational studies.

线粒体DNA (mtDNA)突变对疾病研究、进化研究和谱系追踪至关重要,但由于核线粒体序列(numt)的干扰,分析具有挑战性。目前的高通量测序技术依赖于多个引物或探针来扩增短mtDNA片段,然后与参考基因组比对。然而,这种方法不能减轻numt的干扰,导致结果不明确。在这项研究中,我们提出了一种基于纳米孔的第三代测序(TGS)方法,使用单个引物对扩增全长mtDNA,有效地规避了numt伪影。测序在QITAN TECH qname -3841hex平台上进行,对来自8个不同家庭谱系的106个样本进行了完整的mtDNA覆盖,包括半兄弟姐妹和多代家庭等复杂的家庭结构。测序实现了100%的基因组覆盖率,平均作图率为99.96%,支持全面的基因组表征。由此产生的数据集为mtDNA突变检测、线粒体遗传学、群体遗传学、祖先追踪和法医鉴定提供了有价值的见解,并可能推进mtDNA测序技术和代际研究。
{"title":"A full-length mtDNA dataset for studying genetic variations across generations and complex family structures.","authors":"Yanan Liu, Qi Yang, Yujia Xuan, Jinyuan Zhao, Anqi Chen, Suhua Zhang","doi":"10.1038/s41597-026-06824-0","DOIUrl":"https://doi.org/10.1038/s41597-026-06824-0","url":null,"abstract":"<p><p>Mitochondrial DNA (mtDNA) mutations are critical to disease research, evolutionary studies, and lineage tracing but are challenging to analyze due to interference from nuclear mitochondrial sequences (NUMTs). Current high-throughput sequencing techniques rely on multiple primers or probes to amplify short mtDNA fragments, followed by alignment to a reference genome. However, this approach fails to mitigate NUMTs interference, leading to ambiguous results. In this study, we presented a nanopore-based third-generation sequencing (TGS) method using a single primer pair to amplify full-length mtDNA, effectively circumventing NUMTs artifacts. Sequencing was carried out on the QITAN TECH QNome-3841hex platform, generating complete mtDNA coverage for 106 samples from eight distinct family pedigrees, including complex familial structures such as half-siblings and multi-generational households. The sequencing achieved 100% genome coverage with an average mapping rate of 99.96%, supporting comprehensive genome characterization. The resulting dataset offers valuable insights into mtDNA mutation detection, mitochondrial genetics, population genetics, ancestry tracing, and forensic identification, and may advance mtDNA sequencing technologies and intergenerational studies.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146195327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Scientific Data
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1