Pub Date : 2026-02-14DOI: 10.1038/s41597-026-06804-4
Setareh Amini, Adrian Huerta, Jörg Franke, Yuri Brugnara, Steven Caluwaerts, Julien Anet, Stevan Savić, Moritz Gubler, Gert-Jan Steeneveld, Lee Chapman, Fred Meier, Vincent Dubreuil, Andreas Christen, Matthias Zeeman, Branislava Lalić, Sebastian Schlögl, Jukka Käyhkö, AmirMasoud Azadfar, Stefan Brönnimann
This study provides a comprehensive dataset (FAIRUrbTemp) that addresses the lack of high-resolution urban air temperature data across Europe. It compiles sub-hourly street-level air temperature data from 811 low-cost to commercial sensors across several European cities and offers data in a quality-controlled, standardized format in sub-hourly, hourly, and daily resolutions. In addition, detailed metadata, as an important source of information in urban studies, is provided at network, station, and measurement levels. This pan-European dataset is rigorously quality-controlled using a serially automatic method applicable to diverse city-scale air temperature data, which identifies systematic and minor inconsistencies to enhance reliability. Expert-based validation shows that the QC reliably identifies problematic measurements, while its performance varies across urban and climatic settings due to local environmental and instrumental effects. To ensure transparency, the results of the quality control are provided to the user together with the original value in the dataset. The validated FAIRUrbTemp is a valuable resource for urban climate studies, with direct applications in validating microclimate models, assessing heat-health risks, and informing climate-adaptive urban planning.
{"title":"Comprehensive compilation and quality assessment of street-level urban air temperature measurements across European networks.","authors":"Setareh Amini, Adrian Huerta, Jörg Franke, Yuri Brugnara, Steven Caluwaerts, Julien Anet, Stevan Savić, Moritz Gubler, Gert-Jan Steeneveld, Lee Chapman, Fred Meier, Vincent Dubreuil, Andreas Christen, Matthias Zeeman, Branislava Lalić, Sebastian Schlögl, Jukka Käyhkö, AmirMasoud Azadfar, Stefan Brönnimann","doi":"10.1038/s41597-026-06804-4","DOIUrl":"https://doi.org/10.1038/s41597-026-06804-4","url":null,"abstract":"<p><p>This study provides a comprehensive dataset (FAIRUrbTemp) that addresses the lack of high-resolution urban air temperature data across Europe. It compiles sub-hourly street-level air temperature data from 811 low-cost to commercial sensors across several European cities and offers data in a quality-controlled, standardized format in sub-hourly, hourly, and daily resolutions. In addition, detailed metadata, as an important source of information in urban studies, is provided at network, station, and measurement levels. This pan-European dataset is rigorously quality-controlled using a serially automatic method applicable to diverse city-scale air temperature data, which identifies systematic and minor inconsistencies to enhance reliability. Expert-based validation shows that the QC reliably identifies problematic measurements, while its performance varies across urban and climatic settings due to local environmental and instrumental effects. To ensure transparency, the results of the quality control are provided to the user together with the original value in the dataset. The validated FAIRUrbTemp is a valuable resource for urban climate studies, with direct applications in validating microclimate models, assessing heat-health risks, and informing climate-adaptive urban planning.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146197943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Acrossocheilus wenchowensis is a lukewarm-water fish found in southern Chinese mountain streams, valued for both ornamental and edible purposes. We assembled a near telomere-to-telomere (T2T) genome using HiFi, ONT, Hi-C and Illumina data. The assembly is approximately 870.69 Mb with a contig N50 of about 21.28 Mb. Among these, 14 chromosomes in Hap1 and 15 chromosomes in Hap2 have reached T2T levels. A total of 24,909 protein-coding genes were predicted in Hap1 and 24,496 in Hap2, with BUSCO scores of 97.4% and 97.6%, respectively. A conserved centromeric satellite sequence (262 bp) derived from an LTR transposon was identified. Comparative genomics showed that Acrossocheilus and Onychostoma diverged approximately 13.7 million years ago (Mya), while A. wenchowensis diverged from A. fasciatus about 5.25 Mya. Resequencing of four geographic populations of A. wenchowensis revealed distinct genetic structure in the LY group compared to the other populations based on SNP and InDel analysis. This genome provides a framework for diploid T2T studies in fish and supports further functional genomics research.
{"title":"Near telomere-to-telomere diploid genome assembly of Acrossocheilus wenchowensis.","authors":"Lingzhan Xue, Mingkun Luo, Haoyu Wang, Wenbin Zhu, Duhuang Chen, Gaoxiong Zeng, Mengxiang Liao, Ji Zhao, Bin Wu, Luohao Xu, Zaijie Dong","doi":"10.1038/s41597-026-06752-z","DOIUrl":"https://doi.org/10.1038/s41597-026-06752-z","url":null,"abstract":"<p><p>Acrossocheilus wenchowensis is a lukewarm-water fish found in southern Chinese mountain streams, valued for both ornamental and edible purposes. We assembled a near telomere-to-telomere (T2T) genome using HiFi, ONT, Hi-C and Illumina data. The assembly is approximately 870.69 Mb with a contig N50 of about 21.28 Mb. Among these, 14 chromosomes in Hap1 and 15 chromosomes in Hap2 have reached T2T levels. A total of 24,909 protein-coding genes were predicted in Hap1 and 24,496 in Hap2, with BUSCO scores of 97.4% and 97.6%, respectively. A conserved centromeric satellite sequence (262 bp) derived from an LTR transposon was identified. Comparative genomics showed that Acrossocheilus and Onychostoma diverged approximately 13.7 million years ago (Mya), while A. wenchowensis diverged from A. fasciatus about 5.25 Mya. Resequencing of four geographic populations of A. wenchowensis revealed distinct genetic structure in the LY group compared to the other populations based on SNP and InDel analysis. This genome provides a framework for diploid T2T studies in fish and supports further functional genomics research.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146195460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-14DOI: 10.1038/s41597-025-06419-1
Vignesh Sampath, Andrew S Lee, Samuel David Miller, Noah H Paulson, Yuepeng Zhang, Logan Ward
Electrode is a key component of many energy storage and energy conversion devices such as batteries and fuel cells. Defects in electrodes can significantly influence device performance and reliability and thus need to be monitored and eliminated during the electrode manufacturing process. Advancements in in-line metrology, computer vision, and machine learning have enabled the development of integrated hardware-software systems for automated defect detection and diagnostics. While several manufacturing domains have published defect datasets to support such efforts, publicly available datasets specific to electrode coating processes are not available. To fill this gap and support research on defect detection for automated coating processes, we present CoatingVision, a comprehensive dataset of slot-die coating images with labeled defect types. This dataset supports a diverse range of image recognition tasks, including defect segmentation, defect detection, and multi-label classification. It includes high-resolution images with associated labels for common defects such as surface cracks, delamination cracks, pinholes, and unclassified defects. To facilitate benchmarking and reproducible research, CoatingVision is packaged with an open-source codebase that enables comparative evaluation of AI models and hyperparameter configurations. The dataset has been meticulously curated to ensure high quality and consistency, providing researchers with reliable data for training and evaluating computer vision models. With over 2,200 image samples under various processing conditions, CoatingVision offers a robust foundation for developing automated defect detection systems. It promotes deeper insights into defect formation in coating manufacturing processes, which can be used to advance various coating-related applications including batteries and fuel cells.
{"title":"A Defect Dataset for Electrode Coating Manufacturing.","authors":"Vignesh Sampath, Andrew S Lee, Samuel David Miller, Noah H Paulson, Yuepeng Zhang, Logan Ward","doi":"10.1038/s41597-025-06419-1","DOIUrl":"https://doi.org/10.1038/s41597-025-06419-1","url":null,"abstract":"<p><p>Electrode is a key component of many energy storage and energy conversion devices such as batteries and fuel cells. Defects in electrodes can significantly influence device performance and reliability and thus need to be monitored and eliminated during the electrode manufacturing process. Advancements in in-line metrology, computer vision, and machine learning have enabled the development of integrated hardware-software systems for automated defect detection and diagnostics. While several manufacturing domains have published defect datasets to support such efforts, publicly available datasets specific to electrode coating processes are not available. To fill this gap and support research on defect detection for automated coating processes, we present CoatingVision, a comprehensive dataset of slot-die coating images with labeled defect types. This dataset supports a diverse range of image recognition tasks, including defect segmentation, defect detection, and multi-label classification. It includes high-resolution images with associated labels for common defects such as surface cracks, delamination cracks, pinholes, and unclassified defects. To facilitate benchmarking and reproducible research, CoatingVision is packaged with an open-source codebase that enables comparative evaluation of AI models and hyperparameter configurations. The dataset has been meticulously curated to ensure high quality and consistency, providing researchers with reliable data for training and evaluating computer vision models. With over 2,200 image samples under various processing conditions, CoatingVision offers a robust foundation for developing automated defect detection systems. It promotes deeper insights into defect formation in coating manufacturing processes, which can be used to advance various coating-related applications including batteries and fuel cells.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146197923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-14DOI: 10.1038/s41597-026-06877-1
Wenbo Yu, Jun Yang, Yuyu Zhou, Xiangming Xiao
Continuing global warming and urbanization have increased the frequency and severity of extreme heat events in cities. Therefore, understanding how the urban heat island (UHI) effect influences cities is essential for developing effective mitigation and prevention strategies. A 1-km resolution dataset was constructed to assess heat-wave exposure attributable to UHIs in urban human settlements worldwide from 2003 to 2020. An adaptive urban-rural threshold method was employed to delineate the spatial extent of UHI impacts, and a spatiotemporally fitted MODIS surface temperature dataset was used to address missing data caused by cloud contamination. This dataset explicitly separates the contributions of background climate, local landscape characteristics, and urbanization to heat wave exposure, providing a scientific basis for identifying key UHI mitigation areas and developing heat wave risk early warning models that account for UHI effects. The proposed methodology and dataset support synergistic decision-making for integrating urban climate adaptation with sustainable development, and the technical framework can be extended to studies of UHIs and heat wave exposure in other regions worldwide.
{"title":"Global dataset on heat wave exposure due to the urban heat island effect.","authors":"Wenbo Yu, Jun Yang, Yuyu Zhou, Xiangming Xiao","doi":"10.1038/s41597-026-06877-1","DOIUrl":"https://doi.org/10.1038/s41597-026-06877-1","url":null,"abstract":"<p><p>Continuing global warming and urbanization have increased the frequency and severity of extreme heat events in cities. Therefore, understanding how the urban heat island (UHI) effect influences cities is essential for developing effective mitigation and prevention strategies. A 1-km resolution dataset was constructed to assess heat-wave exposure attributable to UHIs in urban human settlements worldwide from 2003 to 2020. An adaptive urban-rural threshold method was employed to delineate the spatial extent of UHI impacts, and a spatiotemporally fitted MODIS surface temperature dataset was used to address missing data caused by cloud contamination. This dataset explicitly separates the contributions of background climate, local landscape characteristics, and urbanization to heat wave exposure, providing a scientific basis for identifying key UHI mitigation areas and developing heat wave risk early warning models that account for UHI effects. The proposed methodology and dataset support synergistic decision-making for integrating urban climate adaptation with sustainable development, and the technical framework can be extended to studies of UHIs and heat wave exposure in other regions worldwide.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146197991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The reed vole (Microtus fortis) is an important rodent model for studying unique biological traits, such as its natural resistance to Schistosoma japonicum. To facilitate the genetic study of these phenotypes, we have produced the first high-quality, chromosome-level genome assembly for this species. The genome was assembled using PacBio HiFi long-read sequencing and scaffolded to the chromosome level with Hi-C data. The final 2.29 Gb assembly exhibits excellent continuity (contig N50 = 68.89 Mb; scaffold N50 = 91.23 Mb), with 97.7% of the sequence anchored into 26 pseudomolecules, consistent with the species' karyotype. Genome completeness was estimated at 96.3% via BUSCO analysis (glires_odb10). The annotation includes 23,678 protein-coding genes, with 97.5% assigned a putative function. This publicly available, high-quality genomic resource will be invaluable for future research, providing the necessary foundation to explore the genetic mechanisms behind the unique adaptations of M. fortis, including its innate immunity, digestive physiology, and disease models. The assembly will also serve as a key reference for comparative genomics, enriching our understanding of rodent evolution.
{"title":"Assembling a chromosome-level genome for the Microtus fortis using PacBio HiFi and Hi-C technologies.","authors":"Du Zhang, Qi Hu, Tianqiong He, Junkang Zhou, Yixin Wen, Qian Liu, Jing Zhang, Wenlin Zhi, Lingxuan Ouyang, Suisui Gao, Ruotong Guan, Zhijun Zhou","doi":"10.1038/s41597-026-06813-3","DOIUrl":"https://doi.org/10.1038/s41597-026-06813-3","url":null,"abstract":"<p><p>The reed vole (Microtus fortis) is an important rodent model for studying unique biological traits, such as its natural resistance to Schistosoma japonicum. To facilitate the genetic study of these phenotypes, we have produced the first high-quality, chromosome-level genome assembly for this species. The genome was assembled using PacBio HiFi long-read sequencing and scaffolded to the chromosome level with Hi-C data. The final 2.29 Gb assembly exhibits excellent continuity (contig N50 = 68.89 Mb; scaffold N50 = 91.23 Mb), with 97.7% of the sequence anchored into 26 pseudomolecules, consistent with the species' karyotype. Genome completeness was estimated at 96.3% via BUSCO analysis (glires_odb10). The annotation includes 23,678 protein-coding genes, with 97.5% assigned a putative function. This publicly available, high-quality genomic resource will be invaluable for future research, providing the necessary foundation to explore the genetic mechanisms behind the unique adaptations of M. fortis, including its innate immunity, digestive physiology, and disease models. The assembly will also serve as a key reference for comparative genomics, enriching our understanding of rodent evolution.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146195484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-14DOI: 10.1038/s41597-026-06844-w
Krisztina Molnár, Katalin Maricza, Zsuzsanna Elek, Réka Kovács-Nagy, Ábel Fóthi, Zsófia Bánlaki, Eszter Losoncz, Bernadett Húri, János Kádas, Gergely Keszler, Zsolt Rónai
An adequate immune response is responsible for eradicating pathogens and reestablishing tissue homeostasis upon infection. However, in certain patients, immune processes become dysregulated, leading to sepsis which often results in life-threatening organ dysfunction, and the progression to septic shock is associated with mortality rates of up to 70-80%. The objective of the present data set is to facilitate the identification of transcriptomic signatures characteristic of septic shock and the stable, out of critical condition. A total of six patients were included in the study, with blood samples collected at two different stages - septic shock, at the time of discharge from the intensive care unit - of the disease. Following total RNA isolation, mRNA and microRNA levels were determined by NGS. The dataset's significance is based on the fact that two samples of the same patient were analyzed, ensuring that any observed alteration in transcript levels is related to the change in medical condition, and the analysis included both mRNAs and microRNAs enabling a comprehensive gene expression and regulatory study.
{"title":"A dataset of paired blood mRNA and microRNA sequencing across acute septic shock and recovery.","authors":"Krisztina Molnár, Katalin Maricza, Zsuzsanna Elek, Réka Kovács-Nagy, Ábel Fóthi, Zsófia Bánlaki, Eszter Losoncz, Bernadett Húri, János Kádas, Gergely Keszler, Zsolt Rónai","doi":"10.1038/s41597-026-06844-w","DOIUrl":"https://doi.org/10.1038/s41597-026-06844-w","url":null,"abstract":"<p><p>An adequate immune response is responsible for eradicating pathogens and reestablishing tissue homeostasis upon infection. However, in certain patients, immune processes become dysregulated, leading to sepsis which often results in life-threatening organ dysfunction, and the progression to septic shock is associated with mortality rates of up to 70-80%. The objective of the present data set is to facilitate the identification of transcriptomic signatures characteristic of septic shock and the stable, out of critical condition. A total of six patients were included in the study, with blood samples collected at two different stages - septic shock, at the time of discharge from the intensive care unit - of the disease. Following total RNA isolation, mRNA and microRNA levels were determined by NGS. The dataset's significance is based on the fact that two samples of the same patient were analyzed, ensuring that any observed alteration in transcript levels is related to the change in medical condition, and the analysis included both mRNAs and microRNAs enabling a comprehensive gene expression and regulatory study.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146197898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-13DOI: 10.1038/s41597-026-06830-2
Cuihua Gu, Jie Wang, Guozhe Zhang, Dan Peng, Zhibin Li, Simei Ren, Zhiqiang Wu, Liyuan Yang
Cuphea hookeriana (Lythraceae) is an evergreen flowering shrub native to tropical and subtropical regions, serving as an ornamental plant in landscaping and as a raw material for industry and agriculture. Here, we assembled a haplotype-resolved chromosomal genome for C. hookeriana (2n = 3x = 24) based on HiFi and Hi-C sequencing datasets. The genome was assembled into 16 chromosomes with a total size of 498,761,099 bp, comprising 248,233,948 bp and 240,972,969 bp for haplotypes A and B, respectively. BUSCO evaluation indicated an assembly completeness of 97.7% C value. Approximately 30,000 genes were annotated for each haplotype, achieving a BUSCO complete score of 97%.
{"title":"A chromosomal haplotype-resolved genome assembly of Cuphea hookeriana.","authors":"Cuihua Gu, Jie Wang, Guozhe Zhang, Dan Peng, Zhibin Li, Simei Ren, Zhiqiang Wu, Liyuan Yang","doi":"10.1038/s41597-026-06830-2","DOIUrl":"https://doi.org/10.1038/s41597-026-06830-2","url":null,"abstract":"<p><p>Cuphea hookeriana (Lythraceae) is an evergreen flowering shrub native to tropical and subtropical regions, serving as an ornamental plant in landscaping and as a raw material for industry and agriculture. Here, we assembled a haplotype-resolved chromosomal genome for C. hookeriana (2n = 3x = 24) based on HiFi and Hi-C sequencing datasets. The genome was assembled into 16 chromosomes with a total size of 498,761,099 bp, comprising 248,233,948 bp and 240,972,969 bp for haplotypes A and B, respectively. BUSCO evaluation indicated an assembly completeness of 97.7% C value. Approximately 30,000 genes were annotated for each haplotype, achieving a BUSCO complete score of 97%.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146195346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The pursuit of Universal Health Coverage (UHC) in India is particularly challenging given the country's vast population and pronounced socioeconomic disparities. Although extensive research addresses specific healthcare areas, contemporary data on citizens' healthcare access, quality, and preferences to inform UHC design are lacking. To bridge this gap, the Lancet Commission on a Citizen-Centred Health System for India conducted a Citizens Survey from November 2022 to April 2023, interviewing respondents in person in 50,000 randomly selected households across 125 districts in 29 Indian states and Union Territories. The survey comprised 141 questions covering healthcare utilization, experiences, costs, satisfaction, delivery preferences, insurance coverage, willingness to pay, health information behaviors, technology use, aspirational health norms, and electoral attitudes towards health. The survey had a high participation rate (98%) and a low non-response rate (9.5%), 70% of households were rural, 56% of respondents were male, 79% were Hindu, and 39% identified as Scheduled Caste or Tribes. The data aim to inform citizen-centric reforms, advancing a UHC responsive to India's diverse population needs.
{"title":"The Citizens Survey 2022-23: a household-level dataset on Universal Health Coverage in India.","authors":"Anuska Kalita, Siddhesh Zadey, Sudheer Kumar Shukla, Shubhangi Bhadada, Sumit Kane, Dolon Roy, Mukund Kumar Chandan, Jashanjot Singh Mangat, Preeyati Chopra, Sarika Chaturvedi, Sonia Bhalotra, S V Subramanian, Vikram Patel","doi":"10.1038/s41597-026-06775-6","DOIUrl":"https://doi.org/10.1038/s41597-026-06775-6","url":null,"abstract":"<p><p>The pursuit of Universal Health Coverage (UHC) in India is particularly challenging given the country's vast population and pronounced socioeconomic disparities. Although extensive research addresses specific healthcare areas, contemporary data on citizens' healthcare access, quality, and preferences to inform UHC design are lacking. To bridge this gap, the Lancet Commission on a Citizen-Centred Health System for India conducted a Citizens Survey from November 2022 to April 2023, interviewing respondents in person in 50,000 randomly selected households across 125 districts in 29 Indian states and Union Territories. The survey comprised 141 questions covering healthcare utilization, experiences, costs, satisfaction, delivery preferences, insurance coverage, willingness to pay, health information behaviors, technology use, aspirational health norms, and electoral attitudes towards health. The survey had a high participation rate (98%) and a low non-response rate (9.5%), 70% of households were rural, 56% of respondents were male, 79% were Hindu, and 39% identified as Scheduled Caste or Tribes. The data aim to inform citizen-centric reforms, advancing a UHC responsive to India's diverse population needs.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146195425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-13DOI: 10.1038/s41597-026-06780-9
Yunhao Zheng, Jinhua Wang, Yi Zhang, Naixia Mou, Yu Liu
Nowadays, tourism practices face increasingly intensified flows of people, making it imperative to explore the tourism space through the lens of mobility. To examine nationwide tourist mobility, this study collected online travel blog data from Qunar.com, a leading travel services platform in China, to construct tourist mobility networks across China. In these networks, attractions are represented as nodes, while tourist movements between them, derived from blog data, are represented as weighted and directed edges. To capture different travel contexts, the study also develops mobility networks categorized by departure season and travel partners. All networks are released in a simple, accessible format to support future research.
{"title":"A dataset of tourist mobility networks across China derived from online travel blogs.","authors":"Yunhao Zheng, Jinhua Wang, Yi Zhang, Naixia Mou, Yu Liu","doi":"10.1038/s41597-026-06780-9","DOIUrl":"https://doi.org/10.1038/s41597-026-06780-9","url":null,"abstract":"<p><p>Nowadays, tourism practices face increasingly intensified flows of people, making it imperative to explore the tourism space through the lens of mobility. To examine nationwide tourist mobility, this study collected online travel blog data from Qunar.com, a leading travel services platform in China, to construct tourist mobility networks across China. In these networks, attractions are represented as nodes, while tourist movements between them, derived from blog data, are represented as weighted and directed edges. To capture different travel contexts, the study also develops mobility networks categorized by departure season and travel partners. All networks are released in a simple, accessible format to support future research.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146195339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mitochondrial DNA (mtDNA) mutations are critical to disease research, evolutionary studies, and lineage tracing but are challenging to analyze due to interference from nuclear mitochondrial sequences (NUMTs). Current high-throughput sequencing techniques rely on multiple primers or probes to amplify short mtDNA fragments, followed by alignment to a reference genome. However, this approach fails to mitigate NUMTs interference, leading to ambiguous results. In this study, we presented a nanopore-based third-generation sequencing (TGS) method using a single primer pair to amplify full-length mtDNA, effectively circumventing NUMTs artifacts. Sequencing was carried out on the QITAN TECH QNome-3841hex platform, generating complete mtDNA coverage for 106 samples from eight distinct family pedigrees, including complex familial structures such as half-siblings and multi-generational households. The sequencing achieved 100% genome coverage with an average mapping rate of 99.96%, supporting comprehensive genome characterization. The resulting dataset offers valuable insights into mtDNA mutation detection, mitochondrial genetics, population genetics, ancestry tracing, and forensic identification, and may advance mtDNA sequencing technologies and intergenerational studies.
{"title":"A full-length mtDNA dataset for studying genetic variations across generations and complex family structures.","authors":"Yanan Liu, Qi Yang, Yujia Xuan, Jinyuan Zhao, Anqi Chen, Suhua Zhang","doi":"10.1038/s41597-026-06824-0","DOIUrl":"https://doi.org/10.1038/s41597-026-06824-0","url":null,"abstract":"<p><p>Mitochondrial DNA (mtDNA) mutations are critical to disease research, evolutionary studies, and lineage tracing but are challenging to analyze due to interference from nuclear mitochondrial sequences (NUMTs). Current high-throughput sequencing techniques rely on multiple primers or probes to amplify short mtDNA fragments, followed by alignment to a reference genome. However, this approach fails to mitigate NUMTs interference, leading to ambiguous results. In this study, we presented a nanopore-based third-generation sequencing (TGS) method using a single primer pair to amplify full-length mtDNA, effectively circumventing NUMTs artifacts. Sequencing was carried out on the QITAN TECH QNome-3841hex platform, generating complete mtDNA coverage for 106 samples from eight distinct family pedigrees, including complex familial structures such as half-siblings and multi-generational households. The sequencing achieved 100% genome coverage with an average mapping rate of 99.96%, supporting comprehensive genome characterization. The resulting dataset offers valuable insights into mtDNA mutation detection, mitochondrial genetics, population genetics, ancestry tracing, and forensic identification, and may advance mtDNA sequencing technologies and intergenerational studies.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146195327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}