Pub Date : 2026-01-14DOI: 10.1038/s41597-025-06477-5
Vera Haponava, Catriona Pickard, Ricardo Fernandes
The North-Eastern Europe and Northern Asia open-access dataset (NEENA) is a compilation of over 18,700 isotopic measurements (δ13C, δ15N, δ34S, δ18O, 87Sr/86Sr), predominantly from archaeological human, animal, and plant samples originating from more than 750 sites ranging geographically from the Baltic and Eastern Europe to North-Central Asia and dating between 70,000 years BP and modern times. For each isotope record included in the dataset, information relating to the taxonomic categorisation of the sampled material (e.g., animal and plant species or genus names), the sample type (e.g., bone, dentine, enamel) and contextual, chronological, provenance (i.e., site location and country), and laboratory details are provided where available from original publications. The NEENA dataset can be used to conduct comparative studies of palaeodiet, spatial mobility, paleo-environmental conditions, organic remains preservation, and radiocarbon reservoir effects. NEENA is available in an open-access format via the Pandora data platform.
{"title":"The North-Eastern Europe and Northern Asia isotopic dataset of bioarchaeological samples (NEENA).","authors":"Vera Haponava, Catriona Pickard, Ricardo Fernandes","doi":"10.1038/s41597-025-06477-5","DOIUrl":"https://doi.org/10.1038/s41597-025-06477-5","url":null,"abstract":"<p><p>The North-Eastern Europe and Northern Asia open-access dataset (NEENA) is a compilation of over 18,700 isotopic measurements (δ<sup>13</sup>C, δ<sup>15</sup>N, δ<sup>34</sup>S, δ<sup>18</sup>O, <sup>87</sup>Sr/<sup>86</sup>Sr), predominantly from archaeological human, animal, and plant samples originating from more than 750 sites ranging geographically from the Baltic and Eastern Europe to North-Central Asia and dating between 70,000 years BP and modern times. For each isotope record included in the dataset, information relating to the taxonomic categorisation of the sampled material (e.g., animal and plant species or genus names), the sample type (e.g., bone, dentine, enamel) and contextual, chronological, provenance (i.e., site location and country), and laboratory details are provided where available from original publications. The NEENA dataset can be used to conduct comparative studies of palaeodiet, spatial mobility, paleo-environmental conditions, organic remains preservation, and radiocarbon reservoir effects. NEENA is available in an open-access format via the Pandora data platform.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145985376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-14DOI: 10.1038/s41597-025-06443-1
Na Jia, Kai Chen, Cheng Liu, NanChao Wang
We developed and released an image dataset for surface defect detection on waterborne painted wood products. The dataset comprises 13400 high-resolution images capturing four defect types: scratches, cracks, bubbles, and holes. These include 3645 bubble defects, 3498 scratch defects, 3256 crack defects, and 3001 hole defects. Data were acquired in an operational production facility in Jiangshan, China, using specialized industrial cameras. Professional annotators performed rigorous labeling to ensure accuracy. This dataset provides critical data support for deploying deep learning models in real-world industrial assembly lines. Researchers may leverage this dataset to develop automated machine learning solutions for multi-class defect detection. Such techniques enable timely defect detection and remediation, ensuring finish integrity and final product quality.
{"title":"A Public Image Dataset for Surface Defect Detection of Water-Based Coated Wood Products.","authors":"Na Jia, Kai Chen, Cheng Liu, NanChao Wang","doi":"10.1038/s41597-025-06443-1","DOIUrl":"https://doi.org/10.1038/s41597-025-06443-1","url":null,"abstract":"<p><p>We developed and released an image dataset for surface defect detection on waterborne painted wood products. The dataset comprises 13400 high-resolution images capturing four defect types: scratches, cracks, bubbles, and holes. These include 3645 bubble defects, 3498 scratch defects, 3256 crack defects, and 3001 hole defects. Data were acquired in an operational production facility in Jiangshan, China, using specialized industrial cameras. Professional annotators performed rigorous labeling to ensure accuracy. This dataset provides critical data support for deploying deep learning models in real-world industrial assembly lines. Researchers may leverage this dataset to develop automated machine learning solutions for multi-class defect detection. Such techniques enable timely defect detection and remediation, ensuring finish integrity and final product quality.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145985190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-14DOI: 10.1038/s41597-026-06600-0
Tao Bao, Xiyan Xu, Gensuo Jia, Xingru Zhu, William J Riley, Yuanhe Yang
Field observations provide direct evidence of how does carbon cycling in permafrost ecosystems respond to climate change. This study provides a comprehensive dataset on the impact of warming on carbon cycling and greenhouse gas (GHG) fluxes in permafrost ecosystems. The dataset is extracted and integrated from 132 peer-reviewed studies with 1430 paired observations across eight major permafrost ecosystems, including Arctic and subarctic tundra and wetland, and alpine meadow, steppe, tundra and wetland. This dataset includes 17 variables from experiments conducted during the growing season, covering the plant and soil carbon pools, soil nitrogen pool, and GHG (i.e., CO2, CH4, and N2O) fluxes, among others. Background information on site climate conditions, vegetation and soil characteristics, and details of the warming experiments, including timing, methods, and warming magnitude, are also contained in the dataset. This dataset facilitates a comprehensive understanding of the impact of warming on carbon cycling and GHG fluxes in permafrost ecosystems, and provides supports for meta-analyses and literature reviews, remote sensing data validation, and land model development and parameterization.
{"title":"Dataset about Warming Effects on Carbon Cycling and Greenhouse Gas Fluxes in Permafrost Ecosystems.","authors":"Tao Bao, Xiyan Xu, Gensuo Jia, Xingru Zhu, William J Riley, Yuanhe Yang","doi":"10.1038/s41597-026-06600-0","DOIUrl":"https://doi.org/10.1038/s41597-026-06600-0","url":null,"abstract":"<p><p>Field observations provide direct evidence of how does carbon cycling in permafrost ecosystems respond to climate change. This study provides a comprehensive dataset on the impact of warming on carbon cycling and greenhouse gas (GHG) fluxes in permafrost ecosystems. The dataset is extracted and integrated from 132 peer-reviewed studies with 1430 paired observations across eight major permafrost ecosystems, including Arctic and subarctic tundra and wetland, and alpine meadow, steppe, tundra and wetland. This dataset includes 17 variables from experiments conducted during the growing season, covering the plant and soil carbon pools, soil nitrogen pool, and GHG (i.e., CO<sub>2</sub>, CH<sub>4</sub>, and N<sub>2</sub>O) fluxes, among others. Background information on site climate conditions, vegetation and soil characteristics, and details of the warming experiments, including timing, methods, and warming magnitude, are also contained in the dataset. This dataset facilitates a comprehensive understanding of the impact of warming on carbon cycling and GHG fluxes in permafrost ecosystems, and provides supports for meta-analyses and literature reviews, remote sensing data validation, and land model development and parameterization.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145985251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-14DOI: 10.1038/s41597-026-06558-z
Anthony Huffman, Feng-Yu Yeh, Junguk Hur, Jie Zheng, Anna Maria Masci, Guanming Wu, Cui Tao, Brian Athey, Yongqun He
With the increasing volume of biomedical experimental data, standardizing, sharing, and integrating heterogeneous experimental data across domains has become a major challenge. To address this challenge, we have developed an ontology-supported Study-Experiment-Assay (SEA) common data model (CDM), which includes 10 core and 3 auxiliary classes based on object-oriented modeling. SEA CDM uses interoperable ontologies for data standardization and knowledge inference. Building on the SEA CDM, we developed the Ontology-based SEA Network (OSEAN) relational database and knowledge graph, along with a set of ETL (Extract, Transform, Load) and query tools, and further applied them to represent 1,278 immune studies with over two million samples from three resources: VIGET, ImmPort, and CELLxGENE. Using simple, robust queries and analyses, our research identified multiple scientific insights into sex-specific immune responses, such as neutrophil degranulation and TNF binding to physiological receptors, following live attenuated and trivalent inactivated influenza vaccination. The novel SEA CDM system lays a foundation for establishing an integrative biodata ecosystem across biological and biomedical domains.
{"title":"SEA CDM: Study-Experiment-Assay Common Data Model and Databases for Cross-Domain Data Integration and Analysis.","authors":"Anthony Huffman, Feng-Yu Yeh, Junguk Hur, Jie Zheng, Anna Maria Masci, Guanming Wu, Cui Tao, Brian Athey, Yongqun He","doi":"10.1038/s41597-026-06558-z","DOIUrl":"10.1038/s41597-026-06558-z","url":null,"abstract":"<p><p>With the increasing volume of biomedical experimental data, standardizing, sharing, and integrating heterogeneous experimental data across domains has become a major challenge. To address this challenge, we have developed an ontology-supported Study-Experiment-Assay (SEA) common data model (CDM), which includes 10 core and 3 auxiliary classes based on object-oriented modeling. SEA CDM uses interoperable ontologies for data standardization and knowledge inference. Building on the SEA CDM, we developed the Ontology-based SEA Network (OSEAN) relational database and knowledge graph, along with a set of ETL (Extract, Transform, Load) and query tools, and further applied them to represent 1,278 immune studies with over two million samples from three resources: VIGET, ImmPort, and CELLxGENE. Using simple, robust queries and analyses, our research identified multiple scientific insights into sex-specific immune responses, such as neutrophil degranulation and TNF binding to physiological receptors, following live attenuated and trivalent inactivated influenza vaccination. The novel SEA CDM system lays a foundation for establishing an integrative biodata ecosystem across biological and biomedical domains.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145985384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-13DOI: 10.1038/s41597-025-06530-3
Robin Ramstad, Jasper Halekas, Laila Andersson, David Brain, Jared Espley, David Mitchell, James McFadden, Alexandros Chapasis, Yaxue Dong, Nadia Gonzales, Shannon Curry
Solar wind measurements by the Mars Atmosphere and Volatile EvolutioN (MAVEN) mission provide samples of the heliosphere at 1.38-1.67 AU, and of the upstream conditions that drive numerous processes in the near-Mars plasma environment. We reduce ion measurements from MAVEN's Solar Wind Ion Analyzer (SWIA), using contextual magnetic field measurements, to 13 independent macroscopic plasma parameters by fitting a convolution of SWIA's 3-dimensional response function and a superposition of phase-space bi-kappa distribution functions to each measured distribution using an iterative Poisson optimization scheme. This ensemble of parameters represents the solar wind H+ core, H+ beam, and He2+ (alpha) populations, effectively separating each population's contribution to any measured distribution. Sporadic plasma frequency measurements from MAVEN's Langmuir Probe and Waves (LPW) instrument are used to calibrate the SWIA measurements such that ion charge densities match LPW-derived electron charge densities. The resulting dataset is effectively ground-truthed, largely corrected for instrumental particularities, and provides a rich timeline of solar wind properties at Mars, including composition, velocities, temperature anisotropies, differential drifts, and degree of thermalization.
{"title":"SWFITEM: Solar Wind Fitting for Investigations of Thermodynamics and Energetics at Mars - A MAVEN dataset.","authors":"Robin Ramstad, Jasper Halekas, Laila Andersson, David Brain, Jared Espley, David Mitchell, James McFadden, Alexandros Chapasis, Yaxue Dong, Nadia Gonzales, Shannon Curry","doi":"10.1038/s41597-025-06530-3","DOIUrl":"https://doi.org/10.1038/s41597-025-06530-3","url":null,"abstract":"<p><p>Solar wind measurements by the Mars Atmosphere and Volatile EvolutioN (MAVEN) mission provide samples of the heliosphere at 1.38-1.67 AU, and of the upstream conditions that drive numerous processes in the near-Mars plasma environment. We reduce ion measurements from MAVEN's Solar Wind Ion Analyzer (SWIA), using contextual magnetic field measurements, to 13 independent macroscopic plasma parameters by fitting a convolution of SWIA's 3-dimensional response function and a superposition of phase-space bi-kappa distribution functions to each measured distribution using an iterative Poisson optimization scheme. This ensemble of parameters represents the solar wind H<sup>+</sup> core, H<sup>+</sup> beam, and He<sup>2+</sup> (alpha) populations, effectively separating each population's contribution to any measured distribution. Sporadic plasma frequency measurements from MAVEN's Langmuir Probe and Waves (LPW) instrument are used to calibrate the SWIA measurements such that ion charge densities match LPW-derived electron charge densities. The resulting dataset is effectively ground-truthed, largely corrected for instrumental particularities, and provides a rich timeline of solar wind properties at Mars, including composition, velocities, temperature anisotropies, differential drifts, and degree of thermalization.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145960061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The Ontology of Adverse Events (OAE) was launched in 2011 to define, standardize and integrate various adverse events (AEs) arising after medical interventions. The terminological framework of OAE has undergone consistent expansion since its inception, driven by its successful implementation in numerous AE investigations. In this paper, we document substantial ontological extensions addressing patient anatomic regions and clinical manifestations, encompassing symptoms, physical signs, and pathological processes. Current statistical analysis reveals that OAE has 10,829 formally defined terms with unique identifiers. Compared to the 3,088 ontology terms included in the last OAE publication in 2014, 7,741 new terms have been added to OAE, which represents significant progress of the ontology in clinical granularity and domain coverage. The OAE framework enables structured representation of critical determinants influencing clinical outcomes, including but not limited to administration routes, dosage parameters, and demographic variables such as patient age. Through its standardized semantic architecture, OAE provides an integrative platform for cross-disciplinary analysis of AE patterns, etiological factors, and outcome trajectories in clinical interventions.
{"title":"The Ontology of Adverse Events in 2025.","authors":"Chenchen Pan, Qiuyue Yang, Xue Zhang, Shuyue Luo, Yongqun He, Jiangan Xie","doi":"10.1038/s41597-026-06584-x","DOIUrl":"https://doi.org/10.1038/s41597-026-06584-x","url":null,"abstract":"<p><p>The Ontology of Adverse Events (OAE) was launched in 2011 to define, standardize and integrate various adverse events (AEs) arising after medical interventions. The terminological framework of OAE has undergone consistent expansion since its inception, driven by its successful implementation in numerous AE investigations. In this paper, we document substantial ontological extensions addressing patient anatomic regions and clinical manifestations, encompassing symptoms, physical signs, and pathological processes. Current statistical analysis reveals that OAE has 10,829 formally defined terms with unique identifiers. Compared to the 3,088 ontology terms included in the last OAE publication in 2014, 7,741 new terms have been added to OAE, which represents significant progress of the ontology in clinical granularity and domain coverage. The OAE framework enables structured representation of critical determinants influencing clinical outcomes, including but not limited to administration routes, dosage parameters, and demographic variables such as patient age. Through its standardized semantic architecture, OAE provides an integrative platform for cross-disciplinary analysis of AE patterns, etiological factors, and outcome trajectories in clinical interventions.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145960096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-13DOI: 10.1038/s41597-025-06435-1
Hammaad Adam, Tom Pollard, Vinith Suriyakumar, Benjamin Moody, Jan Niklas Adams, Jennifer Erickson, Greg Segal, Matthew Wadsworth, Ashia Wilson, Marzyeh Ghassemi
Organ transplantation is a life-saving procedure for patients with advanced diseases. However, the demand for transplants far exceeds the supply of donated organs, and there are currently over 100,000 people waiting for a transplant in the United States. The lives of these patients depend on the efficacy of organ procurement organizations (OPOs), which coordinate the recovery of organs from deceased donors. However, many studies have found high variation in performance amongst OPOs. Coordinating data collection and analysis across OPOs is a crucial first step in closing performance gaps and achieving more effective organ donation. In 2021, the Federation of American Scientists announced a collaboration in which six OPOs committed to an unprecedented level of data sharing. This paper marks the release of ORCHID, this collaboration's first public dataset. ORCHID comprises detailed information on referrals for donation, procurement outcomes, and process data from the participating OPOs. Our goal in releasing this data is to promote research that leads to better services for organ donors, donor families, and patients waiting for transplants.
{"title":"Organ retrieval and collection of health information for donation: The ORCHID dataset.","authors":"Hammaad Adam, Tom Pollard, Vinith Suriyakumar, Benjamin Moody, Jan Niklas Adams, Jennifer Erickson, Greg Segal, Matthew Wadsworth, Ashia Wilson, Marzyeh Ghassemi","doi":"10.1038/s41597-025-06435-1","DOIUrl":"https://doi.org/10.1038/s41597-025-06435-1","url":null,"abstract":"<p><p>Organ transplantation is a life-saving procedure for patients with advanced diseases. However, the demand for transplants far exceeds the supply of donated organs, and there are currently over 100,000 people waiting for a transplant in the United States. The lives of these patients depend on the efficacy of organ procurement organizations (OPOs), which coordinate the recovery of organs from deceased donors. However, many studies have found high variation in performance amongst OPOs. Coordinating data collection and analysis across OPOs is a crucial first step in closing performance gaps and achieving more effective organ donation. In 2021, the Federation of American Scientists announced a collaboration in which six OPOs committed to an unprecedented level of data sharing. This paper marks the release of ORCHID, this collaboration's first public dataset. ORCHID comprises detailed information on referrals for donation, procurement outcomes, and process data from the participating OPOs. Our goal in releasing this data is to promote research that leads to better services for organ donors, donor families, and patients waiting for transplants.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145966883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Public transport operation network data serves as the foundation for urban transportation research and sustainable development planning. China operates the world's largest public transport system, where buses and metros constitute primary urban mobility modes. Despite their critical importance, comprehensive integrated bus-metro operation datasets remain lacking at the national scale. This study presents the China Public Transport Operation Network Dataset (CPTOND-2025), systematically integrating bus networks from 350 cities and metro systems from 46 cities across mainland China, Hong Kong, Macao, and Taiwan regions. Based on June 2025 data collection using methodologies integrate professional platforms and commercial APIs, the dataset encompasses approximately 3,408,000 kilometers of operational routes (bus: ~3,375,000 km; metro: ~33,000 km). Key attributes including operating hours, fares, and operating companies are recorded with bilingual support. All data utilize standardized Shapefile format in WGS-84 coordinate system with 5.08-meter average spatial accuracy. This standardized, comprehensive, open-access dataset supports diverse applications including operation efficiency assessment, network analysis, accessibility evaluation, and Transit-Oriented Development (TOD) studies, facilitating transport management decisions and international research.
{"title":"China Public Transport Operation Network Dataset (CPTOND-2025):National-Scale Bus-Metro Vector Dataset.","authors":"Liang Wang, He Wei, Yu Guan, Libin Ouyang, DanDan Xu, Xuehua Han, Min Zhang, Meng Chen, Daosheng Sun, Daqing Gong, Zhenji Zhang, Xinghua Zhang, Xiaodong Zhang","doi":"10.1038/s41597-025-06505-4","DOIUrl":"https://doi.org/10.1038/s41597-025-06505-4","url":null,"abstract":"<p><p>Public transport operation network data serves as the foundation for urban transportation research and sustainable development planning. China operates the world's largest public transport system, where buses and metros constitute primary urban mobility modes. Despite their critical importance, comprehensive integrated bus-metro operation datasets remain lacking at the national scale. This study presents the China Public Transport Operation Network Dataset (CPTOND-2025), systematically integrating bus networks from 350 cities and metro systems from 46 cities across mainland China, Hong Kong, Macao, and Taiwan regions. Based on June 2025 data collection using methodologies integrate professional platforms and commercial APIs, the dataset encompasses approximately 3,408,000 kilometers of operational routes (bus: ~3,375,000 km; metro: ~33,000 km). Key attributes including operating hours, fares, and operating companies are recorded with bilingual support. All data utilize standardized Shapefile format in WGS-84 coordinate system with 5.08-meter average spatial accuracy. This standardized, comprehensive, open-access dataset supports diverse applications including operation efficiency assessment, network analysis, accessibility evaluation, and Transit-Oriented Development (TOD) studies, facilitating transport management decisions and international research.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145966888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-13DOI: 10.1038/s41597-026-06572-1
Mara D Maeke, Christiane Hassenrück, Polette Aguilar-Muñoz, Camila Aravena, Christian Burmeister, Olivier Crispi, Papa Oumar Djibril Diallo, Camila Fernández, Maëlann Gouriou, Alizée Jamont, Emile Laymand, Barbara Marie, Verónica Molina, Eva Ortega-Retuerta, Sophie Rabouille, Mazharul Islam Sajeeb, Maria Sierks, Martha Stevens, Robin Turon, Valentina Valdés-Castro, Sara Beier
Coastal marine environments, such as lagoons, fjords or estuaries, experience pronounced environmental variability, with fluctuations in salinity, temperature and nutrient levels shaping microbial community structure and function. These gradients result in diverse habitats, which may harbour taxonomic and genetic novelty with biogeochemical and biotechnological relevance. To explore microbial diversity and functional potential across these dynamic ecosystems, we sampled 26 sites along the coasts of France and Chile, including lagoons, estuaries, fjords, harbours, as well as coastal and offshore marine sites. Surface waters were collected from all sites, with deeper layers included at three sites. Monthly sampling at six sites in France enabled the assessment of seasonal dynamics. In total, 116 samples were processed for both metabarcoding and metagenomic sequencing yielding over 53,000 amplicon sequence variants (ASVs) and 1,372 metagenome-assembled genomes (MAGs). This dataset further includes a comprehensive gene catalogue and environmental variables such as salinity, temperature, nutrient concentrations, productivity, as well as oxygen consumption metrics collected across the different ecosystems.
{"title":"Metabarcoding and metagenomic data across aquatic environmental gradients along the coasts of France and Chile.","authors":"Mara D Maeke, Christiane Hassenrück, Polette Aguilar-Muñoz, Camila Aravena, Christian Burmeister, Olivier Crispi, Papa Oumar Djibril Diallo, Camila Fernández, Maëlann Gouriou, Alizée Jamont, Emile Laymand, Barbara Marie, Verónica Molina, Eva Ortega-Retuerta, Sophie Rabouille, Mazharul Islam Sajeeb, Maria Sierks, Martha Stevens, Robin Turon, Valentina Valdés-Castro, Sara Beier","doi":"10.1038/s41597-026-06572-1","DOIUrl":"10.1038/s41597-026-06572-1","url":null,"abstract":"<p><p>Coastal marine environments, such as lagoons, fjords or estuaries, experience pronounced environmental variability, with fluctuations in salinity, temperature and nutrient levels shaping microbial community structure and function. These gradients result in diverse habitats, which may harbour taxonomic and genetic novelty with biogeochemical and biotechnological relevance. To explore microbial diversity and functional potential across these dynamic ecosystems, we sampled 26 sites along the coasts of France and Chile, including lagoons, estuaries, fjords, harbours, as well as coastal and offshore marine sites. Surface waters were collected from all sites, with deeper layers included at three sites. Monthly sampling at six sites in France enabled the assessment of seasonal dynamics. In total, 116 samples were processed for both metabarcoding and metagenomic sequencing yielding over 53,000 amplicon sequence variants (ASVs) and 1,372 metagenome-assembled genomes (MAGs). This dataset further includes a comprehensive gene catalogue and environmental variables such as salinity, temperature, nutrient concentrations, productivity, as well as oxygen consumption metrics collected across the different ecosystems.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":"29"},"PeriodicalIF":6.9,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12804801/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145966895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-13DOI: 10.1038/s41597-025-06533-0
Silvia C Hernández, Marina Ainciburu, Laura Sudupe, Nuria Planell, María López-Moreno, Amaia Vilas-Zornoza, Luis Diaz-Martinez, Jorge Cobos-Figueroa, Juan P Romero, Sarai Sarvide, Patxi San Martin-Uriz, Ana López-Pérez, Gloria Abizanda, Purificación Ripalda-Cemboráin, Emma Muinos-López, Vincenzo Lagani, Jesper Tegner, Ming Wu, Stefan Janssens, José M Pérez-Pomares, Felipe Prósper, Adrián Ruiz-Villalba, David Gómez-Cabrero
Cardiac fibroblasts (CFs) are key mediators of heart repair following myocardial infarction (MI). A specific CF subpopulation, termed Reparative Cardiac Fibroblasts (RCFs), has been shown to orchestrate scar formation and prevent ventricular rupture after MI. However, the timing of RCF appearance and the molecular events underlying this transition remain largely undefined. Here, we present a multi-modal dataset capturing the transcriptional dynamics of CFs during the early phase post-MI. Our integrative dataset combines bulk RNA sequencing, RNAscope in situ hybridization, and spatial transcriptomics to anatomically and temporally map the gene expression changes associated with the transition into RCFs. The dataset provides resources to characterize the distinct molecular programs that guide the emergence of RCFs from Periostin (Postn)+ activated CFs. This dataset provides a valuable resource for investigating CF heterogeneity and reparative pathways following MI. All raw and processed data, along with detailed metadata and annotations, are made available to facilitate reuse by the cardiovascular and single-cell biology communities.
{"title":"Single-cell and spatial transcriptomic profiling of cardiac fibroblasts following myocardial infarction.","authors":"Silvia C Hernández, Marina Ainciburu, Laura Sudupe, Nuria Planell, María López-Moreno, Amaia Vilas-Zornoza, Luis Diaz-Martinez, Jorge Cobos-Figueroa, Juan P Romero, Sarai Sarvide, Patxi San Martin-Uriz, Ana López-Pérez, Gloria Abizanda, Purificación Ripalda-Cemboráin, Emma Muinos-López, Vincenzo Lagani, Jesper Tegner, Ming Wu, Stefan Janssens, José M Pérez-Pomares, Felipe Prósper, Adrián Ruiz-Villalba, David Gómez-Cabrero","doi":"10.1038/s41597-025-06533-0","DOIUrl":"https://doi.org/10.1038/s41597-025-06533-0","url":null,"abstract":"<p><p>Cardiac fibroblasts (CFs) are key mediators of heart repair following myocardial infarction (MI). A specific CF subpopulation, termed Reparative Cardiac Fibroblasts (RCFs), has been shown to orchestrate scar formation and prevent ventricular rupture after MI. However, the timing of RCF appearance and the molecular events underlying this transition remain largely undefined. Here, we present a multi-modal dataset capturing the transcriptional dynamics of CFs during the early phase post-MI. Our integrative dataset combines bulk RNA sequencing, RNAscope in situ hybridization, and spatial transcriptomics to anatomically and temporally map the gene expression changes associated with the transition into RCFs. The dataset provides resources to characterize the distinct molecular programs that guide the emergence of RCFs from Periostin (Postn)<sup>+</sup> activated CFs. This dataset provides a valuable resource for investigating CF heterogeneity and reparative pathways following MI. All raw and processed data, along with detailed metadata and annotations, are made available to facilitate reuse by the cardiovascular and single-cell biology communities.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":" ","pages":""},"PeriodicalIF":6.9,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145960007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}