首页 > 最新文献

Scientific Data最新文献

英文 中文
Near complete assembly of Pyricularia penniseti infecting Cenchrus grass identified its eight core chromosomes. 对感染仙草的 Pyricularia penniseti 进行了近乎完整的组装,确定了其八条核心染色体。
IF 5.8 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-10-31 DOI: 10.1038/s41597-024-04035-z
Yuyong Li, Xianjun Wang, Jianqiang Huang, Zhenyu Fang, Xiwen Lian, Guodong Lu, Guifang Lin, Zonghua Wang, Baohua Wang, Xiuxiu Li, Huakun Zheng

Fungi from the Pyricularia genus cause blast disease in many economically important crops and grasses, such as wheat, rice, and Cenchrus grass JUJUNCAO. Structure variation associated with the gain and loss of effectors contributes largely to the adaptive evolution of this fungus towards diverse host plants. A telomere-to-telomere genome assembly would facilitate the identification of genome-wide structural variations through comparative genomics. Here, we report a telomere-to-telomere, near-complete genome assembly of a Pyricularia penniseti isolate JC-1 infecting JUJUNCAO. The assembly consists of eight core chromosomes and two supernumerary chromosomes, named mini1 and mini2, spanning 42.1 Mb. We annotated 12,156 protein-coding genes and identified 4.54% of the genome as repetitive sequences. The two supernumerary chromosomes contained fewer genes and more repetitive sequences than the core chromosomes. Our genome and results provide valuable resources for the future study in genome evolution, structure variation and host adaptation of the Pyricularia fungus.

Pyricularia 属真菌会导致许多具有重要经济价值的作物和禾本科植物(如小麦、水稻和岑氏禾本科植物 JUJUNCAO)发生稻瘟病。与效应器的增减有关的结构变异在很大程度上促成了这种真菌对不同寄主植物的适应性进化。端粒到端粒的基因组组装将有助于通过比较基因组学鉴定全基因组的结构变异。在这里,我们报告了从端粒到端粒的、近乎完整的、感染 JUJUNCAO 的 Pyricularia penniseti 分离物 JC-1 的基因组组装。该基因组包括八条核心染色体和两条编外染色体(命名为 mini1 和 mini2),跨度为 42.1 Mb。我们注释了 12,156 个蛋白编码基因,并确定了基因组中 4.54% 的重复序列。与核心染色体相比,两条编外染色体包含的基因更少,重复序列更多。我们的基因组和研究结果为今后研究Pyricularia真菌的基因组进化、结构变异和宿主适应性提供了宝贵的资源。
{"title":"Near complete assembly of Pyricularia penniseti infecting Cenchrus grass identified its eight core chromosomes.","authors":"Yuyong Li, Xianjun Wang, Jianqiang Huang, Zhenyu Fang, Xiwen Lian, Guodong Lu, Guifang Lin, Zonghua Wang, Baohua Wang, Xiuxiu Li, Huakun Zheng","doi":"10.1038/s41597-024-04035-z","DOIUrl":"10.1038/s41597-024-04035-z","url":null,"abstract":"<p><p>Fungi from the Pyricularia genus cause blast disease in many economically important crops and grasses, such as wheat, rice, and Cenchrus grass JUJUNCAO. Structure variation associated with the gain and loss of effectors contributes largely to the adaptive evolution of this fungus towards diverse host plants. A telomere-to-telomere genome assembly would facilitate the identification of genome-wide structural variations through comparative genomics. Here, we report a telomere-to-telomere, near-complete genome assembly of a Pyricularia penniseti isolate JC-1 infecting JUJUNCAO. The assembly consists of eight core chromosomes and two supernumerary chromosomes, named mini1 and mini2, spanning 42.1 Mb. We annotated 12,156 protein-coding genes and identified 4.54% of the genome as repetitive sequences. The two supernumerary chromosomes contained fewer genes and more repetitive sequences than the core chromosomes. Our genome and results provide valuable resources for the future study in genome evolution, structure variation and host adaptation of the Pyricularia fungus.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1186"},"PeriodicalIF":5.8,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11528102/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142558695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Publisher Correction: Observing the Central Arctic Atmosphere and Surface with University of Colorado uncrewed aircraft systems. 出版商更正:利用科罗拉多大学无人驾驶飞机系统观测北极中部大气层和地表。
IF 5.8 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-10-31 DOI: 10.1038/s41597-024-03954-1
Gijs de Boer, Radiance Calmer, Gina Jozef, John J Cassano, Jonathan Hamilton, Dale Lawrence, Steven Borenstein, Abhiram Doddi, Christopher Cox, Julia Schmale, Andreas Preußer, Brian Argrow
{"title":"Publisher Correction: Observing the Central Arctic Atmosphere and Surface with University of Colorado uncrewed aircraft systems.","authors":"Gijs de Boer, Radiance Calmer, Gina Jozef, John J Cassano, Jonathan Hamilton, Dale Lawrence, Steven Borenstein, Abhiram Doddi, Christopher Cox, Julia Schmale, Andreas Preußer, Brian Argrow","doi":"10.1038/s41597-024-03954-1","DOIUrl":"10.1038/s41597-024-03954-1","url":null,"abstract":"","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1188"},"PeriodicalIF":5.8,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11528026/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142558696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data collation for climate-cooling gas dimethylsulfide in Antarctic snow, sea ice and underlying seawater. 南极雪、海冰和底层海水中气候致冷气体二甲基硫化物的数据整理。
IF 5.8 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-10-31 DOI: 10.1038/s41597-024-04038-w
Gabrielle Burke, Pat Wongpan, Delphine Lannuzel, Hakase Hayashida

Dimethylsulfide (DMS) is a climatically active volatile sulfur compound found in Earth's oceans and atmosphere that plays an important role in cloud formation. DMS originates from its precursor dimethylsulfoniopropionate (DMSP), which is produced by several classes of phytoplankton. Concentrations of DMS and DMSP in Antarctic sea ice, snow and underlying seawater are not well documented and there is currently no dataset available to find the existing data. The purpose of this project was to compile historical measurements into a publicly available dataset. A total of 220 samples collected since 1992 were compiled using the Antarctic Sea ice Processes and Climate program template, in accordance with the existing datasets for chlorophyll-a, macronutrients, and dissolved iron. Analyses performed on the completed DMS dataset showed that the spatial and temporal coverages are limited; there are barely any measurements in autumn and winter, nor in the Amundsen or Ross seas. These findings provide a basis for future sampling efforts in the Antarctic region.

二甲基硫醚(DMS)是一种在地球海洋和大气中发现的具有气候活性的挥发性硫化合物,在云的形成过程中发挥着重要作用。DMS 源于其前体二甲基硫代丙酸酯(DMSP),由几类浮游植物产生。南极海冰、雪和底层海水中 DMS 和 DMSP 的浓度没有得到很好的记录,目前也没有数据集可以查找现有数据。该项目的目的是将历史测量数据汇编成一个可公开获取的数据集。根据现有的叶绿素-a、宏量营养素和溶解铁数据集,利用南极海冰过程与气候计划模板,汇编了自 1992 年以来收集的 220 个样本。对已完成的 DMS 数据集进行的分析表明,其空间和时间覆盖范围有限;几乎没有秋季和冬季的测量数据,也没有阿蒙森海或罗斯海的测量数据。这些发现为今后在南极地区开展采样工作提供了依据。
{"title":"Data collation for climate-cooling gas dimethylsulfide in Antarctic snow, sea ice and underlying seawater.","authors":"Gabrielle Burke, Pat Wongpan, Delphine Lannuzel, Hakase Hayashida","doi":"10.1038/s41597-024-04038-w","DOIUrl":"10.1038/s41597-024-04038-w","url":null,"abstract":"<p><p>Dimethylsulfide (DMS) is a climatically active volatile sulfur compound found in Earth's oceans and atmosphere that plays an important role in cloud formation. DMS originates from its precursor dimethylsulfoniopropionate (DMSP), which is produced by several classes of phytoplankton. Concentrations of DMS and DMSP in Antarctic sea ice, snow and underlying seawater are not well documented and there is currently no dataset available to find the existing data. The purpose of this project was to compile historical measurements into a publicly available dataset. A total of 220 samples collected since 1992 were compiled using the Antarctic Sea ice Processes and Climate program template, in accordance with the existing datasets for chlorophyll-a, macronutrients, and dissolved iron. Analyses performed on the completed DMS dataset showed that the spatial and temporal coverages are limited; there are barely any measurements in autumn and winter, nor in the Amundsen or Ross seas. These findings provide a basis for future sampling efforts in the Antarctic region.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1185"},"PeriodicalIF":5.8,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11528020/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142558694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chromosome-scale genome assembly of the mangrove climber species Dalbergia candenatensis. 红树林攀缘物种 Dalbergia candenatensis 的染色体级基因组组装。
IF 5.8 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-10-31 DOI: 10.1038/s41597-024-04032-2
Miaomiao Shi, Yu Zhang, Huiwen Huang, Shiran Gu, Xiangping Wang, Shijin Li, Zhongtao Zhao, Tieyao Tu

Consisting of trees, climbers and herbs exclusively in the intertidal environments, mangrove forest is one of the most extreme and vulnerable ecosystems of our planet and has long been of great interest for biologists and ecologists. Here, we first assembled the chromosome-scale genome of a climber mangrove plant, Dalbergia candenatensis. The assembled genome size is approximately 474.55 Mb, with a scaffold N50 of 48.1 Mb, a complete BUSCO score of 98.4%, and a high LTR Assembly Index value of 21. The genome contained 283.46 Mb (59.74%) repetitive sequences, and 29,554 protein-coding genes were predicted, of which 87.54% were functionally annotated in five databases. The high-quality genome assembly and annotation presented herein provide a valuable genomic resource that will expedite genomic and evolutionary studies of mangrove plants and facilitate the elucidation of molecular mechanisms underlying the salt- and water-logging-tolerance of mangrove plants.

红树林是地球上最极端和最脆弱的生态系统之一,由树木、攀缘植物和草本植物组成,只生长在潮间带环境中,长期以来一直备受生物学家和生态学家的关注。在这里,我们首次组装了攀缘植物红树(Dalbergia candenatensis)的染色体级基因组。组装的基因组大小约为 474.55 Mb,支架 N50 为 48.1 Mb,完整 BUSCO 得分为 98.4%,LTR 组装指数值高达 21。该基因组包含 283.46 Mb(59.74%)重复序列,预测了 29,554 个蛋白编码基因,其中 87.54% 的基因在五个数据库中进行了功能注释。本文介绍的高质量基因组组装和注释提供了宝贵的基因组资源,将加速红树植物的基因组和进化研究,并有助于阐明红树植物耐盐和耐涝的分子机制。
{"title":"Chromosome-scale genome assembly of the mangrove climber species Dalbergia candenatensis.","authors":"Miaomiao Shi, Yu Zhang, Huiwen Huang, Shiran Gu, Xiangping Wang, Shijin Li, Zhongtao Zhao, Tieyao Tu","doi":"10.1038/s41597-024-04032-2","DOIUrl":"10.1038/s41597-024-04032-2","url":null,"abstract":"<p><p>Consisting of trees, climbers and herbs exclusively in the intertidal environments, mangrove forest is one of the most extreme and vulnerable ecosystems of our planet and has long been of great interest for biologists and ecologists. Here, we first assembled the chromosome-scale genome of a climber mangrove plant, Dalbergia candenatensis. The assembled genome size is approximately 474.55 Mb, with a scaffold N50 of 48.1 Mb, a complete BUSCO score of 98.4%, and a high LTR Assembly Index value of 21. The genome contained 283.46 Mb (59.74%) repetitive sequences, and 29,554 protein-coding genes were predicted, of which 87.54% were functionally annotated in five databases. The high-quality genome assembly and annotation presented herein provide a valuable genomic resource that will expedite genomic and evolutionary studies of mangrove plants and facilitate the elucidation of molecular mechanisms underlying the salt- and water-logging-tolerance of mangrove plants.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1187"},"PeriodicalIF":5.8,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11528007/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142558693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genome guided, organ-specific transcriptome assembly of the European flounder (P. flesus) from the Baltic Sea. 波罗的海欧洲鲽(P. flesus)基因组指导下的器官特异性转录组组装。
IF 5.8 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-10-30 DOI: 10.1038/s41597-024-04004-6
Konrad Pomianowski, Ewa Kulczykowska, Artur Burzyński

Although the European flounder is frequently used in research and has economic importance, there is still lack of comprehensive transcriptome data for this species. In the present research we show RNA-Seq data from ten selected organs of P. flesus female inhabiting brackish waters of the Gulf of Gdańsk (southern Baltic Sea). High throughput Next Generation Sequencing technology NovaSeq 6000 was used to generate 500 M sequencing reads. These were mapped against European flounder reference genome and reads extracted from the mapping were assembled producing 61k reliable contigs. Gene ontology (GO) terms were assigned to the majority of annotated contigs/unigenes based on the results of PFAM, PANTHER, UniProt and InterPro protein databases searches. BUSCOs statistics for eukaryota, metazoa, vertebrata and actinopterygii databases showed that the reported transcriptome represents a high level of completeness. The data set can be successfully used as a tool in design of experiments from various research fields including biology, aquaculture and toxicology.

尽管欧洲鲽经常被用于研究并具有重要的经济价值,但该物种仍然缺乏全面的转录组数据。在本研究中,我们展示了栖息在格但斯克湾(波罗的海南部)咸水中的欧洲鲽雌鱼十个选定器官的 RNA-Seq 数据。利用高通量下一代测序技术 NovaSeq 6000 生成了 500 M 个测序读数。这些读数与欧洲比目鱼参考基因组进行了映射,并将映射中提取的读数进行组装,产生了 61k 个可靠的等位组。根据PFAM、PANTHER、UniProt和InterPro蛋白质数据库的搜索结果,将基因本体论(GO)术语分配给大多数注释等位基因/单基因。真核生物、元古宙、脊椎动物和翼手目数据库的 BUSCOs 统计数据表明,报告的转录组具有很高的完整性。该数据集可成功用作生物学、水产养殖和毒理学等多个研究领域的实验设计工具。
{"title":"Genome guided, organ-specific transcriptome assembly of the European flounder (P. flesus) from the Baltic Sea.","authors":"Konrad Pomianowski, Ewa Kulczykowska, Artur Burzyński","doi":"10.1038/s41597-024-04004-6","DOIUrl":"10.1038/s41597-024-04004-6","url":null,"abstract":"<p><p>Although the European flounder is frequently used in research and has economic importance, there is still lack of comprehensive transcriptome data for this species. In the present research we show RNA-Seq data from ten selected organs of P. flesus female inhabiting brackish waters of the Gulf of Gdańsk (southern Baltic Sea). High throughput Next Generation Sequencing technology NovaSeq 6000 was used to generate 500 M sequencing reads. These were mapped against European flounder reference genome and reads extracted from the mapping were assembled producing 61k reliable contigs. Gene ontology (GO) terms were assigned to the majority of annotated contigs/unigenes based on the results of PFAM, PANTHER, UniProt and InterPro protein databases searches. BUSCOs statistics for eukaryota, metazoa, vertebrata and actinopterygii databases showed that the reported transcriptome represents a high level of completeness. The data set can be successfully used as a tool in design of experiments from various research fields including biology, aquaculture and toxicology.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1184"},"PeriodicalIF":5.8,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11525550/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142547179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
WEMAC: Women and Emotion Multi-modal Affective Computing dataset. WEMAC:妇女与情感多模态情感计算数据集。
IF 5.8 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-10-30 DOI: 10.1038/s41597-024-04002-8
Jose A Miranda Calero, Laura Gutiérrez-Martín, Esther Rituerto-González, Elena Romero-Perales, Jose M Lanza-Gutiérrez, Carmen Peláez-Moreno, Celia López-Ongil

WEMAC is a unique open multi-modal dataset that comprises physiological, speech, and self-reported emotional data records of 100 women, targeting Gender-based Violence detection. Emotions were elicited through visualizing a validated video set using an immersive virtual reality headset. The physiological signals captured during the experiment include blood volume pulse, galvanic skin response, and skin temperature. The speech was acquired right after the stimuli visualization to capture the final traces of the perceived emotion. Subjects were asked to annotate among 12 categorical emotions, several dimensional emotions with a modified version of the Self-Assessment Manikin, and liking and familiarity labels. The technical validation proves that all the targeted categorical emotions show a strong statistically significant positive correlation with their corresponding reported ones. That means that the videos elicit the desired emotions in the users in most cases. Specifically, a negative correlation is found when comparing fear and not-fear emotions, indicating that this is a well-portrayed emotional dimension, a specific, though not exclusive, purpose of WEMAC towards detecting gender violence.

WEMAC 是一个独特的开放式多模态数据集,由 100 名女性的生理、语言和自我报告的情绪数据记录组成,目标是检测性别暴力。情绪是通过使用沉浸式虚拟现实头戴设备观看经过验证的视频集来激发的。实验中采集的生理信号包括血容量脉搏、皮肤电反应和皮肤温度。在刺激可视化后立即采集语音,以捕捉感知情绪的最终痕迹。受试者被要求在 12 种分类情绪中进行标注,并使用改进版的自我评估人体模型对几种维度情绪以及喜欢和熟悉程度标签进行标注。技术验证结果表明,所有目标分类情绪都与相应的报告情绪在统计学上呈现出显著的正相关。这意味着视频在大多数情况下都能激发用户的预期情绪。具体而言,在比较恐惧和不恐惧情绪时发现了负相关,这表明这是一个很好的情绪维度,也是 WEMAC 检测性别暴力的特定目的(尽管不是唯一目的)。
{"title":"WEMAC: Women and Emotion Multi-modal Affective Computing dataset.","authors":"Jose A Miranda Calero, Laura Gutiérrez-Martín, Esther Rituerto-González, Elena Romero-Perales, Jose M Lanza-Gutiérrez, Carmen Peláez-Moreno, Celia López-Ongil","doi":"10.1038/s41597-024-04002-8","DOIUrl":"10.1038/s41597-024-04002-8","url":null,"abstract":"<p><p>WEMAC is a unique open multi-modal dataset that comprises physiological, speech, and self-reported emotional data records of 100 women, targeting Gender-based Violence detection. Emotions were elicited through visualizing a validated video set using an immersive virtual reality headset. The physiological signals captured during the experiment include blood volume pulse, galvanic skin response, and skin temperature. The speech was acquired right after the stimuli visualization to capture the final traces of the perceived emotion. Subjects were asked to annotate among 12 categorical emotions, several dimensional emotions with a modified version of the Self-Assessment Manikin, and liking and familiarity labels. The technical validation proves that all the targeted categorical emotions show a strong statistically significant positive correlation with their corresponding reported ones. That means that the videos elicit the desired emotions in the users in most cases. Specifically, a negative correlation is found when comparing fear and not-fear emotions, indicating that this is a well-portrayed emotional dimension, a specific, though not exclusive, purpose of WEMAC towards detecting gender violence.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1182"},"PeriodicalIF":5.8,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11525988/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142547184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A dataset of drone-captured, segmented images for oil spill detection in port environments. 用于港口环境溢油检测的无人机捕获分割图像数据集。
IF 5.8 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-10-30 DOI: 10.1038/s41597-024-03993-8
Thomas De Kerf, Seppe Sels, Svetlana Samsonova, Steve Vanlanduit

The high incidence of oil spills in port areas poses a serious threat to the environment, prompting the need for efficient detection mechanisms. Utilizing automated drones for this purpose can significantly improve the speed and accuracy of oil spill detection. Such advancements not only expedite cleanup operations, reducing environmental harm but also enhance polluter accountability, potentially deterring future incidents. Currently, there's a scarcity of datasets employing RGB images for oil spill detection in maritime settings. This paper presents a unique, annotated dataset aimed at addressing this gap, leveraging a neural network for analysis on both desktop and edge computing platforms. The dataset, captured via drone, comprises 1268 images categorized into oil, water, and other, with a convolutional neural network trained using an Unet model architecture achieving an F1 score of 0.71 for oil detection. This underscores the dataset's practicality for real-world applications, offering crucial resources for environmental conservation in port environments.

港口地区漏油事件频发,对环境构成严重威胁,因此需要高效的检测机制。为此,利用自动无人机可以大大提高溢油检测的速度和准确性。这种进步不仅能加快清理行动,减少对环境的危害,还能加强对污染者的问责,有可能阻止未来事件的发生。目前,采用 RGB 图像进行海上溢油检测的数据集非常稀少。本文介绍了一个独特的、带有注释的数据集,旨在利用神经网络在桌面和边缘计算平台上进行分析,从而弥补这一空白。该数据集通过无人机捕获,包含 1268 张图像,分为油、水和其他类别,使用 Unet 模型架构训练的卷积神经网络在油类检测方面的 F1 得分为 0.71。这凸显了该数据集在实际应用中的实用性,为港口环境的环境保护提供了重要资源。
{"title":"A dataset of drone-captured, segmented images for oil spill detection in port environments.","authors":"Thomas De Kerf, Seppe Sels, Svetlana Samsonova, Steve Vanlanduit","doi":"10.1038/s41597-024-03993-8","DOIUrl":"10.1038/s41597-024-03993-8","url":null,"abstract":"<p><p>The high incidence of oil spills in port areas poses a serious threat to the environment, prompting the need for efficient detection mechanisms. Utilizing automated drones for this purpose can significantly improve the speed and accuracy of oil spill detection. Such advancements not only expedite cleanup operations, reducing environmental harm but also enhance polluter accountability, potentially deterring future incidents. Currently, there's a scarcity of datasets employing RGB images for oil spill detection in maritime settings. This paper presents a unique, annotated dataset aimed at addressing this gap, leveraging a neural network for analysis on both desktop and edge computing platforms. The dataset, captured via drone, comprises 1268 images categorized into oil, water, and other, with a convolutional neural network trained using an Unet model architecture achieving an F1 score of 0.71 for oil detection. This underscores the dataset's practicality for real-world applications, offering crucial resources for environmental conservation in port environments.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1180"},"PeriodicalIF":5.8,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11525993/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142547171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FARFUM-RoP, A dataset for computer-aided detection of Retinopathy of Prematurity. FARFUM-RoP,早产儿视网膜病变计算机辅助检测数据集。
IF 5.8 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-10-30 DOI: 10.1038/s41597-024-03897-7
Morteza Akbari, Hamid-Reza Pourreza, Elias Khalili Pour, Afsar Dastjani Farahani, Fatemeh Bazvand, Nazanin Ebrahimiadib, Marjan Imani Fooladi, Fereshteh Ramazani K

Retinopathy of Prematurity (ROP) is a critical eye disorder affecting premature infants, characterized by abnormal blood vessel development in the retina. Plus Disease, indicating severe ROP progression, plays a pivotal role in diagnosis. Recent advancements in Artificial Intelligence (AI) have shown parity with or surpass human experts in ROP detection, especially Plus Disease. However, the success of AI systems depends on high-quality datasets, emphasizing the need for collaboration and data sharing among researchers. To address this challenge, the paper introduces a new public dataset, FARFUM-RoP (Farabi and Ferdowsi University of Mashhad's ROP dataset), comprising 1533 ROP fundus images from 68 patients, annotated independently by five experienced childhood ophthalmologists as "Normal," "Pre-Plus," or "Plus." Ethical principles and consent were meticulously followed during data collection. The paper presents the dataset structure, patient details, and expert labels.

早产儿视网膜病变(ROP)是早产儿的一种严重眼部疾病,其特点是视网膜血管发育异常。显示严重早产儿视网膜病变进展的 "加号病"(Plus Disease)在诊断中起着至关重要的作用。人工智能(AI)的最新进展表明,在 ROP 检测(尤其是加号病)方面,人工智能与人类专家不相上下,甚至更胜一筹。然而,人工智能系统的成功取决于高质量的数据集,这就强调了研究人员之间合作和数据共享的必要性。为了应对这一挑战,本文介绍了一个新的公共数据集 FARFUM-RoP(法拉比和马什哈德费尔道西大学的 ROP 数据集),该数据集由 68 名患者的 1533 张 ROP 眼底图像组成,由五位经验丰富的儿童眼科专家独立注释为 "正常"、"Pre-Plus "或 "Plus"。数据收集过程中严格遵守了伦理原则并征得了同意。本文介绍了数据集结构、患者详情和专家标签。
{"title":"FARFUM-RoP, A dataset for computer-aided detection of Retinopathy of Prematurity.","authors":"Morteza Akbari, Hamid-Reza Pourreza, Elias Khalili Pour, Afsar Dastjani Farahani, Fatemeh Bazvand, Nazanin Ebrahimiadib, Marjan Imani Fooladi, Fereshteh Ramazani K","doi":"10.1038/s41597-024-03897-7","DOIUrl":"10.1038/s41597-024-03897-7","url":null,"abstract":"<p><p>Retinopathy of Prematurity (ROP) is a critical eye disorder affecting premature infants, characterized by abnormal blood vessel development in the retina. Plus Disease, indicating severe ROP progression, plays a pivotal role in diagnosis. Recent advancements in Artificial Intelligence (AI) have shown parity with or surpass human experts in ROP detection, especially Plus Disease. However, the success of AI systems depends on high-quality datasets, emphasizing the need for collaboration and data sharing among researchers. To address this challenge, the paper introduces a new public dataset, FARFUM-RoP (Farabi and Ferdowsi University of Mashhad's ROP dataset), comprising 1533 ROP fundus images from 68 patients, annotated independently by five experienced childhood ophthalmologists as \"Normal,\" \"Pre-Plus,\" or \"Plus.\" Ethical principles and consent were meticulously followed during data collection. The paper presents the dataset structure, patient details, and expert labels.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1176"},"PeriodicalIF":5.8,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11525552/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142547178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ancient Yi Script Handwriting Sample Repository. 古彝文手写样本库。
IF 5.8 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-10-30 DOI: 10.1038/s41597-024-03918-5
Xiaojuan Liu, Xu Han, Shanxiong Chen, Weijia Dai, Qiuyue Ruan

The ancient Yi script has been used for over 8000 years, which can be ranked with Oracle,Sumerian,Egyptian,Mayan and Harappan,and is one of the six ancient scripts in the world. In this article, we collected 2922 handwritten single word samples of commonly used ancient Yi characters. Each character was written by 310 people respectively, with a total of 427,939 valid characters. We completed continuous handwritten text sampling, written by 250 people, with 5 texts per person, covering topics such as Yi astronomy, geography, rituals, and agriculture. In the process of data collection, we proposed an automatic sampling method for ancient Yi script, and completed the automatic cutting and labeling of handwritten samples. Furthermore, we tested the recognition performance of the sorted data set under different deep learning network models. The results show that ancient Yi script has diverse shape structures and rich writing styles, which can be used as a benchmark data set in related fields such as handwritten text recognition and handwritten text generation.

古彝文已有 8000 多年的历史,与甲骨文、苏美尔文、埃及文、玛雅文、哈拉帕文齐名,是世界六大古文字之一。本文收集了 2922 个常用古彝文手写单字样本。每个字分别由 310 人书写,共计 427 939 个有效字。我们完成了由 250 人书写的连续手写文本采样,每人 5 篇,内容涉及彝族天文、地理、礼仪、农业等。在数据采集过程中,我们提出了彝文古文字自动采样方法,并完成了手写样本的自动切割和标注。此外,我们还测试了分类数据集在不同深度学习网络模型下的识别性能。结果表明,古彝文具有多样的形状结构和丰富的书写风格,可以作为手写文字识别和手写文字生成等相关领域的基准数据集。
{"title":"Ancient Yi Script Handwriting Sample Repository.","authors":"Xiaojuan Liu, Xu Han, Shanxiong Chen, Weijia Dai, Qiuyue Ruan","doi":"10.1038/s41597-024-03918-5","DOIUrl":"10.1038/s41597-024-03918-5","url":null,"abstract":"<p><p>The ancient Yi script has been used for over 8000 years, which can be ranked with Oracle,Sumerian,Egyptian,Mayan and Harappan,and is one of the six ancient scripts in the world. In this article, we collected 2922 handwritten single word samples of commonly used ancient Yi characters. Each character was written by 310 people respectively, with a total of 427,939 valid characters. We completed continuous handwritten text sampling, written by 250 people, with 5 texts per person, covering topics such as Yi astronomy, geography, rituals, and agriculture. In the process of data collection, we proposed an automatic sampling method for ancient Yi script, and completed the automatic cutting and labeling of handwritten samples. Furthermore, we tested the recognition performance of the sorted data set under different deep learning network models. The results show that ancient Yi script has diverse shape structures and rich writing styles, which can be used as a benchmark data set in related fields such as handwritten text recognition and handwritten text generation.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1183"},"PeriodicalIF":5.8,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11526026/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142547174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NPKGRIDS: a global georeferenced dataset of N, P2O5, and K2O fertilizer application rates for 173 crops. NPKGRIDS:173 种作物的氮、五氧化二磷和氧化钾施肥量的全球地理参照数据集。
IF 5.8 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-10-30 DOI: 10.1038/s41597-024-04030-4
Thu Ha Nguyen, Fiona H M Tang, Giulia Conchedda, Leon Casse, Griffiths Obli-Laryea, Francesco N Tubiello, Federico Maggi

We introduce NPKGRIDS, a new geospatial dataset, providing for the first time data on application rates for all three main plant nutrients, nitrogen (N), phosphorus (P, in terms of phosphorus pentoxide, P2O5) and potassium (K, in terms of potassium oxide, K2O) across 173 crops as of 2020, with a geospatial resolution of 0.05° (approximately 5.6 km at the equator). Development of NPKGRIDS adopted a data fusion approach to integrate crop mask information with eight published datasets of fertilizer application rates, compiled from either georeferenced data or national and subnational statistics. Furthermore, the total applied mass of N, P2O5, and K2O were benchmarked against the country level information from FAO and the International Fertilizers Association (IFA) and validated against data available from National Statistical Offices (NSOs). NPKGRIDS can be used in global modelling, and decision and policy making to help maximize crop yields while reducing environmental impacts.

我们介绍的 NPKGRIDS 是一个新的地理空间数据集,它首次提供了截至 2020 年 173 种作物的所有三种主要植物养分,即氮(N)、磷(P,以五氧化二磷表示)和钾(K,以氧化钾表示)的施肥量数据,地理空间分辨率为 0.05°(赤道约 5.6 千米)。NPKGRIDS 的开发采用了数据融合方法,将作物掩膜信息与八个已发布的化肥施用量数据集整合在一起,这些数据集由地理参照数据或国家和国家以下各级统计数据编制而成。此外,N、P2O5 和 K2O 的总施用量以粮农组织和国际肥料协会 (IFA) 提供的国家级信息为基准,并与国家统计局 (NSO) 提供的数据进行了验证。NPKGRIDS 可用于全球建模、决策和政策制定,以帮助最大限度地提高作物产量,同时减少对环境的影响。
{"title":"NPKGRIDS: a global georeferenced dataset of N, P<sub>2</sub>O<sub>5</sub>, and K<sub>2</sub>O fertilizer application rates for 173 crops.","authors":"Thu Ha Nguyen, Fiona H M Tang, Giulia Conchedda, Leon Casse, Griffiths Obli-Laryea, Francesco N Tubiello, Federico Maggi","doi":"10.1038/s41597-024-04030-4","DOIUrl":"10.1038/s41597-024-04030-4","url":null,"abstract":"<p><p>We introduce NPKGRIDS, a new geospatial dataset, providing for the first time data on application rates for all three main plant nutrients, nitrogen (N), phosphorus (P, in terms of phosphorus pentoxide, P<sub>2</sub>O<sub>5</sub>) and potassium (K, in terms of potassium oxide, K<sub>2</sub>O) across 173 crops as of 2020, with a geospatial resolution of 0.05° (approximately 5.6 km at the equator). Development of NPKGRIDS adopted a data fusion approach to integrate crop mask information with eight published datasets of fertilizer application rates, compiled from either georeferenced data or national and subnational statistics. Furthermore, the total applied mass of N, P<sub>2</sub>O<sub>5</sub>, and K<sub>2</sub>O were benchmarked against the country level information from FAO and the International Fertilizers Association (IFA) and validated against data available from National Statistical Offices (NSOs). NPKGRIDS can be used in global modelling, and decision and policy making to help maximize crop yields while reducing environmental impacts.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1179"},"PeriodicalIF":5.8,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11526156/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142547181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Scientific Data
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1