首页 > 最新文献

Data in Brief最新文献

英文 中文
Soil and crop data from a long-term organic fertilization trial in Sub-Sahelian market gardening 萨赫勒以南地区市场园艺长期有机施肥试验的土壤和作物数据
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-08 DOI: 10.1016/j.dib.2026.112456
Marie-Liesse Vermeire , Pathé Basse , Samuel Legros , Falilou Diallo , Anne Desnues , Frédéric Feder
Recycling the growing stock of organic waste products (OWP) from cities, factories, and farms is a key challenge for sustainable agriculture. However, it must be done with awareness of performances but also potential long-term environmental and health risks. In this context, the SOERE PRO observatory was established ("Systèmes d'Observation et d'Expérimentation pour la Recherche en Environnement - Produits Résiduaires Organiques'', a label granted by the French National Research Alliance for the Environment (AllEnvi) to recognize high-quality research infrastructures, which translates to "Long-term Observation and Experimentation Systems for Environmental Research - Organic Waste Products''), including the trial in Sangalkam, in the Dakar region of Senegal, where these data are collected. Since 2016, four fertilizer types - one mineral (synthetic) and three organic - have been applied annually to three successive vegetable crops (tomato, lettuce, carrot). The dataset currently covers the period 2016 - 2025, with data collection ongoing and new data to be added in the future. Manual weeding and hoeing is carried out regularly for each crop, no pesticides are used for crop protection on the trial. A comprehensive, multi-variable dataset is consistently documented, including soil physico-chemical parameters measured annually at three depths, organic waste product characterization, crop yield and quality parameters, and detailed management activities, making it particularly suitable for process-based modelling and long-term impact assessment. The originality of this dataset lies in its long duration, the diversity of organic and mineral fertilization strategies, the inclusion of multiple vegetable crops per year, and its location under Sub-Sahelian conditions, a context for which long-term agronomic datasets remain scarce. All soil, OWP and vegetables samples are stored in a sample bank in Dakar, and available for additional analyses. The objective of this dataset is to provide long-term, integrated information on crop productivity, crop quality, and soil responses to repeated organic and mineral fertilization in a Sub-Sahelian market-gardening system. The dataset is publicly available through a Dataverse repository for free (re)use in meta-analyses, process-based modelling, and environmental studies, notably to improve understanding of nutrient cycling, contaminant dynamics, soil biodiversity, and long-term soil functioning in Sub-Sahelian agroecosystems, and to support sustainable land management and food security in Southern countries under future climate change.
从城市、工厂和农场中回收越来越多的有机废物(OWP)是可持续农业的一个关键挑战。然而,在进行这项工作时,不仅要意识到业绩,还要注意潜在的长期环境和健康风险。在这方面,建立了SOERE PRO观测站(“环境研究的观察和实验系统-有机产品”,这是法国国家环境研究联盟(AllEnvi)授予的一个标签,以承认高质量的研究基础设施,其翻译为“环境研究的长期观察和实验系统-有机废物”),包括在桑卡尔卡姆的试验。在收集这些数据的塞内加尔达喀尔地区。自2016年以来,四种肥料——一种矿物(合成)和三种有机——每年连续施用于三种蔬菜作物(番茄、生菜、胡萝卜)。该数据集目前涵盖2016 - 2025年期间,数据收集正在进行中,未来将添加新数据。每个作物定期进行人工除草和锄地,试验中不使用农药进行作物保护。一个全面的、多变量的数据集被一致地记录下来,包括每年在三个深度测量的土壤物理化学参数、有机废物特性、作物产量和质量参数,以及详细的管理活动,使其特别适合基于过程的建模和长期影响评估。该数据集的独创性在于其持续时间长,有机和矿物施肥策略的多样性,每年包括多种蔬菜作物,以及其在萨赫勒以南条件下的位置,这是一个长期农艺数据集仍然稀缺的背景。所有土壤、土壤磷和蔬菜样本都储存在达喀尔的一个样本库中,供进一步分析使用。该数据集的目的是提供关于萨赫勒以南市场园艺系统中作物生产力、作物质量和土壤对重复施用有机和矿物肥料的反应的长期综合信息。该数据集可通过Dataverse存储库公开提供,供元分析、基于过程的建模和环境研究免费(重复)使用,特别是用于提高对萨赫勒以南农业生态系统中养分循环、污染物动态、土壤生物多样性和长期土壤功能的理解,并支持南方国家在未来气候变化下的可持续土地管理和粮食安全。
{"title":"Soil and crop data from a long-term organic fertilization trial in Sub-Sahelian market gardening","authors":"Marie-Liesse Vermeire ,&nbsp;Pathé Basse ,&nbsp;Samuel Legros ,&nbsp;Falilou Diallo ,&nbsp;Anne Desnues ,&nbsp;Frédéric Feder","doi":"10.1016/j.dib.2026.112456","DOIUrl":"10.1016/j.dib.2026.112456","url":null,"abstract":"<div><div>Recycling the growing stock of organic waste products (OWP) from cities, factories, and farms is a key challenge for sustainable agriculture. However, it must be done with awareness of performances but also potential long-term environmental and health risks. In this context, the SOERE PRO observatory was established (\"Systèmes d'Observation et d'Expérimentation pour la Recherche en Environnement - Produits Résiduaires Organiques'', a label granted by the French National Research Alliance for the Environment (AllEnvi) to recognize high-quality research infrastructures, which translates to \"Long-term Observation and Experimentation Systems for Environmental Research - Organic Waste Products''), including the trial in Sangalkam, in the Dakar region of Senegal, where these data are collected. Since 2016, four fertilizer types - one mineral (synthetic) and three organic - have been applied annually to three successive vegetable crops (tomato, lettuce, carrot). The dataset currently covers the period 2016 - 2025, with data collection ongoing and new data to be added in the future. Manual weeding and hoeing is carried out regularly for each crop, no pesticides are used for crop protection on the trial. A comprehensive, multi-variable dataset is consistently documented, including soil physico-chemical parameters measured annually at three depths, organic waste product characterization, crop yield and quality parameters, and detailed management activities, making it particularly suitable for process-based modelling and long-term impact assessment. The originality of this dataset lies in its long duration, the diversity of organic and mineral fertilization strategies, the inclusion of multiple vegetable crops per year, and its location under Sub-Sahelian conditions, a context for which long-term agronomic datasets remain scarce. All soil, OWP and vegetables samples are stored in a sample bank in Dakar, and available for additional analyses. The objective of this dataset is to provide long-term, integrated information on crop productivity, crop quality, and soil responses to repeated organic and mineral fertilization in a Sub-Sahelian market-gardening system. The dataset is publicly available through a Dataverse repository for free (re)use in meta-analyses, process-based modelling, and environmental studies, notably to improve understanding of nutrient cycling, contaminant dynamics, soil biodiversity, and long-term soil functioning in Sub-Sahelian agroecosystems, and to support sustainable land management and food security in Southern countries under future climate change.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112456"},"PeriodicalIF":1.4,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dataset of RGB-D images of object collections from multiple viewpoints with aligned high-resolution 3D models of objects 多视点物体集合的RGB-D图像数据集,具有对齐的高分辨率物体3D模型
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-08 DOI: 10.1016/j.dib.2026.112450
Xinchao Song , Mingjun Li , Sean Banerjee , Natasha Kholgade Banerjee
We present the HILO dataset consisting of high-resolution 3D scanned models for 253 common-use objects and 32,256 multi-viewpoint RGB-D images with typically low-resolution data for 144 tabletop scenes consisting of collections of random sets of 10 objects drawn from the set of 253 objects. The dataset provides the 6 degree of freedom (6DOF) pose for all objects found in each of the 32,256 RGB-D images, obtained by performing precise 3D alignment of the 3D models to the RGB-D images. The dataset also contains metadata on object mass, short text descriptor, binning into everyday use classes, and aspect ratio and function categories, intrinsic parameter information for RGB-D sensors used in capture, and transformations between camera poses. Object 3D models in the dataset were acquired by scanning using a tabletop 3D scanner, and were manually inspected, cleaned, repaired, and exported as original ultra high-resolution at ∼1M vertices and simplified high-resolution meshes at ∼10k vertices. To capture the multi-view RGB-D images, we established an in-house testbed consisting of a turntable and two robotic manipulators to respectively cover azimuth angles and elevation angles, and span a hemisphere. Images were captured using two Microsoft Azure Kinect sensors mounted at the wrists of the robot, one per robot. We captured images over two distances forming hemispherical shells. We used in-house software written in python to control the turntable movement, robot motion, and image capture, as well as to perform camera calibration, processing to generate registered images and foreground masks, manual precise alignment of object models to images, and post-capture correction of misalignments in camera transformation parameters. The dataset provides value in enabling training and evaluation of algorithms for several tasks in computer vision, artificial intelligence (AI), and robotics such as object completion, recognition, segmentation, high-resolution structure generation, robotic grasp planning, and recognition of human-preferred grasp locations for human-robot collaboration.
我们展示了HILO数据集,包括253个常用对象的高分辨率3D扫描模型和32,256个多视点RGB-D图像,以及144个桌面场景的典型低分辨率数据,这些场景由从253个对象集中抽取的10个对象的随机集合组成。该数据集通过对3D模型与RGB-D图像进行精确的3D对齐,为32,256张RGB-D图像中的所有物体提供了6个自由度(6DOF)的姿态。该数据集还包含关于物体质量的元数据,短文本描述符,分成日常使用类,宽高比和功能类别,用于捕获的RGB-D传感器的内在参数信息,以及相机姿势之间的转换。数据集中的对象3D模型是通过使用桌面3D扫描仪扫描获得的,然后进行人工检查、清洗、修复,并在~ 1M顶点处导出为原始的超高分辨率网格,在~ 10k顶点处导出为简化的高分辨率网格。为了捕获多视角RGB-D图像,我们建立了一个内部测试平台,该平台由一个转台和两个机器人操作台组成,分别覆盖方位角和仰角,并跨越一个半球。图像是通过安装在机器人手腕上的两个微软Azure Kinect传感器捕获的,每个机器人一个。我们在两个距离上拍摄了形成半球形壳的图像。我们使用python编写的内部软件来控制转台运动,机器人运动和图像捕获,以及执行相机校准,处理以生成配准图像和前景蒙版,手动精确对齐对象模型到图像,以及捕获后相机变换参数的不校准校正。该数据集为计算机视觉,人工智能(AI)和机器人技术中的几个任务的算法训练和评估提供了价值,例如对象补全,识别,分割,高分辨率结构生成,机器人抓取规划以及识别人类首选的抓取位置以进行人机协作。
{"title":"Dataset of RGB-D images of object collections from multiple viewpoints with aligned high-resolution 3D models of objects","authors":"Xinchao Song ,&nbsp;Mingjun Li ,&nbsp;Sean Banerjee ,&nbsp;Natasha Kholgade Banerjee","doi":"10.1016/j.dib.2026.112450","DOIUrl":"10.1016/j.dib.2026.112450","url":null,"abstract":"<div><div>We present the HILO dataset consisting of high-resolution 3D scanned models for 253 common-use objects and 32,256 multi-viewpoint RGB-D images with typically low-resolution data for 144 tabletop scenes consisting of collections of random sets of 10 objects drawn from the set of 253 objects. The dataset provides the 6 degree of freedom (6DOF) pose for all objects found in each of the 32,256 RGB-D images, obtained by performing precise 3D alignment of the 3D models to the RGB-D images. The dataset also contains metadata on object mass, short text descriptor, binning into everyday use classes, and aspect ratio and function categories, intrinsic parameter information for RGB-D sensors used in capture, and transformations between camera poses. Object 3D models in the dataset were acquired by scanning using a tabletop 3D scanner, and were manually inspected, cleaned, repaired, and exported as original ultra high-resolution at ∼1M vertices and simplified high-resolution meshes at ∼10k vertices. To capture the multi-view RGB-D images, we established an in-house testbed consisting of a turntable and two robotic manipulators to respectively cover azimuth angles and elevation angles, and span a hemisphere. Images were captured using two Microsoft Azure Kinect sensors mounted at the wrists of the robot, one per robot. We captured images over two distances forming hemispherical shells. We used in-house software written in python to control the turntable movement, robot motion, and image capture, as well as to perform camera calibration, processing to generate registered images and foreground masks, manual precise alignment of object models to images, and post-capture correction of misalignments in camera transformation parameters. The dataset provides value in enabling training and evaluation of algorithms for several tasks in computer vision, artificial intelligence (AI), and robotics such as object completion, recognition, segmentation, high-resolution structure generation, robotic grasp planning, and recognition of human-preferred grasp locations for human-robot collaboration.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112450"},"PeriodicalIF":1.4,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comprehensive and spatially detailed passenger vehicle and truck traffic volume data for the United States estimated by machine learning. 通过机器学习估计的美国全面和空间详细的乘用车和卡车交通量数据。
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-07 eCollection Date: 2026-02-01 DOI: 10.1016/j.dib.2026.112451
Brittany Antonczak, Meg Fay, Aviral Chawla, Gregory Rowangould

The Highway Performance Monitoring System, managed by the Federal Highway Administration, provides data on average annual daily traffic volume across roadways in the United States, but it has limited representation of medium- and heavy-duty vehicle traffic on lower-volume roadways that are not part of the national highway system. This gap limits research and policy analysis on the community impacts of truck traffic, especially concerning air quality and public health. To address this, we use Random Forest Regression to estimate medium- and heavy-duty vehicle traffic volumes on network links where these data are missing. The result is a comprehensive vehicle traffic dataset that covers 85.2% of public roadways in the United States. From these data, we also calculate traffic density values for each census block and vehicle class that can serve as a high-resolution surrogate for traffic-related air pollution exposure in public health studies and policy analysis. Our high-resolution spatial data products are rigorously validated and provide a more complete representation of truck traffic than any existing publicly available dataset. These datasets are valuable for transportation planning, public health research, and policy decisions aimed at understanding and mitigating the effects of truck traffic on communities that are disproportionately exposed to air pollution from vehicle traffic.

美国联邦公路管理局(Federal Highway Administration)管理的公路运行状况监测系统(Highway Performance Monitoring System)提供了美国公路平均每日交通量的数据,但它对不属于国家公路系统的低交通量公路上的中型和重型车辆的交通情况的反映有限。这一差距限制了对卡车交通对社区影响的研究和政策分析,特别是对空气质量和公共卫生的影响。为了解决这个问题,我们使用随机森林回归来估计这些数据缺失的网络链路上的中型和重型车辆交通量。其结果是一个全面的车辆交通数据集,覆盖了美国85.2%的公共道路。从这些数据中,我们还计算了每个普查街区和车辆类别的交通密度值,这些值可以作为公共卫生研究和政策分析中交通相关空气污染暴露的高分辨率替代值。我们的高分辨率空间数据产品经过严格验证,比任何现有的公开数据集都能提供更完整的卡车交通表现。这些数据集对于交通规划、公共卫生研究和政策决策具有重要价值,这些决策旨在了解和减轻卡车交通对那些不成比例地暴露于车辆交通空气污染的社区的影响。
{"title":"Comprehensive and spatially detailed passenger vehicle and truck traffic volume data for the United States estimated by machine learning.","authors":"Brittany Antonczak, Meg Fay, Aviral Chawla, Gregory Rowangould","doi":"10.1016/j.dib.2026.112451","DOIUrl":"10.1016/j.dib.2026.112451","url":null,"abstract":"<p><p>The Highway Performance Monitoring System, managed by the Federal Highway Administration, provides data on average annual daily traffic volume across roadways in the United States, but it has limited representation of medium- and heavy-duty vehicle traffic on lower-volume roadways that are not part of the national highway system. This gap limits research and policy analysis on the community impacts of truck traffic, especially concerning air quality and public health. To address this, we use Random Forest Regression to estimate medium- and heavy-duty vehicle traffic volumes on network links where these data are missing. The result is a comprehensive vehicle traffic dataset that covers 85.2% of public roadways in the United States. From these data, we also calculate traffic density values for each census block and vehicle class that can serve as a high-resolution surrogate for traffic-related air pollution exposure in public health studies and policy analysis. Our high-resolution spatial data products are rigorously validated and provide a more complete representation of truck traffic than any existing publicly available dataset. These datasets are valuable for transportation planning, public health research, and policy decisions aimed at understanding and mitigating the effects of truck traffic on communities that are disproportionately exposed to air pollution from vehicle traffic.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"64 ","pages":"112451"},"PeriodicalIF":1.4,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12855594/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146104187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effects of raw and thermally processed spent coffee grounds on Miscanthus × giganteus plantation: Data description 生咖啡渣和热处理咖啡渣对芒草种植园的影响:数据描述
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-07 DOI: 10.1016/j.dib.2025.112432
Nicole Nawrot , Jacek Kluska
Cultivating Miscanthus × giganteus (M × g) energy crop on marginal soil supports phytoattenuation and provides high-energy biomass for biofuel production. Improving nutrient-poor soil with low-cost recovered organic amendments, such as spent coffee grounds (SCG) and SCG-derived biochar (BC) offers sustainable benefits. This data article presents the findings from a medium-term greenhouse experiment at the Gdansk University of Technology assessing M × g cultivation on marginal soil with SCG and BC amendments into soil. In a pot-scale experiment the medium term-effect on M × g biomass growth, photosynthesis parameters, root tissues development, as well as final elemental composition was examined. Soil pH and elemental composition were also determined. As global coffee consumption increases, large quantities of SCG are generated and often landfilled. Their beneficial reuse aligns with circular economy principles and Sustainable Development Goals (SDGs 7 and 13), providing both a short-term nutrient source and a means of improving soil quality and resilience. The article compiles five datasets detailing: (1) M × g growth parameters, tissue development, and photosynthetic indices, (2) nutrient and caffeine leaching behaviour; and (3) elemental composition of plants and soils following exposure. These datasets, available in the Bridge of Knowledge Gdansk University of Technology repository, provide a resource for environmental researchers, soil and plant scientists, biochar specialists, and decisionmakers working to restore marginal soil usability. This study promotes sustainable land management by demonstrating how organic wastes and biochar can be combined to improve crop performance, sequester carbon, and reduce nutrient losses while minimizing external fertilizer inputs.
在边缘土壤上种植芒草(M × g)能源作物支持植物衰减,并为生物燃料生产提供高能生物质。利用低成本的回收有机改良剂,如废咖啡渣(SCG)和源自SCG的生物炭(BC)改善营养贫乏的土壤,提供了可持续的效益。这篇数据文章介绍了格但斯克工业大学中期温室试验的结果,该试验评估了在边缘土壤上使用SCG和BC改良的M × g种植。在盆栽试验中,考察了中期对M × g生物量生长、光合参数、根组织发育和最终元素组成的影响。测定了土壤pH值和元素组成。随着全球咖啡消费量的增加,产生了大量的SCG,并经常被填埋。它们的有益再利用符合循环经济原则和可持续发展目标(可持续发展目标7和13),既提供短期营养来源,又提供改善土壤质量和恢复力的手段。本文编制了五个数据集,详细说明:(1)M × g生长参数、组织发育和光合指标;(2)养分和咖啡因浸出行为;(3)暴露后植物和土壤的元素组成。这些数据集可在格但斯克科技大学知识库的知识之桥中获得,为环境研究人员、土壤和植物科学家、生物炭专家以及致力于恢复边际土壤可用性的决策者提供了资源。这项研究通过展示有机废物和生物炭如何结合起来提高作物性能、固碳和减少养分损失,同时最大限度地减少外部肥料投入,促进了可持续土地管理。
{"title":"Effects of raw and thermally processed spent coffee grounds on Miscanthus × giganteus plantation: Data description","authors":"Nicole Nawrot ,&nbsp;Jacek Kluska","doi":"10.1016/j.dib.2025.112432","DOIUrl":"10.1016/j.dib.2025.112432","url":null,"abstract":"<div><div>Cultivating <em>Miscanthus × giganteus</em> (<em>M</em> <em>×</em> <em>g</em>) energy crop on marginal soil supports phytoattenuation and provides high-energy biomass for biofuel production. Improving nutrient-poor soil with low-cost recovered organic amendments, such as spent coffee grounds (SCG) and SCG-derived biochar (BC) offers sustainable benefits. This data article presents the findings from a medium-term greenhouse experiment at the Gdansk University of Technology assessing <em>M</em> <em>×</em> <em>g</em> cultivation on marginal soil with SCG and BC amendments into soil. In a pot-scale experiment the medium term-effect on <em>M</em> <em>×</em> <em>g</em> biomass growth, photosynthesis parameters, root tissues development, as well as final elemental composition was examined. Soil pH and elemental composition were also determined. As global coffee consumption increases, large quantities of SCG are generated and often landfilled. Their beneficial reuse aligns with circular economy principles and Sustainable Development Goals (SDGs 7 and 13), providing both a short-term nutrient source and a means of improving soil quality and resilience. The article compiles five datasets detailing: (1) <em>M</em> <em>×</em> <em>g</em> growth parameters, tissue development, and photosynthetic indices, (2) nutrient and caffeine leaching behaviour; and (3) elemental composition of plants and soils following exposure. These datasets, available in the Bridge of Knowledge Gdansk University of Technology repository, provide a resource for environmental researchers, soil and plant scientists, biochar specialists, and decisionmakers working to restore marginal soil usability. This study promotes sustainable land management by demonstrating how organic wastes and biochar can be combined to improve crop performance, sequester carbon, and reduce nutrient losses while minimizing external fertilizer inputs.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"64 ","pages":"Article 112432"},"PeriodicalIF":1.4,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145973351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Participatory and multi-disciplinary science dataset and surveys for the assessment of the microbiological and behavioural factors influencing fresh fruits and vegetables' waste at home 参与和多学科的科学数据集和调查,以评估影响家中新鲜水果和蔬菜浪费的微生物和行为因素
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-07 DOI: 10.1016/j.dib.2025.112434
Camille Marchal , Damien Ballan , Sarra Azib , Morgane Innocent , Bertrand Urien , Annick Tamaro , Marine Le Gall-Ely , Emmanuel Coton , Adeline Picot , Jérôme Mounier , Louis Coroller , Patrick Gabriel
Fresh fruits and vegetables (FFV) represent the largest part of food waste at the consumer level. This waste directly results from FFV physiological and microbiological spoilage, itself intricately linked to behavioural factors such as consumer practices, including purchase, storage and hygiene practices, but also consumers’ perceptions towards spoilage. Based on a dual approach combining microbiological and behavioural sciences, we examined the link between FFV waste produced by 49 volunteering French households, measured using connected bins, the microbial ecology of their storage compartments, using culture-dependent and -independent approaches, and their consumer behaviour, cleaning and storage practices, through in-depth interviews and a dedicated survey. An exploratory qualitative survey carried out on 17 individuals followed by two quantitative data collections on 1048 and 815 representative French consumers enabled us to identify anti-FFV waste practices and to cluster consumers according to their anti-FFV waste behaviours. Spoilage dynamics of commonly consumed FFV, according to storage temperature, microbial contamination level and the presence or absence of surface wounds, were also performed in controlled conditions. This citizen-science-based dataset covers a wide array of microbiological and behavioural factors related to domestic FFV waste, as well as real measurements of waste volumes thanks to the innovative use of connected bins. Altogether, this data could provide interesting insights into more effective and accessible guidelines for FFV waste reduction at the consumer level, and thus to a potential reduction of global food waste and its related costs.
在消费者层面,新鲜水果和蔬菜(FFV)是食物浪费的最大部分。这种浪费直接来自FFV生理和微生物腐败,其本身与消费者行为(包括购买、储存和卫生习惯)以及消费者对腐败的看法等行为因素密切相关。基于微生物学和行为科学相结合的双重方法,我们通过深入访谈和专门调查,研究了49个法国志愿家庭产生的FFV废物之间的联系,使用连接的垃圾箱进行测量,他们的储存隔间的微生物生态,使用培养依赖和独立的方法,以及他们的消费者行为,清洁和储存实践。我们对17个人进行了探索性定性调查,随后对1048名和815名具有代表性的法国消费者进行了两次定量数据收集,使我们能够确定反ffv浪费行为,并根据消费者的反ffv浪费行为对消费者进行分类。在受控条件下,根据储存温度、微生物污染水平和有无表面伤口,研究了通常消耗的FFV的腐败动力学。这个以公民科学为基础的数据集涵盖了与家庭FFV废物有关的一系列微生物和行为因素,以及由于创新使用连接垃圾箱而对废物量的实际测量。总的来说,这些数据可以为在消费者层面上更有效和更容易获得的FFV废物减少准则提供有趣的见解,从而可能减少全球食物浪费及其相关成本。
{"title":"Participatory and multi-disciplinary science dataset and surveys for the assessment of the microbiological and behavioural factors influencing fresh fruits and vegetables' waste at home","authors":"Camille Marchal ,&nbsp;Damien Ballan ,&nbsp;Sarra Azib ,&nbsp;Morgane Innocent ,&nbsp;Bertrand Urien ,&nbsp;Annick Tamaro ,&nbsp;Marine Le Gall-Ely ,&nbsp;Emmanuel Coton ,&nbsp;Adeline Picot ,&nbsp;Jérôme Mounier ,&nbsp;Louis Coroller ,&nbsp;Patrick Gabriel","doi":"10.1016/j.dib.2025.112434","DOIUrl":"10.1016/j.dib.2025.112434","url":null,"abstract":"<div><div>Fresh fruits and vegetables (FFV) represent the largest part of food waste at the consumer level. This waste directly results from FFV physiological and microbiological spoilage, itself intricately linked to behavioural factors such as consumer practices, including purchase, storage and hygiene practices, but also consumers’ perceptions towards spoilage. Based on a dual approach combining microbiological and behavioural sciences, we examined the link between FFV waste produced by 49 volunteering French households, measured using connected bins, the microbial ecology of their storage compartments, using culture-dependent and -independent approaches, and their consumer behaviour, cleaning and storage practices, through in-depth interviews and a dedicated survey. An exploratory qualitative survey carried out on 17 individuals followed by two quantitative data collections on 1048 and 815 representative French consumers enabled us to identify anti-FFV waste practices and to cluster consumers according to their anti-FFV waste behaviours. Spoilage dynamics of commonly consumed FFV, according to storage temperature, microbial contamination level and the presence or absence of surface wounds, were also performed in controlled conditions. This citizen-science-based dataset covers a wide array of microbiological and behavioural factors related to domestic FFV waste, as well as real measurements of waste volumes thanks to the innovative use of connected bins. Altogether, this data could provide interesting insights into more effective and accessible guidelines for FFV waste reduction at the consumer level, and thus to a potential reduction of global food waste and its related costs.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112434"},"PeriodicalIF":1.4,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
UniEload: Electrical load dataset for energy forecasting applications at public universities in Bangladesh uniload:孟加拉国公立大学能源预测应用的电力负荷数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-07 DOI: 10.1016/j.dib.2025.112444
Utshob Sutradhar , Priyankar Biswas , Sumon Hossain , A.T.M. Saiful Islam , Shuvo Dev
This paper presents a dataset on electrical power collected from a university campus in Bangladesh. It is meant to help research on energy forecasting in university settings. The dataset has hourly measurements of system voltage, three-phase currents (R, Y, B), and power factor (pf). These were recorded at the campus substation. Data were collected during different operational conditions, including academic periods and vacations. This provides insights into load behaviour, changes in power factor, and phase imbalance patterns in an educational setting. The dataset supports the creation and assessment of models for load forecasting, anomaly detection, and improving power efficiency. It was also combined with weather data to aid research on load forecasting that takes weather into account. The weather parameters include temperature, humidity, precipitation, wind speed, and solar radiation. All weather values match energy values and were gathered hourly and daily. This dataset is especially useful for researchers studying how artificial intelligence and machine learning can be applied in managing electrical energy. The dataset also includes notes about context, such as reduced load during national holidays. This improves its usefulness for studies that focus on events in forecasting. By making this dataset open access, it helps fill the gap in publicly available electrical load data from educational institutions in developing countries. This supports reproducible research and sustainable energy management on campus.
本文介绍了从孟加拉国一所大学校园收集的电力数据集。它的目的是帮助大学环境中的能源预测研究。该数据集每小时测量一次系统电压、三相电流(R、Y、B)和功率因数(pf)。这些都是在校园变电站录的。数据是在不同的操作条件下收集的,包括学习期间和假期。这提供了对负载行为、功率因数变化和教育环境中的相位不平衡模式的见解。该数据集支持创建和评估负荷预测、异常检测和提高电力效率的模型。它还与天气数据相结合,以帮助考虑天气因素的负荷预测研究。天气参数包括温度、湿度、降水、风速和太阳辐射。所有的天气值都与能量值相匹配,每小时和每天收集一次。该数据集对于研究如何将人工智能和机器学习应用于电能管理的研究人员特别有用。该数据集还包括有关上下文的注释,例如国家假日期间的负载减少。这提高了它对关注预测事件的研究的有用性。通过使该数据集开放获取,它有助于填补发展中国家教育机构公开可用的电力负荷数据的空白。这支持可重复研究和校园可持续能源管理。
{"title":"UniEload: Electrical load dataset for energy forecasting applications at public universities in Bangladesh","authors":"Utshob Sutradhar ,&nbsp;Priyankar Biswas ,&nbsp;Sumon Hossain ,&nbsp;A.T.M. Saiful Islam ,&nbsp;Shuvo Dev","doi":"10.1016/j.dib.2025.112444","DOIUrl":"10.1016/j.dib.2025.112444","url":null,"abstract":"<div><div>This paper presents a dataset on electrical power collected from a university campus in Bangladesh. It is meant to help research on energy forecasting in university settings. The dataset has hourly measurements of system voltage, three-phase currents (R, Y, B), and power factor (pf). These were recorded at the campus substation. Data were collected during different operational conditions, including academic periods and vacations. This provides insights into load behaviour, changes in power factor, and phase imbalance patterns in an educational setting. The dataset supports the creation and assessment of models for load forecasting, anomaly detection, and improving power efficiency. It was also combined with weather data to aid research on load forecasting that takes weather into account. The weather parameters include temperature, humidity, precipitation, wind speed, and solar radiation. All weather values match energy values and were gathered hourly and daily. This dataset is especially useful for researchers studying how artificial intelligence and machine learning can be applied in managing electrical energy. The dataset also includes notes about context, such as reduced load during national holidays. This improves its usefulness for studies that focus on events in forecasting. By making this dataset open access, it helps fill the gap in publicly available electrical load data from educational institutions in developing countries. This supports reproducible research and sustainable energy management on campus.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112444"},"PeriodicalIF":1.4,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A georeferenced dataset of archaeobotanical findings of Olea europaea and Vitis vinifera compiled from published records from Central Italy 根据意大利中部出版的记录汇编的欧洲油橄榄和葡萄的考古植物学发现的地理参考数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-07 DOI: 10.1016/j.dib.2025.112443
Claudia Moricca , Erasmo Di Fonso , Rachele Nicolini , Laura Sadori
Here we present a coherent, georeferenced and chronologically qualified corpus of fossil plant remains compiled from published archaeobotanical records from archaeological sites from Central Italy, focused on Olea europaea (olive) and Vitis vinifera (grape). The dataset is entirely based on secondary data and does not include newly generated primary archaeobotanical analyses. The dataset integrates site, context and all relevant archaeobotanical occurrences within a coherent relational and spatial model. The corpus was initiated through a structured bibliographic survey aided by the BRAIN database. Exclusively published literature was consulted, allowing to model archaeological sites and link them to excavation contexts and individual archaeobotanical occurrences (defined as the combination of a taxon and the specific plant part recovered, e.g., fruit, seed, rachis). The geodatabase was implemented using QGIS, with a local backend in GeoPackage, then migrated to PostgreSQL/PostGIS to support complex spatial/relational queries and future online outputs. All entities have a defined spatial placement accompanied by explicit quality-control parameters documenting positional uncertainty, source type and authority, as derived from the original published sources, ensuring transparent assessment of locational reliability. To enrich taxonomic information, an automated open thesaurus was built from CC BY/CC BY-SA resources (Floritaly, Acta Plantarum, and Wikimedia projects). The workflow employs REST-style access (or form-equivalent submissions), conservative rate-limiting, randomized waits, retries, and checkpoints; provenance and attribution (including noted transformations) are preserved. A standardized chronological table harmonizes relative cultural phases using ICCD nomenclature, with controlled fallbacks to Perio.do or peer-reviewed literature; a self-referential hierarchy (parent_id) ensures inheritance from sub-phase to broader period. Crucially, the use of open licenses, stable identifiers and cross-references makes the dataset interoperable and interlinked with the source ecosystems from which the secondary archaeobotanical data were extracted: records can resolve back to Floritaly and Acta Plantarum, and our forthcoming web portal can expose these connections for bidirectional navigation, automated updating and external reuse. The result is an interoperable, verifiable resource suitable for spatial and temporal analyses of plant remains based on aggregated and standardized published archaeobotanical data, while remaining legally reusable under the original licenses.
在这里,我们提出了一个连贯的、地理参考的、年代合格的植物化石语料库,这些语料库是从意大利中部考古遗址发表的考古植物学记录中汇编而来的,重点是橄榄(Olea europaea)和葡萄(vinifera)。该数据集完全基于二手数据,不包括新生成的原始考古植物学分析。该数据集将地点、背景和所有相关的考古植物事件整合在一个连贯的关系和空间模型中。语料库是在BRAIN数据库的帮助下通过结构化的书目调查发起的。我们参考了专门发表的文献,从而建立了考古遗址的模型,并将其与挖掘背景和个别考古植物事件(定义为分类群与恢复的特定植物部分的结合,例如果实、种子、轴)联系起来。地理数据库是使用QGIS实现的,在GeoPackage中有一个本地后端,然后迁移到PostgreSQL/PostGIS以支持复杂的空间/关系查询和未来的在线输出。所有实体都有明确的空间位置,并附有明确的质量控制参数,记录位置不确定性、来源类型和权威,这些参数来自原始公布的来源,确保对位置可靠性的透明评估。为了丰富分类信息,从CC BY/CC BY- sa资源(Floritaly、Acta Plantarum和Wikimedia项目)构建了一个自动开放的词库。工作流采用rest风格的访问(或表单等效提交)、保守的速率限制、随机等待、重试和检查点;保存了出处和归属(包括注意到的转换)。标准化的时间顺序表使用ICCD命名法协调相对的文化阶段,并有控制地回退到佩里奥。或同行评议的文献;自引用的层次结构(parent_id)确保从子阶段继承到更广泛的时期。至关重要的是,开放许可、稳定标识符和交叉引用的使用使数据集可互操作,并与提取次要考古植物数据的源生态系统相互关联:记录可以解析回Floritaly和Acta Plantarum,我们即将推出的门户网站可以公开这些连接,以进行双向导航、自动更新和外部重用。其结果是一个可互操作的、可验证的资源,适用于基于聚合和标准化出版的考古数据的植物遗骸的空间和时间分析,同时在原始许可下仍然合法可重复使用。
{"title":"A georeferenced dataset of archaeobotanical findings of Olea europaea and Vitis vinifera compiled from published records from Central Italy","authors":"Claudia Moricca ,&nbsp;Erasmo Di Fonso ,&nbsp;Rachele Nicolini ,&nbsp;Laura Sadori","doi":"10.1016/j.dib.2025.112443","DOIUrl":"10.1016/j.dib.2025.112443","url":null,"abstract":"<div><div>Here we present a coherent, georeferenced and chronologically qualified corpus of fossil plant remains compiled from published archaeobotanical records from archaeological sites from Central Italy, focused on <em>Olea europaea</em> (olive) and <em>Vitis vinifera</em> (grape). The dataset is entirely based on secondary data and does not include newly generated primary archaeobotanical analyses. The dataset integrates site, context and all relevant archaeobotanical occurrences within a coherent relational and spatial model. The corpus was initiated through a structured bibliographic survey aided by the BRAIN database. Exclusively published literature was consulted, allowing to model archaeological sites and link them to excavation contexts and individual archaeobotanical occurrences (defined as the combination of a taxon and the specific plant part recovered, e.g., fruit, seed, rachis). The geodatabase was implemented using QGIS, with a local backend in GeoPackage, then migrated to PostgreSQL/PostGIS to support complex spatial/relational queries and future online outputs. All entities have a defined spatial placement accompanied by explicit quality-control parameters documenting positional uncertainty, source type and authority, as derived from the original published sources, ensuring transparent assessment of locational reliability. To enrich taxonomic information, an automated open thesaurus was built from CC BY/CC BY-SA resources (Floritaly, Acta Plantarum, and Wikimedia projects). The workflow employs REST-style access (or form-equivalent submissions), conservative rate-limiting, randomized waits, retries, and checkpoints; provenance and attribution (including noted transformations) are preserved. A standardized chronological table harmonizes relative cultural phases using ICCD nomenclature, with controlled fallbacks to Perio.do or peer-reviewed literature; a self-referential hierarchy (parent_id) ensures inheritance from sub-phase to broader period. Crucially, the use of open licenses, stable identifiers and cross-references makes the dataset interoperable and interlinked with the source ecosystems from which the secondary archaeobotanical data were extracted: records can resolve back to Floritaly and <em>Acta Plantarum</em>, and our forthcoming web portal can expose these connections for bidirectional navigation, automated updating and external reuse. The result is an interoperable, verifiable resource suitable for spatial and temporal analyses of plant remains based on aggregated and standardized published archaeobotanical data, while remaining legally reusable under the original licenses.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"64 ","pages":"Article 112443"},"PeriodicalIF":1.4,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145973353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transcriptomic dataset of RAW264.7 murine macrophages pretreated with 9-methoxycanthin-6-one under poly(I:C)-TLR3 stimulation poly(I:C)-TLR3刺激下9-甲氧基canthin-6-one预处理RAW264.7小鼠巨噬细胞的转录组学数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-07 DOI: 10.1016/j.dib.2025.112445
Trang Thu Tran , Huyen Minh Thi Ta , Duc Hoang Le , Duong Huy Nguyen , Nam Trung Nguyen
This dataset presents RNA sequencing (RNA-seq) data from RAW264.7 murine macrophages pretreated with 9-methoxycanthin-6-one, a canthin-6-one–type alkaloid isolated from Eurycoma longifolia Jack, and subsequently stimulated with polyinosinic:polycytidylic acid [poly(I:C)], a synthetic double-stranded RNA analog that activates TLR3-mediated antiviral signaling. RAW264.7 cells were pretreated with 9-methoxycanthin-6-one (30 µM) for 30 min and then exposed to poly(I:C) (20 µg/mL) for 6 h. Total RNA was extracted, quality-checked, and sequenced on the Illumina platform to generate paired-end reads. Differential expression analysis and functional annotation were performed to profile genes responsive to 9-methoxycanthin-6-one treatment under poly(I:C) stimulation. The dataset includes normalized expression matrices, lists of upregulated and downregulated genes, and pathway enrichment outputs in standard formats. These data provide a reference resource for understanding the transcriptomic responses of macrophages to natural alkaloid treatment during viral-mimetic immune activation. The dataset can be reused to compare host antiviral transcriptional responses across TLR3-related pathways, evaluate macrophage activation markers, or integrate with other E. longifolia bioactive compounds.
该数据集展示了RAW264.7小鼠巨噬细胞的RNA测序(RNA-seq)数据,这些巨噬细胞用9-甲氧基canthin-6- 1(一种从长叶Eurycoma longifolia Jack中分离的canthin-6- 1型生物碱)预处理,随后用多肌苷:多胞酸[poly(I:C)]刺激,多肌苷:多胞酸是一种合成的双链RNA类似物,可激活tlr3介导的抗病毒信号。RAW264.7细胞用9-甲氧基cantin -6-one(30µM)预处理30 min,然后暴露于poly(I:C)(20µg/mL)中6 h。提取总RNA,进行质量检查,并在Illumina平台上测序,生成对端reads。在poly(I:C)刺激下,对9-甲氧基canthin-6-one响应的基因进行了差异表达分析和功能注释。该数据集包括标准化表达矩阵,上调和下调基因列表,以及标准格式的途径富集输出。这些数据为了解巨噬细胞在模拟病毒免疫激活过程中对天然生物碱处理的转录组反应提供了参考资源。该数据集可用于比较tlr3相关途径的宿主抗病毒转录反应,评估巨噬细胞激活标记物,或与其他长叶莲子生物活性化合物整合。
{"title":"Transcriptomic dataset of RAW264.7 murine macrophages pretreated with 9-methoxycanthin-6-one under poly(I:C)-TLR3 stimulation","authors":"Trang Thu Tran ,&nbsp;Huyen Minh Thi Ta ,&nbsp;Duc Hoang Le ,&nbsp;Duong Huy Nguyen ,&nbsp;Nam Trung Nguyen","doi":"10.1016/j.dib.2025.112445","DOIUrl":"10.1016/j.dib.2025.112445","url":null,"abstract":"<div><div>This dataset presents RNA sequencing (RNA-seq) data from RAW264.7 murine macrophages pretreated with 9-methoxycanthin-6-one, a canthin-6-one–type alkaloid isolated from <em>Eurycoma longifolia</em> Jack, and subsequently stimulated with polyinosinic:polycytidylic acid [poly(I:C)], a synthetic double-stranded RNA analog that activates TLR3-mediated antiviral signaling. RAW264.7 cells were pretreated with 9-methoxycanthin-6-one (30 µM) for 30 min and then exposed to poly(I:C) (20 µg/mL) for 6 h. Total RNA was extracted, quality-checked, and sequenced on the Illumina platform to generate paired-end reads. Differential expression analysis and functional annotation were performed to profile genes responsive to 9-methoxycanthin-6-one treatment under poly(I:C) stimulation. The dataset includes normalized expression matrices, lists of upregulated and downregulated genes, and pathway enrichment outputs in standard formats. These data provide a reference resource for understanding the transcriptomic responses of macrophages to natural alkaloid treatment during viral-mimetic immune activation. The dataset can be reused to compare host antiviral transcriptional responses across TLR3-related pathways, evaluate macrophage activation markers, or integrate with other <em>E. longifolia</em> bioactive compounds.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112445"},"PeriodicalIF":1.4,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Creep reference data of single-crystal Ni-based superalloy CMSX-6 单晶镍基高温合金CMSX-6蠕变参考数据
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-07 DOI: 10.1016/j.dib.2025.112436
Luis Ávila Calderón , Sina Schriever , Ying Han , Jürgen Olbricht , Pedro Dolabella Portella , Birgit Skrotzki
The article presents creep data for the single-crystal, [001]-oriented nickel-based superalloy CMSX-6, tested at a temperature of 980 °C under initial stresses ranging from 140 MPa to 230 MPa. The constant-load creep experiments were performed in accordance with DIN EN ISO 204:2019–4 standard within an ISO 17025 accredited laboratory. A total of 12 datasets are included, each of which includes the percentage creep extension as a function of time. The data series and associated metadata were systematically documented using a data schema specifically developed for creep data of single-crystal Ni-based superalloys. This dataset serves multiple purposes: it can be used to compare with one's own creep test results on similar materials, to verify testing setups (e.g., by replicating tests on the same or comparable materials), to calibrate and validate creep models, and to support alloy development efforts.
本文介绍了单晶,[001]取向镍基高温合金CMSX-6的蠕变数据,测试温度为980°C,初始应力范围为140 MPa至230 MPa。恒载蠕变实验在ISO 17025认可的实验室中按照DIN EN ISO 204:2019-4标准进行。总共包括12个数据集,每个数据集都包含蠕变扩展百分比作为时间的函数。使用专门为单晶镍基高温合金蠕变数据开发的数据模式,系统地记录了数据系列和相关元数据。该数据集具有多种用途:它可用于与类似材料的蠕变测试结果进行比较,验证测试设置(例如,通过在相同或可比材料上重复测试),校准和验证蠕变模型,并支持合金开发工作。
{"title":"Creep reference data of single-crystal Ni-based superalloy CMSX-6","authors":"Luis Ávila Calderón ,&nbsp;Sina Schriever ,&nbsp;Ying Han ,&nbsp;Jürgen Olbricht ,&nbsp;Pedro Dolabella Portella ,&nbsp;Birgit Skrotzki","doi":"10.1016/j.dib.2025.112436","DOIUrl":"10.1016/j.dib.2025.112436","url":null,"abstract":"<div><div>The article presents creep data for the single-crystal, [001]-oriented nickel-based superalloy CMSX-6, tested at a temperature of 980 °C under initial stresses ranging from 140 MPa to 230 MPa. The constant-load creep experiments were performed in accordance with DIN EN ISO 204:2019–4 standard within an ISO 17025 accredited laboratory. A total of 12 datasets are included, each of which includes the percentage creep extension as a function of time. The data series and associated metadata were systematically documented using a data schema specifically developed for creep data of single-crystal Ni-based superalloys. This dataset serves multiple purposes: it can be used to compare with one's own creep test results on similar materials, to verify testing setups (e.g., by replicating tests on the same or comparable materials), to calibrate and validate creep models, and to support alloy development efforts.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112436"},"PeriodicalIF":1.4,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ATDD: Multi-lingual dataset for auto-tune detection in music recordings ATDD:多语言数据集,用于音乐录音中的自动调音检测
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-07 DOI: 10.1016/j.dib.2025.112446
Mahyar Gohari , Paolo Bestagini , Sergio Benini , Nicola Adami
This study introduces a novel multilingual dataset designed to distinguish auto-tuned musical compositions from authentic recordings, addressing a significant gap in existing resources. The dataset encompasses songs in English, Mandarin, and Japanese, ensuring a diverse representation of linguistic contexts. The data collection process began with aggregating diverse datasets from the Music Information Retrieval domain, incorporating tracks from the three specified languages to capture a wide range of musical styles and recording qualities. Each audio file was subsequently standardized into 10-second intervals with the sample rate of 16 kHz to facilitate manageable analysis. For the creation of auto-tuned samples, pitch correction was implemented using the probabilistic YIN (PYIN) algorithm for accurate pitch detection, followed by transposition via the pitch-synchronized overlap and add (PSOLA) technique. To emulate realistic auto-tuning scenarios, pitch correction was randomly applied to portions of each 10-second segment, ensuring variability and realism in the dataset, which makes it suitable for training robust detection models. Additionally, time-domain labels indicating the exact locations of pitch correction within each segment were generated, providing precise annotations crucial for developing accurate detection algorithms. The resulting multilingual dataset comprises a comprehensive collection of both auto-tuned and authentic musical segments across English, Mandarin, and Japanese languages, each annotated with detailed information about pitch correction applications. This rich annotation allows for nuanced analysis and supports various research applications, while the dataset's structure and thorough documentation of its creation process make it a valuable resource for researchers in music analysis, machine learning, and audio signal processing.
本研究介绍了一种新的多语言数据集,旨在区分自动调谐的音乐作品和真实的录音,解决了现有资源的重大差距。该数据集包含英语、普通话和日语歌曲,确保语言上下文的多样化表示。数据收集过程从汇总来自音乐信息检索领域的不同数据集开始,结合来自三种指定语言的曲目,以捕获广泛的音乐风格和录音质量。每个音频文件随后被标准化为10秒间隔,采样率为16 kHz,以方便管理分析。对于自动调谐样本的创建,使用概率YIN (PYIN)算法实现音调校正以进行准确的音调检测,然后通过音调同步重叠和添加(PSOLA)技术进行换位。为了模拟真实的自动调谐场景,每个10秒片段的部分随机应用音调校正,确保数据集的可变性和真实感,这使得它适合训练鲁棒检测模型。此外,还生成了时域标签,指示每个段内音高校正的确切位置,为开发准确的检测算法提供了精确的注释。由此产生的多语言数据集包括英语、普通话和日语的自动调谐和真实音乐片段的综合集合,每个片段都附有有关音高校正应用程序的详细信息。这种丰富的注释允许细致入微的分析,并支持各种研究应用程序,而数据集的结构和其创建过程的彻底文档使其成为音乐分析,机器学习和音频信号处理研究人员的宝贵资源。
{"title":"ATDD: Multi-lingual dataset for auto-tune detection in music recordings","authors":"Mahyar Gohari ,&nbsp;Paolo Bestagini ,&nbsp;Sergio Benini ,&nbsp;Nicola Adami","doi":"10.1016/j.dib.2025.112446","DOIUrl":"10.1016/j.dib.2025.112446","url":null,"abstract":"<div><div>This study introduces a novel multilingual dataset designed to distinguish auto-tuned musical compositions from authentic recordings, addressing a significant gap in existing resources. The dataset encompasses songs in English, Mandarin, and Japanese, ensuring a diverse representation of linguistic contexts. The data collection process began with aggregating diverse datasets from the Music Information Retrieval domain, incorporating tracks from the three specified languages to capture a wide range of musical styles and recording qualities. Each audio file was subsequently standardized into 10-second intervals with the sample rate of 16 kHz to facilitate manageable analysis. For the creation of auto-tuned samples, pitch correction was implemented using the probabilistic YIN (PYIN) algorithm for accurate pitch detection, followed by transposition via the pitch-synchronized overlap and add (PSOLA) technique. To emulate realistic auto-tuning scenarios, pitch correction was randomly applied to portions of each 10-second segment, ensuring variability and realism in the dataset, which makes it suitable for training robust detection models. Additionally, time-domain labels indicating the exact locations of pitch correction within each segment were generated, providing precise annotations crucial for developing accurate detection algorithms. The resulting multilingual dataset comprises a comprehensive collection of both auto-tuned and authentic musical segments across English, Mandarin, and Japanese languages, each annotated with detailed information about pitch correction applications. This rich annotation allows for nuanced analysis and supports various research applications, while the dataset's structure and thorough documentation of its creation process make it a valuable resource for researchers in music analysis, machine learning, and audio signal processing.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112446"},"PeriodicalIF":1.4,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Data in Brief
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1