首页 > 最新文献

Data in Brief最新文献

英文 中文
A human fecal metaproteomic dataset from celiac disease patients on gluten-free diet with or without poly-autoimmunity 来自无麸质饮食的乳糜泻患者的粪便蛋白质组学数据集,有或没有多重自身免疫
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-22 DOI: 10.1016/j.dib.2026.112501
Marcello Abbondio , Alessandro Tanca , Rosangela Sau , Giovanna Pira , Alessandra Errigo , Roberto Manetti , Giovanni Mario Pes , Stefano Bibbò , Maria Pina Dore , Sergio Uzzau
This dataset provides the fecal metaproteome profiles of 28 celiac disease patients on a gluten-free diet, distinguished by the presence or absence of co-occurring autoimmune conditions. The resource includes raw liquid chromatography-tandem mass spectrometry (LC-MS/MS) files, database search results, protein/peptide identification outputs, and taxonomic/functional annotation outputs, along with comprehensive anthropometric, clinical, and dietary metadata for each patient. The identified proteins originate from microbial, human, and plant sources, consistent with the multi-database search strategy used. This collection is designed for reuse in meta-analyses and integrative studies exploring functional changes in the gut microbiome related to auto-immune status and dietary variables. The complete dataset is available via the ProteomeXchange Consortium with the identifier PXD069517.
该数据集提供了28例无麸质饮食的乳糜泻患者的粪便元蛋白质组谱,通过存在或不存在共同发生的自身免疫性疾病来区分。该资源包括原始的液相色谱-串联质谱(LC-MS/MS)文件、数据库搜索结果、蛋白质/肽鉴定输出和分类/功能注释输出,以及每个患者的综合人体测量学、临床和饮食元数据。鉴定的蛋白质来源于微生物、人类和植物,与使用的多数据库搜索策略一致。该收集旨在用于荟萃分析和综合研究,探索与自身免疫状态和饮食变量相关的肠道微生物组的功能变化。完整的数据集可通过ProteomeXchange Consortium获得,标识符为PXD069517。
{"title":"A human fecal metaproteomic dataset from celiac disease patients on gluten-free diet with or without poly-autoimmunity","authors":"Marcello Abbondio ,&nbsp;Alessandro Tanca ,&nbsp;Rosangela Sau ,&nbsp;Giovanna Pira ,&nbsp;Alessandra Errigo ,&nbsp;Roberto Manetti ,&nbsp;Giovanni Mario Pes ,&nbsp;Stefano Bibbò ,&nbsp;Maria Pina Dore ,&nbsp;Sergio Uzzau","doi":"10.1016/j.dib.2026.112501","DOIUrl":"10.1016/j.dib.2026.112501","url":null,"abstract":"<div><div>This dataset provides the fecal metaproteome profiles of 28 celiac disease patients on a gluten-free diet, distinguished by the presence or absence of co-occurring autoimmune conditions. The resource includes raw liquid chromatography-tandem mass spectrometry (LC-MS/MS) files, database search results, protein/peptide identification outputs, and taxonomic/functional annotation outputs, along with comprehensive anthropometric, clinical, and dietary metadata for each patient. The identified proteins originate from microbial, human, and plant sources, consistent with the multi-database search strategy used. This collection is designed for reuse in meta-analyses and integrative studies exploring functional changes in the gut microbiome related to auto-immune status and dietary variables. The complete dataset is available via the ProteomeXchange Consortium with the identifier PXD069517.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112501"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VisioDECT: A robust dataset for aerial and scenario based multi-drone detection, identification, and neutralization VisioDECT:一个强大的数据集,用于空中和基于场景的多无人机检测、识别和中和
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-08 DOI: 10.1016/j.dib.2026.112448
Simeon Okechukwu Ajakwe , Vivian Ukamaka Ihekoronye , Golam Mohtasin , Rubina Akter , Jae Min Lee , Dong Seong Kim
The rapid proliferation of unmanned aerial vehicles (UAVs) for logistics, surveillance, and civilian applications continues to pose significant challenges to airspace security, particularly through unauthorized or malicious deployments. Existing UAV datasets are limited in scope, often focusing on single-drone scenarios, synthetic imagery, or restricted environmental conditions, thereby constraining the development of robust counter-UAV systems. To bridge these gaps, we present vision-based drone detection dataset named as VisioDECT, a comprehensive and scenario-rich dataset for multi-drone detection, identification, and neutralization. The dataset comprises 20,924 annotated images and labels from six UAV models (Anafi-Extended, DJI FPV, DJI Phantom, EFT-E410S, Mavic Air 2, and Mavic 2 Enterprise), captured across three distinct scenarios (sunny, cloudy, and evening) at varying altitudes (30–100 m) and distances. Importantly, all UAVs included in this dataset are rotary-wing (multirotor) platforms, which dominate low-altitude airspace and are the most commonly encountered in real-world surveillance and counter-UAV scenarios. Data were collected over 20 months from more than 12 locations in South Korea, ensuring diversity in illumination, weather, and background complexity. Each sample is provided in three standard formats (.txt, .xml, .csv), with detailed metadata and quality-verified annotations for detection and classification tasks. Illustrative benchmark evaluations using state-of-the-art detection models (e.g., DRONET, YOLO variants) are included solely to validate the quality and practical usability of the dataset for real-time drone defense research. VisioDECT provides a standardized, reproducible, and scalable resource that enables benchmarking, model training, and evaluation for airspace surveillance, UAV traffic management, and national security applications.
用于后勤、监视和民用应用的无人机(uav)的快速扩散继续对空域安全构成重大挑战,特别是通过未经授权或恶意部署。现有的无人机数据集在范围上是有限的,通常聚焦于单无人机场景、合成图像或受限的环境条件,从而限制了鲁棒反无人机系统的发展。为了弥补这些差距,我们提出了基于视觉的无人机检测数据集VisioDECT,这是一个全面的、场景丰富的多无人机检测、识别和中和数据集。该数据集包括来自六种无人机型号(Anafi-Extended, DJI FPV, DJI Phantom, EFT-E410S, Mavic Air 2和Mavic 2 Enterprise)的20,924张注释图像和标签,在不同高度(30-100米)和距离的三种不同场景(晴天,多云和傍晚)中捕获。重要的是,该数据集中包含的所有无人机都是旋翼(多旋翼)平台,它们主导着低空空域,在现实世界的监视和反无人机场景中最常见。数据在20个月内从韩国超过12个地点收集,确保了照明、天气和背景复杂性的多样性。每个示例都以三种标准格式(.txt、.xml和.csv)提供,并提供详细的元数据和用于检测和分类任务的经过质量验证的注释。使用最先进的检测模型(例如,DRONET, YOLO变体)进行说明性基准评估,仅用于验证实时无人机防御研究数据集的质量和实际可用性。VisioDECT提供了标准化、可复制和可扩展的资源,可以为空域监视、无人机交通管理和国家安全应用提供基准测试、模型训练和评估。
{"title":"VisioDECT: A robust dataset for aerial and scenario based multi-drone detection, identification, and neutralization","authors":"Simeon Okechukwu Ajakwe ,&nbsp;Vivian Ukamaka Ihekoronye ,&nbsp;Golam Mohtasin ,&nbsp;Rubina Akter ,&nbsp;Jae Min Lee ,&nbsp;Dong Seong Kim","doi":"10.1016/j.dib.2026.112448","DOIUrl":"10.1016/j.dib.2026.112448","url":null,"abstract":"<div><div>The rapid proliferation of unmanned aerial vehicles (UAVs) for logistics, surveillance, and civilian applications continues to pose significant challenges to airspace security, particularly through unauthorized or malicious deployments. Existing UAV datasets are limited in scope, often focusing on single-drone scenarios, synthetic imagery, or restricted environmental conditions, thereby constraining the development of robust counter-UAV systems. To bridge these gaps, we present vision-based drone detection dataset named as <strong>VisioDECT</strong>, a comprehensive and scenario-rich dataset for multi-drone detection, identification, and neutralization. The dataset comprises 20,924 annotated images and labels from six UAV models (Anafi-Extended, DJI FPV, DJI Phantom, EFT-E410S, Mavic Air 2, and Mavic 2 Enterprise), captured across three distinct scenarios (sunny, cloudy, and evening) at varying altitudes (30–100 m) and distances. Importantly, all UAVs included in this dataset are rotary-wing (multirotor) platforms, which dominate low-altitude airspace and are the most commonly encountered in real-world surveillance and counter-UAV scenarios. Data were collected over 20 months from more than 12 locations in South Korea, ensuring diversity in illumination, weather, and background complexity. Each sample is provided in three standard formats (.txt, .xml, .csv), with detailed metadata and quality-verified annotations for detection and classification tasks. Illustrative benchmark evaluations using state-of-the-art detection models (e.g., DRONET, YOLO variants) are included solely to validate the quality and practical usability of the dataset for real-time drone defense research. VisioDECT provides a standardized, reproducible, and scalable resource that enables benchmarking, model training, and evaluation for airspace surveillance, UAV traffic management, and national security applications.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112448"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Guarding against malicious biased threats (GAMBiT) datasets: Revealing cognitive bias in human-subjects red-team cyber range operations 防范恶意偏见威胁(GAMBiT)数据集:揭示人类受试者红队网络范围行动中的认知偏见
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-18 DOI: 10.1016/j.dib.2026.112476
Brandon Beltz , Jim Doty , Yvonne Fonken , Nikolos Gurney , Brett Israelsen , Nathan Lau , Stacy Marsella , Rachelle Thomas , Stoney Trent , Peggy Wu , Ya-Ting Yang , Quanyan Zhu
We present datasets from three large-scale human-subject experiments involving red-team hacking in a cyber range in the Guarding Against Malicious Biased Threats (GAMBiT) project. Across Experiments 1-3 (July 2024-March 2025), 19-20 skilled attackers per experiment conducted two 8-hour days of self-paced operations in a simulated enterprise network (SimSpace Cyber Force Platform) while collecting multi-modal data: self-reports (background, demographics, psychometrics), operational notes, terminal histories, key logs, network packet captures (PCAP), and NIDS alerts (Suricata). Each participant began from a standardized Kali Linux VM and pursued realistic objectives (e.g., target discovery and data exfiltration) under controlled constraints. Derivative curated logs and labels are included. The combined data release supports research on attacker behavior modeling, bias-aware analytics, and method benchmarking. Data are available via IEEE DataPort entries for Experiments 1-3.
我们展示了在防范恶意偏见威胁(GAMBiT)项目中涉及红队黑客在网络范围内的三个大规模人类受试者实验的数据集。在实验1-3期间(2024年7月至2025年3月),每个实验有19-20名熟练的攻击者在模拟企业网络(SimSpace Cyber Force Platform)中进行了两个8小时的自定义操作,同时收集了多模态数据:自我报告(背景,人口统计,心理测量),操作笔记,终端历史,关键日志,网络数据包捕获(PCAP)和NIDS警报(Suricata)。每个参与者都从一个标准化的Kali Linux VM开始,在受控的约束下追求现实的目标(例如,目标发现和数据泄露)。衍生策划日志和标签包括在内。合并的数据发布支持攻击者行为建模、偏见感知分析和方法基准测试的研究。实验1-3的数据可通过IEEE数据端口条目获得。
{"title":"Guarding against malicious biased threats (GAMBiT) datasets: Revealing cognitive bias in human-subjects red-team cyber range operations","authors":"Brandon Beltz ,&nbsp;Jim Doty ,&nbsp;Yvonne Fonken ,&nbsp;Nikolos Gurney ,&nbsp;Brett Israelsen ,&nbsp;Nathan Lau ,&nbsp;Stacy Marsella ,&nbsp;Rachelle Thomas ,&nbsp;Stoney Trent ,&nbsp;Peggy Wu ,&nbsp;Ya-Ting Yang ,&nbsp;Quanyan Zhu","doi":"10.1016/j.dib.2026.112476","DOIUrl":"10.1016/j.dib.2026.112476","url":null,"abstract":"<div><div>We present datasets from three large-scale human-subject experiments involving red-team hacking in a cyber range in the Guarding Against Malicious Biased Threats (GAMBiT) project. Across Experiments 1-3 (July 2024-March 2025), 19-20 skilled attackers per experiment conducted two 8-hour days of self-paced operations in a simulated enterprise network (SimSpace Cyber Force Platform) while collecting multi-modal data: self-reports (background, demographics, psychometrics), operational notes, terminal histories, key logs, network packet captures (PCAP), and NIDS alerts (Suricata). Each participant began from a standardized Kali Linux VM and pursued realistic objectives (e.g., target discovery and data exfiltration) under controlled constraints. Derivative curated logs and labels are included. The combined data release supports research on attacker behavior modeling, bias-aware analytics, and method benchmarking. Data are available via IEEE DataPort entries for Experiments 1-3.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112476"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Monitoring ecosystem functions in mountain catchments of chilean patagonia: A cluster-based dataset 智利巴塔哥尼亚山区集水区生态系统功能监测:基于集群的数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-21 DOI: 10.1016/j.dib.2026.112481
Paulo Moreno-Meynard
This dataset documents the spatially explicit quantification of multiple ecosystem functions across 12 mountain headwater catchments in the Aysén Region of Chilean Patagonia. Designed to capture landscape variability, the observational framework employs a paired-catchment approach, comparing basins with different degrees of anthropogenic disturbance across two forest types: deciduous and evergreen. Each catchment is treated as an integrated landscape unit, with cluster-based field measurements capturing fine-scale variation in vegetation structure, biomass, soil conditions, and species richness.
The field inventory integrates and adapts methodologies from several national and international forest monitoring frameworks. Its core structure is based on Chile’s Continuous National Forest Inventory, but also incorporates sampling concepts and measurement protocols inspired by the Swiss National Forest Inventory (LFI), the U.S. Forest Inventory and Analysis (FIA) program, and long-term ecological monitoring plots used in New Zealand. This hybrid design ensures multidimensional assessment of ecosystem functions while enhancing cross-regional comparability.
The sampling design addresses ecosystem functions across four service categories: provisioning (sawlog and firewood volume), regulating (carbon stocks in trees, shrubs, and deadwood, and decadal sequestration rates), supporting (soil formation and erosion proxies, plus nutrient concentrations), and biodiversity maintenance (vascular plant and epiphyte).
This dataset supports ecological synthesis, spatial modeling, and integration into broader assessments of ecosystem services and land-use impacts under changing environmental conditions.
该数据集记录了智利巴塔哥尼亚ayssamn地区12个山区水源集水区多种生态系统功能的空间明确量化。为了捕捉景观变化,该观测框架采用了配对集水区方法,比较了两种森林类型(落叶森林和常绿森林)中不同程度人为干扰的流域。每个集水区都被视为一个完整的景观单元,通过基于集群的实地测量捕获植被结构、生物量、土壤条件和物种丰富度的细微变化。实地清查综合并调整了若干国家和国际森林监测框架的方法。其核心结构以智利的连续国家森林清查为基础,但也结合了瑞士国家森林清查(LFI)、美国森林清查和分析(FIA)计划以及新西兰使用的长期生态监测地块所启发的抽样概念和测量方案。这种混合设计确保了生态系统功能的多维评估,同时增强了跨区域的可比性。采样设计涉及四个服务类别的生态系统功能:供应(锯材和木柴量),调节(树木、灌木和枯木的碳储量,以及年代际封存率),支持(土壤形成和侵蚀代理,以及养分浓度),以及维持生物多样性(维管植物和附生植物)。该数据集支持生态综合、空间建模,并整合到不断变化的环境条件下更广泛的生态系统服务和土地利用影响评估中。
{"title":"Monitoring ecosystem functions in mountain catchments of chilean patagonia: A cluster-based dataset","authors":"Paulo Moreno-Meynard","doi":"10.1016/j.dib.2026.112481","DOIUrl":"10.1016/j.dib.2026.112481","url":null,"abstract":"<div><div>This dataset documents the spatially explicit quantification of multiple ecosystem functions across 12 mountain headwater catchments in the Aysén Region of Chilean Patagonia. Designed to capture landscape variability, the observational framework employs a paired-catchment approach, comparing basins with different degrees of anthropogenic disturbance across two forest types: deciduous and evergreen. Each catchment is treated as an integrated landscape unit, with cluster-based field measurements capturing fine-scale variation in vegetation structure, biomass, soil conditions, and species richness.</div><div>The field inventory integrates and adapts methodologies from several national and international forest monitoring frameworks. Its core structure is based on Chile’s Continuous National Forest Inventory, but also incorporates sampling concepts and measurement protocols inspired by the Swiss National Forest Inventory (LFI), the U.S. Forest Inventory and Analysis (FIA) program, and long-term ecological monitoring plots used in New Zealand. This hybrid design ensures multidimensional assessment of ecosystem functions while enhancing cross-regional comparability.</div><div>The sampling design addresses ecosystem functions across four service categories: provisioning (sawlog and firewood volume), regulating (carbon stocks in trees, shrubs, and deadwood, and decadal sequestration rates), supporting (soil formation and erosion proxies, plus nutrient concentrations), and biodiversity maintenance (vascular plant and epiphyte).</div><div>This dataset supports ecological synthesis, spatial modeling, and integration into broader assessments of ecosystem services and land-use impacts under changing environmental conditions.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112481"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A dataset of sugarcane crop yield, production environment, meteorological records, and satellite images of commercial fields in the northeast of São Paulo State, Brazil 巴西<s:1>圣保罗州东北部商业农田的甘蔗作物产量、生产环境、气象记录和卫星图像数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-02-06 DOI: 10.1016/j.dib.2026.112549
Luiz Antonio Falaguasta Barbosa, Hernani Mazier Junior, Ivan Rizzo Guilherme, Daniel Carlos Guimarães Pedronette
Brazil is the world’s largest producer of sugarcane (Saccharum officinarum), accounting for approximately 40% of global production, with the state of São Paulo responsible for more than half of the national output due to its high level of mechanization. Despite its economic importance, publicly available datasets integrating information on sugarcane yield and production environment remain scarce.
This is the first freely available dataset comprising crop yield, meteorological, and production environment data with a large number of observations derived from multiple plots, harvest cycles, and time steps, and that identifies the exact locations of 12 commercial fields in the northeast of São Paulo State, Brazil. It is combined with images downloaded from the Sentinel-2 satellite, based on plot shapefiles, and with other meteorological data at the exact locations and during the same periods of sugarcane cultivation.
Crop yield and production environment data were shared by a sugar and alcohol plant operating in the region, collected at farms in the northeast of São Paulo State, Brazil, with measurements taken at the plot level across two plots per farm, across six farms. The data correspond to different numbers of harvests per plot. Between the plant and harvest dates, complementary data were generated by downloading Sentinel-2 RGB bands as single-band images and combining them into a single image. The exact process is applied using a meteorological dataset, selecting the closest meteorological station to obtain data for the same days between the plant and harvest dates.
Given the unavailability of integrated sugarcane datasets, this resource provides a valuable foundation for studies on crop yield prediction, analysis of production environments, and the development and evaluation of data-driven models in precision agriculture.
巴西是世界上最大的甘蔗(Saccharum officinarum)生产国,约占全球产量的40%,由于机械化水平高,圣保罗州的产量占全国产量的一半以上。尽管甘蔗具有重要的经济意义,但整合甘蔗产量和生产环境信息的公开数据集仍然很少。这是第一个免费提供的数据集,包括作物产量、气象和生产环境数据,以及从多个地块、收获周期和时间步长获得的大量观测数据,并确定了巴西圣保罗州东北部12个商业地块的确切位置。它结合了从哨兵2号卫星下载的基于地块形状文件的图像,以及在甘蔗种植的确切地点和同一时期的其他气象数据。在该地区运营的一家糖和酒精工厂共享了作物产量和生产环境数据,这些数据是从巴西圣保罗州东北部的农场收集的,并在六个农场的每个农场的两个地块上进行了地块水平的测量。这些数据对应于每块土地的不同收成数。在种植日期和收获日期之间,通过下载Sentinel-2 RGB波段作为单波段图像并将其合并为单个图像来生成补充数据。使用气象数据集应用精确的过程,选择最近的气象站来获取种植日期和收获日期之间同一天的数据。考虑到甘蔗综合数据集的缺乏,该资源为作物产量预测、生产环境分析以及数据驱动模型的开发和评估提供了有价值的基础。
{"title":"A dataset of sugarcane crop yield, production environment, meteorological records, and satellite images of commercial fields in the northeast of São Paulo State, Brazil","authors":"Luiz Antonio Falaguasta Barbosa,&nbsp;Hernani Mazier Junior,&nbsp;Ivan Rizzo Guilherme,&nbsp;Daniel Carlos Guimarães Pedronette","doi":"10.1016/j.dib.2026.112549","DOIUrl":"10.1016/j.dib.2026.112549","url":null,"abstract":"<div><div>Brazil is the world’s largest producer of sugarcane (<em>Saccharum officinarum</em>), accounting for approximately 40% of global production, with the state of São Paulo responsible for more than half of the national output due to its high level of mechanization. Despite its economic importance, publicly available datasets integrating information on sugarcane yield and production environment remain scarce.</div><div>This is the first freely available dataset comprising crop yield, meteorological, and production environment data with a large number of observations derived from multiple plots, harvest cycles, and time steps, and that identifies the exact locations of 12 commercial fields in the northeast of São Paulo State, Brazil. It is combined with images downloaded from the Sentinel-2 satellite, based on plot shapefiles, and with other meteorological data at the exact locations and during the same periods of sugarcane cultivation.</div><div>Crop yield and production environment data were shared by a sugar and alcohol plant operating in the region, collected at farms in the northeast of São Paulo State, Brazil, with measurements taken at the plot level across two plots per farm, across six farms. The data correspond to different numbers of harvests per plot. Between the plant and harvest dates, complementary data were generated by downloading Sentinel-2 RGB bands as single-band images and combining them into a single image. The exact process is applied using a meteorological dataset, selecting the closest meteorological station to obtain data for the same days between the plant and harvest dates.</div><div>Given the unavailability of integrated sugarcane datasets, this resource provides a valuable foundation for studies on crop yield prediction, analysis of production environments, and the development and evaluation of data-driven models in precision agriculture.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112549"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146184990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Introducing OpenTextile-NIR: Near-infrared hyperspectral imaging and photography dataset for optical identification of textiles 介绍opentexile - nir:用于纺织品光学识别的近红外高光谱成像和摄影数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-02-09 DOI: 10.1016/j.dib.2026.112559
Tuomas Sormunen , Ella Mahlamäki , Satu-Marja Mäkelä , Mikko Mäkelä
This dataset presents the first open-access collection of near-infrared hyperspectral imaging (NIR-HSI) data for the optical identification of textiles, with a focus on supporting research in sensor-based textile sorting and recycling. The dataset comprises hyperspectral images, RGB photographs, and detailed metadata, including fibre composition and colour, for 71 post-industrial textile samples, collected in Finland. Over 11 million spectra are included in the hyperspectral images, with more than 6 million annotated, providing a robust foundation for machine learning and data analysis. In addition, we provide a single representative NIR spectra and RGB value for each sample in order to accommodate classic spectroscopic analysis.
Used garments were sourced from a partner company specializing in end-of-life textile management, with ground truth information on fibre composition obtained from suppliers. Small pieces of each garment were measured using Specim SWIR 3 hyperspectral camera and photographed with high-resolution mobile phone camera (Samsung Galaxy A52). The dataset is organized into folders containing raw and processed data, including ENVI-format hyperspectral images, RGB images, as well as CSV files with mean spectra, mean RGB values, and sample metadata. An example Python script is provided to facilitate data access and processing.
Potential reuse scenarios include classification of textiles by material or colour, prediction of natural fibre content, image segmentation, algorithm development for spectral classification, and use as a reference spectral library. The dataset’s comprehensive structure and open availability address the limitations of previous research, which often relied on small or non-public datasets, and is intended to accelerate advances in optical identification technologies for textile recycling.
该数据集展示了第一个开放获取的近红外高光谱成像(NIR-HSI)数据集,用于纺织品的光学识别,重点是支持基于传感器的纺织品分类和回收研究。该数据集包括高光谱图像、RGB照片和详细的元数据,包括在芬兰收集的71个后工业纺织品样品的纤维成分和颜色。高光谱图像中包含超过1100万个光谱,其中超过600万个有注释,为机器学习和数据分析提供了坚实的基础。此外,我们为每个样品提供了一个具有代表性的近红外光谱和RGB值,以适应经典的光谱分析。二手服装是从一家专门从事报废纺织品管理的合作伙伴公司采购的,并从供应商那里获得了纤维成分的真实信息。使用specm SWIR 3高光谱相机测量每件衣服的小片,并使用高分辨率手机相机(三星Galaxy A52)拍摄。数据集被组织成包含原始和处理数据的文件夹,包括envi格式的高光谱图像、RGB图像以及具有平均光谱、平均RGB值和样本元数据的CSV文件。提供了一个示例Python脚本来促进数据访问和处理。潜在的再利用方案包括按材料或颜色对纺织品进行分类、预测天然纤维含量、图像分割、光谱分类算法开发以及用作参考光谱库。该数据集的全面结构和开放可用性解决了以往研究的局限性,这些研究通常依赖于小型或非公共数据集,旨在加速纺织品回收光学识别技术的进步。
{"title":"Introducing OpenTextile-NIR: Near-infrared hyperspectral imaging and photography dataset for optical identification of textiles","authors":"Tuomas Sormunen ,&nbsp;Ella Mahlamäki ,&nbsp;Satu-Marja Mäkelä ,&nbsp;Mikko Mäkelä","doi":"10.1016/j.dib.2026.112559","DOIUrl":"10.1016/j.dib.2026.112559","url":null,"abstract":"<div><div>This dataset presents the first open-access collection of near-infrared hyperspectral imaging (NIR-HSI) data for the optical identification of textiles, with a focus on supporting research in sensor-based textile sorting and recycling. The dataset comprises hyperspectral images, RGB photographs, and detailed metadata, including fibre composition and colour, for 71 post-industrial textile samples, collected in Finland. Over 11 million spectra are included in the hyperspectral images, with more than 6 million annotated, providing a robust foundation for machine learning and data analysis. In addition, we provide a single representative NIR spectra and RGB value for each sample in order to accommodate classic spectroscopic analysis.</div><div>Used garments were sourced from a partner company specializing in end-of-life textile management, with ground truth information on fibre composition obtained from suppliers. Small pieces of each garment were measured using Specim SWIR 3 hyperspectral camera and photographed with high-resolution mobile phone camera (Samsung Galaxy A52). The dataset is organized into folders containing raw and processed data, including ENVI-format hyperspectral images, RGB images, as well as CSV files with mean spectra, mean RGB values, and sample metadata. An example Python script is provided to facilitate data access and processing.</div><div>Potential reuse scenarios include classification of textiles by material or colour, prediction of natural fibre content, image segmentation, algorithm development for spectral classification, and use as a reference spectral library. The dataset’s comprehensive structure and open availability address the limitations of previous research, which often relied on small or non-public datasets, and is intended to accelerate advances in optical identification technologies for textile recycling.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112559"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RNA-seq data of healthy and fungal infected Tachypleus tridentatus embryos 健康和真菌感染的三叉鱼胚胎RNA-seq数据
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-27 DOI: 10.1016/j.dib.2026.112510
Yunfan Huang , Ying Qiao , Ruifang Chen , Lanfang Dong , Haijuan Liu , Theerakamol Pengsakul , Xiaowan Ma
The “living fossil” Tachypleus tridentatus holds significant medical and economic value but is currently experiencing a severe decline in germplasm resources. The extended incubation period of T. tridentatus eggs make them susceptible to invasion by pathogenic microorganisms, with fungal infections posing a major threat to embryonic development. However, the molecular immune mechanisms underlying embryonic immunity in T. tridentatus remain poorly understood. We collected T. tridentatus embryos at stages 18–20 that were naturally infected with Aspergillus candidus under aquaculture conditions, and conducted RNA sequencing to analyze transcriptomic response to the fungal infection.
“活化石”三叉戟鱼具有重要的医学和经济价值,但目前种质资源严重减少。三叉剑齿虎卵的孵育期较长,容易受到病原微生物的侵袭,真菌感染对胚胎发育构成重大威胁。然而,分子免疫机制背后的胚胎免疫三叉齿鼠仍然知之甚少。我们收集了在养殖条件下自然感染念珠曲霉的18-20期三叉鱼胚胎,并进行了RNA测序,分析了对真菌感染的转录组反应。
{"title":"RNA-seq data of healthy and fungal infected Tachypleus tridentatus embryos","authors":"Yunfan Huang ,&nbsp;Ying Qiao ,&nbsp;Ruifang Chen ,&nbsp;Lanfang Dong ,&nbsp;Haijuan Liu ,&nbsp;Theerakamol Pengsakul ,&nbsp;Xiaowan Ma","doi":"10.1016/j.dib.2026.112510","DOIUrl":"10.1016/j.dib.2026.112510","url":null,"abstract":"<div><div>The “living fossil” <em>Tachypleus tridentatus</em> holds significant medical and economic value but is currently experiencing a severe decline in germplasm resources. The extended incubation period of <em>T. tridentatus</em> eggs make them susceptible to invasion by pathogenic microorganisms, with fungal infections posing a major threat to embryonic development. However, the molecular immune mechanisms underlying embryonic immunity in <em>T. tridentatus</em> remain poorly understood. We collected <em>T. tridentatus</em> embryos at stages 18–20 that were naturally infected with <em>Aspergillus candidus</em> under aquaculture conditions, and conducted RNA sequencing to analyze transcriptomic response to the fungal infection.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112510"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Metabarcoding data: Full-length 16S rRNA sequence of endophytic bacteria in the root of asymptomatic and blast-symptomatic rice plants (Oryza sativa, L.) 元条形码数据:无病和有病水稻(Oryza sativa, L.)根系内生细菌16S rRNA全长序列
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-27 DOI: 10.1016/j.dib.2026.112522
Yasir Sidiq , Triastuti Rahayu , Peni Indrayudha , Erma Musbita Tyastuti , Azmi Zaki Waliudin Althaf , Banuwati Kartika Sari
There is a sustained demand for biofertilizers to enhance crop productivity. Endophytic bacteria associated with disease-tolerant rice varieties offer significant potential as biofertilizers; however, the bacteriome diversity within these plants remains underexplored. This dataset presents full-length 16S metagenomic sequences of endophytic bacteria isolated from the roots of blast-infected and uninfected rice plants. Root samples were processed and subjected to surface sterilisation. Following total genomic DNA extraction, sequencing was performed using 16S ribosomal RNA primers via the high-throughput Oxford Nanopore Technologies platform. The raw sequence data were filtered for quality control using NanoFilt. Subsequently, the sequences were aligned against the National Center for Biotechnology Information (NCBI) 16S RefSeq database to identify the species of the endophytic root bacteria. The data associated with this project have been registered in the NCBI BioProject database under accession number PRJNA992961. The dataset comprises two distinct sample groups, each analysed in duplicate, with sequencing yields ranging from 17.7 to 20.3 Mb. Consequently, this dataset provides valuable insights regarding the comparative composition of endophytic bacteria inhabiting healthy roots versus those found in blast-infected rice. Characterizing this diversity, particularly within healthy rice plants, is essential for foundational research underpinning the future development of biofertilizers.
为提高作物产量,对生物肥料有持续的需求。与抗病水稻品种相关的内生细菌具有作为生物肥料的巨大潜力;然而,这些植物中的细菌群多样性仍未得到充分研究。该数据集展示了从稻瘟病感染和未感染水稻根系分离的内生细菌的全长16S宏基因组序列。根样品经过处理并进行表面消毒。提取总基因组DNA后,通过高通量Oxford Nanopore Technologies平台使用16S核糖体RNA引物进行测序。使用NanoFilt对原始序列数据进行过滤以进行质量控制。随后,将这些序列与美国国家生物技术信息中心(NCBI) 16S RefSeq数据库比对,以确定内生根菌的种类。与本项目相关的数据已在NCBI BioProject数据库中注册,注册号为PRJNA992961。该数据集包括两个不同的样本组,每个样本组重复分析,测序产量范围从17.7到20.3 Mb。因此,该数据集提供了关于健康根部内生细菌与稻瘟病感染水稻中发现的内生细菌的比较组成的宝贵见解。表征这种多样性,特别是在健康的水稻植物中,对于支撑生物肥料未来发展的基础研究至关重要。
{"title":"Metabarcoding data: Full-length 16S rRNA sequence of endophytic bacteria in the root of asymptomatic and blast-symptomatic rice plants (Oryza sativa, L.)","authors":"Yasir Sidiq ,&nbsp;Triastuti Rahayu ,&nbsp;Peni Indrayudha ,&nbsp;Erma Musbita Tyastuti ,&nbsp;Azmi Zaki Waliudin Althaf ,&nbsp;Banuwati Kartika Sari","doi":"10.1016/j.dib.2026.112522","DOIUrl":"10.1016/j.dib.2026.112522","url":null,"abstract":"<div><div>There is a sustained demand for biofertilizers to enhance crop productivity. Endophytic bacteria associated with disease-tolerant rice varieties offer significant potential as biofertilizers; however, the bacteriome diversity within these plants remains underexplored. This dataset presents full-length 16S metagenomic sequences of endophytic bacteria isolated from the roots of blast-infected and uninfected rice plants. Root samples were processed and subjected to surface sterilisation. Following total genomic DNA extraction, sequencing was performed using 16S ribosomal RNA primers via the high-throughput Oxford Nanopore Technologies platform. The raw sequence data were filtered for quality control using NanoFilt. Subsequently, the sequences were aligned against the National Center for Biotechnology Information (NCBI) 16S RefSeq database to identify the species of the endophytic root bacteria. The data associated with this project have been registered in the NCBI BioProject database under accession number PRJNA992961. The dataset comprises two distinct sample groups, each analysed in duplicate, with sequencing yields ranging from 17.7 to 20.3 Mb. Consequently, this dataset provides valuable insights regarding the comparative composition of endophytic bacteria inhabiting healthy roots versus those found in blast-infected rice. Characterizing this diversity, particularly within healthy rice plants, is essential for foundational research underpinning the future development of biofertilizers<em>.</em></div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112522"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
UniEload: Electrical load dataset for energy forecasting applications at public universities in Bangladesh uniload:孟加拉国公立大学能源预测应用的电力负荷数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-07 DOI: 10.1016/j.dib.2025.112444
Utshob Sutradhar , Priyankar Biswas , Sumon Hossain , A.T.M. Saiful Islam , Shuvo Dev
This paper presents a dataset on electrical power collected from a university campus in Bangladesh. It is meant to help research on energy forecasting in university settings. The dataset has hourly measurements of system voltage, three-phase currents (R, Y, B), and power factor (pf). These were recorded at the campus substation. Data were collected during different operational conditions, including academic periods and vacations. This provides insights into load behaviour, changes in power factor, and phase imbalance patterns in an educational setting. The dataset supports the creation and assessment of models for load forecasting, anomaly detection, and improving power efficiency. It was also combined with weather data to aid research on load forecasting that takes weather into account. The weather parameters include temperature, humidity, precipitation, wind speed, and solar radiation. All weather values match energy values and were gathered hourly and daily. This dataset is especially useful for researchers studying how artificial intelligence and machine learning can be applied in managing electrical energy. The dataset also includes notes about context, such as reduced load during national holidays. This improves its usefulness for studies that focus on events in forecasting. By making this dataset open access, it helps fill the gap in publicly available electrical load data from educational institutions in developing countries. This supports reproducible research and sustainable energy management on campus.
本文介绍了从孟加拉国一所大学校园收集的电力数据集。它的目的是帮助大学环境中的能源预测研究。该数据集每小时测量一次系统电压、三相电流(R、Y、B)和功率因数(pf)。这些都是在校园变电站录的。数据是在不同的操作条件下收集的,包括学习期间和假期。这提供了对负载行为、功率因数变化和教育环境中的相位不平衡模式的见解。该数据集支持创建和评估负荷预测、异常检测和提高电力效率的模型。它还与天气数据相结合,以帮助考虑天气因素的负荷预测研究。天气参数包括温度、湿度、降水、风速和太阳辐射。所有的天气值都与能量值相匹配,每小时和每天收集一次。该数据集对于研究如何将人工智能和机器学习应用于电能管理的研究人员特别有用。该数据集还包括有关上下文的注释,例如国家假日期间的负载减少。这提高了它对关注预测事件的研究的有用性。通过使该数据集开放获取,它有助于填补发展中国家教育机构公开可用的电力负荷数据的空白。这支持可重复研究和校园可持续能源管理。
{"title":"UniEload: Electrical load dataset for energy forecasting applications at public universities in Bangladesh","authors":"Utshob Sutradhar ,&nbsp;Priyankar Biswas ,&nbsp;Sumon Hossain ,&nbsp;A.T.M. Saiful Islam ,&nbsp;Shuvo Dev","doi":"10.1016/j.dib.2025.112444","DOIUrl":"10.1016/j.dib.2025.112444","url":null,"abstract":"<div><div>This paper presents a dataset on electrical power collected from a university campus in Bangladesh. It is meant to help research on energy forecasting in university settings. The dataset has hourly measurements of system voltage, three-phase currents (R, Y, B), and power factor (pf). These were recorded at the campus substation. Data were collected during different operational conditions, including academic periods and vacations. This provides insights into load behaviour, changes in power factor, and phase imbalance patterns in an educational setting. The dataset supports the creation and assessment of models for load forecasting, anomaly detection, and improving power efficiency. It was also combined with weather data to aid research on load forecasting that takes weather into account. The weather parameters include temperature, humidity, precipitation, wind speed, and solar radiation. All weather values match energy values and were gathered hourly and daily. This dataset is especially useful for researchers studying how artificial intelligence and machine learning can be applied in managing electrical energy. The dataset also includes notes about context, such as reduced load during national holidays. This improves its usefulness for studies that focus on events in forecasting. By making this dataset open access, it helps fill the gap in publicly available electrical load data from educational institutions in developing countries. This supports reproducible research and sustainable energy management on campus.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112444"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Participatory and multi-disciplinary science dataset and surveys for the assessment of the microbiological and behavioural factors influencing fresh fruits and vegetables' waste at home 参与和多学科的科学数据集和调查,以评估影响家中新鲜水果和蔬菜浪费的微生物和行为因素
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-07 DOI: 10.1016/j.dib.2025.112434
Camille Marchal , Damien Ballan , Sarra Azib , Morgane Innocent , Bertrand Urien , Annick Tamaro , Marine Le Gall-Ely , Emmanuel Coton , Adeline Picot , Jérôme Mounier , Louis Coroller , Patrick Gabriel
Fresh fruits and vegetables (FFV) represent the largest part of food waste at the consumer level. This waste directly results from FFV physiological and microbiological spoilage, itself intricately linked to behavioural factors such as consumer practices, including purchase, storage and hygiene practices, but also consumers’ perceptions towards spoilage. Based on a dual approach combining microbiological and behavioural sciences, we examined the link between FFV waste produced by 49 volunteering French households, measured using connected bins, the microbial ecology of their storage compartments, using culture-dependent and -independent approaches, and their consumer behaviour, cleaning and storage practices, through in-depth interviews and a dedicated survey. An exploratory qualitative survey carried out on 17 individuals followed by two quantitative data collections on 1048 and 815 representative French consumers enabled us to identify anti-FFV waste practices and to cluster consumers according to their anti-FFV waste behaviours. Spoilage dynamics of commonly consumed FFV, according to storage temperature, microbial contamination level and the presence or absence of surface wounds, were also performed in controlled conditions. This citizen-science-based dataset covers a wide array of microbiological and behavioural factors related to domestic FFV waste, as well as real measurements of waste volumes thanks to the innovative use of connected bins. Altogether, this data could provide interesting insights into more effective and accessible guidelines for FFV waste reduction at the consumer level, and thus to a potential reduction of global food waste and its related costs.
在消费者层面,新鲜水果和蔬菜(FFV)是食物浪费的最大部分。这种浪费直接来自FFV生理和微生物腐败,其本身与消费者行为(包括购买、储存和卫生习惯)以及消费者对腐败的看法等行为因素密切相关。基于微生物学和行为科学相结合的双重方法,我们通过深入访谈和专门调查,研究了49个法国志愿家庭产生的FFV废物之间的联系,使用连接的垃圾箱进行测量,他们的储存隔间的微生物生态,使用培养依赖和独立的方法,以及他们的消费者行为,清洁和储存实践。我们对17个人进行了探索性定性调查,随后对1048名和815名具有代表性的法国消费者进行了两次定量数据收集,使我们能够确定反ffv浪费行为,并根据消费者的反ffv浪费行为对消费者进行分类。在受控条件下,根据储存温度、微生物污染水平和有无表面伤口,研究了通常消耗的FFV的腐败动力学。这个以公民科学为基础的数据集涵盖了与家庭FFV废物有关的一系列微生物和行为因素,以及由于创新使用连接垃圾箱而对废物量的实际测量。总的来说,这些数据可以为在消费者层面上更有效和更容易获得的FFV废物减少准则提供有趣的见解,从而可能减少全球食物浪费及其相关成本。
{"title":"Participatory and multi-disciplinary science dataset and surveys for the assessment of the microbiological and behavioural factors influencing fresh fruits and vegetables' waste at home","authors":"Camille Marchal ,&nbsp;Damien Ballan ,&nbsp;Sarra Azib ,&nbsp;Morgane Innocent ,&nbsp;Bertrand Urien ,&nbsp;Annick Tamaro ,&nbsp;Marine Le Gall-Ely ,&nbsp;Emmanuel Coton ,&nbsp;Adeline Picot ,&nbsp;Jérôme Mounier ,&nbsp;Louis Coroller ,&nbsp;Patrick Gabriel","doi":"10.1016/j.dib.2025.112434","DOIUrl":"10.1016/j.dib.2025.112434","url":null,"abstract":"<div><div>Fresh fruits and vegetables (FFV) represent the largest part of food waste at the consumer level. This waste directly results from FFV physiological and microbiological spoilage, itself intricately linked to behavioural factors such as consumer practices, including purchase, storage and hygiene practices, but also consumers’ perceptions towards spoilage. Based on a dual approach combining microbiological and behavioural sciences, we examined the link between FFV waste produced by 49 volunteering French households, measured using connected bins, the microbial ecology of their storage compartments, using culture-dependent and -independent approaches, and their consumer behaviour, cleaning and storage practices, through in-depth interviews and a dedicated survey. An exploratory qualitative survey carried out on 17 individuals followed by two quantitative data collections on 1048 and 815 representative French consumers enabled us to identify anti-FFV waste practices and to cluster consumers according to their anti-FFV waste behaviours. Spoilage dynamics of commonly consumed FFV, according to storage temperature, microbial contamination level and the presence or absence of surface wounds, were also performed in controlled conditions. This citizen-science-based dataset covers a wide array of microbiological and behavioural factors related to domestic FFV waste, as well as real measurements of waste volumes thanks to the innovative use of connected bins. Altogether, this data could provide interesting insights into more effective and accessible guidelines for FFV waste reduction at the consumer level, and thus to a potential reduction of global food waste and its related costs.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112434"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Data in Brief
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1