首页 > 最新文献

Data in Brief最新文献

英文 中文
Deep-sea image dataset for organism detection 用于生物检测的深海图像数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-10 DOI: 10.1016/j.dib.2026.112462
Takaki Nishio, Yuki Kawae
The conservation of marine resources and the mitigation of marine pollution require strengthened knowledge of marine biodiversity, particularly in the deep sea. Videos and images are valuable for documenting the distribution of deep-sea organisms, but manual processing is labor-intensive and variable, emphasizing the need for automated methods. To address this, the J-EDI Organism Detection Dataset (JODD) is introduced. This dataset comprises 8151 images and 15,621 bounding boxes annotated in the Common Objects in Context (COCO) format. The images were captured during deep-sea surveys conducted by the Japan Agency for Marine-Earth Science and Technology (JAMSTEC) between 1984 and 2021, using remotely operated vehicles (ROVs) and human-occupied vehicles (HOVs). All images were derived from publicly available videos in JAMSTEC’s E-library of Deep-sea Images (J-EDI). The dataset includes 20 object categories—19 biological groups and one machine category—providing a reusable resource for developing and benchmarking machine learning models for the automatic detection of deep-sea organisms.
养护海洋资源和减轻海洋污染需要加强对海洋生物多样性的认识,特别是对深海生物多样性的认识。视频和图像对于记录深海生物的分布是有价值的,但人工处理是劳动密集型的,而且是可变的,强调了自动化方法的必要性。为了解决这个问题,引入了J-EDI生物检测数据集(JODD)。该数据集包括8151张图像和15621个边界框,以Common Objects in Context (COCO)格式标注。这些图像是在1984年至2021年期间由日本海洋地球科学技术机构(JAMSTEC)使用远程操作车辆(rov)和载人车辆(hov)进行的深海调查中捕获的。所有图像均来自JAMSTEC的深海图像电子库(J-EDI)中的公开视频。该数据集包括20个对象类别- 19个生物类群和一个机器类别-为深海生物自动检测的机器学习模型的开发和基准测试提供了可重复使用的资源。
{"title":"Deep-sea image dataset for organism detection","authors":"Takaki Nishio,&nbsp;Yuki Kawae","doi":"10.1016/j.dib.2026.112462","DOIUrl":"10.1016/j.dib.2026.112462","url":null,"abstract":"<div><div>The conservation of marine resources and the mitigation of marine pollution require strengthened knowledge of marine biodiversity, particularly in the deep sea. Videos and images are valuable for documenting the distribution of deep-sea organisms, but manual processing is labor-intensive and variable, emphasizing the need for automated methods. To address this, the J-EDI Organism Detection Dataset (JODD) is introduced. This dataset comprises 8151 images and 15,621 bounding boxes annotated in the Common Objects in Context (COCO) format. The images were captured during deep-sea surveys conducted by the Japan Agency for Marine-Earth Science and Technology (JAMSTEC) between 1984 and 2021, using remotely operated vehicles (ROVs) and human-occupied vehicles (HOVs). All images were derived from publicly available videos in JAMSTEC’s E-library of Deep-sea Images (J-EDI). The dataset includes 20 object categories—19 biological groups and one machine category—providing a reusable resource for developing and benchmarking machine learning models for the automatic detection of deep-sea organisms.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112462"},"PeriodicalIF":1.4,"publicationDate":"2026-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A clinical dataset on type-2 diabetes including demographic, anthropometric, and biochemical parameters from Bangladesh 来自孟加拉国的2型糖尿病临床数据集,包括人口统计学、人体测量学和生化参数
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-10 DOI: 10.1016/j.dib.2026.112457
Md. Younus Bhuiyan , Shahriar Siddique Ayon , Md. Ebrahim Hossain , Md. Saef Ullah Miah , Afjal H. Sarower , Fateha khanam Bappee
Type-2 diabetes is a major public health concern in Bangladesh, and this dataset provides 1065 curated patient records with demographic, anthropometric, and clinical variables relevant to its assessment. The data were collected during routine clinical visits and recorded by trained staff, with checks to ensure accuracy and completeness. It includes basic details like age, pregnancy count, body mass index, and skin-fold thickness; vital signs such as blood pressure; lab results related to blood sugar (fasting glucose and insulin); the Diabetes Pedigree Function; and a simple yes/no label for Type-2 diabetes. A few values are missing for diastolic blood pressure and skin-fold thickness, so users should handle these carefully. Since the data are cross-sectional and come from patients seeking care, there are more diabetic cases (840) than non-diabetic cases (225). The dataset is intended for reuse in method development (for example, machine-learning classifier training, feature-selection benchmarking, and oversampling/imputation research), for context-specific epidemiologic description and model validation in South Asian clinical settings, and as a teaching resource for reproducible biomedical-data workflows.
2型糖尿病是孟加拉国的一个主要公共卫生问题,该数据集提供了1065份精心整理的患者记录,其中包含与评估相关的人口统计学、人体测量学和临床变量。数据是在常规临床访问期间收集的,并由训练有素的工作人员记录,并检查以确保准确性和完整性。它包括基本细节,如年龄、怀孕数、体重指数和皮肤褶皱厚度;生命体征,如血压;与血糖相关的实验室结果(空腹血糖和胰岛素);糖尿病谱系功能;以及2型糖尿病简单的是/否标签。舒张压和皮肤褶皱厚度的一些值缺失,因此用户应小心处理这些值。由于数据是横断面的,并且来自寻求治疗的患者,因此糖尿病病例(840例)多于非糖尿病病例(225例)。该数据集旨在用于方法开发(例如,机器学习分类器训练,特征选择基准测试和过采样/归算研究),用于南亚临床环境中特定背景的流行病学描述和模型验证,以及作为可重复的生物医学数据工作流程的教学资源。
{"title":"A clinical dataset on type-2 diabetes including demographic, anthropometric, and biochemical parameters from Bangladesh","authors":"Md. Younus Bhuiyan ,&nbsp;Shahriar Siddique Ayon ,&nbsp;Md. Ebrahim Hossain ,&nbsp;Md. Saef Ullah Miah ,&nbsp;Afjal H. Sarower ,&nbsp;Fateha khanam Bappee","doi":"10.1016/j.dib.2026.112457","DOIUrl":"10.1016/j.dib.2026.112457","url":null,"abstract":"<div><div>Type-2 diabetes is a major public health concern in Bangladesh, and this dataset provides 1065 curated patient records with demographic, anthropometric, and clinical variables relevant to its assessment. The data were collected during routine clinical visits and recorded by trained staff, with checks to ensure accuracy and completeness. It includes basic details like age, pregnancy count, body mass index, and skin-fold thickness; vital signs such as blood pressure; lab results related to blood sugar (fasting glucose and insulin); the Diabetes Pedigree Function; and a simple yes/no label for Type-2 diabetes. A few values are missing for diastolic blood pressure and skin-fold thickness, so users should handle these carefully. Since the data are cross-sectional and come from patients seeking care, there are more diabetic cases (840) than non-diabetic cases (225). The dataset is intended for reuse in method development (for example, machine-learning classifier training, feature-selection benchmarking, and oversampling/imputation research), for context-specific epidemiologic description and model validation in South Asian clinical settings, and as a teaching resource for reproducible biomedical-data workflows.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112457"},"PeriodicalIF":1.4,"publicationDate":"2026-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Uav-based multisensor framework for legal industrial Cannabis monitoring and open-access dataset development 基于无人机的多传感器框架,用于合法工业大麻监测和开放获取数据集开发
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-09 DOI: 10.1016/j.dib.2026.112463
Genta Rexha , Ina Papadhopulli , Aleksandër Biberaj , Elson Agastra , Enida Sheme , Elinda Meçe
Industrial hemp cultivation is expanding and requires reliable monitoring for legal compliance and agricultural management. This paper presents a standardized UAV-based multisensor framework designed for Cannabis sativa L. It integrates RGB, multispectral, and thermal imaging as core modules, with hyperspectral and LiDAR as optional extensions. The framework sets protocols for sensor integration, flight planning, field measurements, and annotation, ensuring datasets that meet EU altitude limits (≤120 m AGL). Multi-altitude and multi-time-of-day acquisitions are proposed to capture spatial and diurnal variability. These data improve model robustness for phenotyping, stress detection, and THC compliance verification. Potential applications include precision agriculture, breeding, regulatory monitoring, environmental assessment, and illicit crop detection. Open-access datasets generated through this framework will support reproducibility, machine learning development, and collaboration among researchers, farmers, and regulators.
工业大麻种植正在扩大,需要对法律合规和农业管理进行可靠的监测。本文提出了一种针对大麻的标准化无人机多传感器框架,该框架将RGB、多光谱和热成像作为核心模块,高光谱和激光雷达作为可选扩展模块。该框架为传感器集成、飞行计划、现场测量和注释设置协议,确保数据集符合欧盟高度限制(≤120米AGL)。提出了多海拔和多时段采集以捕获空间和日变化。这些数据提高了模型在表型、应力检测和THC依从性验证方面的稳健性。潜在的应用包括精准农业、育种、监管监测、环境评估和非法作物检测。通过该框架生成的开放获取数据集将支持可重复性、机器学习开发以及研究人员、农民和监管机构之间的合作。
{"title":"A Uav-based multisensor framework for legal industrial Cannabis monitoring and open-access dataset development","authors":"Genta Rexha ,&nbsp;Ina Papadhopulli ,&nbsp;Aleksandër Biberaj ,&nbsp;Elson Agastra ,&nbsp;Enida Sheme ,&nbsp;Elinda Meçe","doi":"10.1016/j.dib.2026.112463","DOIUrl":"10.1016/j.dib.2026.112463","url":null,"abstract":"<div><div>Industrial hemp cultivation is expanding and requires reliable monitoring for legal compliance and agricultural management. This paper presents a standardized UAV-based multisensor framework designed for Cannabis sativa L. It integrates RGB, multispectral, and thermal imaging as core modules, with hyperspectral and LiDAR as optional extensions. The framework sets protocols for sensor integration, flight planning, field measurements, and annotation, ensuring datasets that meet EU altitude limits (≤120 m AGL). Multi-altitude and multi-time-of-day acquisitions are proposed to capture spatial and diurnal variability. These data improve model robustness for phenotyping, stress detection, and THC compliance verification. Potential applications include precision agriculture, breeding, regulatory monitoring, environmental assessment, and illicit crop detection. Open-access datasets generated through this framework will support reproducibility, machine learning development, and collaboration among researchers, farmers, and regulators.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112463"},"PeriodicalIF":1.4,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Draft genome data analysis and pathogenicity profiling of Staphylococcus aureus strain IHS3A with antibiotic resistance genes isolated from a hospital in Jordan 从约旦一家医院分离的具有抗生素耐药基因的金黄色葡萄球菌IHS3A菌株基因组数据分析和致病性谱草稿
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-08 DOI: 10.1016/j.dib.2026.112453
Saqr Abushattal , Sulaiman M. Alnaimat , Nidal Odat , Mahmoud Abushattal
This dataset provides a comprehensive genomic and pathogenicity profiling of Staphylococcus aureus strain IHS3A, a methicillin-resistant (MRSA) clinical isolate obtained from a healthcare worker in a teaching hospital in Jordan, Middle East. Whole genome sequencing was performed using the Illumina NextSeq 2000 platform, followed by high-quality de novo assembly using SPAdes. The genome spans 2821,373 bp across 90 contigs, with a GC content of 32.78%, and demonstrates high-quality metrics, including 99.67% completeness and minimal contamination (0.08%). The genome analysis identified 2611 predicted protein-coding sequences. Multilocus sequence typing (MLST) assigned the isolate to ST10647, SCCmec typing revealed type IVc (2B), and spa typing identified t131. The dataset includes comprehensive annotations of key antimicrobial resistance genes, such as mecA (methicillin resistance), blaZ (penicillin resistance), and lmrS (macrolide efflux), as well as virulence factors related to adherence (e.g., atl, clfA), immune evasion (e.g., scn, adsA), secretion systems (e.g., esaA, esaB), and toxins (e.g., hla, lukF-PV, tsst). Secondary metabolite biosynthetic gene clusters, such as staphyloferrin B and staphylopine, were identified. The genome also encodes a diverse carbohydrate-active enzyme (CAZyme) profile. These genomic data are valuable for further research on MRSA evolution, resistance mechanisms, and virulence factors in Jordan and the Middle East. The genome data have been deposited in the NCBI database under the accession number JBPPGA000000000, with a direct URL to data: https://www.ncbi.nlm.nih.gov/nuccore/JBPPGA000000000.1. Bioproject: PRJNA1283614, Biosample: SAMN49700843.
该数据集提供了金黄色葡萄球菌菌株IHS3A的全面基因组和致病性分析,这是一种耐甲氧西林(MRSA)临床分离物,来自中东约旦一家教学医院的一名卫生保健工作者。使用Illumina NextSeq 2000平台进行全基因组测序,然后使用SPAdes进行高质量的从头组装。该基因组全长2821,373 bp,共90个contigs, GC含量为32.78%,具有高质量的指标,包括99.67%的完整性和最小污染(0.08%)。基因组分析鉴定出2611个预测蛋白编码序列。多位点序列分型(MLST)鉴定分离株为ST10647型,SCCmec分型鉴定为IVc型(2B), spa分型鉴定为t131型。该数据集包括关键抗菌素耐药基因的综合注释,如mecA(甲氧西林耐药)、blaZ(青霉素耐药)和lmrS(大环内酯外排),以及与粘附(如atl、clfA)、免疫逃避(如scn、adsA)、分泌系统(如esaA、esaB)和毒素(如hla、lukF-PV、tsst)相关的毒力因子。次生代谢物生物合成基因簇,如葡萄铁蛋白B和葡萄蛋白。基因组还编码多种碳水化合物活性酶(CAZyme)谱。这些基因组数据对进一步研究约旦和中东地区的MRSA进化、耐药机制和毒力因素具有重要价值。基因组数据已存入NCBI数据库,登录号为JBPPGA000000000,数据的直接URL为:https://www.ncbi.nlm.nih.gov/nuccore/JBPPGA000000000.1。生物工程:PRJNA1283614,生物样品:SAMN49700843。
{"title":"Draft genome data analysis and pathogenicity profiling of Staphylococcus aureus strain IHS3A with antibiotic resistance genes isolated from a hospital in Jordan","authors":"Saqr Abushattal ,&nbsp;Sulaiman M. Alnaimat ,&nbsp;Nidal Odat ,&nbsp;Mahmoud Abushattal","doi":"10.1016/j.dib.2026.112453","DOIUrl":"10.1016/j.dib.2026.112453","url":null,"abstract":"<div><div>This dataset provides a comprehensive genomic and pathogenicity profiling of <em>Staphylococcus aureus</em> strain IHS3A, a methicillin-resistant (MRSA) clinical isolate obtained from a healthcare worker in a teaching hospital in Jordan, Middle East. Whole genome sequencing was performed using the Illumina NextSeq 2000 platform, followed by high-quality de novo assembly using SPAdes. The genome spans 2821,373 bp across 90 contigs, with a GC content of 32.78%, and demonstrates high-quality metrics, including 99.67% completeness and minimal contamination (0.08%). The genome analysis identified 2611 predicted protein-coding sequences. Multilocus sequence typing (MLST) assigned the isolate to ST10647, SCC<em>mec</em> typing revealed type IVc (2B), and spa typing identified t131. The dataset includes comprehensive annotations of key antimicrobial resistance genes, such as <em>mecA</em> (methicillin resistance), <em>blaZ</em> (penicillin resistance), and <em>lmrS</em> (macrolide efflux), as well as virulence factors related to adherence (e.g., <em>atl, clfA</em>), immune evasion (e.g., <em>scn, adsA</em>), secretion systems (e.g., <em>esaA, esaB</em>), and toxins (e.g., <em>hla, lukF</em>-<em>PV, tsst</em>). Secondary metabolite biosynthetic gene clusters, such as staphyloferrin B and staphylopine, were identified. The genome also encodes a diverse carbohydrate-active enzyme (CAZyme) profile. These genomic data are valuable for further research on MRSA evolution, resistance mechanisms, and virulence factors in Jordan and the Middle East. The genome data have been deposited in the NCBI database under the accession number JBPPGA000000000, with a direct URL to data: <span><span>https://www.ncbi.nlm.nih.gov/nuccore/JBPPGA000000000.1</span><svg><path></path></svg></span>. Bioproject: PRJNA1283614, Biosample: SAMN49700843.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"64 ","pages":"Article 112453"},"PeriodicalIF":1.4,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145973220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corn seed dataset based on hyperspectral and RGB images 基于高光谱和RGB图像的玉米种子数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-08 DOI: 10.1016/j.dib.2026.112455
Chao LI , Chen Zhang , Wenbo Zhang , Chengzhen LV , Yaqiang Li , Yufen Wang
This study employed an HY-6010-S hyperspectral imaging system, covering a spectral range of 400–1000 nm, combined with an RGB industrial camera to acquire multimodal data. The dataset simulates phenotypic analysis scenarios of maize seeds under controlled laboratory conditions, with the ambient temperature maintained at 20–25°C. Comprehensive testing was conducted using 12 different maize varieties. Approximately 200 seed samples were collected per variety, resulting in a total sample size of about 2400, each subjected to hyperspectral and RGB image acquisition. Preprocessing steps included noise reduction, background removal, band selection, and modality alignment. To ensure the accuracy and reliability of the experimental data, HHIT software and Python were utilized for data processing. This dataset plays a significant role in seed variety classification, phenotypic analysis, precision agriculture, and machine learning applications.
本研究采用HY-6010-S高光谱成像系统,覆盖400-1000 nm光谱范围,结合RGB工业相机获取多模态数据。该数据集模拟了受控实验室条件下玉米种子的表型分析情景,环境温度保持在20-25℃。采用12个不同的玉米品种进行了综合试验。每个品种大约收集了200个种子样本,总样本量约为2400个,每个样本都进行了高光谱和RGB图像采集。预处理步骤包括降噪、背景去除、波段选择和模态对齐。为了保证实验数据的准确性和可靠性,使用HHIT软件和Python进行数据处理。该数据集在种子品种分类、表型分析、精准农业和机器学习应用中发挥着重要作用。
{"title":"Corn seed dataset based on hyperspectral and RGB images","authors":"Chao LI ,&nbsp;Chen Zhang ,&nbsp;Wenbo Zhang ,&nbsp;Chengzhen LV ,&nbsp;Yaqiang Li ,&nbsp;Yufen Wang","doi":"10.1016/j.dib.2026.112455","DOIUrl":"10.1016/j.dib.2026.112455","url":null,"abstract":"<div><div>This study employed an HY-6010-S hyperspectral imaging system, covering a spectral range of 400–1000 nm, combined with an RGB industrial camera to acquire multimodal data. The dataset simulates phenotypic analysis scenarios of maize seeds under controlled laboratory conditions, with the ambient temperature maintained at 20–25°C. Comprehensive testing was conducted using 12 different maize varieties. Approximately 200 seed samples were collected per variety, resulting in a total sample size of about 2400, each subjected to hyperspectral and RGB image acquisition. Preprocessing steps included noise reduction, background removal, band selection, and modality alignment. To ensure the accuracy and reliability of the experimental data, HHIT software and Python were utilized for data processing. This dataset plays a significant role in seed variety classification, phenotypic analysis, precision agriculture, and machine learning applications.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112455"},"PeriodicalIF":1.4,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Paired clinical 12 lead and apple watch electrocardiogram data repository from childhood cancer survivors authors 来自儿童癌症幸存者作者的配对临床铅和苹果手表心电图数据库
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-08 DOI: 10.1016/j.dib.2026.112452
Oguz Akbilgic , Ibrahim Karabayir , Luke Patterson , Stephanie B. Dixon , Daniel A. Mulrooney , Kirsten K. Ness , Melissa M. Hudson
Childhood cancer survivors (CCS), exposed to prior cardiotoxic treatments such as anthracyclines and chest radiation, are at lifelong risk of cardiovascular complications. Current guidelines recommend periodic echocardiographic surveillance, but adherence rates are as low as 41%. This dataset provides paired same-day 12-lead clinical electrocardiograms (ECG) and single-lead wearable ECG recordings from the Apple Watch, collected from adult CCS participating in the St. Jude Lifetime Cohort Study (SJLIFE). The availability of paired wearable and clinical ECGs enables the development and validation of remote AI-based cardiac screening tools, potentially leading to more precise long-term cardiovascular surveillance in this population. Using this dataset, researchers can assess whether an AI model developed using clinical ECG can be repeat when using ECG from an Apple Watch.
儿童癌症幸存者(CCS)先前暴露于蒽环类药物和胸部放射等心脏毒性治疗,终生面临心血管并发症的风险。目前的指南建议定期超声心动图监测,但依从率低至41%。该数据集提供了来自Apple Watch的配对当日12导联临床心电图(ECG)和单导联可穿戴心电图记录,这些记录来自参加St. Jude终身队列研究(SJLIFE)的成人CCS。配对可穿戴和临床心电图的可用性使基于人工智能的远程心脏筛查工具的开发和验证成为可能,从而在这一人群中实现更精确的长期心血管监测。利用该数据集,研究人员可以评估使用临床心电图开发的人工智能模型是否可以在使用苹果手表的心电图时重复。
{"title":"Paired clinical 12 lead and apple watch electrocardiogram data repository from childhood cancer survivors authors","authors":"Oguz Akbilgic ,&nbsp;Ibrahim Karabayir ,&nbsp;Luke Patterson ,&nbsp;Stephanie B. Dixon ,&nbsp;Daniel A. Mulrooney ,&nbsp;Kirsten K. Ness ,&nbsp;Melissa M. Hudson","doi":"10.1016/j.dib.2026.112452","DOIUrl":"10.1016/j.dib.2026.112452","url":null,"abstract":"<div><div>Childhood cancer survivors (CCS), exposed to prior cardiotoxic treatments such as anthracyclines and chest radiation, are at lifelong risk of cardiovascular complications. Current guidelines recommend periodic echocardiographic surveillance, but adherence rates are as low as 41%. This dataset provides paired same-day 12-lead clinical electrocardiograms (ECG) and single-lead wearable ECG recordings from the Apple Watch, collected from adult CCS participating in the St. Jude Lifetime Cohort Study (SJLIFE). The availability of paired wearable and clinical ECGs enables the development and validation of remote AI-based cardiac screening tools, potentially leading to more precise long-term cardiovascular surveillance in this population. Using this dataset, researchers can assess whether an AI model developed using clinical ECG can be repeat when using ECG from an Apple Watch.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112452"},"PeriodicalIF":1.4,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Soil and crop data from a long-term organic fertilization trial in Sub-Sahelian market gardening 萨赫勒以南地区市场园艺长期有机施肥试验的土壤和作物数据
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-08 DOI: 10.1016/j.dib.2026.112456
Marie-Liesse Vermeire , Pathé Basse , Samuel Legros , Falilou Diallo , Anne Desnues , Frédéric Feder
Recycling the growing stock of organic waste products (OWP) from cities, factories, and farms is a key challenge for sustainable agriculture. However, it must be done with awareness of performances but also potential long-term environmental and health risks. In this context, the SOERE PRO observatory was established ("Systèmes d'Observation et d'Expérimentation pour la Recherche en Environnement - Produits Résiduaires Organiques'', a label granted by the French National Research Alliance for the Environment (AllEnvi) to recognize high-quality research infrastructures, which translates to "Long-term Observation and Experimentation Systems for Environmental Research - Organic Waste Products''), including the trial in Sangalkam, in the Dakar region of Senegal, where these data are collected. Since 2016, four fertilizer types - one mineral (synthetic) and three organic - have been applied annually to three successive vegetable crops (tomato, lettuce, carrot). The dataset currently covers the period 2016 - 2025, with data collection ongoing and new data to be added in the future. Manual weeding and hoeing is carried out regularly for each crop, no pesticides are used for crop protection on the trial. A comprehensive, multi-variable dataset is consistently documented, including soil physico-chemical parameters measured annually at three depths, organic waste product characterization, crop yield and quality parameters, and detailed management activities, making it particularly suitable for process-based modelling and long-term impact assessment. The originality of this dataset lies in its long duration, the diversity of organic and mineral fertilization strategies, the inclusion of multiple vegetable crops per year, and its location under Sub-Sahelian conditions, a context for which long-term agronomic datasets remain scarce. All soil, OWP and vegetables samples are stored in a sample bank in Dakar, and available for additional analyses. The objective of this dataset is to provide long-term, integrated information on crop productivity, crop quality, and soil responses to repeated organic and mineral fertilization in a Sub-Sahelian market-gardening system. The dataset is publicly available through a Dataverse repository for free (re)use in meta-analyses, process-based modelling, and environmental studies, notably to improve understanding of nutrient cycling, contaminant dynamics, soil biodiversity, and long-term soil functioning in Sub-Sahelian agroecosystems, and to support sustainable land management and food security in Southern countries under future climate change.
从城市、工厂和农场中回收越来越多的有机废物(OWP)是可持续农业的一个关键挑战。然而,在进行这项工作时,不仅要意识到业绩,还要注意潜在的长期环境和健康风险。在这方面,建立了SOERE PRO观测站(“环境研究的观察和实验系统-有机产品”,这是法国国家环境研究联盟(AllEnvi)授予的一个标签,以承认高质量的研究基础设施,其翻译为“环境研究的长期观察和实验系统-有机废物”),包括在桑卡尔卡姆的试验。在收集这些数据的塞内加尔达喀尔地区。自2016年以来,四种肥料——一种矿物(合成)和三种有机——每年连续施用于三种蔬菜作物(番茄、生菜、胡萝卜)。该数据集目前涵盖2016 - 2025年期间,数据收集正在进行中,未来将添加新数据。每个作物定期进行人工除草和锄地,试验中不使用农药进行作物保护。一个全面的、多变量的数据集被一致地记录下来,包括每年在三个深度测量的土壤物理化学参数、有机废物特性、作物产量和质量参数,以及详细的管理活动,使其特别适合基于过程的建模和长期影响评估。该数据集的独创性在于其持续时间长,有机和矿物施肥策略的多样性,每年包括多种蔬菜作物,以及其在萨赫勒以南条件下的位置,这是一个长期农艺数据集仍然稀缺的背景。所有土壤、土壤磷和蔬菜样本都储存在达喀尔的一个样本库中,供进一步分析使用。该数据集的目的是提供关于萨赫勒以南市场园艺系统中作物生产力、作物质量和土壤对重复施用有机和矿物肥料的反应的长期综合信息。该数据集可通过Dataverse存储库公开提供,供元分析、基于过程的建模和环境研究免费(重复)使用,特别是用于提高对萨赫勒以南农业生态系统中养分循环、污染物动态、土壤生物多样性和长期土壤功能的理解,并支持南方国家在未来气候变化下的可持续土地管理和粮食安全。
{"title":"Soil and crop data from a long-term organic fertilization trial in Sub-Sahelian market gardening","authors":"Marie-Liesse Vermeire ,&nbsp;Pathé Basse ,&nbsp;Samuel Legros ,&nbsp;Falilou Diallo ,&nbsp;Anne Desnues ,&nbsp;Frédéric Feder","doi":"10.1016/j.dib.2026.112456","DOIUrl":"10.1016/j.dib.2026.112456","url":null,"abstract":"<div><div>Recycling the growing stock of organic waste products (OWP) from cities, factories, and farms is a key challenge for sustainable agriculture. However, it must be done with awareness of performances but also potential long-term environmental and health risks. In this context, the SOERE PRO observatory was established (\"Systèmes d'Observation et d'Expérimentation pour la Recherche en Environnement - Produits Résiduaires Organiques'', a label granted by the French National Research Alliance for the Environment (AllEnvi) to recognize high-quality research infrastructures, which translates to \"Long-term Observation and Experimentation Systems for Environmental Research - Organic Waste Products''), including the trial in Sangalkam, in the Dakar region of Senegal, where these data are collected. Since 2016, four fertilizer types - one mineral (synthetic) and three organic - have been applied annually to three successive vegetable crops (tomato, lettuce, carrot). The dataset currently covers the period 2016 - 2025, with data collection ongoing and new data to be added in the future. Manual weeding and hoeing is carried out regularly for each crop, no pesticides are used for crop protection on the trial. A comprehensive, multi-variable dataset is consistently documented, including soil physico-chemical parameters measured annually at three depths, organic waste product characterization, crop yield and quality parameters, and detailed management activities, making it particularly suitable for process-based modelling and long-term impact assessment. The originality of this dataset lies in its long duration, the diversity of organic and mineral fertilization strategies, the inclusion of multiple vegetable crops per year, and its location under Sub-Sahelian conditions, a context for which long-term agronomic datasets remain scarce. All soil, OWP and vegetables samples are stored in a sample bank in Dakar, and available for additional analyses. The objective of this dataset is to provide long-term, integrated information on crop productivity, crop quality, and soil responses to repeated organic and mineral fertilization in a Sub-Sahelian market-gardening system. The dataset is publicly available through a Dataverse repository for free (re)use in meta-analyses, process-based modelling, and environmental studies, notably to improve understanding of nutrient cycling, contaminant dynamics, soil biodiversity, and long-term soil functioning in Sub-Sahelian agroecosystems, and to support sustainable land management and food security in Southern countries under future climate change.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112456"},"PeriodicalIF":1.4,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dataset of RGB-D images of object collections from multiple viewpoints with aligned high-resolution 3D models of objects 多视点物体集合的RGB-D图像数据集,具有对齐的高分辨率物体3D模型
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-08 DOI: 10.1016/j.dib.2026.112450
Xinchao Song , Mingjun Li , Sean Banerjee , Natasha Kholgade Banerjee
We present the HILO dataset consisting of high-resolution 3D scanned models for 253 common-use objects and 32,256 multi-viewpoint RGB-D images with typically low-resolution data for 144 tabletop scenes consisting of collections of random sets of 10 objects drawn from the set of 253 objects. The dataset provides the 6 degree of freedom (6DOF) pose for all objects found in each of the 32,256 RGB-D images, obtained by performing precise 3D alignment of the 3D models to the RGB-D images. The dataset also contains metadata on object mass, short text descriptor, binning into everyday use classes, and aspect ratio and function categories, intrinsic parameter information for RGB-D sensors used in capture, and transformations between camera poses. Object 3D models in the dataset were acquired by scanning using a tabletop 3D scanner, and were manually inspected, cleaned, repaired, and exported as original ultra high-resolution at ∼1M vertices and simplified high-resolution meshes at ∼10k vertices. To capture the multi-view RGB-D images, we established an in-house testbed consisting of a turntable and two robotic manipulators to respectively cover azimuth angles and elevation angles, and span a hemisphere. Images were captured using two Microsoft Azure Kinect sensors mounted at the wrists of the robot, one per robot. We captured images over two distances forming hemispherical shells. We used in-house software written in python to control the turntable movement, robot motion, and image capture, as well as to perform camera calibration, processing to generate registered images and foreground masks, manual precise alignment of object models to images, and post-capture correction of misalignments in camera transformation parameters. The dataset provides value in enabling training and evaluation of algorithms for several tasks in computer vision, artificial intelligence (AI), and robotics such as object completion, recognition, segmentation, high-resolution structure generation, robotic grasp planning, and recognition of human-preferred grasp locations for human-robot collaboration.
我们展示了HILO数据集,包括253个常用对象的高分辨率3D扫描模型和32,256个多视点RGB-D图像,以及144个桌面场景的典型低分辨率数据,这些场景由从253个对象集中抽取的10个对象的随机集合组成。该数据集通过对3D模型与RGB-D图像进行精确的3D对齐,为32,256张RGB-D图像中的所有物体提供了6个自由度(6DOF)的姿态。该数据集还包含关于物体质量的元数据,短文本描述符,分成日常使用类,宽高比和功能类别,用于捕获的RGB-D传感器的内在参数信息,以及相机姿势之间的转换。数据集中的对象3D模型是通过使用桌面3D扫描仪扫描获得的,然后进行人工检查、清洗、修复,并在~ 1M顶点处导出为原始的超高分辨率网格,在~ 10k顶点处导出为简化的高分辨率网格。为了捕获多视角RGB-D图像,我们建立了一个内部测试平台,该平台由一个转台和两个机器人操作台组成,分别覆盖方位角和仰角,并跨越一个半球。图像是通过安装在机器人手腕上的两个微软Azure Kinect传感器捕获的,每个机器人一个。我们在两个距离上拍摄了形成半球形壳的图像。我们使用python编写的内部软件来控制转台运动,机器人运动和图像捕获,以及执行相机校准,处理以生成配准图像和前景蒙版,手动精确对齐对象模型到图像,以及捕获后相机变换参数的不校准校正。该数据集为计算机视觉,人工智能(AI)和机器人技术中的几个任务的算法训练和评估提供了价值,例如对象补全,识别,分割,高分辨率结构生成,机器人抓取规划以及识别人类首选的抓取位置以进行人机协作。
{"title":"Dataset of RGB-D images of object collections from multiple viewpoints with aligned high-resolution 3D models of objects","authors":"Xinchao Song ,&nbsp;Mingjun Li ,&nbsp;Sean Banerjee ,&nbsp;Natasha Kholgade Banerjee","doi":"10.1016/j.dib.2026.112450","DOIUrl":"10.1016/j.dib.2026.112450","url":null,"abstract":"<div><div>We present the HILO dataset consisting of high-resolution 3D scanned models for 253 common-use objects and 32,256 multi-viewpoint RGB-D images with typically low-resolution data for 144 tabletop scenes consisting of collections of random sets of 10 objects drawn from the set of 253 objects. The dataset provides the 6 degree of freedom (6DOF) pose for all objects found in each of the 32,256 RGB-D images, obtained by performing precise 3D alignment of the 3D models to the RGB-D images. The dataset also contains metadata on object mass, short text descriptor, binning into everyday use classes, and aspect ratio and function categories, intrinsic parameter information for RGB-D sensors used in capture, and transformations between camera poses. Object 3D models in the dataset were acquired by scanning using a tabletop 3D scanner, and were manually inspected, cleaned, repaired, and exported as original ultra high-resolution at ∼1M vertices and simplified high-resolution meshes at ∼10k vertices. To capture the multi-view RGB-D images, we established an in-house testbed consisting of a turntable and two robotic manipulators to respectively cover azimuth angles and elevation angles, and span a hemisphere. Images were captured using two Microsoft Azure Kinect sensors mounted at the wrists of the robot, one per robot. We captured images over two distances forming hemispherical shells. We used in-house software written in python to control the turntable movement, robot motion, and image capture, as well as to perform camera calibration, processing to generate registered images and foreground masks, manual precise alignment of object models to images, and post-capture correction of misalignments in camera transformation parameters. The dataset provides value in enabling training and evaluation of algorithms for several tasks in computer vision, artificial intelligence (AI), and robotics such as object completion, recognition, segmentation, high-resolution structure generation, robotic grasp planning, and recognition of human-preferred grasp locations for human-robot collaboration.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112450"},"PeriodicalIF":1.4,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effects of raw and thermally processed spent coffee grounds on Miscanthus × giganteus plantation: Data description 生咖啡渣和热处理咖啡渣对芒草种植园的影响:数据描述
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-07 DOI: 10.1016/j.dib.2025.112432
Nicole Nawrot , Jacek Kluska
Cultivating Miscanthus × giganteus (M × g) energy crop on marginal soil supports phytoattenuation and provides high-energy biomass for biofuel production. Improving nutrient-poor soil with low-cost recovered organic amendments, such as spent coffee grounds (SCG) and SCG-derived biochar (BC) offers sustainable benefits. This data article presents the findings from a medium-term greenhouse experiment at the Gdansk University of Technology assessing M × g cultivation on marginal soil with SCG and BC amendments into soil. In a pot-scale experiment the medium term-effect on M × g biomass growth, photosynthesis parameters, root tissues development, as well as final elemental composition was examined. Soil pH and elemental composition were also determined. As global coffee consumption increases, large quantities of SCG are generated and often landfilled. Their beneficial reuse aligns with circular economy principles and Sustainable Development Goals (SDGs 7 and 13), providing both a short-term nutrient source and a means of improving soil quality and resilience. The article compiles five datasets detailing: (1) M × g growth parameters, tissue development, and photosynthetic indices, (2) nutrient and caffeine leaching behaviour; and (3) elemental composition of plants and soils following exposure. These datasets, available in the Bridge of Knowledge Gdansk University of Technology repository, provide a resource for environmental researchers, soil and plant scientists, biochar specialists, and decisionmakers working to restore marginal soil usability. This study promotes sustainable land management by demonstrating how organic wastes and biochar can be combined to improve crop performance, sequester carbon, and reduce nutrient losses while minimizing external fertilizer inputs.
在边缘土壤上种植芒草(M × g)能源作物支持植物衰减,并为生物燃料生产提供高能生物质。利用低成本的回收有机改良剂,如废咖啡渣(SCG)和源自SCG的生物炭(BC)改善营养贫乏的土壤,提供了可持续的效益。这篇数据文章介绍了格但斯克工业大学中期温室试验的结果,该试验评估了在边缘土壤上使用SCG和BC改良的M × g种植。在盆栽试验中,考察了中期对M × g生物量生长、光合参数、根组织发育和最终元素组成的影响。测定了土壤pH值和元素组成。随着全球咖啡消费量的增加,产生了大量的SCG,并经常被填埋。它们的有益再利用符合循环经济原则和可持续发展目标(可持续发展目标7和13),既提供短期营养来源,又提供改善土壤质量和恢复力的手段。本文编制了五个数据集,详细说明:(1)M × g生长参数、组织发育和光合指标;(2)养分和咖啡因浸出行为;(3)暴露后植物和土壤的元素组成。这些数据集可在格但斯克科技大学知识库的知识之桥中获得,为环境研究人员、土壤和植物科学家、生物炭专家以及致力于恢复边际土壤可用性的决策者提供了资源。这项研究通过展示有机废物和生物炭如何结合起来提高作物性能、固碳和减少养分损失,同时最大限度地减少外部肥料投入,促进了可持续土地管理。
{"title":"Effects of raw and thermally processed spent coffee grounds on Miscanthus × giganteus plantation: Data description","authors":"Nicole Nawrot ,&nbsp;Jacek Kluska","doi":"10.1016/j.dib.2025.112432","DOIUrl":"10.1016/j.dib.2025.112432","url":null,"abstract":"<div><div>Cultivating <em>Miscanthus × giganteus</em> (<em>M</em> <em>×</em> <em>g</em>) energy crop on marginal soil supports phytoattenuation and provides high-energy biomass for biofuel production. Improving nutrient-poor soil with low-cost recovered organic amendments, such as spent coffee grounds (SCG) and SCG-derived biochar (BC) offers sustainable benefits. This data article presents the findings from a medium-term greenhouse experiment at the Gdansk University of Technology assessing <em>M</em> <em>×</em> <em>g</em> cultivation on marginal soil with SCG and BC amendments into soil. In a pot-scale experiment the medium term-effect on <em>M</em> <em>×</em> <em>g</em> biomass growth, photosynthesis parameters, root tissues development, as well as final elemental composition was examined. Soil pH and elemental composition were also determined. As global coffee consumption increases, large quantities of SCG are generated and often landfilled. Their beneficial reuse aligns with circular economy principles and Sustainable Development Goals (SDGs 7 and 13), providing both a short-term nutrient source and a means of improving soil quality and resilience. The article compiles five datasets detailing: (1) <em>M</em> <em>×</em> <em>g</em> growth parameters, tissue development, and photosynthetic indices, (2) nutrient and caffeine leaching behaviour; and (3) elemental composition of plants and soils following exposure. These datasets, available in the Bridge of Knowledge Gdansk University of Technology repository, provide a resource for environmental researchers, soil and plant scientists, biochar specialists, and decisionmakers working to restore marginal soil usability. This study promotes sustainable land management by demonstrating how organic wastes and biochar can be combined to improve crop performance, sequester carbon, and reduce nutrient losses while minimizing external fertilizer inputs.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"64 ","pages":"Article 112432"},"PeriodicalIF":1.4,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145973351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Participatory and multi-disciplinary science dataset and surveys for the assessment of the microbiological and behavioural factors influencing fresh fruits and vegetables' waste at home 参与和多学科的科学数据集和调查,以评估影响家中新鲜水果和蔬菜浪费的微生物和行为因素
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-07 DOI: 10.1016/j.dib.2025.112434
Camille Marchal , Damien Ballan , Sarra Azib , Morgane Innocent , Bertrand Urien , Annick Tamaro , Marine Le Gall-Ely , Emmanuel Coton , Adeline Picot , Jérôme Mounier , Louis Coroller , Patrick Gabriel
Fresh fruits and vegetables (FFV) represent the largest part of food waste at the consumer level. This waste directly results from FFV physiological and microbiological spoilage, itself intricately linked to behavioural factors such as consumer practices, including purchase, storage and hygiene practices, but also consumers’ perceptions towards spoilage. Based on a dual approach combining microbiological and behavioural sciences, we examined the link between FFV waste produced by 49 volunteering French households, measured using connected bins, the microbial ecology of their storage compartments, using culture-dependent and -independent approaches, and their consumer behaviour, cleaning and storage practices, through in-depth interviews and a dedicated survey. An exploratory qualitative survey carried out on 17 individuals followed by two quantitative data collections on 1048 and 815 representative French consumers enabled us to identify anti-FFV waste practices and to cluster consumers according to their anti-FFV waste behaviours. Spoilage dynamics of commonly consumed FFV, according to storage temperature, microbial contamination level and the presence or absence of surface wounds, were also performed in controlled conditions. This citizen-science-based dataset covers a wide array of microbiological and behavioural factors related to domestic FFV waste, as well as real measurements of waste volumes thanks to the innovative use of connected bins. Altogether, this data could provide interesting insights into more effective and accessible guidelines for FFV waste reduction at the consumer level, and thus to a potential reduction of global food waste and its related costs.
在消费者层面,新鲜水果和蔬菜(FFV)是食物浪费的最大部分。这种浪费直接来自FFV生理和微生物腐败,其本身与消费者行为(包括购买、储存和卫生习惯)以及消费者对腐败的看法等行为因素密切相关。基于微生物学和行为科学相结合的双重方法,我们通过深入访谈和专门调查,研究了49个法国志愿家庭产生的FFV废物之间的联系,使用连接的垃圾箱进行测量,他们的储存隔间的微生物生态,使用培养依赖和独立的方法,以及他们的消费者行为,清洁和储存实践。我们对17个人进行了探索性定性调查,随后对1048名和815名具有代表性的法国消费者进行了两次定量数据收集,使我们能够确定反ffv浪费行为,并根据消费者的反ffv浪费行为对消费者进行分类。在受控条件下,根据储存温度、微生物污染水平和有无表面伤口,研究了通常消耗的FFV的腐败动力学。这个以公民科学为基础的数据集涵盖了与家庭FFV废物有关的一系列微生物和行为因素,以及由于创新使用连接垃圾箱而对废物量的实际测量。总的来说,这些数据可以为在消费者层面上更有效和更容易获得的FFV废物减少准则提供有趣的见解,从而可能减少全球食物浪费及其相关成本。
{"title":"Participatory and multi-disciplinary science dataset and surveys for the assessment of the microbiological and behavioural factors influencing fresh fruits and vegetables' waste at home","authors":"Camille Marchal ,&nbsp;Damien Ballan ,&nbsp;Sarra Azib ,&nbsp;Morgane Innocent ,&nbsp;Bertrand Urien ,&nbsp;Annick Tamaro ,&nbsp;Marine Le Gall-Ely ,&nbsp;Emmanuel Coton ,&nbsp;Adeline Picot ,&nbsp;Jérôme Mounier ,&nbsp;Louis Coroller ,&nbsp;Patrick Gabriel","doi":"10.1016/j.dib.2025.112434","DOIUrl":"10.1016/j.dib.2025.112434","url":null,"abstract":"<div><div>Fresh fruits and vegetables (FFV) represent the largest part of food waste at the consumer level. This waste directly results from FFV physiological and microbiological spoilage, itself intricately linked to behavioural factors such as consumer practices, including purchase, storage and hygiene practices, but also consumers’ perceptions towards spoilage. Based on a dual approach combining microbiological and behavioural sciences, we examined the link between FFV waste produced by 49 volunteering French households, measured using connected bins, the microbial ecology of their storage compartments, using culture-dependent and -independent approaches, and their consumer behaviour, cleaning and storage practices, through in-depth interviews and a dedicated survey. An exploratory qualitative survey carried out on 17 individuals followed by two quantitative data collections on 1048 and 815 representative French consumers enabled us to identify anti-FFV waste practices and to cluster consumers according to their anti-FFV waste behaviours. Spoilage dynamics of commonly consumed FFV, according to storage temperature, microbial contamination level and the presence or absence of surface wounds, were also performed in controlled conditions. This citizen-science-based dataset covers a wide array of microbiological and behavioural factors related to domestic FFV waste, as well as real measurements of waste volumes thanks to the innovative use of connected bins. Altogether, this data could provide interesting insights into more effective and accessible guidelines for FFV waste reduction at the consumer level, and thus to a potential reduction of global food waste and its related costs.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112434"},"PeriodicalIF":1.4,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Data in Brief
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1