首页 > 最新文献

Data in Brief最新文献

英文 中文
Dataset of high-speed camera measurements from impact-tested reinforced concrete beams 高速相机测量数据集从冲击测试的钢筋混凝土梁
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-21 DOI: 10.1016/j.dib.2026.112487
Viktor Peterson
Impact-loaded reinforced concrete beams often fail in shear. This becomes relevant for shelter design against ballistics or fragment impact, for instance. An experimental campaign was conducted to study the different types of shear failure and governing parameters. Eighteen reinforced concrete beams were tested by a 70 kg steel striker dropped from a 2.4 m height. The beams were loaded at different positions from the support with different amounts of transverse reinforcement. The beams were of reduced scale with a length of 0.80 m and a square 0.15 m × 0.15 m cross-section. The drop weight tests were monitored with shock accelerometers on the striker and beam centre, load cells under the supports measuring reaction forces, and a high-speed camera (HSC). High-speed camera measurements were recorded orthogonal to the surface with the aim of performing high-quality digital image correlation (DIC) analyses. The beams and striker were painted with a speckled pattern prior to testing for the DIC analyses. Camera recordings were conducted with a 1024 × 512 px resolution and 6 kHz sampling, resulting in a time resolution of about 0.17 ms. Accelerometer and load cell measurements were sampled at 19.2 kHz. The accelerometer on the striker was used to approximate the impact force, and beam acceleration can be used to synchronize the camera and DAQ recordings. The data may be used to calibrate finite element models, study the impact response of beams, or develop new mechanical models.
受冲击荷载的钢筋混凝土梁常因剪切而失效。例如,这与抵御弹道或碎片冲击的掩体设计有关。对不同类型的剪切破坏和控制参数进行了试验研究。18根钢筋混凝土梁被一个70公斤的钢锤从2.4米的高度落下测试。梁被加载在不同位置的支持与不同数量的横向钢筋。梁的尺寸缩小,长度为0.80 m,截面为0.15 m × 0.15 m。跌落重量测试的监测采用了冲击加速度计,在打击器和梁中心,在支撑下测量反作用力的称重传感器,以及高速摄像机(HSC)。高速相机测量数据与表面正交记录,目的是进行高质量的数字图像相关(DIC)分析。在进行DIC分析测试之前,梁和前锋被涂上了斑点图案。摄像机记录以1024 × 512像素分辨率和6 kHz采样进行,时间分辨率约为0.17 ms。加速度计和称重传感器测量以19.2 kHz采样。冲锋器上的加速度计用于近似冲击力,光束加速度可用于同步摄像机和DAQ记录。这些数据可用于校正有限元模型,研究梁的冲击响应,或开发新的力学模型。
{"title":"Dataset of high-speed camera measurements from impact-tested reinforced concrete beams","authors":"Viktor Peterson","doi":"10.1016/j.dib.2026.112487","DOIUrl":"10.1016/j.dib.2026.112487","url":null,"abstract":"<div><div>Impact-loaded reinforced concrete beams often fail in shear. This becomes relevant for shelter design against ballistics or fragment impact, for instance. An experimental campaign was conducted to study the different types of shear failure and governing parameters. Eighteen reinforced concrete beams were tested by a 70 kg steel striker dropped from a 2.4 m height. The beams were loaded at different positions from the support with different amounts of transverse reinforcement. The beams were of reduced scale with a length of 0.80 m and a square 0.15 m × 0.15 m cross-section. The drop weight tests were monitored with shock accelerometers on the striker and beam centre, load cells under the supports measuring reaction forces, and a high-speed camera (HSC). High-speed camera measurements were recorded orthogonal to the surface with the aim of performing high-quality digital image correlation (DIC) analyses. The beams and striker were painted with a speckled pattern prior to testing for the DIC analyses. Camera recordings were conducted with a 1024 × 512 px resolution and 6 kHz sampling, resulting in a time resolution of about 0.17 ms. Accelerometer and load cell measurements were sampled at 19.2 kHz. The accelerometer on the striker was used to approximate the impact force, and beam acceleration can be used to synchronize the camera and DAQ recordings. The data may be used to calibrate finite element models, study the impact response of beams, or develop new mechanical models.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112487"},"PeriodicalIF":1.4,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
COLLECTiEF dataset: A high-resolution indoor environmental dataset from European buildings across diverse climates supporting thermal, air-quality, and visual-comfort assessments COLLECTiEF数据集:来自不同气候条件下的欧洲建筑的高分辨率室内环境数据集,支持热、空气质量和视觉舒适度评估
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-21 DOI: 10.1016/j.dib.2026.112486
Italo Aldo Campodonico-Avendano , Silvia Erba , Panayiotis Papadopoulos , Salvatore Carlucci , Antonio Luparelli , Amedeo Ingrosso , Greta Tresoldi , Muhammad Salman Shahid , Frederic Wurtz , Benoit Delinchant , Per Martin Leinan , Stefano Cera , Peter Riederer , Runar Solli , Amin Moazami , Mohammadreza Aghaei
Indoor Environmental Quality directly affects public health, productivity, and well-being, while also playing a vital role in developing climate-neutral, energy-efficient, and resilient buildings. This paper presents a comprehensive dataset of indoor environmental parameters that affect thermal comfort, indoor air quality, and visual comfort, which was created under the European Union’s Horizon 2020 Project Collective Intelligence for Energy Flexibility. The dataset comprises high-resolution measurements of carbon dioxide, pollutants, volatile organic compounds, air temperature, relative humidity, and illuminance on a horizontal plane, collected over a two-year period at 1-minute intervals. Data were gathered from 14 pilot buildings across four European climates: Cyprus, France, Italy, and Norway, covering diverse building types such as schools, medical centres, sports arenas, residential complexes, universities, and elder care facilities, representing about 40 % of common European building categories. Sensors were installed in specific thermal zones within each building to monitor environmental conditions. All data is organized by building and zone and supplemented with standardized Brick metadata to ensure interoperability. This comprehensive dataset, with its broad geographic coverage, variety of building types, long-term high-frequency measurements, and multimodal data, provides a valuable resource for comparative IEQ research, cross-domain modelling, and integrated assessments of comfort, ventilation, and daylighting across different climates and operational settings and is available upon request under a non-disclosure agreement provided by the consortium.
室内环境质量直接影响到公众健康、生产力和福祉,同时在发展气候中性、节能和抗灾建筑方面也发挥着至关重要的作用。本文介绍了影响热舒适、室内空气质量和视觉舒适的室内环境参数的综合数据集,该数据集是在欧盟的地平线2020项目集体智能能源灵活性下创建的。该数据集包括二氧化碳、污染物、挥发性有机化合物、空气温度、相对湿度和水平面照度的高分辨率测量数据,收集时间为两年,间隔1分钟。数据来自欧洲四个气候地区的14座试点建筑:塞浦路斯、法国、意大利和挪威,涵盖了不同的建筑类型,如学校、医疗中心、运动场、住宅综合体、大学和老年人护理设施,约占欧洲常见建筑类别的40%。传感器安装在每栋建筑的特定热区,以监测环境状况。所有数据由建筑和区域组织,并辅以标准化的Brick元数据以确保互操作性。该综合数据集具有广泛的地理覆盖范围、各种建筑类型、长期高频测量和多模式数据,为比较IEQ研究、跨领域建模以及不同气候和操作设置下舒适性、通风和采光的综合评估提供了宝贵的资源,并可根据财团提供的保密协议要求提供。
{"title":"COLLECTiEF dataset: A high-resolution indoor environmental dataset from European buildings across diverse climates supporting thermal, air-quality, and visual-comfort assessments","authors":"Italo Aldo Campodonico-Avendano ,&nbsp;Silvia Erba ,&nbsp;Panayiotis Papadopoulos ,&nbsp;Salvatore Carlucci ,&nbsp;Antonio Luparelli ,&nbsp;Amedeo Ingrosso ,&nbsp;Greta Tresoldi ,&nbsp;Muhammad Salman Shahid ,&nbsp;Frederic Wurtz ,&nbsp;Benoit Delinchant ,&nbsp;Per Martin Leinan ,&nbsp;Stefano Cera ,&nbsp;Peter Riederer ,&nbsp;Runar Solli ,&nbsp;Amin Moazami ,&nbsp;Mohammadreza Aghaei","doi":"10.1016/j.dib.2026.112486","DOIUrl":"10.1016/j.dib.2026.112486","url":null,"abstract":"<div><div>Indoor Environmental Quality directly affects public health, productivity, and well-being, while also playing a vital role in developing climate-neutral, energy-efficient, and resilient buildings. This paper presents a comprehensive dataset of indoor environmental parameters that affect thermal comfort, indoor air quality, and visual comfort, which was created under the European Union’s Horizon 2020 Project <em>Collective Intelligence for Energy Flexibility</em>. The dataset comprises high-resolution measurements of carbon dioxide, pollutants, volatile organic compounds, air temperature, relative humidity, and illuminance on a horizontal plane, collected over a two-year period at 1-minute intervals. Data were gathered from 14 pilot buildings across four European climates: Cyprus, France, Italy, and Norway, covering diverse building types such as schools, medical centres, sports arenas, residential complexes, universities, and elder care facilities, representing about 40 % of common European building categories. Sensors were installed in specific thermal zones within each building to monitor environmental conditions. All data is organized by building and zone and supplemented with standardized Brick metadata to ensure interoperability. This comprehensive dataset, with its broad geographic coverage, variety of building types, long-term high-frequency measurements, and multimodal data, provides a valuable resource for comparative IEQ research, cross-domain modelling, and integrated assessments of comfort, ventilation, and daylighting across different climates and operational settings and is available upon request under a non-disclosure agreement provided by the consortium.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112486"},"PeriodicalIF":1.4,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generated cultural heritage question–answer dataset: Durga in multi-dimensional perspectives 生成的文化遗产问答数据集:多维视角下的杜尔加
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-20 DOI: 10.1016/j.dib.2026.112495
Tri Lathif Mardi Suryanto , Aji Prasetya Wibawa , Hariyono , Andrew Nafalski , Gulsun Kurubacak Çakır
This dataset presents a valuable compilation of question–answer (QA) pairs derived from cultural texts and sources related to Durga mythology. A total of 21,395 QA pairs, encompassing textual materials such as scriptures, ritual narratives, temple inscriptions, and traditional storytelling records. Each entry includes the source reference, question, and corresponding answer, provided in a structured format compatible with Excel for seamless integration into downstream natural language processing (NLP) tasks. Data collection involved manual curation and annotation by domain experts, followed by preprocessing steps including text normalization, duplication removal, and verification of factual and contextual accuracy. The dataset is designed to support generative QA models, culturally aware chatbots, and digital preservation of heritage knowledge. It is particularly valuable for research in AI-driven cultural applications, educational tools, and digital humanities initiatives aiming to bridge traditional knowledge with computational methods. Researchers and practitioners may utilize the dataset for training generative models, creating interactive educational platforms, developing culturally sensitive AI agents, and supporting comparative studies in cross-cultural heritage. This openly accessible resource adheres to ethical standards, with proper attribution to source materials, and provides a foundational asset for both academic research and applied development in culturally informed artificial intelligence.
这个数据集呈现了一个有价值的问答(QA)对的汇编,这些问答来自与杜尔加神话有关的文化文本和来源。共有21395对QA,包括经文、仪式叙述、寺庙铭文和传统故事记录等文本材料。每个条目包括源参考、问题和相应的答案,以与Excel兼容的结构化格式提供,以便无缝集成到下游自然语言处理(NLP)任务中。数据收集涉及领域专家的手动管理和注释,然后是预处理步骤,包括文本规范化、重复删除以及事实和上下文准确性的验证。该数据集旨在支持生成式QA模型、具有文化意识的聊天机器人和遗产知识的数字化保存。它对于人工智能驱动的文化应用、教育工具和旨在将传统知识与计算方法联系起来的数字人文倡议的研究尤其有价值。研究人员和从业者可以利用该数据集来训练生成模型,创建交互式教育平台,开发具有文化敏感性的人工智能代理,并支持跨文化遗产的比较研究。这种可公开获取的资源符合道德标准,并适当注明源材料的出处,为人工智能的学术研究和应用开发提供了基础资产。
{"title":"Generated cultural heritage question–answer dataset: Durga in multi-dimensional perspectives","authors":"Tri Lathif Mardi Suryanto ,&nbsp;Aji Prasetya Wibawa ,&nbsp;Hariyono ,&nbsp;Andrew Nafalski ,&nbsp;Gulsun Kurubacak Çakır","doi":"10.1016/j.dib.2026.112495","DOIUrl":"10.1016/j.dib.2026.112495","url":null,"abstract":"<div><div>This dataset presents a valuable compilation of question–answer (QA) pairs derived from cultural texts and sources related to Durga mythology. A total of 21,395 QA pairs, encompassing textual materials such as scriptures, ritual narratives, temple inscriptions, and traditional storytelling records. Each entry includes the source reference, question, and corresponding answer, provided in a structured format compatible with Excel for seamless integration into downstream natural language processing (NLP) tasks. Data collection involved manual curation and annotation by domain experts, followed by preprocessing steps including text normalization, duplication removal, and verification of factual and contextual accuracy. The dataset is designed to support generative QA models, culturally aware chatbots, and digital preservation of heritage knowledge. It is particularly valuable for research in AI-driven cultural applications, educational tools, and digital humanities initiatives aiming to bridge traditional knowledge with computational methods. Researchers and practitioners may utilize the dataset for training generative models, creating interactive educational platforms, developing culturally sensitive AI agents, and supporting comparative studies in cross-cultural heritage. This openly accessible resource adheres to ethical standards, with proper attribution to source materials, and provides a foundational asset for both academic research and applied development in culturally informed artificial intelligence.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112495"},"PeriodicalIF":1.4,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Experimental dataset of the reverse water-gas shift reaction in a fixed-bed reactor setup under varying reactor conditions 不同反应器条件下固定床反应器中逆水气转换反应的实验数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-20 DOI: 10.1016/j.dib.2026.112485
Enzo Komatz , Marion Andritz , Christoph Markowitsch
This data article presents a dataset of miniplant-scale reverse water-gas shift (rWGS) experiments conducted in a heated fixed-bed reactor under systematically varied operating conditions. The dataset contains processed measurements including reactor temperature, molar fractions of CO2, CO, H2, CH4, and derived quantities such as CO2 conversion and CO selectivity. The experiments cover a wide parameter space, including gas hourly space velocities of 8000, 14,000 and 20,000 h-1 with temperatures between 550 and 950 °C (increment of 50 K), and H2:CO2 feed ratios of 2:1, 2.5:1 and 3:1.
The dataset presents the steady-state values and links to the reproductible data processing step, based on a prior study, enabling Fairness of all steps from the initial measurements to the final processed variables. The processing workflow includes calibration of gas analysis signals, smoothing, dry-gas calculation, and uncertainty estimation.
These data provide value for validating mechanistic kinetic models, benchmarking computational fluid dynamics (CFD) reactor simulations, training machine learning models including physics-informed machine learning frameworks, and supporting thermodynamic model assessments. All raw and processed data are made publicly available in a long-term repository, ensuring FAIR access and enabling reuse by the scientific community.
本文介绍了在加热固定床反应器中在系统变化的操作条件下进行的小型工厂规模逆水气转换(rWGS)实验数据集。该数据集包含处理过的测量数据,包括反应器温度,CO2, CO, H2, CH4的摩尔分数,以及CO2转化率和CO选择性等衍生量。实验参数范围广,气体每小时空速为8000、14000和20000 h-1,温度为550 ~ 950℃(增量50 K), H2:CO2进料比为2:1、2.5:1和3:1。该数据集基于先前的研究,呈现了稳态值并链接到可重复的数据处理步骤,从而实现了从初始测量到最终处理变量的所有步骤的公平性。处理流程包括气体分析信号的校准、平滑、干气计算和不确定度估计。这些数据为验证机械动力学模型、计算流体动力学(CFD)反应堆模拟基准、训练机器学习模型(包括物理知识的机器学习框架)以及支持热力学模型评估提供了价值。所有原始和处理过的数据都在一个长期存储库中公开提供,确保公平访问并使科学界能够重复使用。
{"title":"Experimental dataset of the reverse water-gas shift reaction in a fixed-bed reactor setup under varying reactor conditions","authors":"Enzo Komatz ,&nbsp;Marion Andritz ,&nbsp;Christoph Markowitsch","doi":"10.1016/j.dib.2026.112485","DOIUrl":"10.1016/j.dib.2026.112485","url":null,"abstract":"<div><div>This data article presents a dataset of miniplant-scale reverse water-gas shift (rWGS) experiments conducted in a heated fixed-bed reactor under systematically varied operating conditions. The dataset contains processed measurements including reactor temperature, molar fractions of CO<sub>2</sub>, CO, H<sub>2</sub>, CH<sub>4</sub>, and derived quantities such as CO<sub>2</sub> conversion and CO selectivity. The experiments cover a wide parameter space, including gas hourly space velocities of 8000, 14,000 and 20,000 h<sup>-1</sup> with temperatures between 550 and 950 °C (increment of 50 K), and H<sub>2</sub>:CO<sub>2</sub> feed ratios of 2:1, 2.5:1 and 3:1.</div><div>The dataset presents the steady-state values and links to the reproductible data processing step, based on a prior study, enabling Fairness of all steps from the initial measurements to the final processed variables. The processing workflow includes calibration of gas analysis signals, smoothing, dry-gas calculation, and uncertainty estimation.</div><div>These data provide value for validating mechanistic kinetic models, benchmarking computational fluid dynamics (CFD) reactor simulations, training machine learning models including physics-informed machine learning frameworks, and supporting thermodynamic model assessments. All raw and processed data are made publicly available in a long-term repository, ensuring FAIR access and enabling reuse by the scientific community.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112485"},"PeriodicalIF":1.4,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A 3D point cloud dataset of Jining Qing Goats for segmentation analysis and body size measurement 济宁青山羊三维点云数据集的分割分析与体型测量
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-20 DOI: 10.1016/j.dib.2026.112496
Kai Zhang, Qin Ma, Yichen Liu, Xiaochen Shi
The rapid advancement of intelligent livestock farming and precision breeding has underscored the importance of non-contact body measurement and weight estimation. Based on 3D reconstruction, these techniques represent a critical pathway for the digital transformation of animal husbandry. However, there are no publicly available 3D point cloud datasets specific to Jining Qing Goats, particularly systematic data covering key developmental stages. To bridge this gap, we present a comprehensive dataset comprising multi-view 3D point clouds and standardized morphometric records of Jining Qing Goats. The dataset spans multiple age groups and emphasizes critical phases such as juvenile, growing, mature, and reproductive stages, thereby capturing a holistic representation of the breed’s life cycle. During data acquisition, two Microsoft Kinect DK depth cameras were positioned bilaterally to capture RGB and depth images simultaneously under relatively static conditions. Multi-view point clouds were registered using the Iterative Closest Point (ICP) algorithm, with the floor plane serving as a unified reference to align all scans within a global coordinate system. In parallel, manual measurements of six key morphometric traits, including body length, withers height, shoulder width, abdominal width, heart girth and hip width, were collected as validation references. The dataset consists of raw RGB images, depth maps, point cloud files, camera calibration parameters, and manually annotated measurement records, all of which are openly accessible. This resource supports a wide range of computer vision tasks such as livestock 3D reconstruction, automated morphometric measurement, and weight estimation, thereby facilitating digital transformation, intelligent management, and sustainable development in modern livestock farming.
畜禽智能养殖和精准养殖的快速发展凸显了非接触体测量和体重估算的重要性。基于三维重建,这些技术代表了畜牧业数字化转型的关键途径。然而,目前还没有针对济宁青山羊的公开的三维点云数据集,特别是覆盖关键发育阶段的系统数据。为了弥补这一差距,我们提出了一个综合数据集,包括济宁青山羊的多视图三维点云和标准化形态测量记录。该数据集跨越多个年龄组,并强调关键阶段,如幼年、生长、成熟和繁殖阶段,从而捕捉到品种生命周期的整体表现。在数据采集过程中,两台Microsoft Kinect DK深度摄像头被放置在两侧,在相对静态的条件下同时捕获RGB和深度图像。使用迭代最近点(ICP)算法注册多视图点云,地板平面作为统一参考,在全局坐标系内对齐所有扫描。同时,收集6个关键形态特征的人工测量值,包括体长、肩高、肩宽、腹宽、胸围和臀宽,作为验证参考。该数据集由原始RGB图像、深度图、点云文件、相机校准参数和手动注释的测量记录组成,所有这些都是开放访问的。该资源支持广泛的计算机视觉任务,如牲畜3D重建,自动形态测量和重量估计,从而促进现代畜牧业的数字化转型,智能管理和可持续发展。
{"title":"A 3D point cloud dataset of Jining Qing Goats for segmentation analysis and body size measurement","authors":"Kai Zhang,&nbsp;Qin Ma,&nbsp;Yichen Liu,&nbsp;Xiaochen Shi","doi":"10.1016/j.dib.2026.112496","DOIUrl":"10.1016/j.dib.2026.112496","url":null,"abstract":"<div><div>The rapid advancement of intelligent livestock farming and precision breeding has underscored the importance of non-contact body measurement and weight estimation. Based on 3D reconstruction, these techniques represent a critical pathway for the digital transformation of animal husbandry. However, there are no publicly available 3D point cloud datasets specific to Jining Qing Goats, particularly systematic data covering key developmental stages. To bridge this gap, we present a comprehensive dataset comprising multi-view 3D point clouds and standardized morphometric records of Jining Qing Goats. The dataset spans multiple age groups and emphasizes critical phases such as juvenile, growing, mature, and reproductive stages, thereby capturing a holistic representation of the breed’s life cycle. During data acquisition, two Microsoft Kinect DK depth cameras were positioned bilaterally to capture RGB and depth images simultaneously under relatively static conditions. Multi-view point clouds were registered using the Iterative Closest Point (ICP) algorithm, with the floor plane serving as a unified reference to align all scans within a global coordinate system. In parallel, manual measurements of six key morphometric traits, including body length, withers height, shoulder width, abdominal width, heart girth and hip width, were collected as validation references. The dataset consists of raw RGB images, depth maps, point cloud files, camera calibration parameters, and manually annotated measurement records, all of which are openly accessible. This resource supports a wide range of computer vision tasks such as livestock 3D reconstruction, automated morphometric measurement, and weight estimation, thereby facilitating digital transformation, intelligent management, and sustainable development in modern livestock farming.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112496"},"PeriodicalIF":1.4,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive image dataset of American Sign Language hand gestures 美国手语手势的综合图像数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-20 DOI: 10.1016/j.dib.2026.112492
Md. Famidul Islam Pranto, Md. Rifatul Islam, Md. Ali Akbor, Nabonita Ghosh, Md. Rahatun Alam, Sudipto Chaki, Md. Masudul Islam
We present ASL-HG, a comprehensive American Sign Language (ASL) image dataset designed to advance gesture recognition and assistive technologies. The collection contains 36,000 static images across 36 classes, covering the full English alphabet (A–Z) and digits (0–9). Data were captured from 10 volunteers in Mirpur, Dhaka, Bangladesh, with each participant contributing 100 samples per class, ensuring a balanced distribution across subjects, genders, and skin tones. Unlike many existing ASL datasets, ASL-HG explicitly distinguishes between the letter “O” and the digit “0″ by including the standard two-handed ASL “zero” sign used in practical alphanumeric communication. The dataset is released in two complementary forms: raw images with natural indoor and outdoor backgrounds, and a MediaPipe-processed version with hand-segmented crops and predefined 80–20 train–test splits. This design supports both custom pre-processing and immediate model training. ASL-HG is intended to serve as a benchmark resource for developing robust and fair ASL recognition systems, reducing communication barriers for deaf and speech-impaired users, and enabling broader research in gesture-based human–computer interaction.
我们提出了ASL- hg,一个全面的美国手语(ASL)图像数据集,旨在推进手势识别和辅助技术。该收藏包含36个类别的36000张静态图片,涵盖了完整的英语字母表(A-Z)和数字(0-9)。数据来自孟加拉国达卡米尔普尔的10名志愿者,每个参与者每班贡献100个样本,以确保不同科目、性别和肤色的平衡分布。与许多现有的ASL数据集不同,ASL- hg通过包括实际字母数字通信中使用的标准双手ASL“0”符号,明确区分字母“O”和数字“0″”。该数据集以两种互补的形式发布:具有自然室内和室外背景的原始图像,以及具有手工分割作物和预定义的80-20训练测试分割的mediapie处理版本。该设计支持自定义预处理和即时模型训练。ASL- hg旨在作为开发稳健和公平的ASL识别系统的基准资源,减少聋人和语言障碍用户的沟通障碍,并促进基于手势的人机交互的更广泛研究。
{"title":"A comprehensive image dataset of American Sign Language hand gestures","authors":"Md. Famidul Islam Pranto,&nbsp;Md. Rifatul Islam,&nbsp;Md. Ali Akbor,&nbsp;Nabonita Ghosh,&nbsp;Md. Rahatun Alam,&nbsp;Sudipto Chaki,&nbsp;Md. Masudul Islam","doi":"10.1016/j.dib.2026.112492","DOIUrl":"10.1016/j.dib.2026.112492","url":null,"abstract":"<div><div>We present ASL-HG, a comprehensive American Sign Language (ASL) image dataset designed to advance gesture recognition and assistive technologies. The collection contains 36,000 static images across 36 classes, covering the full English alphabet (A–Z) and digits (0–9). Data were captured from 10 volunteers in Mirpur, Dhaka, Bangladesh, with each participant contributing 100 samples per class, ensuring a balanced distribution across subjects, genders, and skin tones. Unlike many existing ASL datasets, ASL-HG explicitly distinguishes between the letter “O” and the digit “0″ by including the standard two-handed ASL “zero” sign used in practical alphanumeric communication. The dataset is released in two complementary forms: raw images with natural indoor and outdoor backgrounds, and a MediaPipe-processed version with hand-segmented crops and predefined 80–20 train–test splits. This design supports both custom pre-processing and immediate model training. ASL-HG is intended to serve as a benchmark resource for developing robust and fair ASL recognition systems, reducing communication barriers for deaf and speech-impaired users, and enabling broader research in gesture-based human–computer interaction.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112492"},"PeriodicalIF":1.4,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
4D (space + time) datasets of spruce wood enzymatic hydrolysis 云杉木材酶解的4D(空间+时间)数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-20 DOI: 10.1016/j.dib.2026.112489
Solmaz Hossein Khani, Maxime Corré, Khadidja Ould Amer, Noah Remy, Berangère Lebas, Anouck Habrant, Gabriel Paës, Yassin Refahi
The conversion of lignocellulosic biomass from plant cell walls into bioproducts can contribute to reducing dependence on fossil sources and achieving sustainable development. Biotechnological conversion of lignocellulosic biomass has several advantages over other conversion approaches such as thermochemical and chemical conversions. These advantages include improved efficiency and specificity for desired products, ecological compatibility and reduced toxicity. Enzymatic transformation is a key step in biotechnological conversion. To achieve a cost-effective conversion, a comprehensive understanding of cell wall enzymatic hydrolysis is required. Despite progress, the enzymatic hydrolysis at microscale is comparatively understudied and lacks comprehensive investigation. Addressing this gap requires collection of time-lapse image datasets of cell wall enzymatic hydrolysis which is a technically demanding task. Furthermore, accurate processing of the time-lapse images to identify and track individual cell walls is particularly challenging, notably because of the sample drift present in the images. Recently, an efficient image processing pipeline, called AIMTrack, has been developed which uses an enhanced divide-and-conquer strategy to divide time-lapse images into clusters whose sizes are dynamically adjusted to the deconstruction extent. The image registrations are then limited to clusters and the resulting transformations are combined to correct sample drift across time-lapse images. Subsequently AIMTrack provides segmentation of time-lapse images where voxels belonging to the same cell walls are labelled with a unique identifier. The time-lapse image datasets presented here consist of time-lapse images of spruce wood cell walls acquired during enzymatic hydrolysis using a cellulolytic enzyme cocktail at two enzyme loadings of 15 and 30 FPU/g biomass. Control time-lapse datasets which are acquired under the identical conditions, but without addition of enzymes, are also included. Both control and hydrolysis datasets are processed using AIMTrack to track the cell walls from time-lapse images. The generated segmentations are also provided.
将植物细胞壁中的木质纤维素生物质转化为生物产品有助于减少对化石资源的依赖并实现可持续发展。与热化学和化学转化等其他转化方法相比,木质纤维素生物质的生物技术转化具有若干优势。这些优点包括提高效率和对所需产品的特异性,生态相容性和降低毒性。酶转化是生物技术转化的关键步骤。为了实现经济有效的转化,需要对细胞壁酶解有全面的了解。尽管取得了进展,但在微观尺度上的酶解研究相对不足,缺乏全面的研究。解决这一差距需要收集细胞壁酶解的延时图像数据集,这是一项技术要求很高的任务。此外,准确处理延时图像以识别和跟踪单个细胞壁尤其具有挑战性,特别是因为图像中存在样品漂移。近年来,人们开发了一种高效的图像处理管道AIMTrack,该管道采用一种增强的分而治之策略,将延时图像分成簇,并根据解构程度动态调整簇的大小。然后将图像配准限制在簇中,并将产生的转换组合在一起以纠正跨延时图像的样本漂移。随后,AIMTrack提供了延时图像的分割,其中属于相同细胞壁的体素被标记为唯一标识符。本文提供的延时图像数据集包括使用纤维素水解酶混合物在15和30 FPU/g生物量的两种酶负荷下酶水解过程中获得的云杉木细胞壁的延时图像。还包括在相同条件下获得的控制性时移数据集,但没有添加酶。使用AIMTrack对控制和水解数据集进行处理,从延时图像中跟踪细胞壁。还提供了生成的分段。
{"title":"4D (space + time) datasets of spruce wood enzymatic hydrolysis","authors":"Solmaz Hossein Khani,&nbsp;Maxime Corré,&nbsp;Khadidja Ould Amer,&nbsp;Noah Remy,&nbsp;Berangère Lebas,&nbsp;Anouck Habrant,&nbsp;Gabriel Paës,&nbsp;Yassin Refahi","doi":"10.1016/j.dib.2026.112489","DOIUrl":"10.1016/j.dib.2026.112489","url":null,"abstract":"<div><div>The conversion of lignocellulosic biomass from plant cell walls into bioproducts can contribute to reducing dependence on fossil sources and achieving sustainable development. Biotechnological conversion of lignocellulosic biomass has several advantages over other conversion approaches such as thermochemical and chemical conversions. These advantages include improved efficiency and specificity for desired products, ecological compatibility and reduced toxicity. Enzymatic transformation is a key step in biotechnological conversion. To achieve a cost-effective conversion, a comprehensive understanding of cell wall enzymatic hydrolysis is required. Despite progress, the enzymatic hydrolysis at microscale is comparatively understudied and lacks comprehensive investigation. Addressing this gap requires collection of time-lapse image datasets of cell wall enzymatic hydrolysis which is a technically demanding task. Furthermore, accurate processing of the time-lapse images to identify and track individual cell walls is particularly challenging, notably because of the sample drift present in the images. Recently, an efficient image processing pipeline, called AIMTrack, has been developed which uses an enhanced divide-and-conquer strategy to divide time-lapse images into clusters whose sizes are dynamically adjusted to the deconstruction extent. The image registrations are then limited to clusters and the resulting transformations are combined to correct sample drift across time-lapse images. Subsequently AIMTrack provides segmentation of time-lapse images where voxels belonging to the same cell walls are labelled with a unique identifier. The time-lapse image datasets presented here consist of time-lapse images of spruce wood cell walls acquired during enzymatic hydrolysis using a cellulolytic enzyme cocktail at two enzyme loadings of 15 and 30 FPU/g biomass. Control time-lapse datasets which are acquired under the identical conditions, but without addition of enzymes, are also included. Both control and hydrolysis datasets are processed using AIMTrack to track the cell walls from time-lapse images. The generated segmentations are also provided.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112489"},"PeriodicalIF":1.4,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ConcreteCARB: A comprehensive image dataset of concrete carbonation for computer vision tasks ConcreteCARB:用于计算机视觉任务的混凝土碳化的综合图像数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-20 DOI: 10.1016/j.dib.2026.112493
José A. Guzmán-Torres, Sandra del C. Arguello-Hernández, Francisco J. Domínguez-Mota, Gerardo Tinoco-Guerrero, Elia M. Alonso-Guzmán
The ConcreteCARB dataset provides a comprehensive repository of 903 high-resolution images of concrete surfaces evaluated using the phenolphthalein test for carbonation detection. This data was collected under controlled laboratory conditions and aims to support artificial intelligence applications in civil engineering, especially in structural health monitoring tasks. The images are systematically organized into two distinct classes: “Carbonated Samples” and “No Carbonation Presence,” enabling binary classification approaches. All samples were manually tested, split, and visually labelled by expert engineers to ensure reliable ground-truth classification, in accordance with standardized procedures. The dataset includes images of concrete prism elements fabricated with varying mix designs, incorporating different water-cement ratios and additives, such as industrial silica waste and natural admixtures derived from Opuntia ficus-indica. The specimens were subjected to natural atmospheric carbonation conditions for 180 days, and their carbonation fronts were revealed by phenolphthalein staining. The samples were then split manually with a chisel and hammer, and photographic documentation was performed with a Samsung SM-S901U1 smartphone using predefined settings to ensure consistency and quality across the dataset. ConcreteCARB is intended for researchers, engineers, and data scientists working on machine learning, deep learning, and computer vision solutions for concrete diagnostics. It provides valuable training and benchmarking data for the development of automated detection, classification, and segmentation models for carbonation damage assessment. Furthermore, the dataset can serve as a foundational tool for cross-comparative studies on the efficacy of AI techniques in materials degradation analysis. The openly accessible nature of the dataset through a public repository supports reproducibility and encourages the extension of AI applications in concrete durability and sustainability studies.
ConcreteCARB数据集提供了903张混凝土表面高分辨率图像的综合存储库,使用酚酞测试进行碳化检测。这些数据是在受控的实验室条件下收集的,旨在支持人工智能在土木工程中的应用,特别是在结构健康监测任务中。这些图像被系统地组织成两个不同的类别:“碳酸化样本”和“无碳酸化存在”,这使得二元分类方法成为可能。所有样品都由专家工程师手工测试、分离和视觉标记,以确保根据标准化程序进行可靠的真实分类。该数据集包括混凝土棱柱体的图像,这些棱柱体采用不同的混合设计,采用不同的水灰比和添加剂,如工业二氧化硅废料和源自无花果树的天然外加剂。将样品置于自然大气碳酸化条件下180 d,通过酚酞染色显示其碳酸化锋。然后用凿子和锤子手动分割样本,并使用三星SM-S901U1智能手机进行摄影记录,使用预定义设置确保数据集的一致性和质量。ConcreteCARB适用于研究机器学习、深度学习和混凝土诊断计算机视觉解决方案的研究人员、工程师和数据科学家。它为开发用于碳酸化损害评估的自动检测、分类和分割模型提供了有价值的训练和基准数据。此外,该数据集可以作为人工智能技术在材料降解分析中的功效交叉比较研究的基础工具。通过公共存储库开放访问数据集的特性支持再现性,并鼓励人工智能应用在混凝土耐久性和可持续性研究中的扩展。
{"title":"ConcreteCARB: A comprehensive image dataset of concrete carbonation for computer vision tasks","authors":"José A. Guzmán-Torres,&nbsp;Sandra del C. Arguello-Hernández,&nbsp;Francisco J. Domínguez-Mota,&nbsp;Gerardo Tinoco-Guerrero,&nbsp;Elia M. Alonso-Guzmán","doi":"10.1016/j.dib.2026.112493","DOIUrl":"10.1016/j.dib.2026.112493","url":null,"abstract":"<div><div>The ConcreteCARB dataset provides a comprehensive repository of 903 high-resolution images of concrete surfaces evaluated using the phenolphthalein test for carbonation detection. This data was collected under controlled laboratory conditions and aims to support artificial intelligence applications in civil engineering, especially in structural health monitoring tasks. The images are systematically organized into two distinct classes: “Carbonated Samples” and “No Carbonation Presence,” enabling binary classification approaches. All samples were manually tested, split, and visually labelled by expert engineers to ensure reliable ground-truth classification, in accordance with standardized procedures. The dataset includes images of concrete prism elements fabricated with varying mix designs, incorporating different water-cement ratios and additives, such as industrial silica waste and natural admixtures derived from Opuntia ficus-indica. The specimens were subjected to natural atmospheric carbonation conditions for 180 days, and their carbonation fronts were revealed by phenolphthalein staining. The samples were then split manually with a chisel and hammer, and photographic documentation was performed with a Samsung SM-S901U1 smartphone using predefined settings to ensure consistency and quality across the dataset. ConcreteCARB is intended for researchers, engineers, and data scientists working on machine learning, deep learning, and computer vision solutions for concrete diagnostics. It provides valuable training and benchmarking data for the development of automated detection, classification, and segmentation models for carbonation damage assessment. Furthermore, the dataset can serve as a foundational tool for cross-comparative studies on the efficacy of AI techniques in materials degradation analysis. The openly accessible nature of the dataset through a public repository supports reproducibility and encourages the extension of AI applications in concrete durability and sustainability studies.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112493"},"PeriodicalIF":1.4,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dataset of in-situ meteorological measurements for urban wind energy assessment in the southern region of the Dominican Republic 多米尼加共和国南部地区城市风能评估的现场气象测量数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-19 DOI: 10.1016/j.dib.2026.112482
Alexander Vallejo-Díaz , Idalberto Herrera-Moya , Edwin Garabitos-Lara , Héctor David Morbán-Ramírez , Adermim Jhoswar Severino-Simeón , Anders Malmquist
This dataset provides in-situ wind measurements collected between March 2024 and June 2025 from four urban locations in the southern region of the Dominican Republic: San Cristóbal, Azua, Barahona, and San Juan de la Maguana. Measurements were performed using Davis Instruments Vantage PRO2 meteorological stations, strategically installed to characterize the urban wind resource for potential microgeneration applications. The dataset includes wind speed, wind direction, air temperature, relative humidity, and atmospheric pressure. Data processing involved quality control procedures, gap filling through interpolation techniques, and subsequent analysis for wind characterization. The analysis was carried out using WRPLOT View – Wind Rose Plotting Software Version 9.2.0 for wind rose generation, HOMER Pro – Microgrid Analysis Tool, Version 3.18.4 for renewable resource assessment, and Microsoft Excel for parameterization of the Weibull distribution function. In addition, derived metrics such as theoretical wind potential and the theoretically available wind potential were calculated for each location.
This dataset can serve as a valuable resource for preliminary renewable energy feasibility studies, particularly for screening and comparative assessments of small-scale or distributed generation in urban environments. It also supports urban energy planning and can be used to inform computational fluid dynamics (CFD) studies of urban wind flows, for example through boundary condition definition, model calibration, or comparative analysis, rather than direct validation of highly turbulent urban wind fields, and the design of hybrid wind–solar systems. Beyond energy applications, the data may be applied to urban climate studies, including the assessment of diurnal and seasonal variability and heat island effects, and to modeling the dispersion of air pollution in complex urban settings.
该数据集提供了在2024年3月至2025年6月期间从多米尼加共和国南部地区的四个城市地点收集的现场风测量数据:圣Cristóbal,阿祖阿,巴拉霍纳和圣胡安德拉马瓜纳。测量使用的是Davis Instruments Vantage PRO2气象站,这些气象站战略性地安装在城市风力资源特征上,用于潜在的微发电应用。数据集包括风速、风向、气温、相对湿度和大气压力。数据处理包括质量控制程序,通过插值技术填充间隙,以及随后的风特性分析。分析采用WRPLOT View -风玫瑰生成软件9.2.0版本,HOMER Pro -微电网分析工具3.18.4版本进行可再生资源评价,Microsoft Excel进行威布尔分布函数参数化。此外,还计算了每个地点的理论风势和理论可用风势等衍生指标。该数据集可作为初步可再生能源可行性研究的宝贵资源,特别是用于筛选和比较城市环境中小规模或分布式发电的评估。它还支持城市能源规划,并可用于城市风流的计算流体动力学(CFD)研究,例如通过边界条件定义、模型校准或比较分析,而不是直接验证高度湍流的城市风场,以及设计混合风能-太阳能系统。除了能源应用之外,这些数据还可用于城市气候研究,包括评估日和季节变率以及热岛效应,并可用于模拟复杂城市环境中空气污染的扩散。
{"title":"Dataset of in-situ meteorological measurements for urban wind energy assessment in the southern region of the Dominican Republic","authors":"Alexander Vallejo-Díaz ,&nbsp;Idalberto Herrera-Moya ,&nbsp;Edwin Garabitos-Lara ,&nbsp;Héctor David Morbán-Ramírez ,&nbsp;Adermim Jhoswar Severino-Simeón ,&nbsp;Anders Malmquist","doi":"10.1016/j.dib.2026.112482","DOIUrl":"10.1016/j.dib.2026.112482","url":null,"abstract":"<div><div>This dataset provides in-situ wind measurements collected between March 2024 and June 2025 from four urban locations in the southern region of the Dominican Republic: San Cristóbal, Azua, Barahona, and San Juan de la Maguana. Measurements were performed using Davis Instruments Vantage PRO2 meteorological stations, strategically installed to characterize the urban wind resource for potential microgeneration applications. The dataset includes wind speed, wind direction, air temperature, relative humidity, and atmospheric pressure. Data processing involved quality control procedures, gap filling through interpolation techniques, and subsequent analysis for wind characterization. The analysis was carried out using WRPLOT View – Wind Rose Plotting Software Version 9.2.0 for wind rose generation, HOMER Pro – Microgrid Analysis Tool, Version 3.18.4 for renewable resource assessment, and Microsoft Excel for parameterization of the Weibull distribution function. In addition, derived metrics such as theoretical wind potential and the theoretically available wind potential were calculated for each location.</div><div>This dataset can serve as a valuable resource for preliminary renewable energy feasibility studies, particularly for screening and comparative assessments of small-scale or distributed generation in urban environments. It also supports urban energy planning and can be used to inform computational fluid dynamics (CFD) studies of urban wind flows, for example through boundary condition definition, model calibration, or comparative analysis, rather than direct validation of highly turbulent urban wind fields, and the design of hybrid wind–solar systems. Beyond energy applications, the data may be applied to urban climate studies, including the assessment of diurnal and seasonal variability and heat island effects, and to modeling the dispersion of air pollution in complex urban settings.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112482"},"PeriodicalIF":1.4,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards sustainable management of Xylella fastidiosa vectors: An annotated image dataset for automated in-field detection of Aphrophoridae foam 对苛刻木杆菌载体的可持续管理:一个用于自动现场检测泡沫蚜的注释图像数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-01-19 DOI: 10.1016/j.dib.2026.112477
Michele Elia , Angelo Cardellicchio , Michele Paradiso , Giuseppe Veronico , Arianna Rana , Antonio Petitti , Vito Renò , Simone Pascuzzi , Annalisa Milella
Insects feeding on xylem sap, such as adult Aphrophoridae spittlebugs, are vectors of the plant pathogenic xylem-limited bacterium Xylella fastidiosa (Xf), a causal agent of a number of severe diseases, including the Olive Quick Decline Syndrome (OQDS), which has decimated olive trees in the Mediterranean region. The Aphrophoridae life cycle and behaviour feature a weak stage, known as the juvenile stage, in which the insects live solitary on stems covered in a self-produced foamy fluid (froth) that protects them from dehydration and temperature stress. Juvenile vectors are ideal targets for a control intervention aimed at reducing transmission by adults. This paper presents the first, to the best of our knowledge, image dataset framing spittlebug froth samples in the field for the purpose of automated Aphrophoridae nymph identification. Images were captured using different devices including a consumer-grade RGB-D sensor, a digital reflex camera, and a smartphone camera. The dataset comprises 365 colour images, focusing on spittlebug foam. 211 of these images were captured in April 2024 during a two-day campaign. For these 211 images, a manual semantic annotation was performed, generating PNG binary masks that precisely distinguish spittlebug foam pixels from the background. To further enhance usability, labels are also provided in YOLO (You Only Look Once) format as text files, both for segmentation and object detection. The remaining 154 images were collected during a separate two-day campaign in 2025. These images are unannotated and are intended for further testing purposes. Overall, the dataset enables the development of both semantic segmentation models and object detectors for automated froth detection in natural images, thus facilitating the early identification of potentially harmful insects in sustainable pest management and control systems.
以木质部汁液为食的昆虫,如成年aphaphoridae口吐虫,是植物致病性木质部限制细菌苛养木杆菌(Xf)的媒介,Xf是许多严重疾病的病原体,包括橄榄树快速衰退综合征(OQDS),它使地中海地区的橄榄树大量死亡。aphaphoridae的生命周期和行为特征是一个较弱的阶段,称为幼年期,在这个阶段,昆虫独自生活在覆盖着自我产生的泡沫液体(泡沫)的茎上,以保护它们免受脱水和温度压力。青少年病媒是旨在减少成人传播的控制干预的理想目标。本文提出了第一个,据我们所知,图像数据集框架的吐沫虫泡沫样本在现场,目的是自动识别Aphrophoridae仙女。使用不同的设备拍摄图像,包括消费级RGB-D传感器,数码反光相机和智能手机相机。该数据集包括365张彩色图像,重点是吐痰虫泡沫。其中211张是在2024年4月为期两天的活动中拍摄的。对于这211张图像,我们执行了手动语义注释,生成了PNG二进制掩码,可以精确地将吐沫虫泡沫像素与背景区分开来。为了进一步增强可用性,标签还以YOLO(你只看一次)格式作为文本文件提供,用于分割和对象检测。其余154幅图像是在2025年的一个单独的为期两天的活动中收集的。这些图像未加注释,用于进一步测试。总体而言,该数据集支持语义分割模型和对象检测器的开发,用于自然图像中的自动泡沫检测,从而促进可持续害虫管理和控制系统中潜在有害昆虫的早期识别。
{"title":"Towards sustainable management of Xylella fastidiosa vectors: An annotated image dataset for automated in-field detection of Aphrophoridae foam","authors":"Michele Elia ,&nbsp;Angelo Cardellicchio ,&nbsp;Michele Paradiso ,&nbsp;Giuseppe Veronico ,&nbsp;Arianna Rana ,&nbsp;Antonio Petitti ,&nbsp;Vito Renò ,&nbsp;Simone Pascuzzi ,&nbsp;Annalisa Milella","doi":"10.1016/j.dib.2026.112477","DOIUrl":"10.1016/j.dib.2026.112477","url":null,"abstract":"<div><div>Insects feeding on xylem sap, such as adult <em>Aphrophoridae</em> spittlebugs, are vectors of the plant pathogenic xylem-limited bacterium <em>Xylella fastidiosa (Xf)</em>, a causal agent of a number of severe diseases, including the Olive Quick Decline Syndrome (OQDS), which has decimated olive trees in the Mediterranean region. The <em>Aphrophoridae</em> life cycle and behaviour feature a weak stage, known as the juvenile stage, in which the insects live solitary on stems covered in a self-produced foamy fluid (froth) that protects them from dehydration and temperature stress. Juvenile vectors are ideal targets for a control intervention aimed at reducing transmission by adults. This paper presents the first, to the best of our knowledge, image dataset framing spittlebug froth samples in the field for the purpose of automated <em>Aphrophoridae</em> nymph identification. Images were captured using different devices including a consumer-grade RGB-D sensor, a digital reflex camera, and a smartphone camera. The dataset comprises 365 colour images, focusing on spittlebug foam. 211 of these images were captured in April 2024 during a two-day campaign. For these 211 images, a manual semantic annotation was performed, generating PNG binary masks that precisely distinguish spittlebug foam pixels from the background. To further enhance usability, labels are also provided in YOLO (You Only Look Once) format as text files, both for segmentation and object detection. The remaining 154 images were collected during a separate two-day campaign in 2025. These images are unannotated and are intended for further testing purposes. Overall, the dataset enables the development of both semantic segmentation models and object detectors for automated froth detection in natural images, thus facilitating the early identification of potentially harmful insects in sustainable pest management and control systems.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112477"},"PeriodicalIF":1.4,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Data in Brief
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1