首页 > 最新文献

Data in Brief最新文献

英文 中文
A semi-labelled dataset for fault detection in air handling units from a large-scale office 用于检测大型办公室空气处理装置故障的半标签数据集
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-09-21 DOI: 10.1016/j.dib.2024.110956
Fault detection and diagnosis (FDD) in Air Handling Units (AHUs) ensure building functions such as energy efficiency and occupant comfort by quickly identifying and diagnosing faults. Combining deep learning with FDD has demonstrated high generalization ability in this field. To develop deep learning models, this research constructed a dataset sourced from real data collected from a large-scale office in South Korea. The raw AHU data were extracted from the Building Management System (BMS) at 1-h intervals, spanning from November 2023 to May 2024. The dataset was partially labeled by annotation experts, categorizing the data into six types: normal condition, supply fan fault, total heating pump fault, return air temperature sensor fault, supply air Temperature sensor fault, and valve position fault. Additionally, semi-supervised learning methods were applied as an application example using this constructed dataset. The main contributions of this dataset to the field are twofold. First, it represents a unique dataset sourced from the real operational data of a large-scale office, which is currently non-existent in this domain. Second, the dataset's expert labeling adds significant value by ensuring accurate fault classification. Therefore, we hope that this dataset will encourage the development of robust FDD techniques that are more suitable for real-world applications.
空气处理机组(AHU)中的故障检测与诊断(FDD)可通过快速识别和诊断故障,确保建筑物的能效和居住舒适度等功能。将深度学习与 FDD 相结合已在该领域展现出很高的泛化能力。为了开发深度学习模型,本研究构建了一个数据集,该数据集来源于从韩国大型办公室收集的真实数据。AHU 原始数据是从楼宇管理系统(BMS)中以 1 小时为间隔提取的,时间跨度为 2023 年 11 月至 2024 年 5 月。数据集由标注专家进行了部分标注,将数据分为六种类型:正常状态、送风机故障、总加热泵故障、回风温度传感器故障、送风温度传感器故障和阀门位置故障。此外,还利用所构建的数据集作为应用实例,应用了半监督学习方法。该数据集对该领域的主要贡献有两个方面。首先,它代表了一个独特的数据集,该数据集来源于大型办公室的真实运行数据,目前在该领域尚不存在。其次,数据集的专家标注确保了故障分类的准确性,从而增加了数据集的重要价值。因此,我们希望该数据集能鼓励开发更适合实际应用的稳健 FDD 技术。
{"title":"A semi-labelled dataset for fault detection in air handling units from a large-scale office","authors":"","doi":"10.1016/j.dib.2024.110956","DOIUrl":"10.1016/j.dib.2024.110956","url":null,"abstract":"<div><div>Fault detection and diagnosis (FDD) in Air Handling Units (AHUs) ensure building functions such as energy efficiency and occupant comfort by quickly identifying and diagnosing faults. Combining deep learning with FDD has demonstrated high generalization ability in this field. To develop deep learning models, this research constructed a dataset sourced from real data collected from a large-scale office in South Korea. The raw AHU data were extracted from the Building Management System (BMS) at 1-h intervals, spanning from November 2023 to May 2024. The dataset was partially labeled by annotation experts, categorizing the data into six types: normal condition, supply fan fault, total heating pump fault, return air temperature sensor fault, supply air Temperature sensor fault, and valve position fault. Additionally, semi-supervised learning methods were applied as an application example using this constructed dataset. The main contributions of this dataset to the field are twofold. First, it represents a unique dataset sourced from the real operational data of a large-scale office, which is currently non-existent in this domain. Second, the dataset's expert labeling adds significant value by ensuring accurate fault classification. Therefore, we hope that this dataset will encourage the development of robust FDD techniques that are more suitable for real-world applications.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142318523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Innovation system functions: Survey data of additive manufacturing enterprises in South Africa 创新系统功能:南非增材制造企业的调查数据
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-09-20 DOI: 10.1016/j.dib.2024.110943
This dataset comprises survey data from South African additive manufacturing (AM) enterprises, representing approximately 80% of the industry. The survey was designed using innovation indicators from the technology innovation system (TIS) framework to explore the dynamics within South African AM enterprises. Six TIS experts and 2AM industry specialists validated the survey's comprehensiveness and relevance. A statistician reviewed the data collection process to ensure it was suitable for robust statistical analysis. Managers of South African AM enterprises were invited to complete the survey online, and the responses were systematically collected and stored. This dataset is a valuable resource for AM researchers and practitioners, facilitating market gap analysis concerning current materials and services and offering insights for future AM applications.
该数据集包含来自南非增材制造(AM)企业的调查数据,约占该行业的 80%。调查使用技术创新系统(TIS)框架中的创新指标进行设计,以探索南非 AM 企业内部的动态。六位技术创新体系专家和 2 位 AM 行业专家对调查的全面性和相关性进行了验证。一位统计学家对数据收集过程进行了审查,以确保其适合进行可靠的统计分析。南非 AM 企业的管理人员应邀在线完成了调查,并系统地收集和储存了答复。该数据集是 AM 研究人员和从业人员的宝贵资源,有助于分析当前材料和服务的市场差距,并为 AM 的未来应用提供见解。
{"title":"Innovation system functions: Survey data of additive manufacturing enterprises in South Africa","authors":"","doi":"10.1016/j.dib.2024.110943","DOIUrl":"10.1016/j.dib.2024.110943","url":null,"abstract":"<div><div>This dataset comprises survey data from South African additive manufacturing (AM) enterprises, representing approximately 80% of the industry. The survey was designed using innovation indicators from the technology innovation system (TIS) framework to explore the dynamics within South African AM enterprises. Six TIS experts and 2AM industry specialists validated the survey's comprehensiveness and relevance. A statistician reviewed the data collection process to ensure it was suitable for robust statistical analysis. Managers of South African AM enterprises were invited to complete the survey online, and the responses were systematically collected and stored. This dataset is a valuable resource for AM researchers and practitioners, facilitating market gap analysis concerning current materials and services and offering insights for future AM applications.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142318526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data on the financial performance of companies on BIST Sustainability 25 Index: An Entropy-based TOPSIS approach BIST 可持续发展 25 指数公司的财务业绩数据:基于熵的 TOPSIS 方法
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-09-19 DOI: 10.1016/j.dib.2024.110959
In this data article, the Entropy-based TOPSIS technique is employed to assess the cash flow-based financial performance of companies. The study encompasses data from companies listed on the Borsa Istanbul (İstanbul Stock Exchange) and included in the BIST Sustainability 25 Index between 2018 and 2022. The performance metrics considered in the dataset are grouped into categories including liquidity, operational efficiency, financial structure, and profitability ratios. The dataset is derived from company balance sheets, income statements, and cash flow statements.
在这篇数据文章中,采用了基于熵的 TOPSIS 技术来评估公司基于现金流的财务绩效。研究涵盖了在伊斯坦布尔证券交易所(Borsa Istanbul Stock Exchange)上市并在 2018 年至 2022 年期间被纳入 BIST 可持续发展 25 指数的公司数据。数据集中考虑的绩效指标分为多个类别,包括流动性、运营效率、财务结构和盈利比率。数据集来自公司资产负债表、损益表和现金流量表。
{"title":"Data on the financial performance of companies on BIST Sustainability 25 Index: An Entropy-based TOPSIS approach","authors":"","doi":"10.1016/j.dib.2024.110959","DOIUrl":"10.1016/j.dib.2024.110959","url":null,"abstract":"<div><div>In this data article, the Entropy-based TOPSIS technique is employed to assess the cash flow-based financial performance of companies. The study encompasses data from companies listed on the Borsa Istanbul (İstanbul Stock Exchange) and included in the BIST Sustainability 25 Index between 2018 and 2022. The performance metrics considered in the dataset are grouped into categories including liquidity, operational efficiency, financial structure, and profitability ratios. The dataset is derived from company balance sheets, income statements, and cash flow statements.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352340924009211/pdfft?md5=dc102832543f9abf5d58ffb9c7b403fc&pid=1-s2.0-S2352340924009211-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142314792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A dental intraoral image dataset of gingivitis for image captioning 用于图像标题的牙龈炎口内图像数据集
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-09-19 DOI: 10.1016/j.dib.2024.110960
One of the most striking topics in Artificial Intelligence (AI) is Image captioning that aims to integrate computer vision and natural language processing to create descriptions for each image. In this paper, we propose a new dataset designed specifically for image captioning in gingivitis diagnosis using deep learning. It includes 1,096 high-resolution intraoral images of 12 anterior teeth and surrounding gingival tissue that were collected under controlled conditions with professional-grade photography equipment. Each image features detailed labels and descriptive captions. The labeling process involved three periodontists with over ten years of experience who assigned Modified Gingival Index (MGI) scores to each tooth in the images, achieving high inter-rater reliability through a rigorous calibration process. Captions were then created by the same periodontists, offering diverse descriptions of gingivitis severity and locations. The dataset is systematically organized into training, validation, and testing subsets for systematic accessibility. This dataset supports the development of advanced image captioning algorithms and is a valuable educational resource for integrating real-world data into dental research and curriculum.
图像标题是人工智能(AI)领域最引人注目的课题之一,其目的是整合计算机视觉和自然语言处理,为每幅图像创建描述。在本文中,我们提出了一个新的数据集,专门用于利用深度学习为牙龈炎诊断提供图像标题。该数据集包括 12 颗前牙和周围牙龈组织的 1,096 张高分辨率口内图像,这些图像是在受控条件下使用专业级摄影设备采集的。每张图像都有详细的标签和描述性标题。标注过程由三位拥有十年以上经验的牙周病专家完成,他们为图像中的每颗牙齿分配了修正牙龈指数 (MGI) 分值,通过严格的校准过程实现了高度的评分者间可靠性。然后,由同几位牙周病专家制作标题,对牙龈炎的严重程度和位置进行不同的描述。数据集被系统地分为训练、验证和测试子集,以便系统地进行访问。该数据集支持高级图像字幕算法的开发,是将真实世界数据整合到牙科研究和课程中的宝贵教育资源。
{"title":"A dental intraoral image dataset of gingivitis for image captioning","authors":"","doi":"10.1016/j.dib.2024.110960","DOIUrl":"10.1016/j.dib.2024.110960","url":null,"abstract":"<div><div>One of the most striking topics in Artificial Intelligence (AI) is Image captioning that aims to integrate computer vision and natural language processing to create descriptions for each image. In this paper, we propose a new dataset designed specifically for image captioning in gingivitis diagnosis using deep learning. It includes 1,096 high-resolution intraoral images of 12 anterior teeth and surrounding gingival tissue that were collected under controlled conditions with professional-grade photography equipment. Each image features detailed labels and descriptive captions. The labeling process involved three periodontists with over ten years of experience who assigned Modified Gingival Index (MGI) scores to each tooth in the images, achieving high inter-rater reliability through a rigorous calibration process. Captions were then created by the same periodontists, offering diverse descriptions of gingivitis severity and locations. The dataset is systematically organized into training, validation, and testing subsets for systematic accessibility. This dataset supports the development of advanced image captioning algorithms and is a valuable educational resource for integrating real-world data into dental research and curriculum.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352340924009223/pdfft?md5=f01ff43bb621297b870041fdf2b81d30&pid=1-s2.0-S2352340924009223-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142314236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dataset of 16S rRNA gene sequences of 111 healthy and Newcastle disease infected caecal samples from multiple chicken breeds of Pakistan 来自巴基斯坦多个鸡种的 111 份健康和新城疫感染肠道样本的 16S rRNA 基因序列数据集
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-09-19 DOI: 10.1016/j.dib.2024.110957
The article presents a processed dataset from amplicon sequencing of the V4 region of the 16S rRNA gene to recover bacterial and archaeal taxa from the caeca of multiple chicken breeds of Pakistan. These include chicken breeds commonly raised at commercial level, Naked Neck, Black Australorp, Rhode Island Red, White Layer, and Broiler. All the breeds were challenged with Newcastle Disease Virus (NDV), with vaccination against the disease also explored. This resulted in samples belonging to four treatment groups as: Control; Vaccinated; Vaccinated and Challenged; and Non-vaccinated and Challenged. These were raised on an antibiotic free diet in a semi-controlled farming setup. 16S rRNA gene amplicon sequencing of caecal DNA from day old and mature chicken samples (22 weeks for Naked Neck, Black Australorp, Rhode Island Red and White Layer; 8 weeks for Broiler) of the four groups was performed. The paired-end reads from all the samples were quality trimmed, error corrected, and overlapped, on which unique Operational Taxonomic Units (OTUs) were obtained at 99 % similarity. Using predictive modelling, the MetaCyc functional pathways, as well as KEGG orthologs were also recovered. The generated data may be used to explore microbial interactions in gastrointestinal tract with respect to NDV vaccination and infection, together with increased understanding of chicken health and productivity.
文章介绍了对 16S rRNA 基因 V4 区域进行扩增子测序的处理数据集,以恢复巴基斯坦多个鸡种盲肠中的细菌和古细菌分类群。这些鸡种包括商业化饲养的常见鸡种:裸颈鸡、澳洲黑鸡、罗德岛红鸡、白层鸡和肉鸡。所有鸡种都受到了新城疫病毒 (NDV) 的挑战,同时还对疫苗接种进行了研究。结果样品分为四个处理组:对照组;接种疫苗组;接种疫苗组;接种疫苗组:对照组;接种疫苗组;接种疫苗和接受挑战组;未接种疫苗和接受挑战组。这些样本在半控制的养殖环境中以不含抗生素的食物饲养。对四个组的日龄鸡和成熟鸡(裸颈鸡、黑蝎、罗德岛红鸡和白羽肉鸡饲养 22 周;肉鸡饲养 8 周)样本的盲肠 DNA 进行了 16S rRNA 基因扩增片段测序。对所有样本的成对末端读数进行了质量修剪、误差校正和重叠,在此基础上获得了相似度为 99% 的唯一操作分类单元(OTU)。通过预测建模,还恢复了 MetaCyc 功能通路和 KEGG 直向同源物。所生成的数据可用于探索与 NDV 疫苗接种和感染有关的胃肠道微生物相互作用,同时提高对鸡健康和生产率的认识。
{"title":"Dataset of 16S rRNA gene sequences of 111 healthy and Newcastle disease infected caecal samples from multiple chicken breeds of Pakistan","authors":"","doi":"10.1016/j.dib.2024.110957","DOIUrl":"10.1016/j.dib.2024.110957","url":null,"abstract":"<div><div>The article presents a processed dataset from amplicon sequencing of the V4 region of the 16S rRNA gene to recover bacterial and archaeal taxa from the caeca of multiple chicken breeds of Pakistan. These include chicken breeds commonly raised at commercial level, Naked Neck, Black Australorp, Rhode Island Red, White Layer, and Broiler. All the breeds were challenged with Newcastle Disease Virus (NDV), with vaccination against the disease also explored. This resulted in samples belonging to four treatment groups as: Control; Vaccinated; Vaccinated and Challenged; and Non-vaccinated and Challenged. These were raised on an antibiotic free diet in a semi-controlled farming setup. 16S rRNA gene amplicon sequencing of caecal DNA from day old and mature chicken samples (22 weeks for Naked Neck, Black Australorp, Rhode Island Red and White Layer; 8 weeks for Broiler) of the four groups was performed. The paired-end reads from all the samples were quality trimmed, error corrected, and overlapped, on which unique Operational Taxonomic Units (OTUs) were obtained at 99 % similarity. Using predictive modelling, the MetaCyc functional pathways, as well as KEGG orthologs were also recovered. The generated data may be used to explore microbial interactions in gastrointestinal tract with respect to NDV vaccination and infection, together with increased understanding of chicken health and productivity.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352340924009193/pdfft?md5=43209103587f441093dce9cc9377985f&pid=1-s2.0-S2352340924009193-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142314235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dataset on thermal conductivity of composted olive cake (COC) 堆肥橄榄油(COC)导热性数据集
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-09-19 DOI: 10.1016/j.dib.2024.110939
Thermal properties play a critical role in the compost used as a soil amendment for different agricultural applications especially for green roof buildings. Despite this importance, there remains insufficient information on thermal conductivity of composted olive cake (COC), K, and how it is influenced by bulk its density and water content. This shows how thermal conductivity (K) is affected by these two parameters and the potential use of COC as cheap padding in geothermal heat storage and green roof building applications. Thermal conductivities of 30 samples of (COC) were measured experimentally at different moisture contents and bulk densities using a hot wire technique. The results revealed that thermal conductivity exhibits a linear increase as both bulk density and water content increased. It increased from 0.10 to 0.60 W/(m K) at saturation levels ranging from dry to 90 %. The highest thermal conductivity of 0.60 W/m K was revealed at a water content of 90 %. Therefore, (COC) might be used as an inexpensive padding in geothermal heat storage applications and as an eco-friendly insulation pad in green- roof buildings, leading to passive energy savings. Overall, the study provides important insights into the thermal properties of COC and its potential as a sustainable insulation material.
堆肥作为土壤改良剂用于不同的农业应用,尤其是屋顶绿化建筑,其热能特性起着至关重要的作用。尽管如此,关于堆肥橄榄油(COC)的导热性(K)及其如何受体积密度和含水量的影响的信息仍然不足。本文介绍了导热系数(K)如何受这两个参数的影响,以及 COC 作为廉价填充物在地热储存和绿色屋顶建筑应用中的潜在用途。在不同含水量和体积密度条件下,使用热线技术对 30 个 COC 样品的导热系数进行了实验测量。结果表明,随着体积密度和含水量的增加,导热系数呈线性增长。在饱和度从干到 90% 的情况下,导热系数从 0.10 W/(m K) 增加到 0.60 W/(m K)。含水量为 90% 时,热导率最高,为 0.60 W/m K。因此,(COC)可用作地热储存应用中的廉价填充物,以及绿色屋顶建筑中的环保隔热垫,从而实现被动式节能。总之,这项研究为了解 COC 的热性能及其作为可持续隔热材料的潜力提供了重要依据。
{"title":"Dataset on thermal conductivity of composted olive cake (COC)","authors":"","doi":"10.1016/j.dib.2024.110939","DOIUrl":"10.1016/j.dib.2024.110939","url":null,"abstract":"<div><div>Thermal properties play a critical role in the compost used as a soil amendment for different agricultural applications especially for green roof buildings. Despite this importance, there remains insufficient information on thermal conductivity of composted olive cake (COC), K, and how it is influenced by bulk its density and water content. This shows how thermal conductivity (K) is affected by these two parameters and the potential use of COC as cheap padding in geothermal heat storage and green roof building applications. Thermal conductivities of 30 samples of (COC) were measured experimentally at different moisture contents and bulk densities using a hot wire technique. The results revealed that thermal conductivity exhibits a linear increase as both bulk density and water content increased. It increased from 0.10 to 0.60 W/(m K) at saturation levels ranging from dry to 90 %. The highest thermal conductivity of 0.60 W/m K was revealed at a water content of 90 %. Therefore, (COC) might be used as an inexpensive padding in geothermal heat storage applications and as an eco-friendly insulation pad in green- roof buildings, leading to passive energy savings. Overall, the study provides important insights into the thermal properties of COC and its potential as a sustainable insulation material.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142318642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Water sorption isotherms and mid-infrared spectra of dried parchment coffee beans (Coffee arabica L.) processed by wet and semi-dry postharvest methods. A dataset for estimating water sorption and thermodynamic properties. 采用湿法和半干法收获后处理的干羊皮纸咖啡豆(Coffee arabica L.)的吸水等温线和中红外光谱。用于估算吸水性和热力学特性的数据集。
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-09-19 DOI: 10.1016/j.dib.2024.110958
This work contains a water sorption isotherms dataset obtained on dried parchment coffee beans processed by wet and semi-dry postharvest methods and their mid-infrared spectral data. The experimental data of water sorption isotherms were determined using the Dynamic Dewpoint Isotherm (DDI) method. The measurements were taken in a water activity range of 0.1 to 0.85 and at 25, 35, and 45 °C temperatures. To spectrally characterize the dried parchment coffee beans processed by wet and semi-dry postharvest methods, the Attenuated Total Reflection Fourier Transform Infrared (ATR-FTIR) spectroscopy technique was used. The dataset comprises Excel files with the experimental data acquired for the dried parchment coffee beans processed by wet and semi-dry postharvest methods and the experimental conditions assessed. This dataset serves as a reliable and valuable tool for researchers, coffee producers, and decision-makers to be used as the basis for mathematically computing relevant parameters related to the coffee shelf life and hygroscopic behavior, as well as to develop suitable packaging materials/containers to maximize the quality of coffee beans in terms of sensory flavors and moisture stability. Furthermore, the experimental data provide a reliable tool for optimizing the coffee storage process and gaining insights into the water-sorption process.
本研究成果包含对采用湿法和半干法收获后处理的干羊皮纸咖啡豆的水吸附等温线数据集及其中红外光谱数据。水吸附等温线的实验数据是用动态露点等温线(DDI)法测定的。测量在 0.1 至 0.85 的水活度范围内进行,温度为 25、35 和 45 °C。为了从光谱上描述采用湿法和半干法采后加工的干羊皮纸咖啡豆的特征,使用了衰减全反射傅立叶变换红外(ATR-FTIR)光谱技术。数据集由 Excel 文件组成,其中包含采用湿法和半干法采后加工的干羊皮纸咖啡豆的实验数据以及评估的实验条件。该数据集为研究人员、咖啡生产商和决策者提供了可靠而宝贵的工具,可在此基础上对咖啡保质期和吸湿行为的相关参数进行数学计算,并开发合适的包装材料/容器,最大限度地提高咖啡豆的感官风味和水分稳定性。此外,实验数据还为优化咖啡储存过程和深入了解吸水过程提供了可靠的工具。
{"title":"Water sorption isotherms and mid-infrared spectra of dried parchment coffee beans (Coffee arabica L.) processed by wet and semi-dry postharvest methods. A dataset for estimating water sorption and thermodynamic properties.","authors":"","doi":"10.1016/j.dib.2024.110958","DOIUrl":"10.1016/j.dib.2024.110958","url":null,"abstract":"<div><div>This work contains a water sorption isotherms dataset obtained on dried parchment coffee beans processed by wet and semi-dry postharvest methods and their mid-infrared spectral data. The experimental data of water sorption isotherms were determined using the Dynamic Dewpoint Isotherm (DDI) method. The measurements were taken in a water activity range of 0.1 to 0.85 and at 25, 35, and 45 °C temperatures. To spectrally characterize the dried parchment coffee beans processed by wet and semi-dry postharvest methods, the Attenuated Total Reflection Fourier Transform Infrared (ATR-FTIR) spectroscopy technique was used. The dataset comprises Excel files with the experimental data acquired for the dried parchment coffee beans processed by wet and semi-dry postharvest methods and the experimental conditions assessed. This dataset serves as a reliable and valuable tool for researchers, coffee producers, and decision-makers to be used as the basis for mathematically computing relevant parameters related to the coffee shelf life and hygroscopic behavior, as well as to develop suitable packaging materials/containers to maximize the quality of coffee beans in terms of sensory flavors and moisture stability. Furthermore, the experimental data provide a reliable tool for optimizing the coffee storage process and gaining insights into the water-sorption process.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142318528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-datasets for different keyboard key sound recognition 多数据集可识别不同的键盘按键声音
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-09-19 DOI: 10.1016/j.dib.2024.110949
Keyboard acoustic recognition is a pivotal area within cybersecurity and human-computer interaction, where the identification and analysis of keyboard sounds are used to enhance security measures. The performance of acoustic-based security systems can be influenced by factors such as the platform used, typing style, and environmental noise. To address these variations and provide a comprehensive resource, we present the Multi-Keyboard Acoustic (MKA) Datasets. These extensive datasets, meticulously gathered by a team in the Computer Science Department at the University of Halabja, include recordings from six widely-used platforms: HP, Lenovo, MSI, Mac, Messenger, and Zoom. The MKA datasets have structured data for each platform, including raw recordings, segmented sound files, and matrices derived from these sounds. They can be used by researchers in keylogging detection, cybersecurity, and other fields related to acoustic emanation attacks on keyboards. Moreover, the datasets capture the intricacies of typing behaviour with both hands and all ten fingers by carefully segmenting and pre-processing the data using the Praat tool, thus ensuring high-quality and dependable data. This comprehensive approach allows researchers to explore various aspects of keyboard sound recognition, contributing to the development of robust recognition algorithms and enhanced security measures. The MKA Datasets stand as one of the largest and most detailed datasets in this domain, offering significant potential for advancing research and improving defences against acoustic-based threats.
键盘声音识别是网络安全和人机交互的一个关键领域,键盘声音的识别和分析可用于加强安全措施。声学安全系统的性能会受到使用平台、打字风格和环境噪声等因素的影响。为了应对这些变化并提供全面的资源,我们推出了多键盘声学 (MKA) 数据集。这些广泛的数据集由哈拉布贾大学计算机科学系的一个团队精心收集,包括来自六种广泛使用的平台的录音:惠普、联想、微星、Mac、Messenger 和 Zoom。MKA 数据集包含每个平台的结构化数据,包括原始录音、分段声音文件以及从这些声音中得出的矩阵。这些数据集可供键盘记录检测、网络安全和其他与键盘声发射攻击相关领域的研究人员使用。此外,数据集通过使用 Praat 工具对数据进行仔细分割和预处理,捕捉到了双手十指打字行为的复杂性,从而确保了数据的高质量和可靠性。这种全面的方法使研究人员能够探索键盘声音识别的各个方面,有助于开发强大的识别算法和增强安全措施。MKA 数据集是该领域最大、最详细的数据集之一,为推进研究和改进声学威胁防御提供了巨大潜力。
{"title":"Multi-datasets for different keyboard key sound recognition","authors":"","doi":"10.1016/j.dib.2024.110949","DOIUrl":"10.1016/j.dib.2024.110949","url":null,"abstract":"<div><div>Keyboard acoustic recognition is a pivotal area within cybersecurity and human-computer interaction, where the identification and analysis of keyboard sounds are used to enhance security measures. The performance of acoustic-based security systems can be influenced by factors such as the platform used, typing style, and environmental noise. To address these variations and provide a comprehensive resource, we present the Multi-Keyboard Acoustic (MKA) Datasets. These extensive datasets, meticulously gathered by a team in the Computer Science Department at the University of Halabja, include recordings from six widely-used platforms: HP, Lenovo, MSI, Mac, Messenger, and Zoom. The MKA datasets have structured data for each platform, including raw recordings, segmented sound files, and matrices derived from these sounds. They can be used by researchers in keylogging detection, cybersecurity, and other fields related to acoustic emanation attacks on keyboards. Moreover, the datasets capture the intricacies of typing behaviour with both hands and all ten fingers by carefully segmenting and pre-processing the data using the Praat tool, thus ensuring high-quality and dependable data. This comprehensive approach allows researchers to explore various aspects of keyboard sound recognition, contributing to the development of robust recognition algorithms and enhanced security measures. The MKA Datasets stand as one of the largest and most detailed datasets in this domain, offering significant potential for advancing research and improving defences against acoustic-based threats.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352340924009090/pdfft?md5=946e747027631a229faaaa7cdf2abc37&pid=1-s2.0-S2352340924009090-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142314237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lidar-derived structural-complexity data across four experimental forests 四个实验林的激光雷达结构复杂性数据
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-09-18 DOI: 10.1016/j.dib.2024.110955
Structural complexity refers to the three-dimensional arrangement and variability of both biotic and abiotic components of an ecosystem. Metrics that characterize structural complexity are often used to manage various aspects of ecosystem function, such as light transmittance, wildlife habitat, and biological diversity. Additionally, these metrics aid in evaluating resilience to disturbance events, including hurricanes, bark-beetle outbreaks, and wildfire. Recent advances in wildland fire modelling have facilitated the integration of forest structural complexity metrics into the QUIC-Fire model, enabling real-time prediction of fire spread and behaviour by simulating interactions between fire, weather, topography, and forest structure. While QUIC-Fire is designed to be highly adaptable, model performance depends on the availability and accuracy of local data inputs. Expanding the model's usability across different regions can be facilitated by the availability of more comprehensive and high-quality data. Thus, the primary goal behind the data products we developed was to establish a basis for collaborative research across various disciplines, particularly within the focal areas of the Southern Research Station, such as forestry, wildland fire, hydrology, soil science, and cultural resources at Bent Creek, Coweeta, Escambia, and Hitchiti Experimental Forests (EFs).
Airborne laser scanning (ALS) was used to collect point-cloud data for each EF during the leaf-off season to minimize interference from foliage. Subsequent processing of the raw lidar data involved outlier detection and filtering, ground and non-ground classification, and the computation of a variety of metrics representing various aspects of topography and forest structure at both the pixel-level and the tree-level. Pixel-level topographic data products include: digital elevation model (DEM), slope, aspect, topographic position index (TPI), topographic roughness index (TRI), roughness, and flow direction. Forest structural-complexity metrics include canopy height, foliar height diversity (FHD), vertical distribution ratio (VDR), canopy rugosity, crown relief ratio (CRR), understory complexity index (UCI), vertical complexity index (VCI), canopy cover, mean vegetation height, and the standard deviation of vegetation height. Tree-level data products were computed from the point cloud using multiple algorithms to perform individual tree detection (ITD) and individual tree segmentation (ITS). The datasets have been harmonized and are openly accessible through the USDA Forest Service Research Data Archive.
结构复杂性是指生态系统中生物和非生物成分的三维排列和可变性。表征结构复杂性的指标通常用于管理生态系统功能的各个方面,如透光率、野生动物栖息地和生物多样性。此外,这些指标还有助于评估对飓风、树皮甲虫爆发和野火等干扰事件的恢复能力。野外火灾建模的最新进展促进了将森林结构复杂性指标整合到 QUIC-Fire 模型中,通过模拟火灾、天气、地形和森林结构之间的相互作用,实现对火灾蔓延和行为的实时预测。虽然 QUIC-Fire 的设计具有很强的适应性,但模型的性能取决于本地数据输入的可用性和准确性。如果能获得更全面、更高质量的数据,将有助于在不同地区扩大模型的可用性。因此,我们开发数据产品的主要目的是为各学科的合作研究奠定基础,尤其是在南方研究站的重点领域,如本特溪、科威达、埃斯坎比亚和希奇蒂实验林(EFs)的林业、野外火灾、水文、土壤科学和文化资源。对原始激光雷达数据的后续处理包括离群点检测和过滤、地面和非地面分类,以及代表像素级和树级地形和森林结构各个方面的各种指标的计算。像素级地形数据产品包括:数字高程模型 (DEM)、坡度、坡向、地形位置指数 (TPI)、地形粗糙度指数 (TRI)、粗糙度和流向。森林结构复杂性指标包括树冠高度、叶高多样性(FHD)、垂直分布比(VDR)、树冠崎岖度、树冠起伏比(CRR)、林下复杂性指数(UCI)、垂直复杂性指数(VCI)、树冠覆盖率、植被平均高度和植被高度标准偏差。利用多种算法从点云中计算出树木级数据产品,以执行单棵树检测 (ITD) 和单棵树分割 (ITS)。这些数据集已经过协调,可通过美国农业部林业局研究数据档案馆公开访问。
{"title":"Lidar-derived structural-complexity data across four experimental forests","authors":"","doi":"10.1016/j.dib.2024.110955","DOIUrl":"10.1016/j.dib.2024.110955","url":null,"abstract":"<div><div>Structural complexity refers to the three-dimensional arrangement and variability of both biotic and abiotic components of an ecosystem. Metrics that characterize structural complexity are often used to manage various aspects of ecosystem function, such as light transmittance, wildlife habitat, and biological diversity. Additionally, these metrics aid in evaluating resilience to disturbance events, including hurricanes, bark-beetle outbreaks, and wildfire. Recent advances in wildland fire modelling have facilitated the integration of forest structural complexity metrics into the QUIC-Fire model, enabling real-time prediction of fire spread and behaviour by simulating interactions between fire, weather, topography, and forest structure. While QUIC-Fire is designed to be highly adaptable, model performance depends on the availability and accuracy of local data inputs. Expanding the model's usability across different regions can be facilitated by the availability of more comprehensive and high-quality data. Thus, the primary goal behind the data products we developed was to establish a basis for collaborative research across various disciplines, particularly within the focal areas of the Southern Research Station, such as forestry, wildland fire, hydrology, soil science, and cultural resources at Bent Creek, Coweeta, Escambia, and Hitchiti Experimental Forests (EFs).</div><div>Airborne laser scanning (ALS) was used to collect point-cloud data for each EF during the leaf-off season to minimize interference from foliage. Subsequent processing of the raw lidar data involved outlier detection and filtering, ground and non-ground classification, and the computation of a variety of metrics representing various aspects of topography and forest structure at both the pixel-level and the tree-level. Pixel-level topographic data products include: digital elevation model (DEM), slope, aspect, topographic position index (TPI), topographic roughness index (TRI), roughness, and flow direction. Forest structural-complexity metrics include canopy height, foliar height diversity (FHD), vertical distribution ratio (VDR), canopy rugosity, crown relief ratio (CRR), understory complexity index (UCI), vertical complexity index (VCI), canopy cover, mean vegetation height, and the standard deviation of vegetation height. Tree-level data products were computed from the point cloud using multiple algorithms to perform individual tree detection (ITD) and individual tree segmentation (ITS). The datasets have been harmonized and are openly accessible through the USDA Forest Service Research Data Archive.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142318521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dataset on recovered waste post-earthquake 2016 in Manabi Province, Ecuador for recycling and reuse 2016 年厄瓜多尔马纳比省地震后回收的废物数据集,用于回收和再利用
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-09-18 DOI: 10.1016/j.dib.2024.110953
The earthquake in Ecuador on 16 April 2016 generated large volumes of debris and waste. This dataset contains data on recovered and reused disaster materials. Data were collected through a census survey of the scrap dealers of earthquake 2016 debris and waste (n = 62). This dataset was compiled to demonstrate how earthquake waste was generated during the 2016 earthquake and compare it with the pre-disaster period 2015 and the 2019 current when the data were collected. The recovered disaster materials include plastic, metal, cardboard, paper, glass, other recyclable materials, and reused materials. Likewise, the database allows us to observe the time response of medium- and small-sized scrap businesses as scrap dealers engage in the commercial transaction of disaster materials, and this dataset shows the process phases of recovering disaster waste. In addition, the dataset includes profit perceptions and factual earnings from scrap businesses after an earthquake. Considering the significant volume of waste and debris generated, this database can provide useful data for evaluating disaster waste management as an important task in post-disaster recovery.
2016 年 4 月 16 日厄瓜多尔地震产生了大量废墟和废物。本数据集包含有关回收和再利用灾害材料的数据。数据是通过对 2016 年地震废墟和废物的废品经销商(n = 62)进行普查调查收集的。编制本数据集的目的是为了展示 2016 年地震期间产生的地震垃圾,并将其与灾前的 2015 年和收集数据时的 2019 年进行比较。回收的灾难材料包括塑料、金属、纸板、纸张、玻璃、其他可回收材料和再利用材料。同样,通过该数据库,我们可以观察到中小型废品企业在废品经销商从事灾害物资商业交易时的时间反应,该数据集显示了灾害废品回收的流程阶段。此外,数据集还包括地震后废品收购企业的利润认知和实际收益。考虑到所产生的大量废物和废墟,该数据库可为评估作为灾后恢复重要任务的灾害废物管理提供有用的数据。
{"title":"Dataset on recovered waste post-earthquake 2016 in Manabi Province, Ecuador for recycling and reuse","authors":"","doi":"10.1016/j.dib.2024.110953","DOIUrl":"10.1016/j.dib.2024.110953","url":null,"abstract":"<div><div>The earthquake in Ecuador on 16 April 2016 generated large volumes of debris and waste. This dataset contains data on recovered and reused disaster materials. Data were collected through a census survey of the scrap dealers of earthquake 2016 debris and waste (<em>n</em> = 62). This dataset was compiled to demonstrate how earthquake waste was generated during the 2016 earthquake and compare it with the pre-disaster period 2015 and the 2019 current when the data were collected. The recovered disaster materials include plastic, metal, cardboard, paper, glass, other recyclable materials, and reused materials. Likewise, the database allows us to observe the time response of medium- and small-sized scrap businesses as scrap dealers engage in the commercial transaction of disaster materials, and this dataset shows the process phases of recovering disaster waste. In addition, the dataset includes profit perceptions and factual earnings from scrap businesses after an earthquake. Considering the significant volume of waste and debris generated, this database can provide useful data for evaluating disaster waste management as an important task in post-disaster recovery.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142318643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Data in Brief
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1