首页 > 最新文献

Data in Brief最新文献

英文 中文
Developing a comprehensive BACnet attack dataset: A step towards improved cybersecurity in building automation systems. 开发全面的BACnet攻击数据集:朝着提高楼宇自动化系统网络安全迈出的一步。
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-12-03 eCollection Date: 2024-12-01 DOI: 10.1016/j.dib.2024.111192
Seyed Amirhossein Moosavi, Mojtaba Asgari, Seyed Reza Kamel

With the development of smart buildings, the risks of cyber-attacks against them have also increased. One of the popular and evolving protocols used for communication between devices in smart buildings, especially HVAC systems, is the BACnet protocol. Machine learning algorithms and neural networks require datasets of normal traffic and real attacks to develop intrusion detection (IDS) and prevention (IPS) systems that can detect anomalies and prevent attacks. Real traffic datasets for these networks are often unavailable due to confidentiality reasons. To address this, we propose a framework that uses existing real datasets and converts them into BACnet protocol network traffic with detailed network behaviour. In this method, a virtual machine is prepared for each controller based on real scenarios, and by creating a simulator for the controller on the virtual machine, real data previously collected under real conditions from existing datasets is injected into the network with the same date and time during the simulation. We performed three types of attacks, including Falsifying, Modifying, and covert channel attacks on the network. For covert channel attacks, the message was modelled in three forms: Plain text, hashed using SHA3-256, and encrypted using AES-256. Network traffic was recorded using Wireshark software in pcap format. The advantage of the generated dataset is that since we used real data, the data behaviour aligns with real conditions.

随着智能建筑的发展,网络攻击的风险也在增加。BACnet协议是智能建筑(尤其是暖通空调系统)中用于设备之间通信的流行和不断发展的协议之一。机器学习算法和神经网络需要正常流量和真实攻击的数据集来开发能够检测异常并防止攻击的入侵检测(IDS)和防御(IPS)系统。由于保密原因,这些网络的真实流量数据集通常是不可用的。为了解决这个问题,我们提出了一个使用现有真实数据集并将其转换为具有详细网络行为的BACnet协议网络流量的框架。该方法根据真实场景为每个控制器准备一个虚拟机,通过在虚拟机上为控制器创建一个模拟器,将之前在真实条件下从已有数据集中采集到的真实数据,在模拟过程中以相同的日期和时间注入网络。我们对网络进行了三种类型的攻击,包括伪造、修改和隐蔽通道攻击。对于隐蔽通道攻击,消息以三种形式建模:纯文本、使用SHA3-256散列和使用AES-256加密。使用Wireshark软件以pcap格式记录网络流量。生成的数据集的优点是,由于我们使用的是真实数据,因此数据行为与真实条件保持一致。
{"title":"Developing a comprehensive BACnet attack dataset: A step towards improved cybersecurity in building automation systems.","authors":"Seyed Amirhossein Moosavi, Mojtaba Asgari, Seyed Reza Kamel","doi":"10.1016/j.dib.2024.111192","DOIUrl":"10.1016/j.dib.2024.111192","url":null,"abstract":"<p><p>With the development of smart buildings, the risks of cyber-attacks against them have also increased. One of the popular and evolving protocols used for communication between devices in smart buildings, especially HVAC systems, is the BACnet protocol. Machine learning algorithms and neural networks require datasets of normal traffic and real attacks to develop intrusion detection (IDS) and prevention (IPS) systems that can detect anomalies and prevent attacks. Real traffic datasets for these networks are often unavailable due to confidentiality reasons. To address this, we propose a framework that uses existing real datasets and converts them into BACnet protocol network traffic with detailed network behaviour. In this method, a virtual machine is prepared for each controller based on real scenarios, and by creating a simulator for the controller on the virtual machine, real data previously collected under real conditions from existing datasets is injected into the network with the same date and time during the simulation. We performed three types of attacks, including Falsifying, Modifying, and covert channel attacks on the network. For covert channel attacks, the message was modelled in three forms: Plain text, hashed using SHA3-256, and encrypted using AES-256. Network traffic was recorded using Wireshark software in pcap format. The advantage of the generated dataset is that since we used real data, the data behaviour aligns with real conditions.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111192"},"PeriodicalIF":1.0,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11683266/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142906613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A dataset of deep learning performance from cross-base data encoding on MNIST and MNIST-C. 基于MNIST和MNIST- c的跨基数据编码的深度学习性能数据集。
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-12-03 eCollection Date: 2024-12-01 DOI: 10.1016/j.dib.2024.111194
Lawrence McKnight, Chandra Jaiswal, Issa AlHmoud, Balakrishna Gokaraju

Effective data representation in machine learning and deep learning is paramount. For an algorithm or neural network to capture patterns in data and be able to make reliable predictions, the data must appropriately describe the problem domain. Although there exists much literature on data preprocessing for machine learning and data science applications, novel data representation methods for enhancing machine learning model performance remain highly absent within the literature. This dataset is a compilation of convolutional neural network model performance trained and tested on a wide range of numerical base representations of the MNIST and MNIST-C datasets. This performance data can be further analysed by the research community to uncover trends in model performance against the numerical base of its data. This dataset can be used to produce more research of the same nature, testing cross-base data encoding on machine learning training and testing data for a wide range of real-world applications.

有效的数据表示在机器学习和深度学习中是至关重要的。为了让算法或神经网络捕获数据中的模式并能够做出可靠的预测,数据必须恰当地描述问题域。虽然有很多关于机器学习和数据科学应用的数据预处理的文献,但用于增强机器学习模型性能的新颖数据表示方法在文献中仍然非常缺乏。该数据集是卷积神经网络模型性能的汇编,在MNIST和MNIST- c数据集的广泛数值基础表示上进行了训练和测试。研究团体可以进一步分析这些性能数据,以根据数据的数值基础揭示模型性能的趋势。该数据集可用于产生更多相同性质的研究,在机器学习训练和广泛的现实世界应用中测试跨基数据编码。
{"title":"A dataset of deep learning performance from cross-base data encoding on MNIST and MNIST-C.","authors":"Lawrence McKnight, Chandra Jaiswal, Issa AlHmoud, Balakrishna Gokaraju","doi":"10.1016/j.dib.2024.111194","DOIUrl":"https://doi.org/10.1016/j.dib.2024.111194","url":null,"abstract":"<p><p>Effective data representation in machine learning and deep learning is paramount. For an algorithm or neural network to capture patterns in data and be able to make reliable predictions, the data must appropriately describe the problem domain. Although there exists much literature on data preprocessing for machine learning and data science applications, novel data representation methods for enhancing machine learning model performance remain highly absent within the literature. This dataset is a compilation of convolutional neural network model performance trained and tested on a wide range of numerical base representations of the MNIST and MNIST-C datasets. This performance data can be further analysed by the research community to uncover trends in model performance against the numerical base of its data. This dataset can be used to produce more research of the same nature, testing cross-base data encoding on machine learning training and testing data for a wide range of real-world applications.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111194"},"PeriodicalIF":1.0,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11697575/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142930887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dataset on ITS and some chloroplast DNA regions of Boehmeria holosericea Blume in Vietnam. 越南全绢波马属植物ITS及部分叶绿体DNA区数据集。
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-12-03 eCollection Date: 2024-12-01 DOI: 10.1016/j.dib.2024.111193
Quan Huu Nguyen, Trinh Van Nguyen, Thuy Thi Xuan Vi, Thuy Thi Thu Vu, Lan Thi Ngoc Nguyen, Yen Thi Hai Nguyen, Hung Duc Nguyen, Tan Quang Tu, Mau Hoang Chu

Species of the Boehmeria genus have the potential to be natural medicines and have industrial fibre production uses. Many species of this genus are morphologically similar and are difficult to distinguish, especially when their morphology is distorted. This dataset includes sequence information of several DNA regions isolated from the genome of Boehmeria holosericea, namely ITS (from the nuclear genome), matK, trnL-trnF, trnH-psbA, and rpoC1 (from the chloroplast genome) and phylogenetic analysis results based on the isolated sequences. On the phylogenetic tree based on the matK gene sequence, B. holosericea is grouped with B. umbrosa, B. clidemioides, B. spicata, and B. macrophylla with a bootstrap coefficient of 100%. In the phylogenetic tree based on the trnH-psbA spacer region sequences, B. holosericea was grouped with B. clidemioides (a bootstrap coefficient of 96%). In the phylogenetic tree based on the rpoC1 gene sequences, B. holosericea was grouped with B. spicata (a bootstrap coefficient of 100%). In the phylogenetic tree based on the ITS region sequences, B. holosericea was grouped with B. macrophylla (a bootstrap coefficient of 73%), and based on the trnL-trnF spacer region, B. holosericea was grouped with B. pilociuscula (a bootstrap coefficient of 16%). Two genes, matK and rpoC1 and the trnH-psbA region from the chloroplast genome, are potential DNA barcode candidates that could aid in the species identification of B. holosericea. This dataset the first report on the ITS, matK, trnL-trnF, trnH-psbA, and rpoC1 sequences and the phylogeny of B. holosericea.

苧麻属的物种有可能成为天然药物,并具有工业纤维生产用途。该属的许多物种形态相似,很难区分,尤其是当它们的形态扭曲时。本数据集包括从苧麻基因组中分离出的几个 DNA 区域的序列信息,即 ITS(来自核基因组)、matK、trnL-trnF、trnH-psbA 和 rpoC1(来自叶绿体基因组),以及基于分离序列的系统发生分析结果。在基于 matK 基因序列的系统发生树上,B. holosericea 与 B. umbrosa、B. clidemioides、B. spicata 和 B. macrophylla 被归为一类,引导系数为 100%。在基于 trnH-psbA spacer 区域序列的系统发生树中,B. holosericea 与 B. clidemioides 被归为一类(bootstrap coefficient 为 96%)。在基于 rpoC1 基因序列的系统发生树中,B. holosericea 与 B. spicata 被归为一类(引导系数为 100%)。在基于 ITS 区域序列的系统发生树中,B. holosericea 与 B. macrophylla 被归为一类(bootstrap 系数为 73%);基于 trnL-trnF spacer 区域,B. holosericea 与 B. pilociuscula 被归为一类(bootstrap 系数为 16%)。叶绿体基因组中的两个基因 matK 和 rpoC1 以及 trnH-psbA 区域是潜在的 DNA 条形码候选者,可帮助鉴定全丝核菌的物种。该数据集首次报道了 ITS、matK、trnL-trnF、trnH-psbA 和 rpoC1 序列以及 B. holosericea 的系统发育。
{"title":"Dataset on ITS and some chloroplast DNA regions of <i>Boehmeria holosericea</i> Blume in Vietnam.","authors":"Quan Huu Nguyen, Trinh Van Nguyen, Thuy Thi Xuan Vi, Thuy Thi Thu Vu, Lan Thi Ngoc Nguyen, Yen Thi Hai Nguyen, Hung Duc Nguyen, Tan Quang Tu, Mau Hoang Chu","doi":"10.1016/j.dib.2024.111193","DOIUrl":"10.1016/j.dib.2024.111193","url":null,"abstract":"<p><p>Species of the <i>Boehmeria</i> genus have the potential to be natural medicines and have industrial fibre production uses. Many species of this genus are morphologically similar and are difficult to distinguish, especially when their morphology is distorted. This dataset includes sequence information of several DNA regions isolated from the genome of <i>Boehmeria holosericea</i>, namely ITS (from the nuclear genome), <i>matK</i>, trnL-trnF, trnH-psbA, and <i>rpoC1</i> (from the chloroplast genome) and phylogenetic analysis results based on the isolated sequences. On the phylogenetic tree based on the matK gene sequence, B. holosericea is grouped with <i>B. umbrosa, B. clidemioides, B. spicata, and B. macrophylla</i> with a bootstrap coefficient of 100%. In the phylogenetic tree based on the trnH-psbA spacer region sequences, <i>B. holosericea</i> was grouped with B. clidemioides (a bootstrap coefficient of 96%). In the phylogenetic tree based on the <i>rpoC1</i> gene sequences, <i>B. holosericea</i> was grouped with <i>B. spicata</i> (a bootstrap coefficient of 100%). In the phylogenetic tree based on the ITS region sequences, <i>B. holosericea</i> was grouped with B<i>. macrophylla</i> (a bootstrap coefficient of 73%), and based on the trnL-trnF spacer region, <i>B. holosericea</i> was grouped with <i>B. pilociuscula</i> (a bootstrap coefficient of 16%). Two genes, <i>matK</i> and <i>rpoC1</i> and the trnH-psbA region from the chloroplast genome, are potential DNA barcode candidates that could aid in the species identification of <i>B. holosericea</i>. This dataset the first report on the ITS, <i>matK</i>, trnL-trnF, trnH-psbA, and <i>rpoC1</i> sequences and the phylogeny of <i>B. holosericea.</i></p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111193"},"PeriodicalIF":1.0,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11683258/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142906590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Aalto Gear Fault datasets for deep-learning based diagnosis. 用于深度学习诊断的阿尔托齿轮故障数据集。
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-12-02 eCollection Date: 2024-12-01 DOI: 10.1016/j.dib.2024.111171
Zacharias Dahl, Aleksanteri Hämäläinen, Aku Karhinen, Jesse Miettinen, Andre Böhme, Samuel Lillqvist, Sampo Haikonen, Raine Viitala

Accurate system health state prediction through deep learning requires extensive and varied data. However, real-world data scarcity poses a challenge for developing robust fault diagnosis models. This study introduces two extensive datasets, Aalto Shim Dataset and Aalto Gear Fault Dataset, collected under controlled laboratory conditions, aimed at advancing deep learning-based fault diagnosis. The datasets encompass a wide range of gear faults, including synthetic and realistic failure modes, replicated on a downsized azimuth thruster testbench equipped with multiple sensors. The data features various fault types and severities under different operating conditions. The comprehensive data collected, along with the methodologies for creating synthetic faults and replicating common gear failures, provide valuable resources for developing and testing intelligent fault diagnosis models, enhancing their generalization and robustness across diverse scenarios.

通过深度学习进行准确的系统健康状态预测需要广泛而多样的数据。然而,现实世界的数据稀缺性对开发鲁棒故障诊断模型提出了挑战。本研究引入了两个广泛的数据集,Aalto Shim数据集和Aalto Gear故障数据集,在受控的实验室条件下收集,旨在推进基于深度学习的故障诊断。数据集涵盖了广泛的齿轮故障,包括合成和实际故障模式,并在配备多个传感器的小型化方位推进器试验台上进行了复制。这些数据在不同的运行条件下具有不同的故障类型和严重程度。收集到的综合数据,以及创建综合故障和复制常见齿轮故障的方法,为开发和测试智能故障诊断模型提供了宝贵的资源,增强了它们在不同场景下的泛化和鲁棒性。
{"title":"Aalto Gear Fault datasets for deep-learning based diagnosis.","authors":"Zacharias Dahl, Aleksanteri Hämäläinen, Aku Karhinen, Jesse Miettinen, Andre Böhme, Samuel Lillqvist, Sampo Haikonen, Raine Viitala","doi":"10.1016/j.dib.2024.111171","DOIUrl":"10.1016/j.dib.2024.111171","url":null,"abstract":"<p><p>Accurate system health state prediction through deep learning requires extensive and varied data. However, real-world data scarcity poses a challenge for developing robust fault diagnosis models. This study introduces two extensive datasets, Aalto Shim Dataset and Aalto Gear Fault Dataset, collected under controlled laboratory conditions, aimed at advancing deep learning-based fault diagnosis. The datasets encompass a wide range of gear faults, including synthetic and realistic failure modes, replicated on a downsized azimuth thruster testbench equipped with multiple sensors. The data features various fault types and severities under different operating conditions. The comprehensive data collected, along with the methodologies for creating synthetic faults and replicating common gear failures, provide valuable resources for developing and testing intelligent fault diagnosis models, enhancing their generalization and robustness across diverse scenarios.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111171"},"PeriodicalIF":1.0,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11683272/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142906663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Photovoltaic module dataset for automated fault detection and analysis in large photovoltaic systems using photovoltaic module fault detection. 光伏组件故障自动检测与分析数据集,在大型光伏系统中使用光伏组件故障检测。
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-12-02 eCollection Date: 2024-12-01 DOI: 10.1016/j.dib.2024.111184
Rotimi-Williams Bello, Pius A Owolawi, Etienne A van Wyk, Chunling Du

Solar energy has become the fastest growing renewable and alternative source of energy. However, there is little or no open-source datasets to advance research knowledge in photovoltaic related systems. The work presented in this article is a step towards deriving Photo-Voltaic Module Dataset (PVMD) of thermal images and ensuring they are publicly available. The work provides a PVMD dataset comprising a total of 1000 self-acquired and augmented images. The dataset includes both permanent and temporal anomalies, namely Hotspots, Cracks, and Shadings. The dataset was collected on September 5, 2024 at the Soshanguve South Campus, Tshwane University of Technology, South Africa using DJI Mavic 3 Thermal's high-resolution thermal and visual imaging capabilities. DJI Mavic 3 Thermal coupled with its advanced flight features makes it an excellent tool for precise and efficient inspections of PV systems. The laboratory experiment performed on the dataset lasted one week. The work aims to provide supervised dataset good enough to support research method in providing a comprehensive and efficient approach to monitoring and maintaining large PV systems. Extensive analysis of the thermal data reveals the anomalies as indicative of faults in the solar cells of PV module, thereby opening up advancement in solar energy research. Because the data comes from a single-day collection and one week laboratory experiment, it makes the data more suitable for testing algorithms designed for fault detection. The dataset is publicly and freely available to the scientific community at 10.17632/5ssmfpgrpc.1.

太阳能已成为发展最快的可再生能源和替代能源。然而,很少或没有开源数据集来推进光伏相关系统的研究知识。本文中介绍的工作是向获得热图像的光伏模块数据集(PVMD)并确保它们公开可用迈出的一步。该工作提供了一个由1000张自获取和增强图像组成的PVMD数据集。数据集包括永久和时间异常,即热点、裂缝和阴影。该数据集于2024年9月5日在南非Tshwane科技大学Soshanguve南校区收集,使用大疆Mavic 3 Thermal的高分辨率热成像和视觉成像功能。DJI Mavic 3 Thermal加上其先进的飞行功能,使其成为光伏系统精确有效检查的绝佳工具。在数据集上进行的实验室实验持续了一周。这项工作旨在提供足够好的监督数据集,以支持研究方法,为监测和维护大型光伏系统提供全面有效的方法。通过对热数据的广泛分析,揭示了光伏组件太阳能电池的异常是故障的指示,从而开辟了太阳能研究的进展。由于数据来自于一天的采集和一周的实验室实验,因此更适合于为故障检测设计的测试算法。该数据集在10.17632/5ssmfpgrpc.1上公开并免费提供给科学界。
{"title":"Photovoltaic module dataset for automated fault detection and analysis in large photovoltaic systems using photovoltaic module fault detection.","authors":"Rotimi-Williams Bello, Pius A Owolawi, Etienne A van Wyk, Chunling Du","doi":"10.1016/j.dib.2024.111184","DOIUrl":"https://doi.org/10.1016/j.dib.2024.111184","url":null,"abstract":"<p><p>Solar energy has become the fastest growing renewable and alternative source of energy. However, there is little or no open-source datasets to advance research knowledge in photovoltaic related systems. The work presented in this article is a step towards deriving Photo-Voltaic Module Dataset (PVMD) of thermal images and ensuring they are publicly available. The work provides a PVMD dataset comprising a total of 1000 self-acquired and augmented images. The dataset includes both permanent and temporal anomalies, namely Hotspots, Cracks, and Shadings. The dataset was collected on September 5, 2024 at the Soshanguve South Campus, Tshwane University of Technology, South Africa using DJI Mavic 3 Thermal's high-resolution thermal and visual imaging capabilities. DJI Mavic 3 Thermal coupled with its advanced flight features makes it an excellent tool for precise and efficient inspections of PV systems. The laboratory experiment performed on the dataset lasted one week. The work aims to provide supervised dataset good enough to support research method in providing a comprehensive and efficient approach to monitoring and maintaining large PV systems. Extensive analysis of the thermal data reveals the anomalies as indicative of faults in the solar cells of PV module, thereby opening up advancement in solar energy research. Because the data comes from a single-day collection and one week laboratory experiment, it makes the data more suitable for testing algorithms designed for fault detection. The dataset is publicly and freely available to the scientific community at 10.17632/5ssmfpgrpc.1.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111184"},"PeriodicalIF":1.0,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11683254/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142906689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Field data on diversity and vegetation structure of natural regeneration in a chronosequence of abandoned gold-mining lands in a tropical Amazon forest. 热带亚马逊雨林金矿废弃地自然更新多样性与植被结构时序野外数据
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-12-02 eCollection Date: 2024-12-01 DOI: 10.1016/j.dib.2024.111183
Jorge Garate-Quispe, Ramiro Canahuire-Robles, Marx Herrera-Machaca, Sufer Baez-Quispe, Gabriel Alarcón-Aguirre

Anthropogenic activities (e.g., logging, gold-mining, agriculture, and uncontrolled urban expansion) threaten the forests in the southeast of the Peruvian Amazon, one of the most diverse ecosystems worldwide. However, gold-mining generates the most severe impacts on ecosystems and limits its resilience. The natural regeneration of degraded areas in the southeastern Peruvian Amazon have not been studied deeply. The dataset contains floristic inventories of previously uncharacterized or poorly studied secondary forests degraded and abandoned by goldmining activities and an intact forest in the Tres Islas indigenous community, Madre de Dios region, in southeastern Peru. The data presented was obtained from 12 plots (20 m × 60 m) established in three successional forests abandoned by gold mining and an intact forest (without mining impacts), where all trees with a stem diameter at breast height greater than 1 cm were inventoried. To the best of our knowledge, this is the only dataset in the southwest of the Peruvian Amazon that compares the natural colonization after gold-mining and intact forests. This dataset can be useful for long-term study and monitoring of structure and tree diversity in relatively understudied yet important secondary forests after gold-mining abandonment. Also, this dataset could be used to analyze the successional trajectory process of vegetation and the recovery of aboveground biomass. Furthermore, the data could be used to investigate the effects of functional traits and types of mining on vegetation recovery. Hence, understanding the successional processes will help to improve restoration, reforestation, or reclamation strategies for the recovery of degraded lands in the Amazon.

人类活动(如伐木、金矿开采、农业和不受控制的城市扩张)威胁着秘鲁亚马逊东南部的森林,这是世界上最多样化的生态系统之一。然而,金矿开采对生态系统的影响最为严重,并限制了生态系统的恢复能力。秘鲁亚马逊东南部退化地区的自然再生尚未得到深入研究。该数据集包含了秘鲁东南部马德雷德迪奥斯地区特雷斯群岛土著社区一片完整森林的植物区系清单,其中包括以前未被描述或研究较少的次生林,以及因金矿开采活动而退化和废弃的次生林。本文的数据来自3个金矿开采废弃的演替森林和一个完整森林(没有采矿影响)中的12个样地(20 m × 60 m),其中所有茎粗胸高大于1 cm的树木都被调查。据我们所知,这是秘鲁亚马逊西南部唯一一个比较金矿开采后自然殖民化和完整森林的数据集。该数据集可用于长期研究和监测研究相对较少但重要的次生林在放弃金矿开采后的结构和树木多样性。该数据集还可用于分析植被演替轨迹过程和地上生物量恢复。此外,这些数据可用于研究功能性状和采矿类型对植被恢复的影响。因此,了解演替过程将有助于改善亚马逊退化土地的恢复、再造林或复垦策略。
{"title":"Field data on diversity and vegetation structure of natural regeneration in a chronosequence of abandoned gold-mining lands in a tropical Amazon forest.","authors":"Jorge Garate-Quispe, Ramiro Canahuire-Robles, Marx Herrera-Machaca, Sufer Baez-Quispe, Gabriel Alarcón-Aguirre","doi":"10.1016/j.dib.2024.111183","DOIUrl":"10.1016/j.dib.2024.111183","url":null,"abstract":"<p><p>Anthropogenic activities (e.g., logging, gold-mining, agriculture, and uncontrolled urban expansion) threaten the forests in the southeast of the Peruvian Amazon, one of the most diverse ecosystems worldwide. However, gold-mining generates the most severe impacts on ecosystems and limits its resilience. The natural regeneration of degraded areas in the southeastern Peruvian Amazon have not been studied deeply. The dataset contains floristic inventories of previously uncharacterized or poorly studied secondary forests degraded and abandoned by goldmining activities and an intact forest in the Tres Islas indigenous community, Madre de Dios region, in southeastern Peru. The data presented was obtained from 12 plots (20 m × 60 m) established in three successional forests abandoned by gold mining and an intact forest (without mining impacts), where all trees with a stem diameter at breast height greater than 1 cm were inventoried. To the best of our knowledge, this is the only dataset in the southwest of the Peruvian Amazon that compares the natural colonization after gold-mining and intact forests. This dataset can be useful for long-term study and monitoring of structure and tree diversity in relatively understudied yet important secondary forests after gold-mining abandonment. Also, this dataset could be used to analyze the successional trajectory process of vegetation and the recovery of aboveground biomass. Furthermore, the data could be used to investigate the effects of functional traits and types of mining on vegetation recovery. Hence, understanding the successional processes will help to improve restoration, reforestation, or reclamation strategies for the recovery of degraded lands in the Amazon.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111183"},"PeriodicalIF":1.0,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11665693/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142881783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Microbial community assembly across agricultural soil mineral mesocosms revealed by 16S rRNA gene amplicon sequencing data 16S rRNA基因扩增子测序数据揭示农业土壤矿物中生态系统中微生物群落的聚集
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-12-01 DOI: 10.1016/j.dib.2024.111125
Daniel Lee , Fernanda C C Oliveira , Richard T. Conant , Minjae Kim
Increasing atmospheric carbon dioxide (CO2) concentrations are impacting the global climate, resulting in significant interest in soil carbon sequestration as a mitigation strategy. While recognized that mineral-associated organic matter (MAOM) in soils is mainly formed through microbial activity, our understanding of microbial-derived MAOM formation processes remains limited due to the complexity of the soil environment. To gain insights into this issue, we incubated fresh soil samples for 45 days with one of three mineral additions: Sand, Kaolinite+Sand, or Illite+Sand. 16S rRNA V3/V4 gene amplicon sequencing was then conducted on samples using an Illumina NextSeq 2000 flow cell. The reads were analyzed and taxonomically assigned with QIIME2 v2023.5.1 and SILVA 138. The dataset has been made publicly available through NCBI GenBank under BioProject ID PRJNA1124235. This dataset is important and useful as it provides valuable insights into the interactions between soil minerals and microbial communities, which can inform strategies for enhancing soil carbon sequestration and mitigating climate change. Moreover, it serves as a crucial reference for future studies, offering a foundational understanding of microbial dynamics in soil systems and guiding further research in microbial ecology and carbon cycling.
大气中二氧化碳(CO2)浓度的增加正在影响全球气候,因此人们对将土壤固碳作为一种缓解战略产生了极大的兴趣。虽然认识到土壤中的矿物相关有机质(MAOM)主要通过微生物活动形成,但由于土壤环境的复杂性,我们对微生物衍生的MAOM形成过程的了解仍然有限。为了深入了解这个问题,我们将新鲜土壤样品与三种矿物添加物之一孵育了45天:沙子,高岭石+沙子或伊利石+沙子。16S rRNA V3/V4基因扩增子测序采用Illumina NextSeq 2000流式细胞仪。用QIIME2 v2023.5.1和SILVA 138对reads进行分析和分类。该数据集已通过NCBI GenBank公开提供,BioProject ID为PRJNA1124235。该数据集非常重要和有用,因为它为土壤矿物质和微生物群落之间的相互作用提供了有价值的见解,可以为加强土壤固碳和减缓气候变化的战略提供信息。此外,它还为未来的研究提供了重要的参考,为土壤系统微生物动力学的认识提供了基础,并指导了微生物生态学和碳循环的进一步研究。
{"title":"Microbial community assembly across agricultural soil mineral mesocosms revealed by 16S rRNA gene amplicon sequencing data","authors":"Daniel Lee ,&nbsp;Fernanda C C Oliveira ,&nbsp;Richard T. Conant ,&nbsp;Minjae Kim","doi":"10.1016/j.dib.2024.111125","DOIUrl":"10.1016/j.dib.2024.111125","url":null,"abstract":"<div><div>Increasing atmospheric carbon dioxide (CO<sub>2</sub>) concentrations are impacting the global climate, resulting in significant interest in soil carbon sequestration as a mitigation strategy. While recognized that mineral-associated organic matter (MAOM) in soils is mainly formed through microbial activity, our understanding of microbial-derived MAOM formation processes remains limited due to the complexity of the soil environment. To gain insights into this issue, we incubated fresh soil samples for 45 days with one of three mineral additions: Sand, Kaolinite+Sand, or Illite+Sand. 16S rRNA V3/V4 gene amplicon sequencing was then conducted on samples using an Illumina NextSeq 2000 flow cell. The reads were analyzed and taxonomically assigned with QIIME2 v2023.5.1 and SILVA 138. The dataset has been made publicly available through NCBI GenBank under BioProject ID PRJNA1124235. This dataset is important and useful as it provides valuable insights into the interactions between soil minerals and microbial communities, which can inform strategies for enhancing soil carbon sequestration and mitigating climate change. Moreover, it serves as a crucial reference for future studies, offering a foundational understanding of microbial dynamics in soil systems and guiding further research in microbial ecology and carbon cycling.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"Article 111125"},"PeriodicalIF":1.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142745172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dataset of Centella Asiatica leaves for quality assessment and machine learning applications 用于质量评估和机器学习应用的积雪草叶片数据集
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-12-01 DOI: 10.1016/j.dib.2024.111150
Rohini Jadhav , Mayuri Molawade , Amol Bhosle , Yogesh Suryawanshi , Kailas Patil , Prawit Chumchu
Centella asiatica is a significant medicinal herb extensively used in traditional oriental medicine and gaining global popularity. The primary constituents of Centella asiatica leaves are triterpenoid saponins, which are predominantly believed to be responsible for its therapeutic properties. Ensuring the use of high-quality leaves in herbal medicine preparation is crucial across all medicinal practices. To address this quality control issue using machine learning applications, we have developed an image dataset of Centella asiatica leaves. The images were captured using Samsung Galaxy M21 mobile phones and depict the leaves in “Dried,” “Healthy,” and “Unhealthy” states. These states are further divided into “Single” and “Multiple” leaves categories, with “Single” leaves being further classified into “Front” and “Back” views to facilitate a comprehensive study. The images were pre-processed and standardized to 1024 × 768 dimensions, resulting in a dataset comprising a total of 9094 images. This dataset is instrumental in the development and evaluation of image recognition algorithms, serving as a foundational resource for computer vision research. Moreover, it provides a valuable platform for testing and validating algorithms in areas such as image categorization and object detection. For researchers exploring the medicinal potential of Centella asiatica in traditional medicine, this dataset offers critical information on the plantʼs health, thereby advancing research in herbal medicine and ethnopharmacology.
积雪草(Centella asiatica)是一种重要的中草药,在传统东方医学中被广泛使用,并受到全球的欢迎。积雪草叶的主要成分是三萜皂苷,主要被认为是负责其治疗特性。确保在草药制剂中使用高质量的叶子在所有医疗实践中都至关重要。为了使用机器学习应用程序解决这个质量控制问题,我们开发了积雪草叶子的图像数据集。这些照片是用三星Galaxy M21手机拍摄的,描绘了“干燥”、“健康”和“不健康”状态下的叶子。这些状态进一步分为“单”叶和“多”叶类别,其中“单”叶进一步分为“正面”和“背面”视图,以方便全面研究。对图像进行预处理并标准化为1024 × 768维,得到的数据集共包含9094张图像。该数据集有助于图像识别算法的开发和评估,是计算机视觉研究的基础资源。此外,它还为图像分类和目标检测等领域的算法测试和验证提供了一个有价值的平台。对于探索积雪草在传统医学中的药用潜力的研究人员来说,该数据集提供了关于植物健康的关键信息,从而推进了草药和民族药理学的研究。
{"title":"Dataset of Centella Asiatica leaves for quality assessment and machine learning applications","authors":"Rohini Jadhav ,&nbsp;Mayuri Molawade ,&nbsp;Amol Bhosle ,&nbsp;Yogesh Suryawanshi ,&nbsp;Kailas Patil ,&nbsp;Prawit Chumchu","doi":"10.1016/j.dib.2024.111150","DOIUrl":"10.1016/j.dib.2024.111150","url":null,"abstract":"<div><div><em>Centella asiatica</em> is a significant medicinal herb extensively used in traditional oriental medicine and gaining global popularity. The primary constituents of <em>Centella asiatica</em> leaves are triterpenoid saponins, which are predominantly believed to be responsible for its therapeutic properties. Ensuring the use of high-quality leaves in herbal medicine preparation is crucial across all medicinal practices. To address this quality control issue using machine learning applications, we have developed an image dataset of <em>Centella asiatica</em> leaves. The images were captured using Samsung Galaxy M21 mobile phones and depict the leaves in “Dried,” “Healthy,” and “Unhealthy” states. These states are further divided into “Single” and “Multiple” leaves categories, with “Single” leaves being further classified into “Front” and “Back” views to facilitate a comprehensive study. The images were pre-processed and standardized to 1024 × 768 dimensions, resulting in a dataset comprising a total of 9094 images. This dataset is instrumental in the development and evaluation of image recognition algorithms, serving as a foundational resource for computer vision research. Moreover, it provides a valuable platform for testing and validating algorithms in areas such as image categorization and object detection. For researchers exploring the medicinal potential of <em>Centella asiatica</em> in traditional medicine, this dataset offers critical information on the plantʼs health, thereby advancing research in herbal medicine and ethnopharmacology.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"Article 111150"},"PeriodicalIF":1.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142756798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Climate data dynamics: A high-volume real world structured weather dataset 气候数据动力学:一个大容量的真实世界结构化天气数据集
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-12-01 DOI: 10.1016/j.dib.2024.111156
Md Zubair , Md. Nafiz Ishtiaque Mahee , Khondaker Masfiq Reza , Md. Shahidul Salim , Nasim Ahmed
The dataset at hand is a unique resource, officially procured from the Bangladesh Meteorological Department, the sole government institution that diligently monitors weather through 35 strategically placed weather stations across the nation. This dataset is a treasure trove of actual data spanning several decades, from the inception of each weather station to the present. It has been meticulously restructured and processed into four (Rainfall, Temperature, Humidity, and Sunshine) key weather parameters, presented in a highly organized and accessible format. This format not only facilitates its use in the machine-learning training process but also opens up avenues for its application in climate research, weather forecasting, and a myriad of other statistical and machine-learning applications.
手头的数据集是一种独特的资源,官方从孟加拉国气象部门采购,这是唯一一个通过全国35个战略位置的气象站勤奋监测天气的政府机构。这个数据集是一个实际数据的宝库,跨越了几十年,从每个气象站开始到现在。它被精心重组和处理成四个(降雨量、温度、湿度和阳光)关键天气参数,以高度组织和可访问的格式呈现。这种格式不仅有助于其在机器学习训练过程中的使用,而且还为其在气候研究、天气预报以及无数其他统计和机器学习应用中的应用开辟了途径。
{"title":"Climate data dynamics: A high-volume real world structured weather dataset","authors":"Md Zubair ,&nbsp;Md. Nafiz Ishtiaque Mahee ,&nbsp;Khondaker Masfiq Reza ,&nbsp;Md. Shahidul Salim ,&nbsp;Nasim Ahmed","doi":"10.1016/j.dib.2024.111156","DOIUrl":"10.1016/j.dib.2024.111156","url":null,"abstract":"<div><div>The dataset at hand is a unique resource, officially procured from the Bangladesh Meteorological Department, the sole government institution that diligently monitors weather through 35 strategically placed weather stations across the nation. This dataset is a treasure trove of actual data spanning several decades, from the inception of each weather station to the present. It has been meticulously restructured and processed into four (Rainfall, Temperature, Humidity, and Sunshine) key weather parameters, presented in a highly organized and accessible format. This format not only facilitates its use in the machine-learning training process but also opens up avenues for its application in climate research, weather forecasting, and a myriad of other statistical and machine-learning applications.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"Article 111156"},"PeriodicalIF":1.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142756799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BD-freshwater-fish: An image dataset from Bangladesh for AI-powered automatic fish species classification and detection toward smart aquaculture bd -淡水鱼:来自孟加拉国的图像数据集,用于人工智能驱动的鱼类自动分类和智能水产养殖检测
IF 1 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2024-12-01 DOI: 10.1016/j.dib.2024.111132
Pranajit Kumar Das , Md. Abu Kawsar , Puspendu Biswas Paul , Md. Abdullah Al Mamun Hridoy , Md. Sanowar Hossain , Sabyasachi Niloy
There are about 33,000 different species of fish and they are visually identified using variety of traits, i.e., size and shape of body, head's size and shape, skin pattern, fin pattern, mouth pattern, scale pattern, and eye pattern etc. In traditional manner, identifying these fish species is always difficult with necked eye. Identification and detection of fish species from images using deep learning and computer vision based techniques is challenging topic among researchers worldwide as an interesting problem. Automatic fish species classification and detection has practical importance for both smart aquaculture and fish industry. AI powered deep learning and computer vision based automatic fish species recognition and sorting system becoming significant factor for making aquaculture industry more productive and sustainable. However, the performance of machine learning classifier greatly depends on the size of image dataset and the quality of the images in the dataset. This article demonstrate BD-Freshwater-Fish, an image dataset contain 4389 images of 12 different species captured in natural environment using HD mobile camera from local fish market of Sylhet and Jessore district of Bangladesh. Twelve (12) different data classes are: Rohu (Labeo rohita), Catla (Catla catla), Mrigal (Cirrhinus cirrhosus), Grass Carp (Ctenopharyngodon idella), Common Carp (Cyprinus carpio), Mirror Carp (Cyprinus carpio var. specularis), Black Rohu (Labeo calbasu), Silver Carp (Hypophthalmichthys molitrix), Striped Catfish (Pangasius pangasius), Nile Tilapia (Oreochromis niloticus), Long-whiskered Catfish (Sperata aor), Freshwater Shark (Wallago attu) has been included in the dataset with a different number of images of different species. The BD-Freshwater-Fish dataset is hosted by Department of Computer Science and Engineering mutually with the help of the Department of Aquaculture, Sylhet Agricultural University, Sylhet, Bangladesh.
世界上大约有33000种不同的鱼类,它们可以通过各种特征进行视觉识别,比如身体的大小和形状、头部的大小和形状、皮肤的形状、鳍的形状、嘴的形状、鳞片的形状和眼睛的形状等。在传统的方法中,用颈眼识别这些鱼类总是很困难的。利用深度学习和基于计算机视觉的技术从图像中识别和检测鱼类是一个具有挑战性的话题,也是一个有趣的问题。鱼类品种自动分类与检测对智能水产养殖和渔业都具有重要的现实意义。人工智能驱动的深度学习和基于计算机视觉的鱼类自动识别和分类系统成为提高水产养殖业生产力和可持续性的重要因素。然而,机器学习分类器的性能在很大程度上取决于图像数据集的大小和数据集中图像的质量。本文展示了bd - fresh - fish,这是一个图像数据集,包含12个不同物种的4389张图像,使用高清移动相机从孟加拉国Sylhet和Jessore地区的当地鱼市在自然环境中拍摄。十二(12)个不同的数据类是:罗虎(Labeo rohita)、鲶鱼(Catla Catla)、鲫鱼(Cirrhinus)、草鱼(Ctenopharyngodon idella)、鲤鱼(Cyprinus carpio)、镜鲤(Cyprinus carpio var. specularis)、黑罗虎(Labeo calbasu)、鲢鱼(Hypophthalmichthys molitrix)、条纹鲶鱼(Pangasius Pangasius)、尼罗罗非鱼(Oreochromis niloticus)、长须鲶鱼(Sperata aor)、淡水鲨鱼(Wallago attu)已被包含在不同物种的不同数量的图像中。bd -淡水-鱼类数据集由计算机科学与工程系在孟加拉国锡尔赫特农业大学水产养殖系的帮助下共同托管。
{"title":"BD-freshwater-fish: An image dataset from Bangladesh for AI-powered automatic fish species classification and detection toward smart aquaculture","authors":"Pranajit Kumar Das ,&nbsp;Md. Abu Kawsar ,&nbsp;Puspendu Biswas Paul ,&nbsp;Md. Abdullah Al Mamun Hridoy ,&nbsp;Md. Sanowar Hossain ,&nbsp;Sabyasachi Niloy","doi":"10.1016/j.dib.2024.111132","DOIUrl":"10.1016/j.dib.2024.111132","url":null,"abstract":"<div><div>There are about 33,000 different species of fish and they are visually identified using variety of traits, i.e., size and shape of body, head's size and shape, skin pattern, fin pattern, mouth pattern, scale pattern, and eye pattern etc. In traditional manner, identifying these fish species is always difficult with necked eye. Identification and detection of fish species from images using deep learning and computer vision based techniques is challenging topic among researchers worldwide as an interesting problem. Automatic fish species classification and detection has practical importance for both smart aquaculture and fish industry. AI powered deep learning and computer vision based automatic fish species recognition and sorting system becoming significant factor for making aquaculture industry more productive and sustainable. However, the performance of machine learning classifier greatly depends on the size of image dataset and the quality of the images in the dataset. This article demonstrate <em>BD-Freshwater-Fish</em>, an image dataset contain 4389 images of 12 different species captured in natural environment using HD mobile camera from local fish market of Sylhet and Jessore district of Bangladesh. Twelve (12) different data classes are: Rohu (<em>Labeo rohita</em>), Catla (<em>Catla catla</em>), Mrigal (<em>Cirrhinus cirrhosus</em>), Grass Carp (<em>Ctenopharyngodon idella)</em>, Common Carp (<em>Cyprinus carpio</em>), Mirror Carp (<em>Cyprinus carpio</em> var. specularis), Black Rohu (<em>Labeo calbasu</em>), Silver Carp (<em>Hypophthalmichthys molitrix),</em> Striped Catfish (<em>Pangasius pangasius</em>), Nile Tilapia (<em>Oreochromis niloticus</em>), Long-whiskered Catfish (<em>Sperata aor</em>), Freshwater Shark (<em>Wallago attu</em>) has been included in the dataset with a different number of images of different species. The <em>BD-Freshwater-Fish</em> dataset is hosted by Department of Computer Science and Engineering mutually with the help of the Department of Aquaculture, Sylhet Agricultural University, Sylhet, Bangladesh.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"Article 111132"},"PeriodicalIF":1.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142745174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Data in Brief
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1