首页 > 最新文献

Data in Brief最新文献

英文 中文
A transcriptome sequence dataset characterizing eggs, nymphs and adults of Oxycarenus hyalinipennis, the cotton seed bug 棉籽虫卵、若虫和成虫转录组序列数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-02-05 DOI: 10.1016/j.dib.2026.112532
Sam D. Heraghty, Aijun Zhang, Daniel Kuhar, Dawn E. Gundersen-Rindal, Michael E. Sparks
The cotton seed bug, Oxycarenus hyalinipennis, is an agricultural pest that has recently been detected in the United States and has the potential to cause extensive economic damage to the cotton production industry. Currently, there are no transcriptomic resources for this species. The data reported here will serve to help guide future efforts to create additional reference resources as well as facilitate the development of population control strategies. These data could also be of use towards identifying protein coding genes in a cotton seed bug genome assembly. A total of 13,384 differentially expressed genes was identified, which collectively encoded 40,871 distinct transcripts, of which 18,842 could be annotated with a reference protein in the NCBI NR database, 13,233 with Pfam protein families and 8,089 with GO Gene Ontology terms. These transcripts could, for example, be targeted for future functional genomics work.
棉花籽虫,透明质氧虫,是最近在美国发现的一种农业害虫,有可能对棉花生产工业造成广泛的经济损失。目前,没有关于该物种的转录组学资源。这里报告的数据将有助于指导今后创造更多参考资源的工作,并促进人口控制战略的发展。这些数据也可用于鉴定棉籽虫基因组组装中的蛋白质编码基因。共鉴定出13,384个差异表达基因,共编码40,871个不同的转录本,其中18,842个可以用NCBI NR数据库中的参考蛋白进行注释,13,233个可以用Pfam蛋白家族进行注释,8,089个可以用GO基因本体术语进行注释。例如,这些转录本可以成为未来功能基因组学研究的目标。
{"title":"A transcriptome sequence dataset characterizing eggs, nymphs and adults of Oxycarenus hyalinipennis, the cotton seed bug","authors":"Sam D. Heraghty,&nbsp;Aijun Zhang,&nbsp;Daniel Kuhar,&nbsp;Dawn E. Gundersen-Rindal,&nbsp;Michael E. Sparks","doi":"10.1016/j.dib.2026.112532","DOIUrl":"10.1016/j.dib.2026.112532","url":null,"abstract":"<div><div>The cotton seed bug, <em>Oxycarenus hyalinipennis,</em> is an agricultural pest that has recently been detected in the United States and has the potential to cause extensive economic damage to the cotton production industry. Currently, there are no transcriptomic resources for this species. The data reported here will serve to help guide future efforts to create additional reference resources as well as facilitate the development of population control strategies. These data could also be of use towards identifying protein coding genes in a cotton seed bug genome assembly. A total of 13,384 differentially expressed genes was identified, which collectively encoded 40,871 distinct transcripts, of which 18,842 could be annotated with a reference protein in the NCBI NR database, 13,233 with Pfam protein families and 8,089 with GO Gene Ontology terms. These transcripts could, for example, be targeted for future functional genomics work.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112532"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Whole genome sequencing data analysis identified a cefotaxime-resistant Empedobacter brevis GBW-1 isolate from ground beef encoding a novel metallo-beta-lactamase variant, blaEBR-6 全基因组测序数据分析发现,从碎牛肉中分离出一株耐头孢噻肟短恩培多杆菌GBW-1,该菌株编码一种新型金属β -内酰胺酶变体blaEBR-6
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-02-06 DOI: 10.1016/j.dib.2026.112547
Daniel Jones , Praful Aggarwal , Jamison Trewyn , Poojhaa Shanmugam , Kyle Leistikow , Troy Skwor
While investigating foodstuffs for ESBL-producing Aeromonas species on ampicillin dextrin agar with vancomycin and cefotaxime, a multidrug-resistant Empedobacter brevis strain GBW-1 was identified from ground beef. Phylogenetic analysis supports the interconnectedness of environment, humans and food driving this species' evolutionary development. Antimicrobial susceptibility testing demonstrated resistance to gentamicin, carbapenems and third-generation cephalosporins. Data collection from whole genome sequencing of this strain detected a 3.74 Mb genome with 32.8% GC content containing 3780 coding genes. Among these genes, at least three known antimicrobial resistance (AMR) genes were identified from the dataset with qacG, vanT gene within the vanG cluster, and a novel variant of the metallo-β-lactamase blaEBR-6. This homologue, EBR-6, was compared against previously known EBR variants and was found to be closest to EBR-3 with an 84.98% amino acid identity match. Data collection from in silico molecular docking experiments predicted these mutations change the binding to meropenem. Furthermore, nearly 100 annotated regions associated with mobile genetic elements, including the presence of tra operons, were identified on the genome. Together, this dataset provides, genomic, phenotypic, and in silico data that may be reused to monitor the evolution of EBR from a One Health perspective.
在用万古霉素和头孢他肟对氨苄西林糊精琼脂对食品中产生esbl的气单胞菌进行调查时,从碎牛肉中鉴定出一株多重耐药的短恩培多杆菌菌株GBW-1。系统发育分析支持环境、人类和食物的相互联系,推动了这个物种的进化发展。抗菌药物敏感性试验显示对庆大霉素、碳青霉烯类和第三代头孢菌素耐药。全基因组测序结果显示,该菌株基因组全长3.74 Mb, GC含量32.8%,编码基因3780个。在这些基因中,至少有三个已知的抗菌素耐药性(AMR)基因从数据集中鉴定出,其中qacG基因,vanG簇中的vanT基因,以及金属β-内酰胺酶blaEBR-6的新变体。该同源物EBR-6与先前已知的EBR变体进行了比较,发现与EBR-3最接近,氨基酸同源性为84.98%。从硅分子对接实验中收集的数据预测,这些突变改变了与美罗培南的结合。此外,在基因组上鉴定了近100个与移动遗传元件相关的注释区域,包括反操纵子的存在。总的来说,该数据集提供了基因组、表型和计算机数据,可以重用这些数据,从One Health的角度监测EBR的演变。
{"title":"Whole genome sequencing data analysis identified a cefotaxime-resistant Empedobacter brevis GBW-1 isolate from ground beef encoding a novel metallo-beta-lactamase variant, blaEBR-6","authors":"Daniel Jones ,&nbsp;Praful Aggarwal ,&nbsp;Jamison Trewyn ,&nbsp;Poojhaa Shanmugam ,&nbsp;Kyle Leistikow ,&nbsp;Troy Skwor","doi":"10.1016/j.dib.2026.112547","DOIUrl":"10.1016/j.dib.2026.112547","url":null,"abstract":"<div><div>While investigating foodstuffs for ESBL-producing <em>Aeromonas</em> species on ampicillin dextrin agar with vancomycin and cefotaxime, a multidrug-resistant <em>Empedobacter brevis</em> strain GBW-1 was identified from ground beef. Phylogenetic analysis supports the interconnectedness of environment, humans and food driving this species' evolutionary development. Antimicrobial susceptibility testing demonstrated resistance to gentamicin, carbapenems and third-generation cephalosporins. Data collection from whole genome sequencing of this strain detected a 3.74 Mb genome with 32.8% GC content containing 3780 coding genes. Among these genes, at least three known antimicrobial resistance (AMR) genes were identified from the dataset with <em>qacG, vanT</em> gene within the <em>vanG</em> cluster, and a novel variant of the metallo-β-lactamase <em>bla</em><sub>EBR-6</sub>. This homologue, EBR-6, was compared against previously known EBR variants and was found to be closest to EBR-3 with an 84.98% amino acid identity match. Data collection from <em>in silico</em> molecular docking experiments predicted these mutations change the binding to meropenem. Furthermore, nearly 100 annotated regions associated with mobile genetic elements, including the presence of <em>tra</em> operons, were identified on the genome. Together, this dataset provides, genomic, phenotypic, and <em>in</em> silico data that may be reused to monitor the evolution of EBR from a One Health perspective.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112547"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dataset on resource allocation and usage for a private cloud 关于私有云资源分配和使用的数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-29 DOI: 10.1016/j.dib.2026.112514
Paola Marques, Mariana Mendes, Thiago Emmanuel Pereira, Giovanni Farias
While public cloud providers dominate the commercial landscape, private clouds are widely adopted by academic and research institutions to meet specific governance and operational requirements. There are multiple available datasets about resource usage of public clouds; however, datasets capturing usage patterns in private clouds remain scarce, which limits research in this area. This work presents a dataset comprising over 64 million records collected from a private OpenStack-based cloud operated by the Distributed Systems Laboratory at the Federal University of Campina Grande, Brazil. Data was continuously gathered over nearly twelve months (May 23, 2024 to May 16, 2025), periodically querying OpenStack APIs and monitoring services every five minutes. The dataset captures different aspects of the infrastructure, allocation quotas, user-to-project associations (as OpenStack groups users into projects), server (virtual machines) specifications, and resource utilization for users and projects. Entries are timestamped, enabling temporal analyses of system dynamics. Sensitive attributes, such as user names, project names, IP addresses, and server names were protected, leaving only system-generated UUIDs. By offering a detailed, time-stamped, view of a private cloud, this dataset provides a valuable resource for cloud computing research, helping to bridge the gap in publicly available datasets from non-commercial cloud environments. The dataset is valuable not only for academic institutions but also for companies considering cloud repatriation.
虽然公共云提供商在商业领域占据主导地位,但私有云被学术和研究机构广泛采用,以满足特定的治理和运营需求。关于公有云的资源使用有多个可用的数据集;然而,捕获私有云使用模式的数据集仍然很少,这限制了该领域的研究。这项工作展示了一个包含超过6400万条记录的数据集,这些记录来自一个由巴西坎皮纳格兰德联邦大学分布式系统实验室运营的基于openstack的私有云。连续收集数据近12个月(2024年5月23日- 2025年5月16日),每5分钟周期性查询OpenStack api和监控服务。该数据集捕获了基础设施、分配配额、用户到项目的关联(因为OpenStack将用户分组到项目中)、服务器(虚拟机)规范以及用户和项目的资源利用率的不同方面。条目有时间戳,支持对系统动力学进行时间分析。敏感属性,如用户名、项目名、IP地址和服务器名受到保护,只留下系统生成的uuid。通过提供详细的、带有时间戳的私有云视图,该数据集为云计算研究提供了宝贵的资源,有助于弥合来自非商业云环境的公开可用数据集的差距。该数据集不仅对学术机构很有价值,对考虑云回归的公司也很有价值。
{"title":"Dataset on resource allocation and usage for a private cloud","authors":"Paola Marques,&nbsp;Mariana Mendes,&nbsp;Thiago Emmanuel Pereira,&nbsp;Giovanni Farias","doi":"10.1016/j.dib.2026.112514","DOIUrl":"10.1016/j.dib.2026.112514","url":null,"abstract":"<div><div>While public cloud providers dominate the commercial landscape, private clouds are widely adopted by academic and research institutions to meet specific governance and operational requirements. There are multiple available datasets about resource usage of public clouds; however, datasets capturing usage patterns in private clouds remain scarce, which limits research in this area. This work presents a dataset comprising over 64 million records collected from a private OpenStack-based cloud operated by the Distributed Systems Laboratory at the Federal University of Campina Grande, Brazil. Data was continuously gathered over nearly twelve months (May 23, 2024 to May 16, 2025), periodically querying OpenStack APIs and monitoring services every five minutes. The dataset captures different aspects of the infrastructure, allocation quotas, user-to-project associations (as OpenStack groups users into projects), server (virtual machines) specifications, and resource utilization for users and projects. Entries are timestamped, enabling temporal analyses of system dynamics. Sensitive attributes, such as user names, project names, IP addresses, and server names were protected, leaving only system-generated UUIDs. By offering a detailed, time-stamped, view of a private cloud, this dataset provides a valuable resource for cloud computing research, helping to bridge the gap in publicly available datasets from non-commercial cloud environments. The dataset is valuable not only for academic institutions but also for companies considering cloud repatriation.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112514"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Agri-vision Bangladesh: A multi-crop augmented image dataset for automated disease diagnosis in Bottle Gourd, Zucchini, Papaya, and Tomato Agri-vision Bangladesh:用于葫芦、西葫芦、木瓜和番茄疾病自动诊断的多作物增强图像数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-29 DOI: 10.1016/j.dib.2026.112528
Md Masum Billah , Md. Anisur Rahman , Saifuddin Sagor , Sanzida Parvin , Mohammad Shorif Uddin
This article introduces Agri-Vision Bangladesh, a comprehensive, augmented image dataset designed to advance automated disease diagnosis in four economically vital agricultural crops: Bottle Gourd (Lagenaria siceraria), Zucchini (Cucurbita pepo), Papaya (Carica papaya), and Tomato (Solanum lycopersicum). Addressing the scarcity of region-specific agricultural data, a total of 5266 original images were acquired directly from diverse agricultural fields in Bangladesh using a SONY ALPHA 7 II full-frame camera under natural lighting conditions. The dataset encompasses 28 distinct classes, covering a wide spectrum of biotic stressors including viral (Mosaic Virus, Leaf Curl), fungal (Downy Mildew, Anthracnose, Alternaria Blight), bacterial (Bacterial Blight, Xanthomonas), and pest-induced damage (Insect Hole, White Spot), alongside Healthy samples. To ensure scientific reliability, each image underwent a rigorous two-stage validation process by senior agronomists. To tackle class imbalance and facilitate the training of data-intensive Deep Learning models, the dataset was expanded using a Python-based augmentation pipeline incorporating geometric transformations (rotation, flipping) and photometric adjustments (noise, brightness) resulting in a final repository of 28,000 images (5266 original and 22,734 augmented). All files are standardized to 512×512 pixels in JPG format. This expert-validated resource serves as a critical benchmark for developing robust computer vision algorithms (e.g., CNNs, Vision Transformers) for precision agriculture, enabling research into fine-grained classification, object detection, and cross-crop transfer learning in subtropical farming environments.
本文介绍了Agri-Vision Bangladesh,这是一个全面的增强图像数据集,旨在推进四种经济上至关重要的农作物的自动疾病诊断:葫芦(Lagenaria siceraria)、西葫芦(Cucurbita pepo)、木瓜(Carica Papaya)和番茄(Solanum lycopersicum)。为了解决特定区域农业数据的稀缺性,在自然光条件下,使用索尼ALPHA 7 II全画幅相机直接从孟加拉国不同的农业领域获取了总共5266张原始图像。该数据集包含28个不同的类别,涵盖了广泛的生物压力源,包括病毒(花叶病毒,卷曲叶病毒),真菌(霜霉病,炭疽病,疫病),细菌(细菌性疫病,黄单胞菌)和害虫引起的损害(虫洞,白斑),以及健康样本。为了确保科学的可靠性,每张图像都经过了高级农学家严格的两阶段验证过程。为了解决类不平衡问题并促进数据密集型深度学习模型的训练,使用基于python的增强管道扩展数据集,该管道包含几何变换(旋转,翻转)和光度调整(噪声,亮度),最终生成28,000张图像的存储库(5266张原始图像和22,734张增强图像)。所有文件都标准化为512×512像素的JPG格式。该专家验证的资源可作为开发用于精准农业的鲁棒计算机视觉算法(例如cnn, vision Transformers)的关键基准,使研究能够在亚热带农业环境中进行细粒度分类,目标检测和跨作物转移学习。
{"title":"Agri-vision Bangladesh: A multi-crop augmented image dataset for automated disease diagnosis in Bottle Gourd, Zucchini, Papaya, and Tomato","authors":"Md Masum Billah ,&nbsp;Md. Anisur Rahman ,&nbsp;Saifuddin Sagor ,&nbsp;Sanzida Parvin ,&nbsp;Mohammad Shorif Uddin","doi":"10.1016/j.dib.2026.112528","DOIUrl":"10.1016/j.dib.2026.112528","url":null,"abstract":"<div><div>This article introduces Agri-Vision Bangladesh, a comprehensive, augmented image dataset designed to advance automated disease diagnosis in four economically vital agricultural crops: Bottle Gourd (<em>Lagenaria siceraria</em>), Zucchini (<em>Cucurbita pepo</em>), Papaya (Carica papaya), and Tomato (<em>Solanum lycopersicum</em>). Addressing the scarcity of region-specific agricultural data, a total of 5266 original images were acquired directly from diverse agricultural fields in Bangladesh using a SONY ALPHA 7 II full-frame camera under natural lighting conditions. The dataset encompasses 28 distinct classes, covering a wide spectrum of biotic stressors including viral (Mosaic Virus, Leaf Curl), fungal (Downy Mildew, Anthracnose, Alternaria Blight), bacterial (Bacterial Blight, Xanthomonas), and pest-induced damage (Insect Hole, White Spot), alongside Healthy samples. To ensure scientific reliability, each image underwent a rigorous two-stage validation process by senior agronomists. To tackle class imbalance and facilitate the training of data-intensive Deep Learning models, the dataset was expanded using a Python-based augmentation pipeline incorporating geometric transformations (rotation, flipping) and photometric adjustments (noise, brightness) resulting in a final repository of 28,000 images (5266 original and 22,734 augmented). All files are standardized to 512×512 pixels in JPG format. This expert-validated resource serves as a critical benchmark for developing robust computer vision algorithms (e.g., CNNs, Vision Transformers) for precision agriculture, enabling research into fine-grained classification, object detection, and cross-crop transfer learning in subtropical farming environments.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112528"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visualizing archaeobotanical data: A comprehensive photographic record of desiccated plant remains from an early modern context at Santi Quattro Coronati, Rome 可视化的考古植物学数据:在罗马的Santi Quattro Coronati的早期现代背景下,对干燥的植物遗骸进行了全面的摄影记录
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-13 DOI: 10.1016/j.dib.2026.112468
Claudia Moricca , Rachele Nicolini , Lucrezia Masci , Lia Barelli , Simona Morretta , Raffaele Pugliese , Laura Sadori
<div><div>The “Santi Quattro Coronati – archaeobotanical plates” dataset presents a comprehensive photographic collection of carpological remains recovered from a pit in the complex of Santi Quattro Coronati (Rome, Italy). The deposit, dated between the late 15th and the mid-16th century, yielded a diverse assemblage of desiccated plant remains. The dataset is novel in that it provides the complete photographic documentation of all identified taxa from a single Early Modern archaeological context, a chronological phase that remains underrepresented in Italian archaeobotanical research.</div><div>The photographic documentation focuses on a representative sample of each taxon identified in the archaeobotanical analysis, with particular attention to the best-preserved specimens. When multiple plant parts of the same taxon were present, all were included. The dataset also includes fragile and rarely illustrated plant parts, such as cereal rachis fragments, tunics and basal plates of onion and garlic, grapevine tendrils and legume seed coats. These are often excluded from reference atlases due to their low archaeological survivability and the consequent scarcity of well-preserved comparative specimens.</div><div>High-resolution images were acquired using a Leica MC205C stereomicroscope equipped with a Leica IC80HD camera and the Leica Application Suite v.4.5.0 software. Illumination was provided by the Leica LED5000 HDI™ dome system, ensuring constant, diffuse light conditions. A column of images was captured for each specimen and processed with Helicon Focus v.7.0.1 Pro through focus stacking to obtain a single fully focused image. Depending on specimen size and complexity, between 9 and 127 photographs were used per perspective. Larger samples, unsuitable for microscopic observation, were photographed using a Canon digital camera under controlled illumination. Post-processing was performed with GIMP, applying standard tools for background cleaning and masking. Each final plate includes a scale bar for size reference.</div><div>The dataset is organized alphabetically by plant family and taxon. For each taxon, one or more plates are provided, displaying specimens from one to three perspectives to represent their 3D morphology. Nomenclature follows the taxonomy used in the original publication of the assemblage and has been updated according to the most recent checklist of the Italian vascular flora. A metadata .xls file is provided to facilitate consultation, reuse, comparison and integration with other archaeobotanical datasets.</div><div>This dataset offers a well-documented comparative visual reference for species/genus identification and for assessing the preservation state and morphological integrity of desiccated archaeobotanical remains. Offering detailed photographic records of New World plant taxa previously identified in this context, the study enhances accessibility and understanding of these materials through visual reference. Despite bein
“Santi Quattro Coronati -考古植物板块”数据集展示了从意大利罗马的Santi Quattro Coronati复合体的一个坑中恢复的人类学遗骸的综合摄影集合。该矿床的历史可以追溯到15世纪晚期到16世纪中期,发现了各种各样的干枯植物遗骸。该数据集的新颖之处在于,它提供了来自单一的早期现代考古背景的所有已识别分类群的完整照片文档,这是意大利考古植物学研究中尚未充分代表的时间顺序阶段。摄影文献集中于考古植物学分析中确定的每个分类单元的代表性样本,特别关注保存最完好的标本。当同一分类单元的多个植物部分存在时,所有部分都被包括在内。该数据集还包括易碎且很少展示的植物部分,如谷物轴片、洋葱和大蒜的外衣和基板、葡萄藤卷须和豆类种皮。由于它们的考古存续能力较低,因而缺乏保存完好的比较标本,因此经常被排除在参考地图集之外。使用配备徕卡IC80HD相机的徕卡MC205C立体显微镜和Leica Application Suite v.4.5.0软件获取高分辨率图像。照明由徕卡LED5000 HDI™穹顶系统提供,确保恒定的漫射光条件。每个标本采集一列图像,用Helicon Focus v.7.0.1 Pro进行对焦叠加处理,得到一张完全聚焦的图像。根据标本的大小和复杂程度,每个视角使用9到127张照片。较大的样本,不适合显微镜观察,使用佳能数码相机在受控照明下拍摄。使用GIMP进行后处理,使用标准工具进行背景清理和遮盖。每个最终板包括一个比例尺的尺寸参考。数据集按植物科和分类单元的字母顺序组织。对于每个分类单元,提供一个或多个板,从一个到三个角度展示标本,以表示它们的三维形态。命名法遵循汇编原始出版物中使用的分类法,并根据意大利维管植物区系的最新清单进行了更新。提供了一个元数据。xls文件,以便与其他考古植物数据集进行查阅、重用、比较和集成。该数据集为物种/属鉴定和评估干燥考古植物遗骸的保存状态和形态完整性提供了一个有充分记录的比较视觉参考。该研究提供了在此背景下发现的新大陆植物分类群的详细照片记录,通过视觉参考提高了对这些材料的可及性和理解。尽管受到单一背景的限制,该数据集代表了考古植物学的最佳实践,鼓励其他研究人员分享他们所研究的人类学组合的完整照片文档,从而支持开放科学和逐步构建扩展的视觉参考集合。该数据集主要用于研究早期现代背景的考古植物学家和环境考古学家,但它也可以为研究其他年代和地点的干枯植物遗骸的研究人员提供服务。
{"title":"Visualizing archaeobotanical data: A comprehensive photographic record of desiccated plant remains from an early modern context at Santi Quattro Coronati, Rome","authors":"Claudia Moricca ,&nbsp;Rachele Nicolini ,&nbsp;Lucrezia Masci ,&nbsp;Lia Barelli ,&nbsp;Simona Morretta ,&nbsp;Raffaele Pugliese ,&nbsp;Laura Sadori","doi":"10.1016/j.dib.2026.112468","DOIUrl":"10.1016/j.dib.2026.112468","url":null,"abstract":"&lt;div&gt;&lt;div&gt;The “Santi Quattro Coronati – archaeobotanical plates” dataset presents a comprehensive photographic collection of carpological remains recovered from a pit in the complex of Santi Quattro Coronati (Rome, Italy). The deposit, dated between the late 15th and the mid-16th century, yielded a diverse assemblage of desiccated plant remains. The dataset is novel in that it provides the complete photographic documentation of all identified taxa from a single Early Modern archaeological context, a chronological phase that remains underrepresented in Italian archaeobotanical research.&lt;/div&gt;&lt;div&gt;The photographic documentation focuses on a representative sample of each taxon identified in the archaeobotanical analysis, with particular attention to the best-preserved specimens. When multiple plant parts of the same taxon were present, all were included. The dataset also includes fragile and rarely illustrated plant parts, such as cereal rachis fragments, tunics and basal plates of onion and garlic, grapevine tendrils and legume seed coats. These are often excluded from reference atlases due to their low archaeological survivability and the consequent scarcity of well-preserved comparative specimens.&lt;/div&gt;&lt;div&gt;High-resolution images were acquired using a Leica MC205C stereomicroscope equipped with a Leica IC80HD camera and the Leica Application Suite v.4.5.0 software. Illumination was provided by the Leica LED5000 HDI™ dome system, ensuring constant, diffuse light conditions. A column of images was captured for each specimen and processed with Helicon Focus v.7.0.1 Pro through focus stacking to obtain a single fully focused image. Depending on specimen size and complexity, between 9 and 127 photographs were used per perspective. Larger samples, unsuitable for microscopic observation, were photographed using a Canon digital camera under controlled illumination. Post-processing was performed with GIMP, applying standard tools for background cleaning and masking. Each final plate includes a scale bar for size reference.&lt;/div&gt;&lt;div&gt;The dataset is organized alphabetically by plant family and taxon. For each taxon, one or more plates are provided, displaying specimens from one to three perspectives to represent their 3D morphology. Nomenclature follows the taxonomy used in the original publication of the assemblage and has been updated according to the most recent checklist of the Italian vascular flora. A metadata .xls file is provided to facilitate consultation, reuse, comparison and integration with other archaeobotanical datasets.&lt;/div&gt;&lt;div&gt;This dataset offers a well-documented comparative visual reference for species/genus identification and for assessing the preservation state and morphological integrity of desiccated archaeobotanical remains. Offering detailed photographic records of New World plant taxa previously identified in this context, the study enhances accessibility and understanding of these materials through visual reference. Despite bein","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112468"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A reference-grade genome assembly data of sika deer in Hokkaido, Japan, Cervus nippon yesoensis 日本北海道梅花鹿(Cervus nippon yesoensis)参考级基因组组装数据
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2025-12-27 DOI: 10.1016/j.dib.2025.112423
Yuki Matsumoto , Junco Nagata , Yukiko Matsuura , Hayato Iijima
Sika deer (Cervus nippon) is naturally distributed across East Asia and includes 14 subspecies, showing phenotypic and genetic diversity. In this study, we constructed a de novo genome assembly of wild sika deer using one of the largest subspecies, C. n. yesoensis. We used HiFi, high quality long-read based on Pacific Bioscience to assemble our novel genome assembly CerNipYes1.0. The genome size of CerNipYes1.0 is estimated to be 3.1Gb, which is 0.6Gb larger than the other genome assembly of sika deer previously reported. The number of scaffolds is 1,810 and N50 length achieved 77Mb. Compleasm, a genome completeness evaluation tool based on Benchmarking Universal Single-Copy Orthologs (BUSCO) indicated that 12,562 (99.75%) genes are completed as genes with comparing to database. Our results indicate that CerNipYes1.0 is valuable to study the molecular biology, phylogeny and evolution of the Cervidae and its genome.
梅花鹿(Cervus nippon)自然分布于东亚地区,包括14个亚种,表现出表型和遗传多样性。在这项研究中,我们利用野生梅花鹿最大的亚种之一C. n. yesoensis构建了一个全新的基因组组装。我们使用HiFi,高质量的长读基于太平洋生物科学组装我们的新基因组组装CerNipYes1.0。CerNipYes1.0的基因组大小估计为3.1Gb,比先前报道的其他梅花鹿基因组大0.6Gb。支架数量为1810个,N50长度达到77Mb。基于BUSCO (Benchmarking Universal Single-Copy Orthologs)的基因组完整性评估工具Compleasm表明,与数据库比较,有12562个(99.75%)基因被完成为基因。结果表明,CerNipYes1.0在研究蛇科动物及其基因组的分子生物学、系统发育和进化方面具有重要的应用价值。
{"title":"A reference-grade genome assembly data of sika deer in Hokkaido, Japan, Cervus nippon yesoensis","authors":"Yuki Matsumoto ,&nbsp;Junco Nagata ,&nbsp;Yukiko Matsuura ,&nbsp;Hayato Iijima","doi":"10.1016/j.dib.2025.112423","DOIUrl":"10.1016/j.dib.2025.112423","url":null,"abstract":"<div><div>Sika deer (<em>Cervus nippon</em>) is naturally distributed across East Asia and includes 14 subspecies, showing phenotypic and genetic diversity. In this study, we constructed a de novo genome assembly of wild sika deer using one of the largest subspecies, <em>C. n. yesoensis</em>. We used HiFi, high quality long-read based on Pacific Bioscience to assemble our novel genome assembly CerNipYes1.0. The genome size of CerNipYes1.0 is estimated to be 3.1Gb, which is 0.6Gb larger than the other genome assembly of sika deer previously reported. The number of scaffolds is 1,810 and N50 length achieved 77Mb. Compleasm, a genome completeness evaluation tool based on Benchmarking Universal Single-Copy Orthologs (BUSCO) indicated that 12,562 (99.75%) genes are completed as genes with comparing to database. Our results indicate that CerNipYes1.0 is valuable to study the molecular biology, phylogeny and evolution of the Cervidae and its genome.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112423"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BrinjalFruitX: A field-collected image dataset for machine learning and deep learning-based disease identification in brinjal fruits BrinjalFruitX:用于机器学习和基于深度学习的茄子果实疾病识别的现场采集图像数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-21 DOI: 10.1016/j.dib.2026.112490
Abu Kowshir Bitto , Md. Zahid Hasan , Md. Hasan Imam Bijoy , Khalid Been Badruzzaman Biplob , Mohammad Mahadi Hassan , Mohammad Shohel Rana , Abdul Kadar Muhammad Masum
Brinjal (Solanum melongena) or eggplant is one of the four most essential vegetable crops that are grown in Bangladesh and contribute significantly to the agricultural industry of the country. Brinjal supports the livelihood of numerous small farmers; however, brinjal is severely susceptible to various fruit diseases, which have serious impacts on yield quality and may cause considerable economic losses. While most existing plant disease datasets primarily focus on leaf-related disorders, only a limited number include fruit-related diseases and even those contain very few classes. This gap is significant because fruit diseases directly affect crop quality, market value, and overall yield. This is why we present here a new and comprehensive dataset that is unparalleled, exclusively for brinjal fruit diseases. This data set consists of 1823 high-quality, labelled images, across five distinct classes: Phomopsis Blight, Shoot and Fruit Borer, Fruit Cracking, Wet Rot, and Healthy Fruit. The images were collected from real farm conditions in numerous areas of Bangladesh to ensure a robust sample of varied environmental and farming practices impacting the growth of diseases. This dataset is designed with the unique aim to support plant disease research and enhance training of deep learning models for autonomous disease detection. Lastly, the dataset will allow early disease detection, enhancing crop management practice, reduction of losses, and increasing farmers' economic returns. The release of this dataset will encourage agricultural research as well as practical use in precision agriculture.
茄子(茄)或茄子是孟加拉国种植的四种最重要的蔬菜作物之一,对该国的农业作出了重大贡献。茄子支撑着无数小农的生计;然而,茄子极易发生各种果实病害,严重影响产量品质,并可能造成巨大的经济损失。虽然大多数现有的植物疾病数据集主要关注与叶片相关的疾病,但只有有限数量的数据集包括与水果相关的疾病,甚至这些疾病包含的类别也很少。这一差距是显著的,因为水果病害直接影响作物品质、市场价值和总产量。这就是为什么我们在这里提出一个新的和全面的数据集,是无与伦比的,专门为茄子果实疾病。该数据集由1823张高质量的、带标签的图像组成,分为五个不同的类别:油菜枯萎病、茎和果实蛀虫、果实开裂、湿腐病和健康水果。这些图像是从孟加拉国许多地区的真实农场条件中收集的,以确保对影响疾病增长的各种环境和农业做法提供可靠的样本。该数据集的设计具有独特的目的,以支持植物病害研究和增强深度学习模型的训练,用于自主病害检测。最后,该数据集将有助于早期发现疾病,加强作物管理实践,减少损失,并提高农民的经济回报。该数据集的发布将鼓励农业研究以及精准农业的实际应用。
{"title":"BrinjalFruitX: A field-collected image dataset for machine learning and deep learning-based disease identification in brinjal fruits","authors":"Abu Kowshir Bitto ,&nbsp;Md. Zahid Hasan ,&nbsp;Md. Hasan Imam Bijoy ,&nbsp;Khalid Been Badruzzaman Biplob ,&nbsp;Mohammad Mahadi Hassan ,&nbsp;Mohammad Shohel Rana ,&nbsp;Abdul Kadar Muhammad Masum","doi":"10.1016/j.dib.2026.112490","DOIUrl":"10.1016/j.dib.2026.112490","url":null,"abstract":"<div><div>Brinjal (Solanum melongena) or eggplant is one of the four most essential vegetable crops that are grown in Bangladesh and contribute significantly to the agricultural industry of the country. Brinjal supports the livelihood of numerous small farmers; however, brinjal is severely susceptible to various fruit diseases, which have serious impacts on yield quality and may cause considerable economic losses. While most existing plant disease datasets primarily focus on leaf-related disorders, only a limited number include fruit-related diseases and even those contain very few classes. This gap is significant because fruit diseases directly affect crop quality, market value, and overall yield. This is why we present here a new and comprehensive dataset that is unparalleled, exclusively for brinjal fruit diseases. This data set consists of 1823 high-quality, labelled images, across five distinct classes: Phomopsis Blight, Shoot and Fruit Borer, Fruit Cracking, Wet Rot, and Healthy Fruit. The images were collected from real farm conditions in numerous areas of Bangladesh to ensure a robust sample of varied environmental and farming practices impacting the growth of diseases. This dataset is designed with the unique aim to support plant disease research and enhance training of deep learning models for autonomous disease detection. Lastly, the dataset will allow early disease detection, enhancing crop management practice, reduction of losses, and increasing farmers' economic returns. The release of this dataset will encourage agricultural research as well as practical use in precision agriculture.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112490"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A fully synthetic textual dataset of student learning habits and preferences generated using a large language model 使用大型语言模型生成的学生学习习惯和偏好的完全合成文本数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-28 DOI: 10.1016/j.dib.2026.112512
Mehedi Hasan
Educational data mining and learning analytics have become important research areas for supporting pedagogical analysis, algorithm development, and privacy-preserving educational research. The advancement of natural language processing (NLP) methods in educational contexts depends on the availability of structured and well-documented textual datasets; however, access to real student data is often restricted due to ethical, legal, and privacy concerns. This article presents a fully synthetic textual dataset of student learning habits and preferences generated using a large language model (LLM). The dataset contains 10,000 CSV-formatted records representing fictional students and includes attributes such as education level, study hours, preferred learning methods, learning challenges, motivation levels, opinions on online learning, and primary devices used for study. Data generation was performed using structured prompting strategies with explicitly defined controlled vocabularies to ensure internal consistency and reproducibility while avoiding the use of any real personal information. The resulting dataset follows intentionally controlled and near-uniform distributions, with variables generated under independent constraints. This design limits its suitability for modelling real-world stochastic behaviour or discovering natural correlations but makes it appropriate for benchmarking educational NLP pipelines, evaluating synthetic data generation techniques, and conducting privacy-preserving survey and machine learning experiments.
教育数据挖掘和学习分析已成为支持教学分析、算法开发和隐私保护教育研究的重要研究领域。自然语言处理(NLP)方法在教育环境中的进步取决于结构化和文档完备的文本数据集的可用性;然而,由于道德、法律和隐私方面的考虑,访问真实的学生数据往往受到限制。本文介绍了使用大型语言模型(LLM)生成的学生学习习惯和偏好的完整合成文本数据集。该数据集包含10,000条csv格式的记录,代表虚构的学生,包括教育水平、学习时间、首选学习方法、学习挑战、动机水平、对在线学习的看法以及用于学习的主要设备等属性。数据生成使用具有明确定义的受控词汇表的结构化提示策略来执行,以确保内部一致性和可重复性,同时避免使用任何真实的个人信息。结果数据集遵循有意控制和接近均匀的分布,变量在独立约束下生成。这种设计限制了其对真实世界随机行为建模或发现自然相关性的适用性,但使其适合于对教育NLP管道进行基准测试,评估合成数据生成技术,以及进行隐私保护调查和机器学习实验。
{"title":"A fully synthetic textual dataset of student learning habits and preferences generated using a large language model","authors":"Mehedi Hasan","doi":"10.1016/j.dib.2026.112512","DOIUrl":"10.1016/j.dib.2026.112512","url":null,"abstract":"<div><div>Educational data mining and learning analytics have become important research areas for supporting pedagogical analysis, algorithm development, and privacy-preserving educational research. The advancement of natural language processing (NLP) methods in educational contexts depends on the availability of structured and well-documented textual datasets; however, access to real student data is often restricted due to ethical, legal, and privacy concerns. This article presents a fully synthetic textual dataset of student learning habits and preferences generated using a large language model (LLM). The dataset contains 10,000 CSV-formatted records representing fictional students and includes attributes such as education level, study hours, preferred learning methods, learning challenges, motivation levels, opinions on online learning, and primary devices used for study. Data generation was performed using structured prompting strategies with explicitly defined controlled vocabularies to ensure internal consistency and reproducibility while avoiding the use of any real personal information. The resulting dataset follows intentionally controlled and near-uniform distributions, with variables generated under independent constraints. This design limits its suitability for modelling real-world stochastic behaviour or discovering natural correlations but makes it appropriate for benchmarking educational NLP pipelines, evaluating synthetic data generation techniques, and conducting privacy-preserving survey and machine learning experiments.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112512"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Species climate index data for United Kingdom invertebrates 英国无脊椎动物的物种气候指数数据
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-28 DOI: 10.1016/j.dib.2026.112500
Robin J. Pakeman
Numerous approaches have been used to assess the response of species to changing climate. One of the simplest is the calculation of indices which describe the climate of areas occupied by different species and uses them to assess community level change or to assess if species’ trends are predictable from the climate of their ranges. The paper describes the calculation of Species Climate Indices for 4924 UK invertebrate species from freshwater and terrestrial ecosystem by combining information from occurrence records and historical climate data. The indices calculated are the mean January temperature, mean July temperature and mean annual precipitation of 10 km x 10 km squares occupied by the species during the period used for calculating the climate data (1991–2020). These data have been used to assess if trends in occupancy are correlated to species’ climate indices [1] but are also ideally used for looking at trends within communities if repeat sampling has been carried out.
许多方法被用来评估物种对气候变化的反应。其中最简单的一种是计算描述不同物种所占地区气候的指数,并用它们来评估群落水平的变化,或评估物种的趋势是否可以从其范围的气候预测。本文结合发生记录和历史气候资料,计算了英国淡水和陆地生态系统中4924种无脊椎动物的物种气候指数。计算的指数为计算气候资料所用期间(1991-2020年)各物种所占10 km × 10 km平方的1月平均气温、7月平均气温和年平均降水量。这些数据被用来评估占用趋势是否与物种的气候指数[1]相关,但如果进行了重复采样,也可以用来观察群落内的趋势。
{"title":"Species climate index data for United Kingdom invertebrates","authors":"Robin J. Pakeman","doi":"10.1016/j.dib.2026.112500","DOIUrl":"10.1016/j.dib.2026.112500","url":null,"abstract":"<div><div>Numerous approaches have been used to assess the response of species to changing climate. One of the simplest is the calculation of indices which describe the climate of areas occupied by different species and uses them to assess community level change or to assess if species’ trends are predictable from the climate of their ranges. The paper describes the calculation of Species Climate Indices for 4924 UK invertebrate species from freshwater and terrestrial ecosystem by combining information from occurrence records and historical climate data. The indices calculated are the mean January temperature, mean July temperature and mean annual precipitation of 10 km x 10 km squares occupied by the species during the period used for calculating the climate data (1991–2020). These data have been used to assess if trends in occupancy are correlated to species’ climate indices [<span><span>1</span></span>] but are also ideally used for looking at trends within communities if repeat sampling has been carried out.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112500"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dataset of 12,161 steel rebar tests from sudanese construction projects (2016-2022) 2016-2022年苏丹建筑项目12161根钢筋试验数据集
IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Pub Date : 2026-04-01 Epub Date: 2026-01-15 DOI: 10.1016/j.dib.2026.112469
Amged O. Abdelatif, Abdelrahim H. Abdelrahim, Gamar-Aldwla S. Shangray, Mohammed-Alfatih Mustafa, Mustafa M. Abaker, Yahia A. Idris, Abdelrahim M. Yousif
This data article describes a comprehensive dataset comprising 12,161 individual steel reinforcement bar tensile tests (3,898 test reports) collected from various construction projects across Sudan between 2016 and 2022. The data was systematically extracted from official test reports generated by the University of Khartoum, Faculty of Engineering, Department of Civil Engineering, Material and Structures Testing Laboratory. The purpose of this dataset is to establish a verified, large-scale baseline of material performance for Sudanese reinforcement steel, providing transparent and verifiable raw values of key mechanical and dimensional properties for locally sourced rebars with tested diameters ranging from 8 mm to 32 mm. This data is intended for reuse to conduct rigorous analyses on steel reinforcement quality and characteristic properties in Sudan, offering a unique baseline for regional construction quality and providing a representative performance benchmark applicable to other developing countries.
这篇数据文章描述了一个全面的数据集,其中包括2016年至2022年间从苏丹各地的各种建筑项目收集的12,161个单独的钢筋拉伸试验(3,898个试验报告)。数据系统地摘自喀土穆大学土木工程系工程学院、材料和结构测试实验室编制的正式测试报告。该数据集的目的是为苏丹钢筋建立一个经过验证的大规模材料性能基线,为当地采购的钢筋提供透明和可验证的关键机械和尺寸性能原始值,测试直径范围为8毫米至32毫米。这些数据旨在重新利用,以便对苏丹的钢筋质量和特性进行严格分析,为区域建筑质量提供独特的基线,并提供适用于其他发展中国家的具有代表性的性能基准。
{"title":"Dataset of 12,161 steel rebar tests from sudanese construction projects (2016-2022)","authors":"Amged O. Abdelatif,&nbsp;Abdelrahim H. Abdelrahim,&nbsp;Gamar-Aldwla S. Shangray,&nbsp;Mohammed-Alfatih Mustafa,&nbsp;Mustafa M. Abaker,&nbsp;Yahia A. Idris,&nbsp;Abdelrahim M. Yousif","doi":"10.1016/j.dib.2026.112469","DOIUrl":"10.1016/j.dib.2026.112469","url":null,"abstract":"<div><div>This data article describes a comprehensive dataset comprising 12,161 individual steel reinforcement bar tensile tests (3,898 test reports) collected from various construction projects across Sudan between 2016 and 2022. The data was systematically extracted from official test reports generated by the University of Khartoum, Faculty of Engineering, Department of Civil Engineering, Material and Structures Testing Laboratory. The purpose of this dataset is to establish a verified, large-scale baseline of material performance for Sudanese reinforcement steel, providing transparent and verifiable raw values of key mechanical and dimensional properties for locally sourced rebars with tested diameters ranging from 8 mm to 32 mm. This data is intended for reuse to conduct rigorous analyses on steel reinforcement quality and characteristic properties in Sudan, offering a unique baseline for regional construction quality and providing a representative performance benchmark applicable to other developing countries.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112469"},"PeriodicalIF":1.4,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Data in Brief
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1