Pub Date : 2026-01-15DOI: 10.1016/j.dib.2026.112473
Justyna Kadłuczka , Tatsiana Chubukova , Przemysław Mielczarek , Agata Maziak , Adam Roman , Emilija Napieralska , Katarzyna Z. Kuter
The dataset shows proteomic results (timsTOF Pro 2 (Bruker)) obtained using an originally developed method of adult rat brain isolation of astrocytes or microglia from the same sample. Mechano-enzymatic dissociation and FACS sorting retrieved pure, separate cellular fractions from the substantia nigra. Results come from an animal model of early Parkinson’s disease of selective nigrostriatal dopaminergic system neuron degeneration by 6-OHDA, combined with 7-day-long astrocyte dysfunction and death induced by fluorocitrate. Astrocyte and neuron death both induce microglial activation, but to varying degrees and through different mechanisms. Previous studies did not allow for assigning changes in common mechanisms (such as, for example, energy metabolism) to a specific cell type in tissue, while in vitro studies lack functional dimension. This research enables the identification of clear information on mechanisms within each cell type, originating from a multidimensional environment, while maintaining the functional and tissue-specific context. Comparison of astrocyte death-induced vs neuron death-induced microglia activation processes can be analysed using this dataset. Raw data are available via ProteomeXchange with identifiers PXD066353 and PXD067265.
该数据集显示了蛋白质组学结果(timsTOF Pro 2 (Bruker)),使用最初开发的方法从相同样品中分离成年大鼠脑星形胶质细胞或小胶质细胞获得。机械酶解和FACS分选从黑质中提取了纯的、分离的细胞组分。结果来自于6-羟多巴胺诱导的选择性黑质纹状体多巴胺能系统神经元变性,并伴7天星形胶质细胞功能障碍和氟柠檬酸致死亡的早期帕金森病动物模型。星形胶质细胞和神经元死亡都能诱导小胶质细胞活化,但程度和机制不同。以前的研究不允许将共同机制的变化(例如,能量代谢)分配给组织中的特定细胞类型,而体外研究缺乏功能维度。这项研究能够在维持功能和组织特异性背景的同时,从多维环境中确定每种细胞类型机制的清晰信息。星形胶质细胞死亡诱导与神经元死亡诱导的小胶质细胞激活过程的比较可以使用该数据集进行分析。原始数据可通过ProteomeXchange与标识符PXD066353和PXD067265。
{"title":"Glial cell-specific proteomic data from the substantia nigra of a rat 6-OHDA and fluorocitrate model of astrocyte death and microglial activation","authors":"Justyna Kadłuczka , Tatsiana Chubukova , Przemysław Mielczarek , Agata Maziak , Adam Roman , Emilija Napieralska , Katarzyna Z. Kuter","doi":"10.1016/j.dib.2026.112473","DOIUrl":"10.1016/j.dib.2026.112473","url":null,"abstract":"<div><div>The dataset shows proteomic results (timsTOF Pro 2 (Bruker)) obtained using an originally developed method of adult rat brain isolation of astrocytes or microglia from the same sample. Mechano-enzymatic dissociation and FACS sorting retrieved pure, separate cellular fractions from the substantia nigra. Results come from an animal model of early Parkinson’s disease of selective nigrostriatal dopaminergic system neuron degeneration by 6-OHDA, combined with 7-day-long astrocyte dysfunction and death induced by fluorocitrate. Astrocyte and neuron death both induce microglial activation, but to varying degrees and through different mechanisms. Previous studies did not allow for assigning changes in common mechanisms (such as, for example, energy metabolism) to a specific cell type in tissue, while in vitro studies lack functional dimension. This research enables the identification of clear information on mechanisms within each cell type, originating from a multidimensional environment, while maintaining the functional and tissue-specific context. Comparison of astrocyte death-induced vs neuron death-induced microglia activation processes can be analysed using this dataset. Raw data are available via ProteomeXchange with identifiers PXD066353 and PXD067265.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112473"},"PeriodicalIF":1.4,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-15DOI: 10.1016/j.dib.2026.112480
Yuhan Guo , Xihan Mu , Xiang Lyu , Dasheng Fan , Chengzhuo Lei , Ruiqiang Wu , Donghui Xie , Guangjian Yan
Medium- to high-resolution satellite data, such as Sentinel-2 and Landsat-8, have significantly enhanced the accuracy of vegetation monitoring. However, canopy reflectance and vegetation indices are affected by the bidirectional reflectance distribution function (BRDF) effect, introducing uncertainties in vegetation phenology monitoring and parameter retrieval. Herein, a multi-angle reflectance dataset was developed using unmanned aerial vehicles (UAVs) to investigate the angular effect over the crop canopy across various growth stages. Multi-angle measurements were acquired over two 10 m × 10 m plots of wheat and peach trees in Yukou Town, Pinggu District, Beijing, China, during their respective key growth stages (29 Feb to 21 May in 2024 for wheat; 11 Apr to 12 Jun in 2024 for peach trees). Wheat measurements were obtained using a DJI Matrice 350 RTK equipped with a Cubert ULTRIS X20 Plus hyperspectral imager, while peach tree data were captured with an Agrowing 61MP Sextuple multispectral imager. UAV flights used a spherical-helical trajectory to maximize solar-view geometry coverage, yielding over 1,800 angular observations under clear-sky conditions. All UAV imagery was post-processed and stored in text and Excel formats. The dataset provides high-quality, multi-angle spectral measurements for two representative crops during multiple growth stages, enabling in-depth investigations into BRDF characteristics, angular sensitivity of spectral metrics, and temporal spectral dynamics. Furthermore, the data support the validation of physical models for vegetation remote sensing, vegetation parameter inversion, and the training of machine-learning models for more remote sensing applications.
{"title":"A multi-angle reflectance dataset of wheat and peach trees with unmanned aerial vehicle imagery","authors":"Yuhan Guo , Xihan Mu , Xiang Lyu , Dasheng Fan , Chengzhuo Lei , Ruiqiang Wu , Donghui Xie , Guangjian Yan","doi":"10.1016/j.dib.2026.112480","DOIUrl":"10.1016/j.dib.2026.112480","url":null,"abstract":"<div><div>Medium- to high-resolution satellite data, such as Sentinel-2 and Landsat-8, have significantly enhanced the accuracy of vegetation monitoring. However, canopy reflectance and vegetation indices are affected by the bidirectional reflectance distribution function (BRDF) effect, introducing uncertainties in vegetation phenology monitoring and parameter retrieval. Herein, a multi-angle reflectance dataset was developed using unmanned aerial vehicles (UAVs) to investigate the angular effect over the crop canopy across various growth stages. Multi-angle measurements were acquired over two 10 m × 10 m plots of wheat and peach trees in Yukou Town, Pinggu District, Beijing, China, during their respective key growth stages (29 Feb to 21 May in 2024 for wheat; 11 Apr to 12 Jun in 2024 for peach trees). Wheat measurements were obtained using a DJI Matrice 350 RTK equipped with a Cubert ULTRIS X20 Plus hyperspectral imager, while peach tree data were captured with an Agrowing 61MP Sextuple multispectral imager. UAV flights used a spherical-helical trajectory to maximize solar-view geometry coverage, yielding over 1,800 angular observations under clear-sky conditions. All UAV imagery was post-processed and stored in text and Excel formats. The dataset provides high-quality, multi-angle spectral measurements for two representative crops during multiple growth stages, enabling in-depth investigations into BRDF characteristics, angular sensitivity of spectral metrics, and temporal spectral dynamics. Furthermore, the data support the validation of physical models for vegetation remote sensing, vegetation parameter inversion, and the training of machine-learning models for more remote sensing applications.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112480"},"PeriodicalIF":1.4,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This dataset provides a comprehensive, multidimensional phytochemical characterization of Lekhniya Mahakashaya (LMK), a classical Ayurvedic formulation used for the Treatment of obesity and metabolic disorders. Three complementary analytical platforms were employed: High-Resolution Liquid Chromatography-Mass Spectrometry/Mass Spectrometry (HRLC-MS/MS) Orbitrap, High-Performance Thin Layer Chromatography (HPTLC), and Fourier-Transform Infrared (FTIR) spectroscopy. For HRLC-MS/MS analysis, Hydroalcoholic extracts of LMK were prepared and analysed in both Positive and negative ionisation modes using an Orbitrap mass spectrometer. The dataset includes 2034 metabolomics-identified compounds: 1712 in positive ion mode and 322 in negative ion mode, with detailed retention times, molecular weights, and fragmentation patterns, suitable for compound annotation, metabolite networking, and cheminformatics-based correlation studies. HPTLC fingerprinting was performed using methanolic extracts (2–10 µL) on silica gel 60 F₂₅₄ plates, which yielded 7–8 reproducible peaks across the Rf range 0.12–0.89 under 254 nm, 366 nm, and 540 nm, confirming LMK’s polyherbal complexity. Marker-based quantification revealed that berberine (0.24 % w/w) and curcumin (0.31 % w/w) were performed using validated HPTLC protocols, and calibration curves are included for reproducibility. FTIR Spectroscopic data encompass 19 absorption peaks (3278–0468 cm⁻¹), representing hydroxyl, aliphatic, unsaturated, sulfur-, nitrogen-, and halogen-containing functional groups, which highlights LMK’s diverse phytochemical matrix. This dataset is structured for pharmacological exploration, quality control, and phytochemical standardisation of LMK and associated Ayurvedic formulations. This dataset is a reference resource. Additionally, the dataset can be used for molecular docking validation, network pharmacology mapping, metabolomics comparisons, and future drug discovery. To promote transparency, encourage computational or experimental reuse, and support integrative research on traditional medicine, all raw chromatograms, spectrum files, and processed data tables are made available in widely accessible formats.
{"title":"Multi-analytical dataset on Lekhaniya Mahakashaya: HRLC-MS/MS Orbitrap profiling, HPTLC fingerprinting with marker estimation, and FTIR spectroscopy","authors":"Narayan Singh, Anjali Upadhyay, Debajyoti Chakraborty, Girimalla Patil, Pramod Yadav, Galib R, Pradeep Kumar Prajapati","doi":"10.1016/j.dib.2026.112464","DOIUrl":"10.1016/j.dib.2026.112464","url":null,"abstract":"<div><div>This dataset provides a comprehensive, multidimensional phytochemical characterization of <em>Lekhniya Mahakashaya</em> (LMK), a classical Ayurvedic formulation used for the Treatment of obesity and metabolic disorders. Three complementary analytical platforms were employed: High-Resolution Liquid Chromatography-Mass Spectrometry/Mass Spectrometry (HRLC-MS/MS) Orbitrap, High-Performance Thin Layer Chromatography (HPTLC), and Fourier-Transform Infrared (FTIR) spectroscopy. For HRLC-MS/MS analysis, Hydroalcoholic extracts of LMK were prepared and analysed in both Positive and negative ionisation modes using an Orbitrap mass spectrometer. The dataset includes 2034 metabolomics-identified compounds: 1712 in positive ion mode and 322 in negative ion mode, with detailed retention times, molecular weights, and fragmentation patterns, suitable for compound annotation, metabolite networking, and cheminformatics-based correlation studies. HPTLC fingerprinting was performed using methanolic extracts (2–10 µL) on silica gel 60 F₂₅₄ plates, which yielded 7–8 reproducible peaks across the Rf range 0.12–0.89 under 254 nm, 366 nm, and 540 nm, confirming LMK’s polyherbal complexity. Marker-based quantification revealed that berberine (0.24 % w/w) and curcumin (0.31 % w/w) were performed using validated HPTLC protocols, and calibration curves are included for reproducibility. FTIR Spectroscopic data encompass 19 absorption peaks (3278–0468 cm⁻¹), representing hydroxyl, aliphatic, unsaturated, sulfur-, nitrogen-, and halogen-containing functional groups, which highlights LMK’s diverse phytochemical matrix. This dataset is structured for pharmacological exploration, quality control, and phytochemical standardisation of LMK and associated Ayurvedic formulations. This dataset is a reference resource. Additionally, the dataset can be used for molecular docking validation, network pharmacology mapping, metabolomics comparisons, and future drug discovery. To promote transparency, encourage computational or experimental reuse, and support integrative research on traditional medicine, all raw chromatograms, spectrum files, and processed data tables are made available in widely accessible formats.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112464"},"PeriodicalIF":1.4,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This dataset presents experimental data on the performance of a photovoltaic (PV) solar-powered water pumping system installed in a coffee plantation in Chiang Mai province, Thailand. The system performance was evaluated through controlled experiments using response surface methodology (RSM). Three independent variables were systematically varied: solar irradiance (300–900 W/m²), panel inclination (15–35°), and panel surface temperature (30–60°C). A total of 15 experimental runs were conducted, and the pumping efficiency (%) was recorded under each condition. Statistical analyses, including analysis of variance (ANOVA) and regression modeling, were applied to evaluate the effects of the individual variables and their interactions on system performance. The dataset includes raw and processed measurements, regression coefficients, and response surface parameters, enabling replication and further analysis. Perturbation plots, 3D surface plots, and contour plots provide detailed visualizations of the relationships between environmental factors and system efficiency. The optimal operating conditions were identified at a solar irradiance of 600 W/m², a panel inclination of 25°, and a panel surface temperature of 45°C, corresponding to a predicted maximum efficiency of 76.3–77.0%.
This dataset can be reused for designing optimized solar water pumping systems, validating predictive models, and comparing system performance under different environmental conditions or geographic locations. It also serves as a reference for researchers in renewable energy system optimization and agricultural water management. The data provide high-resolution, experimentally validated information on the combined effects of solar irradiance, panel inclination, and panel surface temperature on PV water pumping efficiency. Unlike previous studies, it includes detailed quantitative analysis specific to coffee-growing regions in Northern Thailand, along with regression models and visualizations that can guide both experimental replication and predictive modeling under similar climatic and agricultural conditions
{"title":"Dataset on the performance of a photovoltaic solar water pump in coffee plantations using response surface methodology (RSM)","authors":"Nopparat Suriyachai, Torpong Kreetachat, Saksit Imman","doi":"10.1016/j.dib.2026.112467","DOIUrl":"10.1016/j.dib.2026.112467","url":null,"abstract":"<div><div>This dataset presents experimental data on the performance of a photovoltaic (PV) solar-powered water pumping system installed in a coffee plantation in Chiang Mai province, Thailand. The system performance was evaluated through controlled experiments using response surface methodology (RSM). Three independent variables were systematically varied: solar irradiance (300–900 W/m²), panel inclination (15–35°), and panel surface temperature (30–60°C). A total of 15 experimental runs were conducted, and the pumping efficiency (%) was recorded under each condition. Statistical analyses, including analysis of variance (ANOVA) and regression modeling, were applied to evaluate the effects of the individual variables and their interactions on system performance. The dataset includes raw and processed measurements, regression coefficients, and response surface parameters, enabling replication and further analysis. Perturbation plots, 3D surface plots, and contour plots provide detailed visualizations of the relationships between environmental factors and system efficiency. The optimal operating conditions were identified at a solar irradiance of 600 W/m², a panel inclination of 25°, and a panel surface temperature of 45°C, corresponding to a predicted maximum efficiency of 76.3–77.0%.</div><div>This dataset can be reused for designing optimized solar water pumping systems, validating predictive models, and comparing system performance under different environmental conditions or geographic locations. It also serves as a reference for researchers in renewable energy system optimization and agricultural water management. The data provide high-resolution, experimentally validated information on the combined effects of solar irradiance, panel inclination, and panel surface temperature on PV water pumping efficiency. Unlike previous studies, it includes detailed quantitative analysis specific to coffee-growing regions in Northern Thailand, along with regression models and visualizations that can guide both experimental replication and predictive modeling under similar climatic and agricultural conditions</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112467"},"PeriodicalIF":1.4,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-14DOI: 10.1016/j.dib.2026.112466
Gerardo Roa-Dabike , Trevor J. Cox , Jon P. Barker , Bruno M. Fazenda , Simone Graetzer , Rebecca R. Vos , Michael A. Akeroyd , Jennifer Firth , William M. Whitmer , Scott Bannister , Alinka Greasley
This paper presents CLIP, a dataset of 11,072 popular western music signals sourced from independent artists, accompanied by ground truth lyrics, and lyric intelligibility scores from listening tests. The dataset is designed to facilitate music information retrieval (MIR) research using machine learning. It was created to allow the development of algorithms to predict lyric intelligibility for the Cadenza ICASSP 2026 Signal Processing Grand Challenge. Currently, it is the only publicly available large-scale dataset for such a task. The music was sourced from the Free Music Archive (FMA) dataset and is unlikely to be familiar to listeners. We excluded tracks whose license did not allow derivative works and those that did not have English singing. Ground truth transcriptions were generated by seven native English speakers, resulting in 3700 excerpts of 5 to 10 words each from 1452 different songs. A hearing loss simulation was also applied to the stereo audio. This resulted in 11,100 music signals with no, mild or moderate hearing loss. This was done so more diverse hearing is represented in the dataset. Human transcriptions were then collected via an online listening experiment. Participants self-reported as having normal-hearing and being native English speakers. They listened to each music signal twice before transcribing each line. Final intelligibility scores were the ratio of matching words between the listening test responses and the ground truth transcriptions. The final dataset consists of audio, ground truth lyrics, intelligibility scores and associated metadata.
{"title":"The Cadenza lyric intelligibility prediction (CLIP) dataset","authors":"Gerardo Roa-Dabike , Trevor J. Cox , Jon P. Barker , Bruno M. Fazenda , Simone Graetzer , Rebecca R. Vos , Michael A. Akeroyd , Jennifer Firth , William M. Whitmer , Scott Bannister , Alinka Greasley","doi":"10.1016/j.dib.2026.112466","DOIUrl":"10.1016/j.dib.2026.112466","url":null,"abstract":"<div><div>This paper presents CLIP, a dataset of 11,072 popular western music signals sourced from independent artists, accompanied by ground truth lyrics, and lyric intelligibility scores from listening tests. The dataset is designed to facilitate music information retrieval (MIR) research using machine learning. It was created to allow the development of algorithms to predict lyric intelligibility for the Cadenza ICASSP 2026 Signal Processing Grand Challenge. Currently, it is the only publicly available large-scale dataset for such a task. The music was sourced from the Free Music Archive (FMA) dataset and is unlikely to be familiar to listeners. We excluded tracks whose license did not allow derivative works and those that did not have English singing. Ground truth transcriptions were generated by seven native English speakers, resulting in 3700 excerpts of 5 to 10 words each from 1452 different songs. A hearing loss simulation was also applied to the stereo audio. This resulted in 11,100 music signals with no, mild or moderate hearing loss. This was done so more diverse hearing is represented in the dataset. Human transcriptions were then collected via an online listening experiment. Participants self-reported as having normal-hearing and being native English speakers. They listened to each music signal twice before transcribing each line. Final intelligibility scores were the ratio of matching words between the listening test responses and the ground truth transcriptions. The final dataset consists of audio, ground truth lyrics, intelligibility scores and associated metadata.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112466"},"PeriodicalIF":1.4,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-14DOI: 10.1016/j.dib.2026.112470
Eric Gooden, Nicole Holden
The data reports the results of an experiment used to examine whether short-horizon investors differ from long-horizon investors in their sensitivity to truthful disclosure regarding negative earnings news. We used an experimental methodology to isolate the impact of Investment Horizon and Forthcomingness on investors' long-term management credibility assessments. Specifically, participants assumed the role of either a long-horizon current investor (already owned shares) or a short-horizon prospective investor (contemplating an investment decision) in a fictional company. Participants were then given identical information regarding the firm and asked to make an initial credibility assessment. Subsequently, participants either received forthcoming disclosure from company management regarding negative earnings news (Forthcomingness) or the participant did not receive disclosure. Short-horizon investors then made an investment decision regarding an investment position in the firm, and all participants received negative earnings news regarding the firm. Participants returned two-weeks later to make final credibility assessments of company management as part of the post-experimental questionnaire.
{"title":"Long-term management reporting credibility data from an accounting experiment","authors":"Eric Gooden, Nicole Holden","doi":"10.1016/j.dib.2026.112470","DOIUrl":"10.1016/j.dib.2026.112470","url":null,"abstract":"<div><div>The data reports the results of an experiment used to examine whether short-horizon investors differ from long-horizon investors in their sensitivity to truthful disclosure regarding negative earnings news. We used an experimental methodology to isolate the impact of Investment Horizon and Forthcomingness on investors' long-term management credibility assessments. Specifically, participants assumed the role of either a long-horizon current investor (already owned shares) or a short-horizon prospective investor (contemplating an investment decision) in a fictional company. Participants were then given identical information regarding the firm and asked to make an initial credibility assessment. Subsequently, participants either received forthcoming disclosure from company management regarding negative earnings news (Forthcomingness) or the participant did not receive disclosure. Short-horizon investors then made an investment decision regarding an investment position in the firm, and all participants received negative earnings news regarding the firm. Participants returned two-weeks later to make final credibility assessments of company management as part of the post-experimental questionnaire.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112470"},"PeriodicalIF":1.4,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-14DOI: 10.1016/j.dib.2026.112465
Ha-Nam Nguyen , Hoai-Nam Nguyen , Thi-Thu Ngo
In the context of digital transformation in education, digital competence is one of the significant essential requirements for future teachers. This study surveyed and analyzed the digital competence structure of 1439 pre-service teachers in different regions of Viet Nam. We utilized a self-assessment questionnaire based on the Digital Kids Asia-Pacific (DKAP) framework, with references to TPACK (Technological Pedagogical Content Knowledge), DigComp (Digital Competence Framework for Citizens), and DigCompEdu frameworks. The dataset provided detailed information on each participant’s self-evaluated digital proficiency in five categories, along with demographic variables such as gender and subject specialization. The core of this data file locates itself in such potential to inform teacher training programs and educational policy by offering evidence on prowess and weakness in future teachers’ digital competence.
{"title":"Survey data on digital competence assessment among pre-service teachers in Vietnam","authors":"Ha-Nam Nguyen , Hoai-Nam Nguyen , Thi-Thu Ngo","doi":"10.1016/j.dib.2026.112465","DOIUrl":"10.1016/j.dib.2026.112465","url":null,"abstract":"<div><div>In the context of digital transformation in education, digital competence is one of the significant essential requirements for future teachers. This study surveyed and analyzed the digital competence structure of 1439 pre-service teachers in different regions of Viet Nam. We utilized a self-assessment questionnaire based on the Digital Kids Asia-Pacific (DKAP) framework, with references to TPACK (Technological Pedagogical Content Knowledge), DigComp (Digital Competence Framework for Citizens), and DigCompEdu frameworks. The dataset provided detailed information on each participant’s self-evaluated digital proficiency in five categories, along with demographic variables such as gender and subject specialization. The core of this data file locates itself in such potential to inform teacher training programs and educational policy by offering evidence on prowess and weakness in future teachers’ digital competence.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112465"},"PeriodicalIF":1.4,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146075120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-14DOI: 10.1016/j.dib.2026.112472
Leandro da Silva Gomes , Gustavo Henrique de Queiroz Stabile , Tahisa Neitzel Kuck , Felipe Augusto Pereira de Figueiredo , Elcio Hideiti Shiguemori , Dimas Irion Alves
The Brazilian Amazon Rainforest holds a large ecological and economic importance and is considered one of the most biodiverse regions on the planet. The region faces numerous challenges from illegal human activities that threaten its sustainability and well-being, which are often supported by the construction of unauthorized airstrips. Additionally, due to its persistent cloud cover, which often hinders monitoring with optical satellites, Synthetic Aperture Radar (SAR) imagery provides a crucial alternative for the region surveillance. Thus, this dataset was developed to support the training and evaluation of machine learning techniques, including deep learning models for detecting and segmenting airstrips in the Brazilian Amazon Rainforest using SAR imagery. The dataset comprises images from the Sentinel-1 satellite, acquired primarily between 2021 and 2024, covering 1040 locations of known airstrips sourced from the MapBiomas project (published in 2023, based on 2021 reference data). For the change detection task, historical “before” images were selected from the period between 2014 and 2021 to capture the pre-construction state. The data is structured to support three distinct machine learning tasks: object detection (e.g., YOLOv8), semantic segmentation (e.g., U-Net), and change detection. For each task, specific images and annotations are provided. Additionally, geospatial files (Shapefile, GeoPackage) are included to facilitate the integration and visualization of the dataset in a GIS environment. The data is valuable for researchers in remote sensing, computer vision, environmental monitoring, security and defense, enabling the development of automated systems to monitor irregular activities in remote forest regions. The dataset is available at a Mendeley Data repository: https://data.mendeley.com/datasets/x7rn78ymtn/1
{"title":"A Sentinel-1 SAR imagery dataset for airstrips detection and segmentation in the Brazilian Amazon Rainforest","authors":"Leandro da Silva Gomes , Gustavo Henrique de Queiroz Stabile , Tahisa Neitzel Kuck , Felipe Augusto Pereira de Figueiredo , Elcio Hideiti Shiguemori , Dimas Irion Alves","doi":"10.1016/j.dib.2026.112472","DOIUrl":"10.1016/j.dib.2026.112472","url":null,"abstract":"<div><div>The Brazilian Amazon Rainforest holds a large ecological and economic importance and is considered one of the most biodiverse regions on the planet. The region faces numerous challenges from illegal human activities that threaten its sustainability and well-being, which are often supported by the construction of unauthorized airstrips. Additionally, due to its persistent cloud cover, which often hinders monitoring with optical satellites, Synthetic Aperture Radar (SAR) imagery provides a crucial alternative for the region surveillance. Thus, this dataset was developed to support the training and evaluation of machine learning techniques, including deep learning models for detecting and segmenting airstrips in the Brazilian Amazon Rainforest using SAR imagery. The dataset comprises images from the Sentinel-1 satellite, acquired primarily between 2021 and 2024, covering 1040 locations of known airstrips sourced from the MapBiomas project (published in 2023, based on 2021 reference data). For the change detection task, historical “before” images were selected from the period between 2014 and 2021 to capture the pre-construction state. The data is structured to support three distinct machine learning tasks: object detection (e.g., YOLOv8), semantic segmentation (e.g., U-Net), and change detection. For each task, specific images and annotations are provided. Additionally, geospatial files (Shapefile, GeoPackage) are included to facilitate the integration and visualization of the dataset in a GIS environment. The data is valuable for researchers in remote sensing, computer vision, environmental monitoring, security and defense, enabling the development of automated systems to monitor irregular activities in remote forest regions. The dataset is available at a Mendeley Data repository: <span><span>https://data.mendeley.com/datasets/x7rn78ymtn/1</span><svg><path></path></svg></span></div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112472"},"PeriodicalIF":1.4,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-13DOI: 10.1016/j.dib.2026.112468
Claudia Moricca , Rachele Nicolini , Lucrezia Masci , Lia Barelli , Simona Morretta , Raffaele Pugliese , Laura Sadori
<div><div>The “Santi Quattro Coronati – archaeobotanical plates” dataset presents a comprehensive photographic collection of carpological remains recovered from a pit in the complex of Santi Quattro Coronati (Rome, Italy). The deposit, dated between the late 15th and the mid-16th century, yielded a diverse assemblage of desiccated plant remains. The dataset is novel in that it provides the complete photographic documentation of all identified taxa from a single Early Modern archaeological context, a chronological phase that remains underrepresented in Italian archaeobotanical research.</div><div>The photographic documentation focuses on a representative sample of each taxon identified in the archaeobotanical analysis, with particular attention to the best-preserved specimens. When multiple plant parts of the same taxon were present, all were included. The dataset also includes fragile and rarely illustrated plant parts, such as cereal rachis fragments, tunics and basal plates of onion and garlic, grapevine tendrils and legume seed coats. These are often excluded from reference atlases due to their low archaeological survivability and the consequent scarcity of well-preserved comparative specimens.</div><div>High-resolution images were acquired using a Leica MC205C stereomicroscope equipped with a Leica IC80HD camera and the Leica Application Suite v.4.5.0 software. Illumination was provided by the Leica LED5000 HDI™ dome system, ensuring constant, diffuse light conditions. A column of images was captured for each specimen and processed with Helicon Focus v.7.0.1 Pro through focus stacking to obtain a single fully focused image. Depending on specimen size and complexity, between 9 and 127 photographs were used per perspective. Larger samples, unsuitable for microscopic observation, were photographed using a Canon digital camera under controlled illumination. Post-processing was performed with GIMP, applying standard tools for background cleaning and masking. Each final plate includes a scale bar for size reference.</div><div>The dataset is organized alphabetically by plant family and taxon. For each taxon, one or more plates are provided, displaying specimens from one to three perspectives to represent their 3D morphology. Nomenclature follows the taxonomy used in the original publication of the assemblage and has been updated according to the most recent checklist of the Italian vascular flora. A metadata .xls file is provided to facilitate consultation, reuse, comparison and integration with other archaeobotanical datasets.</div><div>This dataset offers a well-documented comparative visual reference for species/genus identification and for assessing the preservation state and morphological integrity of desiccated archaeobotanical remains. Offering detailed photographic records of New World plant taxa previously identified in this context, the study enhances accessibility and understanding of these materials through visual reference. Despite bein
“Santi Quattro Coronati -考古植物板块”数据集展示了从意大利罗马的Santi Quattro Coronati复合体的一个坑中恢复的人类学遗骸的综合摄影集合。该矿床的历史可以追溯到15世纪晚期到16世纪中期,发现了各种各样的干枯植物遗骸。该数据集的新颖之处在于,它提供了来自单一的早期现代考古背景的所有已识别分类群的完整照片文档,这是意大利考古植物学研究中尚未充分代表的时间顺序阶段。摄影文献集中于考古植物学分析中确定的每个分类单元的代表性样本,特别关注保存最完好的标本。当同一分类单元的多个植物部分存在时,所有部分都被包括在内。该数据集还包括易碎且很少展示的植物部分,如谷物轴片、洋葱和大蒜的外衣和基板、葡萄藤卷须和豆类种皮。由于它们的考古存续能力较低,因而缺乏保存完好的比较标本,因此经常被排除在参考地图集之外。使用配备徕卡IC80HD相机的徕卡MC205C立体显微镜和Leica Application Suite v.4.5.0软件获取高分辨率图像。照明由徕卡LED5000 HDI™穹顶系统提供,确保恒定的漫射光条件。每个标本采集一列图像,用Helicon Focus v.7.0.1 Pro进行对焦叠加处理,得到一张完全聚焦的图像。根据标本的大小和复杂程度,每个视角使用9到127张照片。较大的样本,不适合显微镜观察,使用佳能数码相机在受控照明下拍摄。使用GIMP进行后处理,使用标准工具进行背景清理和遮盖。每个最终板包括一个比例尺的尺寸参考。数据集按植物科和分类单元的字母顺序组织。对于每个分类单元,提供一个或多个板,从一个到三个角度展示标本,以表示它们的三维形态。命名法遵循汇编原始出版物中使用的分类法,并根据意大利维管植物区系的最新清单进行了更新。提供了一个元数据。xls文件,以便与其他考古植物数据集进行查阅、重用、比较和集成。该数据集为物种/属鉴定和评估干燥考古植物遗骸的保存状态和形态完整性提供了一个有充分记录的比较视觉参考。该研究提供了在此背景下发现的新大陆植物分类群的详细照片记录,通过视觉参考提高了对这些材料的可及性和理解。尽管受到单一背景的限制,该数据集代表了考古植物学的最佳实践,鼓励其他研究人员分享他们所研究的人类学组合的完整照片文档,从而支持开放科学和逐步构建扩展的视觉参考集合。该数据集主要用于研究早期现代背景的考古植物学家和环境考古学家,但它也可以为研究其他年代和地点的干枯植物遗骸的研究人员提供服务。
{"title":"Visualizing archaeobotanical data: A comprehensive photographic record of desiccated plant remains from an early modern context at Santi Quattro Coronati, Rome","authors":"Claudia Moricca , Rachele Nicolini , Lucrezia Masci , Lia Barelli , Simona Morretta , Raffaele Pugliese , Laura Sadori","doi":"10.1016/j.dib.2026.112468","DOIUrl":"10.1016/j.dib.2026.112468","url":null,"abstract":"<div><div>The “Santi Quattro Coronati – archaeobotanical plates” dataset presents a comprehensive photographic collection of carpological remains recovered from a pit in the complex of Santi Quattro Coronati (Rome, Italy). The deposit, dated between the late 15th and the mid-16th century, yielded a diverse assemblage of desiccated plant remains. The dataset is novel in that it provides the complete photographic documentation of all identified taxa from a single Early Modern archaeological context, a chronological phase that remains underrepresented in Italian archaeobotanical research.</div><div>The photographic documentation focuses on a representative sample of each taxon identified in the archaeobotanical analysis, with particular attention to the best-preserved specimens. When multiple plant parts of the same taxon were present, all were included. The dataset also includes fragile and rarely illustrated plant parts, such as cereal rachis fragments, tunics and basal plates of onion and garlic, grapevine tendrils and legume seed coats. These are often excluded from reference atlases due to their low archaeological survivability and the consequent scarcity of well-preserved comparative specimens.</div><div>High-resolution images were acquired using a Leica MC205C stereomicroscope equipped with a Leica IC80HD camera and the Leica Application Suite v.4.5.0 software. Illumination was provided by the Leica LED5000 HDI™ dome system, ensuring constant, diffuse light conditions. A column of images was captured for each specimen and processed with Helicon Focus v.7.0.1 Pro through focus stacking to obtain a single fully focused image. Depending on specimen size and complexity, between 9 and 127 photographs were used per perspective. Larger samples, unsuitable for microscopic observation, were photographed using a Canon digital camera under controlled illumination. Post-processing was performed with GIMP, applying standard tools for background cleaning and masking. Each final plate includes a scale bar for size reference.</div><div>The dataset is organized alphabetically by plant family and taxon. For each taxon, one or more plates are provided, displaying specimens from one to three perspectives to represent their 3D morphology. Nomenclature follows the taxonomy used in the original publication of the assemblage and has been updated according to the most recent checklist of the Italian vascular flora. A metadata .xls file is provided to facilitate consultation, reuse, comparison and integration with other archaeobotanical datasets.</div><div>This dataset offers a well-documented comparative visual reference for species/genus identification and for assessing the preservation state and morphological integrity of desiccated archaeobotanical remains. Offering detailed photographic records of New World plant taxa previously identified in this context, the study enhances accessibility and understanding of these materials through visual reference. Despite bein","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112468"},"PeriodicalIF":1.4,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-13DOI: 10.1016/j.dib.2026.112461
Tom Marti , Cécile Costa , Emmanuel de Salis , Laura Brambilla , Stefano Carrino
This dataset presents acoustic emission (AE) recordings collected from woodboring insect-infested and non-infested wood samples and cultural heritage objects. Data acquisition was conducted across four institutions: Haute École Arc (HE-Arc), Switzerland; Canadian Museum of History (CMH), Canada; National Gallery of Canada (NGC), Canada; and Musée National de l'Automobile (MNA), France; from April to July 2025.
The recordings were captured using Vallen VS900-M sensors with AEP5 preamplifiers set to 34dB gain and AMSY-6 4-channel chassis, employing continuous acoustic emission monitoring at 2 MHz sampling rate. Each experiment utilized three sensors positioned on test objects and one reference sensor facing up to record ambient noise conditions. The dataset comprises approximately 440.9 hours of recordings distributed across the four collection sites.
The dataset includes four main components: raw Vallen AE database files (.tradb format), processed statistical data exported as CSV files, contextual images documenting setups and sensor placements, and Python script for statistical data processing. Each experiment is documented with duration, material specifications, coupling methods (renaissance wax, cyclododecane, or mechanical fastening), environmental conditions, and infestation labels.
The dataset's structure enables multiple research applications. The time-series statistical features and binary classification labels (infested/non-infested) provide a foundation for supervised machine learning model development. The diverse experimental conditions across four geographic locations, varying coupling methods, and different ambient environments offer opportunities to evaluate model generalization and robustness. Reference sensor recordings captured simultaneously with each experiment allow for ambient noise characterization studies and development of noise filtering methodologies. The combination of raw acoustic data and contextual documentation makes this dataset suitable for comparative studies of different signal processing approaches and feature extraction techniques in acoustic emission analysis for heritage conservation applications.
该数据集展示了从木材钻孔昆虫感染和非昆虫感染的木材样本和文化遗产中收集的声发射(AE)记录。数据采集在四个机构进行:瑞士的Haute École Arc (HE-Arc);加拿大历史博物馆,加拿大;加拿大国家美术馆(NGC),加拿大;和法国mus National de l'Automobile (MNA);从2025年4月到7月。录音采用Vallen VS900-M传感器,AEP5前置放大器设置为34dB增益,AMSY-6 4通道机箱,采用连续声发射监测,采样率为2 MHz。每个实验使用放置在测试对象上的三个传感器和一个向上的参考传感器来记录环境噪声条件。该数据集包括分布在四个收集点的大约440.9小时的录音。数据集包括四个主要组成部分:原始valenae数据库文件(;tradb格式),处理的统计数据导出为CSV文件,记录设置和传感器位置的上下文图像,以及用于统计数据处理的Python脚本。每个实验都记录了持续时间、材料规格、耦合方法(再生蜡、环十二烷或机械紧固)、环境条件和虫害标签。数据集的结构支持多种研究应用。时间序列统计特征和二元分类标签(出没/未出没)为监督式机器学习模型的开发提供了基础。四个地理位置的不同实验条件、不同的耦合方法和不同的环境为评估模型的泛化和鲁棒性提供了机会。与每个实验同时捕获的参考传感器记录允许环境噪声特性研究和噪声过滤方法的发展。原始声学数据和上下文文档的结合使该数据集适合于在遗产保护应用的声发射分析中对不同信号处理方法和特征提取技术进行比较研究。
{"title":"A dataset of acoustics emissions recordings of woodboring insects in wood and cultural objects, context images and remarks","authors":"Tom Marti , Cécile Costa , Emmanuel de Salis , Laura Brambilla , Stefano Carrino","doi":"10.1016/j.dib.2026.112461","DOIUrl":"10.1016/j.dib.2026.112461","url":null,"abstract":"<div><div>This dataset presents acoustic emission (AE) recordings collected from woodboring insect-infested and non-infested wood samples and cultural heritage objects. Data acquisition was conducted across four institutions: Haute École Arc (HE-Arc), Switzerland; Canadian Museum of History (CMH), Canada; National Gallery of Canada (NGC), Canada; and Musée National de l'Automobile (MNA), France; from April to July 2025.</div><div>The recordings were captured using Vallen VS900-M sensors with AEP5 preamplifiers set to 34dB gain and AMSY-6 4-channel chassis, employing continuous acoustic emission monitoring at 2 MHz sampling rate. Each experiment utilized three sensors positioned on test objects and one reference sensor facing up to record ambient noise conditions. The dataset comprises approximately 440.9 hours of recordings distributed across the four collection sites.</div><div>The dataset includes four main components: raw Vallen AE database files (.tradb format), processed statistical data exported as CSV files, contextual images documenting setups and sensor placements, and Python script for statistical data processing. Each experiment is documented with duration, material specifications, coupling methods (renaissance wax, cyclododecane, or mechanical fastening), environmental conditions, and infestation labels.</div><div>The dataset's structure enables multiple research applications. The time-series statistical features and binary classification labels (infested/non-infested) provide a foundation for supervised machine learning model development. The diverse experimental conditions across four geographic locations, varying coupling methods, and different ambient environments offer opportunities to evaluate model generalization and robustness. Reference sensor recordings captured simultaneously with each experiment allow for ambient noise characterization studies and development of noise filtering methodologies. The combination of raw acoustic data and contextual documentation makes this dataset suitable for comparative studies of different signal processing approaches and feature extraction techniques in acoustic emission analysis for heritage conservation applications.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"65 ","pages":"Article 112461"},"PeriodicalIF":1.4,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}