This work presents the MuscleTracker Hand Movement dataset, containing Surface Electromyography (sEMG) data from the right arm of 49 healthy subjects without neuromuscular or cardiovascular issues. Subjects performed five hand movements-pronation with extended fingers, flexion, extension, pronation with flexed fingers, and relaxation-while standing, with one hand palm-down. Data was recorded from two sEMG channels using Biopac MP36 (1000 Hz) and MuscleTracker (512 Hz), with three and four repetitions per device, respectively, for each movement. The dataset includes 825 samples, along with subject details such as gender, age, physical condition, and, for MuscleTracker subjects, anthropometric measurements. This data supports machine-learning development for classifying hand gestures in sEMG signals, with applications in prosthetics control and human-computer interaction. In addition, validation experiments were performed to validate the database and stablish a comparison baseline.
{"title":"Surface electromyography dataset from different movements of the hand using a portable and a non-portable device.","authors":"Rita Q Fuentes-Aguilar, Dusthon Llorente-Vidrio, Leobardo Campos-Macias, Eduardo Morales-Vargas","doi":"10.1016/j.dib.2024.111079","DOIUrl":"10.1016/j.dib.2024.111079","url":null,"abstract":"<p><p>This work presents the MuscleTracker Hand Movement dataset, containing Surface Electromyography (sEMG) data from the right arm of 49 healthy subjects without neuromuscular or cardiovascular issues. Subjects performed five hand movements-pronation with extended fingers, flexion, extension, pronation with flexed fingers, and relaxation-while standing, with one hand palm-down. Data was recorded from two sEMG channels using Biopac MP36 (1000 Hz) and MuscleTracker (512 Hz), with three and four repetitions per device, respectively, for each movement. The dataset includes 825 samples, along with subject details such as gender, age, physical condition, and, for MuscleTracker subjects, anthropometric measurements. This data supports machine-learning development for classifying hand gestures in sEMG signals, with applications in prosthetics control and human-computer interaction. In addition, validation experiments were performed to validate the database and stablish a comparison baseline.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111079"},"PeriodicalIF":1.0,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648142/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142834369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-19eCollection Date: 2024-12-01DOI: 10.1016/j.dib.2024.111144
Milena Mesa-Lavista, Paola Romo-Letechipía, José Álvarez-Pérez, Ricardo González-Alcorta, Jorge H Chávez-Gómez, G Fajardo-San Miguel
Masonry is a construction material composed of units (blocks or bricks) joined with mortar. It is one of the most widely used materials in construction resisting both vertical and horizontal forces in single and multi-family housing buildings. A correct union between the units and the mortar (interface) is essential, as is determining the resistance from the applied loads. There is a divergence in how the mortar is bedded in the construction of masonry walls. In some countries, such as Canada and Australia, regulations require that the mortar be placed in face shell bedding when hollow blocks are used. However, in countries like Mexico, regulations establish that it be placed in the net area, and construction practices often differ. Much research has been conducted to study the compressive behavior of mortar bedding in masonry of hollow concrete blocks. However, fewer studies have focused on shear behavior. This paper presents the dataset of experimental laboratory tests on wallettes built with hollow concrete blocks. Two methods of mortar bedding were employed: over the net area and the lateral face. The values obtained can be used to compare the shear strength in hollow concrete block masonry and the shear failure. Additionally, they can be useful for calibrating numerical models.
{"title":"Shear strength dataset of hollow concrete block masonry with different mortar bedding.","authors":"Milena Mesa-Lavista, Paola Romo-Letechipía, José Álvarez-Pérez, Ricardo González-Alcorta, Jorge H Chávez-Gómez, G Fajardo-San Miguel","doi":"10.1016/j.dib.2024.111144","DOIUrl":"10.1016/j.dib.2024.111144","url":null,"abstract":"<p><p>Masonry is a construction material composed of units (blocks or bricks) joined with mortar. It is one of the most widely used materials in construction resisting both vertical and horizontal forces in single and multi-family housing buildings. A correct union between the units and the mortar (interface) is essential, as is determining the resistance from the applied loads. There is a divergence in how the mortar is bedded in the construction of masonry walls. In some countries, such as Canada and Australia, regulations require that the mortar be placed in face shell bedding when hollow blocks are used. However, in countries like Mexico, regulations establish that it be placed in the net area, and construction practices often differ. Much research has been conducted to study the compressive behavior of mortar bedding in masonry of hollow concrete blocks. However, fewer studies have focused on shear behavior. This paper presents the dataset of experimental laboratory tests on wallettes built with hollow concrete blocks. Two methods of mortar bedding were employed: over the net area and the lateral face. The values obtained can be used to compare the shear strength in hollow concrete block masonry and the shear failure. Additionally, they can be useful for calibrating numerical models.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111144"},"PeriodicalIF":1.0,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11647165/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142834366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-19eCollection Date: 2025-02-01DOI: 10.1016/j.dib.2024.111148
Salma Kazemi Rashed, Malou Arvidsson, Rafsan Ahmed, Sonja Aits
Many forms of bioimage analysis involve the detection of objects and their outlines. In the context of microscopy-based high-throughput drug and genomic screening and even in smaller scale microscopy experiments, the objects that most often need to be detected are cells. In order to develop and benchmark algorithms and neural networks that can perform this task, high-quality datasets with annotated cell outlines are needed. We have created a dataset, named Aitslab_bioimaging2, consisting of 60 fluorescence microscopy images with EGFP-Galectin-3 labelled cells and their hand-labelled outlines. Images were acquired on a Thermo Fischer CX7 high-content imaging system at 20x magnification created as part of an RNA interference screen with a modified U2OS osteosarcoma cell line. Outlines were labelled by three annotators, who had high inter-annotator agreement between them and with a biomedical expert, who labelled some of the objects for comparison and reviewed a subset of the labels, making minor corrections as needed. The dataset comprises over 2200 annotated cell objects in total, making it sufficient in size to train high-performing neural networks for instance or semantic segmentation. Labels can also easily be converted to boxes for object detection tasks. The dataset is already pre-divided into training, development, and test sets. Matching nuclear staining and outlines are available for part of the dataset from a previous publication (dataset Aitslab_bioimaging1) [1].
{"title":"An annotated high-content fluorescence microscopy dataset with EGFP-Galectin-3-stained cells and manually labelled outlines.","authors":"Salma Kazemi Rashed, Malou Arvidsson, Rafsan Ahmed, Sonja Aits","doi":"10.1016/j.dib.2024.111148","DOIUrl":"10.1016/j.dib.2024.111148","url":null,"abstract":"<p><p>Many forms of bioimage analysis involve the detection of objects and their outlines. In the context of microscopy-based high-throughput drug and genomic screening and even in smaller scale microscopy experiments, the objects that most often need to be detected are cells. In order to develop and benchmark algorithms and neural networks that can perform this task, high-quality datasets with annotated cell outlines are needed. We have created a dataset, named Aitslab_bioimaging2, consisting of 60 fluorescence microscopy images with EGFP-Galectin-3 labelled cells and their hand-labelled outlines. Images were acquired on a Thermo Fischer CX7 high-content imaging system at 20x magnification created as part of an RNA interference screen with a modified U2OS osteosarcoma cell line. Outlines were labelled by three annotators, who had high inter-annotator agreement between them and with a biomedical expert, who labelled some of the objects for comparison and reviewed a subset of the labels, making minor corrections as needed. The dataset comprises over 2200 annotated cell objects in total, making it sufficient in size to train high-performing neural networks for instance or semantic segmentation. Labels can also easily be converted to boxes for object detection tasks. The dataset is already pre-divided into training, development, and test sets. Matching nuclear staining and outlines are available for part of the dataset from a previous publication (dataset Aitslab_bioimaging1) [1].</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"58 ","pages":"111148"},"PeriodicalIF":1.0,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11751569/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143022102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
It is challenging for teachers to monitor each student's emotional state in real-time, making personalized learning difficult to achieve. Previous emotion recognition methods, such as support vector machines, are limited by technology and fail to meet practical application requirements. However, the development of deep learning technology offers new solutions for facial expression recognition, which makes emotional interaction and personalized support in education possible. Until now, there has been a lack of facial expression datasets in real classroom settings. To fill this gap, this study collected facial expression data in a real classroom, preprocessed it using OpenCV, and established the first real-world facial expression dataset. The emotion categories include surprise, happiness, neutrality, confusion, and boredom. The dataset was rigorously screened and contains a total of 5,527 images, divided into training, validation, and test sets. This dataset provides a reliable foundation for future research and applications in educational technology, particularly in the development of real-time emotion recognition models to enhance personalized learning and teaching effectiveness. The rigorous data collection and preprocessing approach ensures the dataset's quality and authenticity, addressing the limitations of existing datasets collected in laboratory settings.
{"title":"Annotated emotional image datasets of Chinese university students in real classrooms for deep learning.","authors":"Chengliang Wang, Haoming Wang, Zihui Hu, Xiaojiao Chen","doi":"10.1016/j.dib.2024.111147","DOIUrl":"https://doi.org/10.1016/j.dib.2024.111147","url":null,"abstract":"<p><p>It is challenging for teachers to monitor each student's emotional state in real-time, making personalized learning difficult to achieve. Previous emotion recognition methods, such as support vector machines, are limited by technology and fail to meet practical application requirements. However, the development of deep learning technology offers new solutions for facial expression recognition, which makes emotional interaction and personalized support in education possible. Until now, there has been a lack of facial expression datasets in real classroom settings. To fill this gap, this study collected facial expression data in a real classroom, preprocessed it using OpenCV, and established the first real-world facial expression dataset. The emotion categories include surprise, happiness, neutrality, confusion, and boredom. The dataset was rigorously screened and contains a total of 5,527 images, divided into training, validation, and test sets. This dataset provides a reliable foundation for future research and applications in educational technology, particularly in the development of real-time emotion recognition models to enhance personalized learning and teaching effectiveness. The rigorous data collection and preprocessing approach ensures the dataset's quality and authenticity, addressing the limitations of existing datasets collected in laboratory settings.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111147"},"PeriodicalIF":1.0,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648130/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142834180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-17eCollection Date: 2024-12-01DOI: 10.1016/j.dib.2024.111141
Serkan Varol, Serkan Catma
This article summarizes a database for analyzing the impact of structural and environmental characteristics on residential property values in Hamilton County, TN. The original dataset consists of house characteristics data for 873 residential sales between January 1, 2023, and September 25, 2023. Using Google's API tools and Point to Edge computations, several geographical variables-including the distance to green recreational areas, surrounding facilities, restaurants, air quality index, walk score-were gathered. The aggregated information can help forecast Hamilton County's housing market with accuracy and correctly assess the environmental impact on housing prices. More specifically, the value of any environmental amenity in the study area can be implicitly estimated using this dataset.
本文总结了一个数据库,用于分析田纳西州汉密尔顿县结构和环境特征对住宅物业价值的影响。原始数据集包括2023年1月1日至2023年9月25日期间873套住宅销售的房屋特征数据。使用谷歌的API工具和Point to Edge计算,收集了几个地理变量,包括到绿色休闲区域的距离、周围设施、餐馆、空气质量指数、步行得分。汇总的信息可以帮助准确预测汉密尔顿县的房地产市场,并正确评估环境对房价的影响。更具体地说,研究区域内任何环境设施的价值都可以使用该数据集进行隐式估计。
{"title":"Unlocking insights in real estate markets: Integrating geo-spatial data for comprehensive property valuation and urban planning.","authors":"Serkan Varol, Serkan Catma","doi":"10.1016/j.dib.2024.111141","DOIUrl":"10.1016/j.dib.2024.111141","url":null,"abstract":"<p><p>This article summarizes a database for analyzing the impact of structural and environmental characteristics on residential property values in Hamilton County, TN. The original dataset consists of house characteristics data for 873 residential sales between January 1, 2023, and September 25, 2023. Using Google's API tools and Point to Edge computations, several geographical variables-including the distance to green recreational areas, surrounding facilities, restaurants, air quality index, walk score-were gathered. The aggregated information can help forecast Hamilton County's housing market with accuracy and correctly assess the environmental impact on housing prices. More specifically, the value of any environmental amenity in the study area can be implicitly estimated using this dataset.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111141"},"PeriodicalIF":1.0,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11647156/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142834309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-17eCollection Date: 2024-12-01DOI: 10.1016/j.dib.2024.111139
Julia Rieber, Gabriella Meier-Bürgisser, Iris Miescher, Franz E Weber, Petra Wolint, Yao Yang, Esteban Ongini, Athanasios Milionis, Jess G Snedeker, Maurizio Calcagni, Johanna Buschmann
The first set of data refers to Insulin-like Growth Factor-1 (IGF-1) protein incorporation via emulsion electrospinning into a DegraPolⓇ random fiber mesh and its characterization. Specifically, the fiber thickness was assessed and compared to pure DegraPolⓇ fibers without IGF-1 (control). Furthermore, the mechanical properties of these meshes were assessed and data on ultimate tensile stress, Young's modulus and ultimate fracture strain are presented for ring specimen and rectangular pieces taken from electrospun tubes in the transverse direction as well as rectangular pieces taken in the axial direction of the electrospun tube. Moreover, the static and the dynamic water contact angles were determined. The second set of data represents morphological aspects, such as the cytoskeletal aspect ratio (i.e. length of the cell divided by its width) for rabbit Achilles tenocytes stimulated in vitro with 1, 10, and 100 ng/mL IGF-1 supplementation compared to the corresponding cell culture without IGF-1 (control). Furthermore, qPCR was performed and collagen I, ki67 and tenomodulin gene expression data are presented for rabbit Achilles tenocytes in vitro with 0.1, 1 and 10 ng/mL IGF-1 supplementation, respectively, as well as with a supplementation of released IGF-1 from the DegraPol mesh (concentration was 1 ng/mL).
{"title":"Mechanical, water contact angle and fiber thickness data for Insulin-like growth gactor-1 (IGF-1) incorporated in electrospun random DegraPol<sup>Ⓡ</sup> fibers and IGF-1 impact on tenocyte aspect ratio and gene expression data.","authors":"Julia Rieber, Gabriella Meier-Bürgisser, Iris Miescher, Franz E Weber, Petra Wolint, Yao Yang, Esteban Ongini, Athanasios Milionis, Jess G Snedeker, Maurizio Calcagni, Johanna Buschmann","doi":"10.1016/j.dib.2024.111139","DOIUrl":"10.1016/j.dib.2024.111139","url":null,"abstract":"<p><p>The first set of data refers to Insulin-like Growth Factor-1 (IGF-1) protein incorporation via emulsion electrospinning into a DegraPol<sup>Ⓡ</sup> random fiber mesh and its characterization. Specifically, the fiber thickness was assessed and compared to pure DegraPol<sup>Ⓡ</sup> fibers without IGF-1 (control). Furthermore, the mechanical properties of these meshes were assessed and data on ultimate tensile stress, Young's modulus and ultimate fracture strain are presented for ring specimen and rectangular pieces taken from electrospun tubes in the transverse direction as well as rectangular pieces taken in the axial direction of the electrospun tube. Moreover, the static and the dynamic water contact angles were determined. The second set of data represents morphological aspects, such as the cytoskeletal aspect ratio (i.e. length of the cell divided by its width) for rabbit Achilles tenocytes stimulated <i>in vitro</i> with 1, 10, and 100 ng/mL IGF-1 supplementation compared to the corresponding cell culture without IGF-1 (control). Furthermore, qPCR was performed and collagen I, ki67 and tenomodulin gene expression data are presented for rabbit Achilles tenocytes in vitro with 0.1, 1 and 10 ng/mL IGF-1 supplementation, respectively, as well as with a supplementation of released IGF-1 from the DegraPol mesh (concentration was 1 ng/mL).</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111139"},"PeriodicalIF":1.0,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11647151/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142834356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-16eCollection Date: 2024-12-01DOI: 10.1016/j.dib.2024.111145
V P Vera-Ávila, R R Rivera-Durón, Onofre Orozco-López, M S Soriano-García, J Ricardo Sevilla-Escoboza, Javier M Buldú
Some real-world phenomena and human-made problems have been modeled as networks where the objects form pairwise interactions. However, this is a limited approach when the existence of high-order interactions is inherent in a system, such as the brain, social networks and ecosystems. The way in which these high-order interactions affect the collective behavior of a complex system is still an open question. For this reason, it is necessary to analyze theoretically, numerically and experimentally the consequences of higher-order interactions in complex systems. Here, we provide experimental datasets of the dynamics of three nonlinear electronic oscillators, namely, Rössler oscillators, interacting into a simplicial complex whose connections rely on both linear (diffusive) and nonlinear (high-order) coupling. It is well-known that Rössler systems only achieve the synchronization when they are coupled by means of or variable. Considering this fact, we designed our experiment considering four scenarios. The first one, when both linear and nonlinear coupling functions are introduced through the variable. The second one, occurring when linear coupling is introduced through the variable and the nonlinear coupling through the variable. The third case happens when the linear coupling is introduced through the variable whereas nonlinear coupling goes through the variable. The last case, when both linear and nonlinear coupling are introduced through the variable. For each scenario, we acquired 10000 times series when both the linear and nonlinear coupling strengths were modified. Each time series contained 30000 temporal points. These datasets are useful to corroborate the conditions to reach the synchronized state varying the linear/non-linear coupling strengths and to test new metrics for better understanding the effects of higher-order interactions in complex networks.
{"title":"Experimental datasets on synchronization in simplicial complexes.","authors":"V P Vera-Ávila, R R Rivera-Durón, Onofre Orozco-López, M S Soriano-García, J Ricardo Sevilla-Escoboza, Javier M Buldú","doi":"10.1016/j.dib.2024.111145","DOIUrl":"https://doi.org/10.1016/j.dib.2024.111145","url":null,"abstract":"<p><p>Some real-world phenomena and human-made problems have been modeled as networks where the objects form pairwise interactions. However, this is a limited approach when the existence of high-order interactions is inherent in a system, such as the brain, social networks and ecosystems. The way in which these high-order interactions affect the collective behavior of a complex system is still an open question. For this reason, it is necessary to analyze theoretically, numerically and experimentally the consequences of higher-order interactions in complex systems. Here, we provide experimental datasets of the dynamics of three nonlinear electronic oscillators, namely, Rössler oscillators, interacting into a simplicial complex whose connections rely on both linear (diffusive) and nonlinear (high-order) coupling. It is well-known that Rössler systems only achieve the synchronization when they are coupled by means of <math><mi>x</mi></math> or <math><mi>y</mi></math> variable. Considering this fact, we designed our experiment considering four scenarios. The first one, when both linear and nonlinear coupling functions are introduced through the <math><mi>x</mi></math> variable. The second one, occurring when linear coupling is introduced through the <math><mi>x</mi></math> variable and the nonlinear coupling through the <math><mi>y</mi></math> variable. The third case happens when the linear coupling is introduced through the <math><mi>y</mi></math> variable whereas nonlinear coupling goes through the <math><mi>x</mi></math> variable. The last case, when both linear and nonlinear coupling are introduced through the <math><mi>y</mi></math> variable. For each scenario, we acquired 10000 times series when both the linear and nonlinear coupling strengths were modified. Each time series contained 30000 temporal points. These datasets are useful to corroborate the conditions to reach the synchronized state varying the linear/non-linear coupling strengths and to test new metrics for better understanding the effects of higher-order interactions in complex networks.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111145"},"PeriodicalIF":1.0,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11697596/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142930899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-16eCollection Date: 2024-12-01DOI: 10.1016/j.dib.2024.111146
Vandermi Silva, Gabriel Tavares, Carlos Freitas, Felipe Maklouf, Chavdar Ivanov, Raimundo Barreto, Rosiane de Freitas
The demand for mobile coverage with adequate signal quality has triggered criticism due to the maturity of the Internet's diffusion in today's society. However, with the deployment of 5G networks, even 5G NSA by 4G LTE, the complexity of the operating environment of mobile networks has increased. To evaluate the behavior of mobile networks in terms of signal quality and other important metrics for mobile telephony, we developed a dataset consisting of 33 radio parameters that can collect up to 736,974 records generated daily by smartphones and tablets. The dataset comprises samples collected in cities situated on the banks of the Amazon and Negro rivers. To create the dataset, an application was designed for the Android operating system using the Kotlin programming language, which can collect data in real time and generate a CSV file. After post-processing the collected data with data science techniques, the filtered dataset was stored in the Mendeley public repository. We divided the data into three regions: the metropolitan area of Manaus, the middle Solimões River, and the middle Amazonas River. To improve the performance of the experiments, the database was separated according to the cities and locations collected.
{"title":"A multi-device and multi-operator dataset from mobile network coverage on Android devices.","authors":"Vandermi Silva, Gabriel Tavares, Carlos Freitas, Felipe Maklouf, Chavdar Ivanov, Raimundo Barreto, Rosiane de Freitas","doi":"10.1016/j.dib.2024.111146","DOIUrl":"10.1016/j.dib.2024.111146","url":null,"abstract":"<p><p>The demand for mobile coverage with adequate signal quality has triggered criticism due to the maturity of the Internet's diffusion in today's society. However, with the deployment of 5G networks, even 5G NSA by 4G LTE, the complexity of the operating environment of mobile networks has increased. To evaluate the behavior of mobile networks in terms of signal quality and other important metrics for mobile telephony, we developed a dataset consisting of 33 radio parameters that can collect up to 736,974 records generated daily by smartphones and tablets. The dataset comprises samples collected in cities situated on the banks of the Amazon and Negro rivers. To create the dataset, an application was designed for the Android operating system using the Kotlin programming language, which can collect data in real time and generate a CSV file. After post-processing the collected data with data science techniques, the filtered dataset was stored in the Mendeley public repository. We divided the data into three regions: the metropolitan area of Manaus, the middle Solimões River, and the middle Amazonas River. To improve the performance of the experiments, the database was separated according to the cities and locations collected.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111146"},"PeriodicalIF":1.0,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11647131/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142834089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-16eCollection Date: 2024-12-01DOI: 10.1016/j.dib.2024.111138
Anugrayani Bustamin, Andi M Rizky, Elly Warni, Intan Sari Areni, Indrabayu
Voice is a one of media for human communication and interaction. Emotions conveyed through voice, such as laughter or tears, can communicate messages more quickly than spoken or written language. In sentiment analysis, the emotional component is crucial for reflecting human perceptions and opinions. This paper introduces IndoWaveSentiment, a dataset of emotional voice recordings categorized into five classes: neutral, happy, surprised, disgusted, and disappointed. The data collection took place in a recording studio with ten actors, evenly split between men and women. Each actor repeated the same sentence in Bahasa Indonesia three times for each emotion class, and the recordings were saved in .wav format. The annotation process was manually conducted using Audacity and validated through a questionnaire-based sampling technique that supports audio data. This dataset is valuable for researchers in Signal Processing and Artificial Intelligence, aiding the development of classification models within Machine Learning.
{"title":"IndoWaveSentiment: Indonesian audio dataset for emotion classification.","authors":"Anugrayani Bustamin, Andi M Rizky, Elly Warni, Intan Sari Areni, Indrabayu","doi":"10.1016/j.dib.2024.111138","DOIUrl":"10.1016/j.dib.2024.111138","url":null,"abstract":"<p><p>Voice is a one of media for human communication and interaction. Emotions conveyed through voice, such as laughter or tears, can communicate messages more quickly than spoken or written language. In sentiment analysis, the emotional component is crucial for reflecting human perceptions and opinions. This paper introduces IndoWaveSentiment, a dataset of emotional voice recordings categorized into five classes: neutral, happy, surprised, disgusted, and disappointed. The data collection took place in a recording studio with ten actors, evenly split between men and women. Each actor repeated the same sentence in Bahasa Indonesia three times for each emotion class, and the recordings were saved in .wav format. The annotation process was manually conducted using Audacity and validated through a questionnaire-based sampling technique that supports audio data. This dataset is valuable for researchers in Signal Processing and Artificial Intelligence, aiding the development of classification models within Machine Learning.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111138"},"PeriodicalIF":1.0,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11647155/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142834355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This dataset provides a numerical simulation of pavement mechanical behavior under Traffic Speed Deflectometer (TSD) measurements. It consists of simulated deflection slope data for various pavement structures and subgrade properties, generated using the Alizé-LCPC software, a standard tool in French pavement engineering. The dataset addresses limitations in traditional Falling Weight Deflectometer (FWD) methods, offering a more accurate and computationally efficient approach for estimating the Subgrade Resilient Modulus (MR) using machine learning models. This resource is valuable for researchers aiming to enhance pavement evaluation methods and develop predictive models for road infrastructure maintenance and assessment. The data are openly accessible, facilitating widespread research collaboration and the application of advanced data analytics in pavement engineering.
{"title":"Numerical data for modelling pavement deflection behaviour under the TSD.","authors":"Abdelgader Abdelmuhsen, Jean-Michel Simonin, Franziska Schmidt, Denis Lievre, Alexis Cothenet, Murilo Freitas, Amine Ihamouten","doi":"10.1016/j.dib.2024.111135","DOIUrl":"https://doi.org/10.1016/j.dib.2024.111135","url":null,"abstract":"<p><p>This dataset provides a numerical simulation of pavement mechanical behavior under Traffic Speed Deflectometer (TSD) measurements. It consists of simulated deflection slope data for various pavement structures and subgrade properties, generated using the Alizé-LCPC software, a standard tool in French pavement engineering. The dataset addresses limitations in traditional Falling Weight Deflectometer (FWD) methods, offering a more accurate and computationally efficient approach for estimating the Subgrade Resilient Modulus (M<sub>R</sub>) using machine learning models. This resource is valuable for researchers aiming to enhance pavement evaluation methods and develop predictive models for road infrastructure maintenance and assessment. The data are openly accessible, facilitating widespread research collaboration and the application of advanced data analytics in pavement engineering.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111135"},"PeriodicalIF":1.0,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11647148/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142834357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}