This paper presents a dataset obtained from an RT2-qPCR array analysis of rat pancreatic RIN-m cells treated with two monocarbonyl analogs of curcumin (MACs), C66 and B2BrBC in the presence or absence of streptozotocin (STZ). The array quantified the expression of 84 genes associated with the onset, development, and progression of diabetes. This dataset provides information on the gene expression profiles of pancreatic cells modulated by two specific MACs in a diabetic context. The data can serve as a foundation for developing new hypotheses, designing follow-up experiments, and identifying novel targets for treatment. It can be used to investigate further the molecular mechanisms underlying the therapeutic effects of these MACs and in comparative studies using other experimental antidiabetic compounds.
Biocultural diversity is important for environmental justice, human wellbeing, and sustainable development. Yet it is threatened by landscape degradation and overexploitation. When species go extinct, there is a co-occurring loss of associated cultural elements, and marginalized cultures are the ones that suffer the most from these losses. Here, we present BioCultBase/Borneo, a database of local uses of plants and their cultural contexts from the biologically and culturally hyper-diverse island of Borneo. The database has been developed from secondary data extracted from scientific literature, but is intended to be a live repository that welcomes contributions from academics, researchers and the general public. BioCultBase/Borneo database currently covers 1319 confirmed plant species and plant parts used for 23 use categories. These uses are reported from 39 ethnic communities of Borneo, together representing at least 2242 unique ecocultural links. The ethnicities represented in the database cover 13 % of the 306 officially recognized ethnicities of Borneo. Developing the database further will enhance access to ecocultural data that can be used for developing policy and practises relevant for a broader range of peoples.
The study of beach morphology holds significant importance in coastal management, offering insights into coastal and environmental processes. It involves analyzing physical characteristics and beach features such as profile shape, slope, sediment composition, and grain size, as well as changes in elevation due to both erosion and accretion over time. Furthermore, studying changes in beach morphology is essential in predicting and monitoring coastal inundation events, especially in the context of rising sea levels and subsidence in some areas. However, having access to high-frequency oblique imagery and beach elevation datasets to document and confirm coastal forcing events and understand their impact on beach morphology is a notable challenge. This paper describes a one-year dataset comprising bi-monthly topographic surveys and imagery collected daily at 30 min increments at the beach adjacent to Horace Caldwell Pier in Port Aransas, Texas. The data collection started in February 2023 and ended in January 2024. The dataset includes 18 topographic surveys, 6879 beach images, and ocean/wave videos that can be combined with colocated National Oceanic and Atmospheric Administration metocean measurements. The one-year temporal span of the dataset allows for the observation and analysis of seasonal variations, contributing to a deeper understanding of coastal dynamics in the study area. Furthermore, a study that combines survey measurements with camera imagery is rare and provides valuable information on conditions before, after, and between surveys and periods of inundation. The imagery enables monitoring of inundation events, while the topographic surveys facilitate the analysis of their impact on beach morphology, including beach erosion and accretion. Various products, including beach profiles, contours, slope maps, triangular irregular networks, and digital elevation models, were derived from the topographic dataset, allowing in depth analysis of beach morphology. Additionally, the dataset contains a time series of four wet/dry shoreline delineations per day and their corresponding elevation extracted by combining the imagery with the digital elevation models. Thus, this paper provides a high-frequency morphological dataset and a machine learning-ready dataset suitable for predicting coastal inundation.
In modern complex mechanical systems, machine faults typically occur in multiple components simultaneously, and the domain of collected sensor data changes continuously due to variations in operating conditions. Deep learning-based fault diagnosis approaches have recently been enhanced to address these real-world industrial challenges. Comprehensive labeled data covering compound fault scenarios and multi-domain conditions are crucial for exploring these issues. However, existing multi-domain datasets focus on a limited range of operating conditions, such as motor rotating speeds and loads. This limits their applicability to real-world industrial scenarios. To bridge this gap, we present a novel multi-domain dataset that incorporates these basic conditions and extends to various bearing types and compound machine faults. The deep groove ball bearing, the cylindrical roller bearing, and the tapered roller bearing were utilized to provide data that reflect diverse mechanical interactions between the shaft and the bearing. Vibration data were collected using a USB digital accelerometer at two sampling rates and six rotating speeds, encompassing three single bearing faults, seven single rotating component faults, and 21 compound faults of the bearing and rotating component. Additionally, the dataset provides spectrograms of vibration data using short-time Fourier transform (STFT) for data-driven analysis with a 2-D input. This dataset encompasses more complex compound fault and domain shift problems than those presented in conventional public vibration datasets, thereby aiding researchers in studying intelligent fault diagnosis methods based on deep learning.
A synthetic population is a distribution of synthetic agents that replicates the demographic distribution of a real-world population based on census records. This paper presents an end-to-end model to generate a synthetic population of residents in Gothenburg, Sweden, along with activity schedules and mobility patterns for present and past populations. Using a stochastic modelling approach, we describe the model and present its corresponding dataset. The model is designed for applications in neighbourhood planning and includes detailed replicas of people in different neighbourhoods of Gothenburg organised as persons, households, houses, buildings, and daily activity chains. While the persons, households, and houses are synthetic replicas, they are connected to existing buildings. The model considers the allocation of primary and secondary locations based on a gravity model, realistic routing for active, public, and private motorised modes of transportation and allows users to introduce new buildings and amenities if needed. The model aims to impute national-level mobility patterns from a household travel survey and apply them locally to capture the nuances of a neighbourhood's built environment and demographic composition.
This CIDACC dataset was created to determine the cell population of Chlorella vulgaris microalga during cultivation. Chlorella vulgaris has diverse applications, including use as food supplement, biofuel production, and pollutant removal. High resolution images were collected using a microscope and annotated, focusing on computer vision and machine learning models creation for automatic Chlorella cell detection, counting, size and geometry estimation. The dataset comprises 628 images, organized into hierarchical folders for easy access. Detailed segmentation masks and bounding boxes were generated using external tools enhancing the dataset's utility. The dataset's efficacy was demonstrated through preliminary experiments using deep learning architecture such as object detection and localization algorithms, as well as image segmentation algorithms, achieving high precision and accuracy. This dataset is a valuable tool for advancing computer vision applications in microalgae research and other related fields. The dataset is particularly challenging due to its dynamic nature and the complex correlations it presents across various application domains, including cell analysis in medical research. Its intricacies not only push the boundaries of current computer vision algorithms but also offer significant potential for advancements in diverse fields such as biomedical imaging, environmental monitoring, and biotechnological innovations.