Tianwei Zhou, Wenwen Zhang, Ben Niu, Pengcheng He, Guanghui Yue
To address the challenge of parameter adjustment in complex environments, this paper introduces a transfer learning-based parameter control framework via deep reinforcement learning for multiobjective evolutionary algorithms (MOEAs). To avoid the requirement for accurate Pareto front information, this framework is proposed with comprehensive global-state information, including basic problem features, the relative position of individuals, the distribution of fitness value, and the grid-IGD. Building on this framework, four reinforced multiobjective evolutionary algorithms (r-MOEAs) are proposed and tested on four DTLZ benchmarks and eight WFG benchmarks. The results of the comparative analyses reveal that compared with the original MOEAs, the four r-MOEAs exhibit faster convergence and stronger robustness. It is also confirmed that our proposed parameter control framework has the capability to learn knowledge from different experiences and improve the performance of MOEAs.
{"title":"Parameter Control Framework for Multiobjective Evolutionary Computation Based on Deep Reinforcement Learning","authors":"Tianwei Zhou, Wenwen Zhang, Ben Niu, Pengcheng He, Guanghui Yue","doi":"10.1155/2024/6740701","DOIUrl":"10.1155/2024/6740701","url":null,"abstract":"<p>To address the challenge of parameter adjustment in complex environments, this paper introduces a transfer learning-based parameter control framework via deep reinforcement learning for multiobjective evolutionary algorithms (MOEAs). To avoid the requirement for accurate Pareto front information, this framework is proposed with comprehensive global-state information, including basic problem features, the relative position of individuals, the distribution of fitness value, and the grid-IGD. Building on this framework, four reinforced multiobjective evolutionary algorithms (r-MOEAs) are proposed and tested on four DTLZ benchmarks and eight WFG benchmarks. The results of the comparative analyses reveal that compared with the original MOEAs, the four r-MOEAs exhibit faster convergence and stronger robustness. It is also confirmed that our proposed parameter control framework has the capability to learn knowledge from different experiences and improve the performance of MOEAs.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139389093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Reza Rohani Sarvestani, Ali Gholami, Reza Boostani
There is growing interest in developing linear/nonlinear feature fusion methods that fuse the elicited features from two different sources of information for achieving a higher recognition rate. In this regard, canonical correlation analysis (CCA), cross-modal factor analysis, and probabilistic CCA (PCCA) have been introduced to better deal with data variability and uncertainty. In our previous research, we formerly developed the kernel version of PCCA (KPCCA) to capture both nonlinear and probabilistic relation between the features of two different source signals. However, KPCCA is only able to estimate latent variables, which are statistically correlated between the features of two independent modalities. To overcome this drawback, we propose a kernel version of the probabilistic dependent-independent CCA (PDICCA) method to capture the nonlinear relation between both dependent and independent latent variables. We have compared the proposed method to PDICCA, CCA, KCCA, cross-modal factor analysis (CFA), and kernel CFA methods over the eNTERFACE and RML datasets for audio-visual emotion recognition and the M2VTS dataset for audio-visual speech recognition. Empirical results on the three datasets indicate the superiority of both the PDICCA and Kernel PDICCA methods to their counterparts.
{"title":"Kernel Probabilistic Dependent-Independent Canonical Correlation Analysis","authors":"Reza Rohani Sarvestani, Ali Gholami, Reza Boostani","doi":"10.1155/2024/7393431","DOIUrl":"10.1155/2024/7393431","url":null,"abstract":"<p>There is growing interest in developing linear/nonlinear feature fusion methods that fuse the elicited features from two different sources of information for achieving a higher recognition rate. In this regard, canonical correlation analysis (CCA), cross-modal factor analysis, and probabilistic CCA (PCCA) have been introduced to better deal with data variability and uncertainty. In our previous research, we formerly developed the kernel version of PCCA (KPCCA) to capture both nonlinear and probabilistic relation between the features of two different source signals. However, KPCCA is only able to estimate latent variables, which are statistically correlated between the features of two independent modalities. To overcome this drawback, we propose a kernel version of the probabilistic dependent-independent CCA (PDICCA) method to capture the nonlinear relation between both dependent and independent latent variables. We have compared the proposed method to PDICCA, CCA, KCCA, cross-modal factor analysis (CFA), and kernel CFA methods over the eNTERFACE and RML datasets for audio-visual emotion recognition and the M2VTS dataset for audio-visual speech recognition. Empirical results on the three datasets indicate the superiority of both the PDICCA and Kernel PDICCA methods to their counterparts.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139387814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Optimal scheduling of microgrids (MGs) is a crucial component of smart grid optimization, playing a vital role in minimizing energy consumption and environmental degradation. However, existing methods tend to consider only a single optimization and do not consider the multiobjective optimization problem of MGs in a comprehensive and integrated way. This study proposes a comprehensive multiobjective optimal scheduling methodology for renewable energy MGs, incorporating demand-side management (DSM) considerations. Initially, a DSM multiobjective optimization model is formulated, focusing on the load shifting of controllable devices within the MG to refine the electricity consumption structure. This model contemplates the renewable energy consumption of the MG, customer electricity purchase costs, and load smoothness. Subsequently, a multiobjective optimization model for grid-connected MGs, encompassing wind and photovoltaic power generation, is constructed with the dual objectives of economic and environmental optimization for the MG. Ultimately, a multimodal multiobjective optimization algorithm, amalgamating a local convergence index and an environment selection strategy, is proposed to solve the model. The experimental results show that compared with other methods, the proposed method in this paper can reduce the integrated cost by 32.6% and 38.9% in summer and 19.4% and 40.2% in winter. This stands out as a unique contribution in the field of MG optimization, as it integrates DSM considerations into a multiobjective optimization model. This methodology achieves a balance between minimizing energy consumption and environmental degradation while also enhancing economic efficiency.
{"title":"Enhanced Multiobjective Optimization Algorithm for Intelligent Grid Management of Renewable Energy Sources","authors":"Xue Han, JiKe Ding, Honglin Cheng","doi":"10.1155/2024/4541163","DOIUrl":"10.1155/2024/4541163","url":null,"abstract":"<p>Optimal scheduling of microgrids (MGs) is a crucial component of smart grid optimization, playing a vital role in minimizing energy consumption and environmental degradation. However, existing methods tend to consider only a single optimization and do not consider the multiobjective optimization problem of MGs in a comprehensive and integrated way. This study proposes a comprehensive multiobjective optimal scheduling methodology for renewable energy MGs, incorporating demand-side management (DSM) considerations. Initially, a DSM multiobjective optimization model is formulated, focusing on the load shifting of controllable devices within the MG to refine the electricity consumption structure. This model contemplates the renewable energy consumption of the MG, customer electricity purchase costs, and load smoothness. Subsequently, a multiobjective optimization model for grid-connected MGs, encompassing wind and photovoltaic power generation, is constructed with the dual objectives of economic and environmental optimization for the MG. Ultimately, a multimodal multiobjective optimization algorithm, amalgamating a local convergence index and an environment selection strategy, is proposed to solve the model. The experimental results show that compared with other methods, the proposed method in this paper can reduce the integrated cost by 32.6% and 38.9% in summer and 19.4% and 40.2% in winter. This stands out as a unique contribution in the field of MG optimization, as it integrates DSM considerations into a multiobjective optimization model. This methodology achieves a balance between minimizing energy consumption and environmental degradation while also enhancing economic efficiency.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139390957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pouya Khani, Vahid Solouk, Hashem Kalbkhani, Farid Ahmadi
Functional near-infrared spectroscopy (fNIRS) is a low-cost and noninvasive method to measure the hemodynamic responses of cortical brain activities and has received great attention in brain-computer interface (BCI) applications. In this paper, we present a method based on deep learning and the time-frequency map (TFM) of fNIRS signals to classify the three motor execution tasks including right-hand tapping, left-hand tapping, and foot tapping. To simultaneously obtain the TFM and consider the correlation among channels, we propose to utilize the two-dimensional discrete orthonormal Stockwell transform (2D-DOST). The TFMs for oxygenated hemoglobin (HbO), reduced hemoglobin (HbR), and two linear combinations of them are obtained and then we propose three fusion schemes for combining their deep information extracted by the convolutional neural network (CNN). Two CNNs, LeNet and MobileNet, are considered and their structures are modified to maximize the accuracy. Due to the lack of enough signals for training CNNs, data augmentation based on the Wasserstein generative adversarial network (WGAN) is performed. Several simulations are performed to assess the performance of the proposed method in three-class and binary scenarios. The results present the efficiency of the proposed method in different scenarios. Also, the proposed method outperforms the recently introduced methods.
{"title":"Fusion of Deep Features from 2D-DOST of fNIRS Signals for Subject-Independent Classification of Motor Execution Tasks","authors":"Pouya Khani, Vahid Solouk, Hashem Kalbkhani, Farid Ahmadi","doi":"10.1155/2023/3178284","DOIUrl":"https://doi.org/10.1155/2023/3178284","url":null,"abstract":"Functional near-infrared spectroscopy (fNIRS) is a low-cost and noninvasive method to measure the hemodynamic responses of cortical brain activities and has received great attention in brain-computer interface (BCI) applications. In this paper, we present a method based on deep learning and the time-frequency map (TFM) of fNIRS signals to classify the three motor execution tasks including right-hand tapping, left-hand tapping, and foot tapping. To simultaneously obtain the TFM and consider the correlation among channels, we propose to utilize the two-dimensional discrete orthonormal Stockwell transform (2D-DOST). The TFMs for oxygenated hemoglobin (HbO), reduced hemoglobin (HbR), and two linear combinations of them are obtained and then we propose three fusion schemes for combining their deep information extracted by the convolutional neural network (CNN). Two CNNs, LeNet and MobileNet, are considered and their structures are modified to maximize the accuracy. Due to the lack of enough signals for training CNNs, data augmentation based on the Wasserstein generative adversarial network (WGAN) is performed. Several simulations are performed to assess the performance of the proposed method in three-class and binary scenarios. The results present the efficiency of the proposed method in different scenarios. Also, the proposed method outperforms the recently introduced methods.","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138953694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zeeshan Ikram Butt, Iftikhar Ahmad, Muhammad Asif Zahoor Raja, Syed Ibrar Hussain, Muhammad Shoaib, Hira Ilyas
In this investigative study, the electro-magneto hydrodynamic (EMHD) influence on a nano viscous fluid model is scrutinized by designing an artificial neural network (ANN) paradigm using a neuro-heuristic approach (NHA) through the combination of GAs (genetic algorithms) and one of the most efficient locally searching solver SQP (sequential quadratic programming), i.e., NHA-GA-SQP. The fluid flow for the proposed problem is initially interpreted in the form of PDEs and then utilization of suitable similarity transformation on these PDEs yields in terms of a stiff nonlinear system of ODEs. The numerical results of the suggested fluidic model based on the variation of its physically existing parameters are calculated through the NHA-GA-SQP solver to detect the variation in velocity, thermal gradient, and concentration during the fluid flow. A detailed analysis of obtained outcomes through the NHA-GA-SQP algorithm and their comparison with the reference results estimated via the Adams method are presented. The calculation of the proposed solver’s accuracy, stability, and consistency through various statistical operators is also involved in the current inspection.
{"title":"Neuro-Heuristic Computational Intelligence Approach for Optimization of Electro-Magneto-Hydrodynamic Influence on a Nano Viscous Fluid Flow","authors":"Zeeshan Ikram Butt, Iftikhar Ahmad, Muhammad Asif Zahoor Raja, Syed Ibrar Hussain, Muhammad Shoaib, Hira Ilyas","doi":"10.1155/2023/7626478","DOIUrl":"https://doi.org/10.1155/2023/7626478","url":null,"abstract":"In this investigative study, the electro-magneto hydrodynamic (EMHD) influence on a nano viscous fluid model is scrutinized by designing an artificial neural network (ANN) paradigm using a neuro-heuristic approach (NHA) through the combination of GAs (genetic algorithms) and one of the most efficient locally searching solver SQP (sequential quadratic programming), i.e., NHA-GA-SQP. The fluid flow for the proposed problem is initially interpreted in the form of PDEs and then utilization of suitable similarity transformation on these PDEs yields in terms of a stiff nonlinear system of ODEs. The numerical results of the suggested fluidic model based on the variation of its physically existing parameters are calculated through the NHA-GA-SQP solver to detect the variation in velocity, thermal gradient, and concentration during the fluid flow. A detailed analysis of obtained outcomes through the NHA-GA-SQP algorithm and their comparison with the reference results estimated via the Adams method are presented. The calculation of the proposed solver’s accuracy, stability, and consistency through various statistical operators is also involved in the current inspection.","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2023-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138967394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Data-hunger is a persistent challenge in machine learning, particularly in the field of image processing based on convolutional neural networks (CNNs). This study systematically investigates the factors contributing to data-hunger in machine-learning-based image-processing algorithms. The results revealed that the proliferation of model parameters, the lack of interpretability, and the complexity of model structure are significant factors influencing data-hunger. Based on these findings, this paper introduces a novel semi-white-box neural network model construction strategy. This approach effectively reduces the number of model parameters while enhancing the interpretability of model components. It accomplishes this by constraining uninterpretable processes within the model and leveraging prior knowledge of image processing for model. Rather than relying on a single all-in-one model, a semi-white-box model is composed of multiple smaller models, each responsible for extracting fundamental semantic features. The final output is derived from these features and prior knowledge. The proposed strategy holds the potential to substantially decrease data requirements under specific data source conditions while improving the interpretability of model components. Validation experiments are conducted on well-established datasets, including MNIST, Fashion MNIST, CIFAR, and generated data. The results demonstrate the superiority of the semi-white-box strategy over the traditional all-in-one approach in terms of accuracy when trained with equivalent data volumes. Impressively, on the tested datasets, a simplified semi-white-box model achieves performance close to that of ResNet while utilizing a small number of parameters. Furthermore, the semi-white-box strategy offers improved interpretability and parameter reusability features that are challenging to achieve with the all-in-one approach. In conclusion, this paper contributes to mitigating data-hunger challenges in machine-learning-based image processing through the introduction of a novel semi-white-box model construction strategy, backed by empirical evidence of its effectiveness.
{"title":"Semi-White-Box Strategy: Enhancing Data Efficiency and Interpretability of Convolutional Neural Networks in Image Processing","authors":"Qi Wang, Jianchao Zeng, Pinle Qin, Pengcheng Zhao, Rui Chai, Zhaomin Yang, Jianshan Zhang","doi":"10.1155/2023/9227348","DOIUrl":"https://doi.org/10.1155/2023/9227348","url":null,"abstract":"Data-hunger is a persistent challenge in machine learning, particularly in the field of image processing based on convolutional neural networks (CNNs). This study systematically investigates the factors contributing to data-hunger in machine-learning-based image-processing algorithms. The results revealed that the proliferation of model parameters, the lack of interpretability, and the complexity of model structure are significant factors influencing data-hunger. Based on these findings, this paper introduces a novel semi-white-box neural network model construction strategy. This approach effectively reduces the number of model parameters while enhancing the interpretability of model components. It accomplishes this by constraining uninterpretable processes within the model and leveraging prior knowledge of image processing for model. Rather than relying on a single all-in-one model, a semi-white-box model is composed of multiple smaller models, each responsible for extracting fundamental semantic features. The final output is derived from these features and prior knowledge. The proposed strategy holds the potential to substantially decrease data requirements under specific data source conditions while improving the interpretability of model components. Validation experiments are conducted on well-established datasets, including MNIST, Fashion MNIST, CIFAR, and generated data. The results demonstrate the superiority of the semi-white-box strategy over the traditional all-in-one approach in terms of accuracy when trained with equivalent data volumes. Impressively, on the tested datasets, a simplified semi-white-box model achieves performance close to that of ResNet while utilizing a small number of parameters. Furthermore, the semi-white-box strategy offers improved interpretability and parameter reusability features that are challenging to achieve with the all-in-one approach. In conclusion, this paper contributes to mitigating data-hunger challenges in machine-learning-based image processing through the introduction of a novel semi-white-box model construction strategy, backed by empirical evidence of its effectiveness.","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138997125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ahmad Salah, Ahmed Salah Fathalla, Esraa Eldesouky, Wei Li, Ahmed Mohamed Mahmoud Ibrahim
The thermal issues generated from friction are the key obstacle in the high-performance machining of titanium alloys. The friction between the workpiece being cut and the cutting tool is the dominant parameter that affects the heat generation during the machining processes, i.e., the temperature inside the cutting zone and the consumed cutting energy. Besides, the complexity is associated with the nature of the friction phenomenon. However, there are limited efforts to forecast the friction coefficient during the machining operations. In this work, the friction coefficients between the titanium alloy against zirconia ceramics lubricated by minimum quantity lubrication were recorded and measured using a universal mechanical tester pin-on-disc tribometer. Then, we proposed two models for forecasting the friction coefficient which are trained and tested on the recorded data. The two predictive models are based on autoregressive integrated moving average and gated recurrent unit deep neural network methods. The proposed models are evaluated through a set of exhaustive experiments. These experiments demonstrated that the proposed models can efficiently be used to reduce power consumption dedicated to monitoring the friction coefficients. Besides, they can reduce or avoid surface thermal damage by predicting the high level of friction coefficients in advance, which can be used as an alert to enable or readjust the lubrication parameters (fluid pressure, fluid flow rate, etc.) to maintain lower ranges of friction coefficients and power consumption.
{"title":"Forecasting the Friction Coefficient of Rubbing Zirconia Ceramics by Titanium Alloy","authors":"Ahmad Salah, Ahmed Salah Fathalla, Esraa Eldesouky, Wei Li, Ahmed Mohamed Mahmoud Ibrahim","doi":"10.1155/2023/6681886","DOIUrl":"https://doi.org/10.1155/2023/6681886","url":null,"abstract":"The thermal issues generated from friction are the key obstacle in the high-performance machining of titanium alloys. The friction between the workpiece being cut and the cutting tool is the dominant parameter that affects the heat generation during the machining processes, i.e., the temperature inside the cutting zone and the consumed cutting energy. Besides, the complexity is associated with the nature of the friction phenomenon. However, there are limited efforts to forecast the friction coefficient during the machining operations. In this work, the friction coefficients between the titanium alloy against zirconia ceramics lubricated by minimum quantity lubrication were recorded and measured using a universal mechanical tester pin-on-disc tribometer. Then, we proposed two models for forecasting the friction coefficient which are trained and tested on the recorded data. The two predictive models are based on autoregressive integrated moving average and gated recurrent unit deep neural network methods. The proposed models are evaluated through a set of exhaustive experiments. These experiments demonstrated that the proposed models can efficiently be used to reduce power consumption dedicated to monitoring the friction coefficients. Besides, they can reduce or avoid surface thermal damage by predicting the high level of friction coefficients in advance, which can be used as an alert to enable or readjust the lubrication parameters (fluid pressure, fluid flow rate, etc.) to maintain lower ranges of friction coefficients and power consumption.","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2023-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139008577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
T. Chellatamilan, S. Narayanasamy, Lalit Garg, Kathiravan Srinivasan, Sardar M. N. Islam
The work of text summarization in question-and-answer systems has gained tremendous popularity recently and has influenced numerous real-world applications for efficient decision-making processes. In this regard, the exponential growth of COVID-19-related healthcare records has necessitated the extraction of fine-grained results to forecast or estimate the potential course of the disease. Machine learning and deep learning models are frequently used to extract relevant insights from textual data sources. However, in order to summarize the textual information relevant to coronavirus, we have concentrated on a number of natural language processing (NLP) models in this research, including Bidirectional Encoder Representations of Transformers (BERT), Sequence-to-Sequence, and Attention models. This ensemble model is built on the previously mentioned models, which primarily concentrate on the segmented context terms included in the textual input. Most crucially, this research has concentrated on two key variations: grouping-related sentences using hierarchical clustering approaches and the distributional semantics of the terms found in the COVID-19 dataset. The gist evaluation (ROUGE) score result shows a significant and respectable accuracy of 0.40 average recalls.
{"title":"Ensemble Text Summarization Model for COVID-19-Associated Datasets","authors":"T. Chellatamilan, S. Narayanasamy, Lalit Garg, Kathiravan Srinivasan, Sardar M. N. Islam","doi":"10.1155/2023/3106631","DOIUrl":"https://doi.org/10.1155/2023/3106631","url":null,"abstract":"The work of text summarization in question-and-answer systems has gained tremendous popularity recently and has influenced numerous real-world applications for efficient decision-making processes. In this regard, the exponential growth of COVID-19-related healthcare records has necessitated the extraction of fine-grained results to forecast or estimate the potential course of the disease. Machine learning and deep learning models are frequently used to extract relevant insights from textual data sources. However, in order to summarize the textual information relevant to coronavirus, we have concentrated on a number of natural language processing (NLP) models in this research, including Bidirectional Encoder Representations of Transformers (BERT), Sequence-to-Sequence, and Attention models. This ensemble model is built on the previously mentioned models, which primarily concentrate on the segmented context terms included in the textual input. Most crucially, this research has concentrated on two key variations: grouping-related sentences using hierarchical clustering approaches and the distributional semantics of the terms found in the COVID-19 dataset. The gist evaluation (ROUGE) score result shows a significant and respectable accuracy of 0.40 average recalls.","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2023-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138981026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The accurate prediction of traffic flow is paramount for the advancement of intelligent transportation systems. Despite this, current prediction models only account for either temporal or spatial features in isolation, without considering their interaction, impeding the model’s ability to express itself. In light of this, we propose the graph differential equations network (GDENet), an approach that can effectively mine spatiotemporal correlation. Specifically, we propose a spatiotemporal feature integrator (STFI), which alleviates the error caused by the deviation of the sampling distribution from the overall distribution. By incorporating temporal information into the model for training and combining it with spatial features, we thoroughly explore the spatiotemporal intrinsic association. When compared to state-of-the-art methods, our proposed algorithm reduces memory consumption and elevates computational efficiency and the practical value. We conduct experiments with real-world datasets, and our proposed model outperformed advanced prediction models.
{"title":"GDENet: Graph Differential Equation Network for Traffic Flow Prediction","authors":"Yanming Miao, Xianghong Tang, Qi Wang, Liya Yu","doi":"10.1155/2023/7099652","DOIUrl":"https://doi.org/10.1155/2023/7099652","url":null,"abstract":"The accurate prediction of traffic flow is paramount for the advancement of intelligent transportation systems. Despite this, current prediction models only account for either temporal or spatial features in isolation, without considering their interaction, impeding the model’s ability to express itself. In light of this, we propose the graph differential equations network (GDENet), an approach that can effectively mine spatiotemporal correlation. Specifically, we propose a spatiotemporal feature integrator (STFI), which alleviates the error caused by the deviation of the sampling distribution from the overall distribution. By incorporating temporal information into the model for training and combining it with spatial features, we thoroughly explore the spatiotemporal intrinsic association. When compared to state-of-the-art methods, our proposed algorithm reduces memory consumption and elevates computational efficiency and the practical value. We conduct experiments with real-world datasets, and our proposed model outperformed advanced prediction models.","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2023-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138585152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Uǧur Kılıç, Işıl Karabey Aksakallı, Gülşah Tümüklü Özyer, T. Aksakallı, B. Özyer, Ş. Adanur
In diagnosing kidney stone disease, clinical specialists often apply medical imaging techniques such as CT and US. Among these imaging techniques, is frequently chosen as the primary examination method in emergency services due to its low cost, accessibility, and low radiation levels. However, interpreting the images by inexperienced specialists can be challenging due to the low image quality and the presence of noise. In this study, we propose a computer-aided diagnosis system based on deep neural networks to assist clinical specialists in detecting kidney stones using Direct Urinary System (DUSX) images. Firstly, in consultation with clinical specialists, we created a new dataset composed of 630 DUSX images and presented it publicly. We also defined preprocessing steps that incorporate image enhancement techniques such as GF, LoG, BF, HE, CLAHE, and CBC to enable deep neural networks to perceive the images more clearly. With these techniques, we considered the noise reduction in the DUSX images and enhanced the poor quality, especially in terms of contrast. For each preprocessing step, we created models to detect kidney stones using YOLOv4 and Mask R-CNN architectures, which are common CNN-based object detectors. We examined the effects of the preprocessing steps on these models. To the best of our knowledge, the combination of BF and CLAHE which is called CBC in this study, has not been applied before in the literature to enhance DUSX images. In addition, this study is the first in its field in which the YOLOv4 and Mask R-CNN architectures have been used for the detection of kidney stones. The experimental results demonstrated the most accurate method is the YOLOv4 model, which includes the CBC preprocessing step, as the result model. This model shows that the accuracy rate, precision, recall, and F1-score were found as 96.1%, 99.3% 96.5%, and 97.9% respectively in the test set. According to these performance metrics, we expect that the proposed model will help to reduce the unnecessary radiation exposure and associated medical costs that come with CT scans.
{"title":"Exploring the Effect of Image Enhancement Techniques with Deep Neural Networks on Direct Urinary System (DUSX) Images for Automated Kidney Stone Detection","authors":"Uǧur Kılıç, Işıl Karabey Aksakallı, Gülşah Tümüklü Özyer, T. Aksakallı, B. Özyer, Ş. Adanur","doi":"10.1155/2023/3801485","DOIUrl":"https://doi.org/10.1155/2023/3801485","url":null,"abstract":"In diagnosing kidney stone disease, clinical specialists often apply medical imaging techniques such as CT and US. Among these imaging techniques, is frequently chosen as the primary examination method in emergency services due to its low cost, accessibility, and low radiation levels. However, interpreting the images by inexperienced specialists can be challenging due to the low image quality and the presence of noise. In this study, we propose a computer-aided diagnosis system based on deep neural networks to assist clinical specialists in detecting kidney stones using Direct Urinary System (DUSX) images. Firstly, in consultation with clinical specialists, we created a new dataset composed of 630 DUSX images and presented it publicly. We also defined preprocessing steps that incorporate image enhancement techniques such as GF, LoG, BF, HE, CLAHE, and CBC to enable deep neural networks to perceive the images more clearly. With these techniques, we considered the noise reduction in the DUSX images and enhanced the poor quality, especially in terms of contrast. For each preprocessing step, we created models to detect kidney stones using YOLOv4 and Mask R-CNN architectures, which are common CNN-based object detectors. We examined the effects of the preprocessing steps on these models. To the best of our knowledge, the combination of BF and CLAHE which is called CBC in this study, has not been applied before in the literature to enhance DUSX images. In addition, this study is the first in its field in which the YOLOv4 and Mask R-CNN architectures have been used for the detection of kidney stones. The experimental results demonstrated the most accurate method is the YOLOv4 model, which includes the CBC preprocessing step, as the result model. This model shows that the accuracy rate, precision, recall, and F1-score were found as 96.1%, 99.3% 96.5%, and 97.9% respectively in the test set. According to these performance metrics, we expect that the proposed model will help to reduce the unnecessary radiation exposure and associated medical costs that come with CT scans.","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2023-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138596966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}