Power Line Communication (PLC) serves as a medium for communication over power lines, utilizing the existing power grid for information transmission. It offers a low-cost, highly scalable signal transmission method and has the potential to become the preferred technology for providing broadband in smart homes, offices, and smart grids. However, noise in the power line channel, especially impulse noise, seriously affects the communication quality of PLC. In this paper, we establish an OFDM-PLC system with higher-order modulation and polar codes. The higher-order QAM or APSK modulation technique is employed to increase the signal transmission rate and the system performance is analyzed. To combat the impulse noise in the PLC channel, we first model it using a superposition of multi-damping sinusoidal functions and then introduce the polar coding scheme to suppress the impulse noise in the system, thereby improving the transmission reliability. Simulation results verify that the proposed polar coding scheme based on the OFDM-PLC system can improve the Bit Error Rate (BER) performance of PLC channel transmission under impulse interference.
{"title":"Polar Codes and a M-ary Modulation-Based OFDM-PLC System","authors":"Ziyi Wang, Yichen Wang, R. Chen","doi":"10.3390/info14070360","DOIUrl":"https://doi.org/10.3390/info14070360","url":null,"abstract":"Power Line Communication (PLC) serves as a medium for communication over power lines, utilizing the existing power grid for information transmission. It offers a low-cost, highly scalable signal transmission method and has the potential to become the preferred technology for providing broadband in smart homes, offices, and smart grids. However, noise in the power line channel, especially impulse noise, seriously affects the communication quality of PLC. In this paper, we establish an OFDM-PLC system with higher-order modulation and polar codes. The higher-order QAM or APSK modulation technique is employed to increase the signal transmission rate and the system performance is analyzed. To combat the impulse noise in the PLC channel, we first model it using a superposition of multi-damping sinusoidal functions and then introduce the polar coding scheme to suppress the impulse noise in the system, thereby improving the transmission reliability. Simulation results verify that the proposed polar coding scheme based on the OFDM-PLC system can improve the Bit Error Rate (BER) performance of PLC channel transmission under impulse interference.","PeriodicalId":13622,"journal":{"name":"Inf. Comput.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90688734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Evianita Dewi Fajrianti, N. Funabiki, S. Sukaridhoto, Y. Panduman, Dezheng Kong, Shihao Fang, Anak Agung Surya Pradhana
Currently, outdoor navigation systems have widely been used around the world on smartphones. They rely on GPS (Global Positioning System). However, indoor navigation systems are still under development due to the complex structure of indoor environments, including multiple floors, many rooms, steps, and elevators. In this paper, we present the design and implementation of the Indoor Navigation System using Unity and Smartphone (INSUS). INSUS shows the arrow of the moving direction on the camera view based on a smartphone’s augmented reality (AR) technology. To trace the user location, it utilizes the Simultaneous Localization and Mapping (SLAM) technique with a gyroscope and a camera in a smartphone to track users’ movements inside a building after initializing the current location by the QR code. Unity is introduced to obtain the 3D information of the target indoor environment for Visual SLAM. The data are stored in the IoT application server called SEMAR for visualizations. We implement a prototype system of INSUS inside buildings in two universities. We found that scanning QR codes with the smartphone perpendicular in angle between 60∘ and 100∘ achieves the highest QR code detection accuracy. We also found that the phone’s tilt angles influence the navigation success rate, with 90∘ to 100∘ tilt angles giving better navigation success compared to lower tilt angles. INSUS also proved to be a robust navigation system, evidenced by near identical navigation success rate results in navigation scenarios with or without disturbance. Furthermore, based on the questionnaire responses from the respondents, it was generally found that INSUS received positive feedback and there is support to improve the system.
{"title":"INSUS: Indoor Navigation System Using Unity and Smartphone for User Ambulation Assistance","authors":"Evianita Dewi Fajrianti, N. Funabiki, S. Sukaridhoto, Y. Panduman, Dezheng Kong, Shihao Fang, Anak Agung Surya Pradhana","doi":"10.3390/info14070359","DOIUrl":"https://doi.org/10.3390/info14070359","url":null,"abstract":"Currently, outdoor navigation systems have widely been used around the world on smartphones. They rely on GPS (Global Positioning System). However, indoor navigation systems are still under development due to the complex structure of indoor environments, including multiple floors, many rooms, steps, and elevators. In this paper, we present the design and implementation of the Indoor Navigation System using Unity and Smartphone (INSUS). INSUS shows the arrow of the moving direction on the camera view based on a smartphone’s augmented reality (AR) technology. To trace the user location, it utilizes the Simultaneous Localization and Mapping (SLAM) technique with a gyroscope and a camera in a smartphone to track users’ movements inside a building after initializing the current location by the QR code. Unity is introduced to obtain the 3D information of the target indoor environment for Visual SLAM. The data are stored in the IoT application server called SEMAR for visualizations. We implement a prototype system of INSUS inside buildings in two universities. We found that scanning QR codes with the smartphone perpendicular in angle between 60∘ and 100∘ achieves the highest QR code detection accuracy. We also found that the phone’s tilt angles influence the navigation success rate, with 90∘ to 100∘ tilt angles giving better navigation success compared to lower tilt angles. INSUS also proved to be a robust navigation system, evidenced by near identical navigation success rate results in navigation scenarios with or without disturbance. Furthermore, based on the questionnaire responses from the respondents, it was generally found that INSUS received positive feedback and there is support to improve the system.","PeriodicalId":13622,"journal":{"name":"Inf. Comput.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83528790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this paper, a new method that computes the aesthetics of a melody fragment is proposed, starting from dissonances. While music generated with artificial intelligence applications may be produced considerably more quickly than human-composed music, it has the drawback of not being appreciated like a human composition, being many times perceived by humans as artificial. For achieving supervised machine learning objectives of improving the quality of the great number of generated melodies, it is a challenge to ask humans to grade them. Therefore, it would be preferable if the aesthetics of artificial-intelligence-generated music is calculated by an algorithm. The proposed method in this paper is based on a neural network and a mathematical formula, which has been developed with the help of a study in which 108 students evaluated the aesthetics of several melodies. For evaluation, numerical values generated by this method were compared with ratings provided by human listeners from a second study in which 30 students participated and scores were generated by an existing different method developed by psychologists and three other methods developed by musicians. Our method achieved a Pearson correlation of 0.49 with human aesthetic scores, which is a much better result than other methods obtained. Additionally, our method made a distinction between human-composed melodies and artificial-intelligence-generated scores in the same way that human listeners did.
{"title":"Measurement of Music Aesthetics Using Deep Neural Networks and Dissonances","authors":"Razvan Paroiu, Stefan Trausan-Matu","doi":"10.3390/info14070358","DOIUrl":"https://doi.org/10.3390/info14070358","url":null,"abstract":"In this paper, a new method that computes the aesthetics of a melody fragment is proposed, starting from dissonances. While music generated with artificial intelligence applications may be produced considerably more quickly than human-composed music, it has the drawback of not being appreciated like a human composition, being many times perceived by humans as artificial. For achieving supervised machine learning objectives of improving the quality of the great number of generated melodies, it is a challenge to ask humans to grade them. Therefore, it would be preferable if the aesthetics of artificial-intelligence-generated music is calculated by an algorithm. The proposed method in this paper is based on a neural network and a mathematical formula, which has been developed with the help of a study in which 108 students evaluated the aesthetics of several melodies. For evaluation, numerical values generated by this method were compared with ratings provided by human listeners from a second study in which 30 students participated and scores were generated by an existing different method developed by psychologists and three other methods developed by musicians. Our method achieved a Pearson correlation of 0.49 with human aesthetic scores, which is a much better result than other methods obtained. Additionally, our method made a distinction between human-composed melodies and artificial-intelligence-generated scores in the same way that human listeners did.","PeriodicalId":13622,"journal":{"name":"Inf. Comput.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83039248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anđelka Štilić, A. Puška, Darko Božanić, Duško Tešić
When carrying out construction work, identifying the best contractor is a critical component of the project life cycle in the construction industry. The investor must use effective and efficient strategies to create a competitive bidding environment in public projects. The research presented in this paper was conducted to demonstrate the competitive nature of public procurements, where contractors compete to present the best bid and win the contract. To award the contract, the best offer must be selected. Based on different strategies and multi-criteria decision-making approaches this study proposes a method for identifying the most suitable strategy out of eight bidding strategies on four different lots, resulting in the most suitable one for landslide rehabilitation in the Brčko district. The results reveal the optimal approach to follow to minimize time and financial losses in the case of landslide rehabilitation during periods of market instability. Such research findings validate the efficiency of the bidding strategies-based decision-making support. The proposed method allows for compromise on both the completion date and the lowest bid made by the winning contractor.
{"title":"Multi-Criteria Decision-Making in Public Procurement: An Empirical Study of Contractor Selection for Landslide Rehabilitation","authors":"Anđelka Štilić, A. Puška, Darko Božanić, Duško Tešić","doi":"10.3390/info14070357","DOIUrl":"https://doi.org/10.3390/info14070357","url":null,"abstract":"When carrying out construction work, identifying the best contractor is a critical component of the project life cycle in the construction industry. The investor must use effective and efficient strategies to create a competitive bidding environment in public projects. The research presented in this paper was conducted to demonstrate the competitive nature of public procurements, where contractors compete to present the best bid and win the contract. To award the contract, the best offer must be selected. Based on different strategies and multi-criteria decision-making approaches this study proposes a method for identifying the most suitable strategy out of eight bidding strategies on four different lots, resulting in the most suitable one for landslide rehabilitation in the Brčko district. The results reveal the optimal approach to follow to minimize time and financial losses in the case of landslide rehabilitation during periods of market instability. Such research findings validate the efficiency of the bidding strategies-based decision-making support. The proposed method allows for compromise on both the completion date and the lowest bid made by the winning contractor.","PeriodicalId":13622,"journal":{"name":"Inf. Comput.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82465086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The metaverse represents an immersive digital environment that has garnered significant attention as a result of its potential to revolutionize various industry sectors and its profound societal impact. While academic interest in the metaverse has surged, a dearth of comprehensive review articles employing bibliometric techniques remains. This study seeks to address this gap by analyzing 595 metaverse-related journal articles using bibliometric and topic modeling techniques, marking the first of its kind to investigate the bibliometric profile of metaverse research. The findings reveal exponential growth in metaverse research since 2020, identifying major trends, prolific authors, and the most active journals in the field. A keyword co-occurrence analysis further uncovers four significant clusters of metaverse-related interests, highlighting its unique facets and underscoring its far-reaching implications across various sectors, including education, healthcare, retail, and tourism. This study emphasizes the need for more research and collaboration in advancing the metaverse field and presents 27 research questions for future investigation. This comprehensive analysis serves as a foundation for understanding the current state of metaverse research and its potential trajectory.
{"title":"Mapping Metaverse Research: Identifying Future Research Areas Based on Bibliometric and Topic Modeling Techniques","authors":"Abderahman Rejeb, Karim Rejeb, Horst Treiblmaier","doi":"10.3390/info14070356","DOIUrl":"https://doi.org/10.3390/info14070356","url":null,"abstract":"The metaverse represents an immersive digital environment that has garnered significant attention as a result of its potential to revolutionize various industry sectors and its profound societal impact. While academic interest in the metaverse has surged, a dearth of comprehensive review articles employing bibliometric techniques remains. This study seeks to address this gap by analyzing 595 metaverse-related journal articles using bibliometric and topic modeling techniques, marking the first of its kind to investigate the bibliometric profile of metaverse research. The findings reveal exponential growth in metaverse research since 2020, identifying major trends, prolific authors, and the most active journals in the field. A keyword co-occurrence analysis further uncovers four significant clusters of metaverse-related interests, highlighting its unique facets and underscoring its far-reaching implications across various sectors, including education, healthcare, retail, and tourism. This study emphasizes the need for more research and collaboration in advancing the metaverse field and presents 27 research questions for future investigation. This comprehensive analysis serves as a foundation for understanding the current state of metaverse research and its potential trajectory.","PeriodicalId":13622,"journal":{"name":"Inf. Comput.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90099722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
João Monge, Gonçalo Ribeiro, A. Raimundo, O. Postolache, Joel Santos
Health monitoring is crucial in hospitals and rehabilitation centers. Challenges can affect the reliability and accuracy of health data. Human error, patient compliance concerns, time, money, technology, and environmental factors might cause these issues. In order to improve patient care, healthcare providers must address these challenges. We propose a non-intrusive smart sensing system that uses a SensFloor smart carpet and an inertial measurement unit (IMU) wearable sensor on the user’s back to monitor position and gait characteristics. Furthermore, we implemented machine learning (ML) algorithms to analyze the data collected from the SensFloor and IMU sensors. The system generates real-time data that are stored in the cloud and are accessible to physical therapists and patients. Additionally, the system’s real-time dashboards provide a comprehensive analysis of the user’s gait and balance, enabling personalized training plans with tailored exercises and better rehabilitation outcomes. Using non-invasive smart sensing technology, our proposed solution enables healthcare facilities to monitor patients’ health and enhance their physical rehabilitation plans.
{"title":"AI-Based Smart Sensing and AR for Gait Rehabilitation Assessment","authors":"João Monge, Gonçalo Ribeiro, A. Raimundo, O. Postolache, Joel Santos","doi":"10.3390/info14070355","DOIUrl":"https://doi.org/10.3390/info14070355","url":null,"abstract":"Health monitoring is crucial in hospitals and rehabilitation centers. Challenges can affect the reliability and accuracy of health data. Human error, patient compliance concerns, time, money, technology, and environmental factors might cause these issues. In order to improve patient care, healthcare providers must address these challenges. We propose a non-intrusive smart sensing system that uses a SensFloor smart carpet and an inertial measurement unit (IMU) wearable sensor on the user’s back to monitor position and gait characteristics. Furthermore, we implemented machine learning (ML) algorithms to analyze the data collected from the SensFloor and IMU sensors. The system generates real-time data that are stored in the cloud and are accessible to physical therapists and patients. Additionally, the system’s real-time dashboards provide a comprehensive analysis of the user’s gait and balance, enabling personalized training plans with tailored exercises and better rehabilitation outcomes. Using non-invasive smart sensing technology, our proposed solution enables healthcare facilities to monitor patients’ health and enhance their physical rehabilitation plans.","PeriodicalId":13622,"journal":{"name":"Inf. Comput.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72626622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Haleh Asgarinia, Andrés Chomczyk Penedo, Beatriz Esteves, David Bruce Lewis
News about personal data breaches or data abusive practices, such as Cambridge Analytica, has questioned the trustworthiness of certain actors in the control of personal data. Innovations in the field of personal information management systems to address this issue have regained traction in recent years, also coinciding with the emergence of new decentralized technologies. However, only with ethically and legally responsible developments will the mistakes of the past be avoided. This contribution explores how current data management schemes are insufficient to adequately safeguard data subjects, and in particular, it focuses on making these data flows transparent to provide an adequate level of accountability. To showcase this, and with the goal of enhancing transparency to foster trust, this paper investigates solutions for standardizing machine-readable policies to express personal data processing activities and their application to decentralized personal data stores as an example of ethical, legal, and technical responsible innovation in this field.
{"title":"\"Who Should I Trust with My Data?\" Ethical and Legal Challenges for Innovation in New Decentralized Data Management Technologies","authors":"Haleh Asgarinia, Andrés Chomczyk Penedo, Beatriz Esteves, David Bruce Lewis","doi":"10.3390/info14070351","DOIUrl":"https://doi.org/10.3390/info14070351","url":null,"abstract":"News about personal data breaches or data abusive practices, such as Cambridge Analytica, has questioned the trustworthiness of certain actors in the control of personal data. Innovations in the field of personal information management systems to address this issue have regained traction in recent years, also coinciding with the emergence of new decentralized technologies. However, only with ethically and legally responsible developments will the mistakes of the past be avoided. This contribution explores how current data management schemes are insufficient to adequately safeguard data subjects, and in particular, it focuses on making these data flows transparent to provide an adequate level of accountability. To showcase this, and with the goal of enhancing transparency to foster trust, this paper investigates solutions for standardizing machine-readable policies to express personal data processing activities and their application to decentralized personal data stores as an example of ethical, legal, and technical responsible innovation in this field.","PeriodicalId":13622,"journal":{"name":"Inf. Comput.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77991119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Georgios Prapas, Kosmas Glavas, Katerina D. Tzimourta, A. Tzallas, M. Tsipouras
Brain-computer interfaces (BCIs) are becoming an increasingly popular technology, used in a variety of fields such as medical, gaming, and lifestyle. This paper describes a 3D non-invasive BCI game that uses a Muse 2 EEG headband to acquire electroencephalogram (EEG) data and OpenViBE platform for processing the signals and classifying them into three different mental states: left and right motor imagery and eye blink. The game is developed to assess user adjustment and improvement in BCI environment after training. The classification algorithm used is Multi-Layer Perceptron (MLP), with 96.94% accuracy. A total of 33 subjects participated in the experiment and successfully controlled an avatar using mental commands to collect coins. The online metrics employed for this BCI system are the average game score, the average number of clusters and average user improvement.
{"title":"Mind the Move: Developing a Brain-Computer Interface Game with Left-Right Motor Imagery","authors":"Georgios Prapas, Kosmas Glavas, Katerina D. Tzimourta, A. Tzallas, M. Tsipouras","doi":"10.3390/info14070354","DOIUrl":"https://doi.org/10.3390/info14070354","url":null,"abstract":"Brain-computer interfaces (BCIs) are becoming an increasingly popular technology, used in a variety of fields such as medical, gaming, and lifestyle. This paper describes a 3D non-invasive BCI game that uses a Muse 2 EEG headband to acquire electroencephalogram (EEG) data and OpenViBE platform for processing the signals and classifying them into three different mental states: left and right motor imagery and eye blink. The game is developed to assess user adjustment and improvement in BCI environment after training. The classification algorithm used is Multi-Layer Perceptron (MLP), with 96.94% accuracy. A total of 33 subjects participated in the experiment and successfully controlled an avatar using mental commands to collect coins. The online metrics employed for this BCI system are the average game score, the average number of clusters and average user improvement.","PeriodicalId":13622,"journal":{"name":"Inf. Comput.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79951584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Aymen Lakehal, S. Lepreux, Christos Efstratiou, C. Kolski, Pavlos Nicolaou
Smartphone map-based pedestrian navigation is known to have a negative effect on the long-term acquisition of spatial knowledge and memorisation of landmarks. Landmark-based navigation has been proposed as an approach that can overcome such limitations. In this work, we investigate how different interaction technologies, namely smartphones and augmented reality (AR) glasses, can affect the acquisition of spatial knowledge when used to support landmark-based pedestrian navigation. We conducted a study involving 20 participants, using smartphones or augmented reality glasses for pedestrian navigation. We studied the effects of these systems on landmark memorisation and spatial knowledge acquisition over a period of time. Our results show statistically significant differences in spatial knowledge acquisition between the two technologies, with the augmented reality glasses enabling better memorisation of landmarks and paths.
{"title":"Spatial Knowledge Acquisition for Pedestrian Navigation: A Comparative Study between Smartphones and AR Glasses","authors":"Aymen Lakehal, S. Lepreux, Christos Efstratiou, C. Kolski, Pavlos Nicolaou","doi":"10.3390/info14070353","DOIUrl":"https://doi.org/10.3390/info14070353","url":null,"abstract":"Smartphone map-based pedestrian navigation is known to have a negative effect on the long-term acquisition of spatial knowledge and memorisation of landmarks. Landmark-based navigation has been proposed as an approach that can overcome such limitations. In this work, we investigate how different interaction technologies, namely smartphones and augmented reality (AR) glasses, can affect the acquisition of spatial knowledge when used to support landmark-based pedestrian navigation. We conducted a study involving 20 participants, using smartphones or augmented reality glasses for pedestrian navigation. We studied the effects of these systems on landmark memorisation and spatial knowledge acquisition over a period of time. Our results show statistically significant differences in spatial knowledge acquisition between the two technologies, with the augmented reality glasses enabling better memorisation of landmarks and paths.","PeriodicalId":13622,"journal":{"name":"Inf. Comput.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81532270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Musab T. S. Al-Kaltakchi, Ahmad Saeed Mohammad, W. Woo
Speech separation is a well-known problem, especially when there is only one sound mixture available. Estimating the Ideal Binary Mask (IBM) is one solution to this problem. Recent research has focused on the supervised classification approach. The challenge of extracting features from the sources is critical for this method. Speech separation has been accomplished by using a variety of feature extraction models. The majority of them, however, are concentrated on a single feature. The complementary nature of various features have not been thoroughly investigated. In this paper, we propose a deep neural network (DNN) ensemble architecture to completely explore the complimentary nature of the diverse features obtained from raw acoustic features. We examined the penultimate discriminative representations instead of employing the features acquired from the output layer. The learned representations were also fused to produce a new features vector, which was then classified by using the Extreme Learning Machine (ELM). In addition, a genetic algorithm (GA) was created to optimize the parameters globally. The results of the experiments showed that our proposed system completely considered various features and produced a high-quality IBM under different conditions.
{"title":"Ensemble System of Deep Neural Networks for Single-Channel Audio Separation","authors":"Musab T. S. Al-Kaltakchi, Ahmad Saeed Mohammad, W. Woo","doi":"10.3390/info14070352","DOIUrl":"https://doi.org/10.3390/info14070352","url":null,"abstract":"Speech separation is a well-known problem, especially when there is only one sound mixture available. Estimating the Ideal Binary Mask (IBM) is one solution to this problem. Recent research has focused on the supervised classification approach. The challenge of extracting features from the sources is critical for this method. Speech separation has been accomplished by using a variety of feature extraction models. The majority of them, however, are concentrated on a single feature. The complementary nature of various features have not been thoroughly investigated. In this paper, we propose a deep neural network (DNN) ensemble architecture to completely explore the complimentary nature of the diverse features obtained from raw acoustic features. We examined the penultimate discriminative representations instead of employing the features acquired from the output layer. The learned representations were also fused to produce a new features vector, which was then classified by using the Extreme Learning Machine (ELM). In addition, a genetic algorithm (GA) was created to optimize the parameters globally. The results of the experiments showed that our proposed system completely considered various features and produced a high-quality IBM under different conditions.","PeriodicalId":13622,"journal":{"name":"Inf. Comput.","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89763307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}