首页 > 最新文献

2022 2nd International Conference on Artificial Intelligence (ICAI)最新文献

英文 中文
Transferable Learning Architecture for Scalable Visual Quality Inspection 可扩展视觉质量检测的可转移学习架构
Pub Date : 2022-03-30 DOI: 10.1109/ICAI55435.2022.9773637
M. Talha, Sheikh Faisal Rashid, Zain Iftikhar, Muhammad Touseef Afzal, Liu Ying
In recent years, convolutional neural networks (CNNs) have become a de facto standard in computer vision for object detection and recognition. At present, CNNs have been used in many application areas including the automation of industrial manufacturing processes. But using CNN in a real-time environment to track defects on products has many shortcomings like long training time, large data requirements, slow inference time, dynamic environment, and hardware dependency. This paper evaluates the state-of-the-art CNN architectures for object detection to address the mentioned challenges and provide the best possible solution. A set of pre-trained models has been trained on just 781 annotated images by applying transfer learning. Experimental results showed that Faster RCNN with VGG-16 backbone outperforms the other models in case of accuracy and mAP. But RetinaNet with an FPN backbone has the fastest inference time on multi-scaled defects. Paper also presents the deployment pipeline for inference on mobile devices to use in a real-time environment without any special hardware. In addition, an improved dataset of submersible pump impellers, based on the existing Kaggle dataset is introduced.
近年来,卷积神经网络(cnn)已经成为计算机视觉对象检测和识别的事实上的标准。目前,cnn已经应用于包括工业制造过程自动化在内的许多应用领域。但在实时环境下使用CNN跟踪产品缺陷存在训练时间长、数据需求大、推理时间慢、环境动态性强、依赖硬件等缺点。本文评估了用于目标检测的最先进的CNN架构,以解决上述挑战并提供最佳解决方案。通过应用迁移学习,一组预训练模型仅在781张带注释的图像上进行了训练。实验结果表明,采用VGG-16骨干网的Faster RCNN在准确率和mAP方面都优于其他模型。而采用FPN骨干网的retanet对多尺度缺陷的推理速度最快。本文还介绍了在移动设备上进行推理的部署管道,以便在不需要任何特殊硬件的情况下在实时环境中使用。此外,在现有Kaggle数据集的基础上,介绍了一种改进的潜水泵叶轮数据集。
{"title":"Transferable Learning Architecture for Scalable Visual Quality Inspection","authors":"M. Talha, Sheikh Faisal Rashid, Zain Iftikhar, Muhammad Touseef Afzal, Liu Ying","doi":"10.1109/ICAI55435.2022.9773637","DOIUrl":"https://doi.org/10.1109/ICAI55435.2022.9773637","url":null,"abstract":"In recent years, convolutional neural networks (CNNs) have become a de facto standard in computer vision for object detection and recognition. At present, CNNs have been used in many application areas including the automation of industrial manufacturing processes. But using CNN in a real-time environment to track defects on products has many shortcomings like long training time, large data requirements, slow inference time, dynamic environment, and hardware dependency. This paper evaluates the state-of-the-art CNN architectures for object detection to address the mentioned challenges and provide the best possible solution. A set of pre-trained models has been trained on just 781 annotated images by applying transfer learning. Experimental results showed that Faster RCNN with VGG-16 backbone outperforms the other models in case of accuracy and mAP. But RetinaNet with an FPN backbone has the fastest inference time on multi-scaled defects. Paper also presents the deployment pipeline for inference on mobile devices to use in a real-time environment without any special hardware. In addition, an improved dataset of submersible pump impellers, based on the existing Kaggle dataset is introduced.","PeriodicalId":146842,"journal":{"name":"2022 2nd International Conference on Artificial Intelligence (ICAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133773444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
COVID-19 Spread Prediction and Its Impact on the Stock market price COVID-19的传播预测及其对股票市场价格的影响
Pub Date : 2022-03-30 DOI: 10.1109/ICAI55435.2022.9773481
Musa Khan, G. M. Khan
Predicting the Covid-19 spread and its impact on the stock market is an important research challenge these days. In order to obtain the best forecasting model, we have exploited neuro-evolutionary technique Cartesian genetic programming evolved artificial neural network (CGPANN) based solution to predict the future cases of COVID-19 up to 6-days in advance. This helps authorities and paramedical staff to take precautionary measures on time which helps in counteracting the spreading of the virus. The rising number of COVID cases has caused a significant impact on the stock market. CGPANN being the best performer for the time series prediction model seems ideal for the case under consideration. The proposed model achieved an accuracy as high as 98% predicting COVID-19 cases for the next six days. When compared with other contemporary models CGPANN seems to perform well ahead in terms of accuracy.
预测新冠病毒的传播及其对股市的影响是目前一项重要的研究挑战。为了获得最佳预测模型,我们利用基于神经进化技术的基于笛卡尔遗传规划进化人工神经网络(CGPANN)的解决方案,提前6天预测未来COVID-19病例。这有助于当局和辅助医务人员及时采取预防措施,有助于遏制病毒的传播。新型冠状病毒感染症(COVID - 19)患者不断增加,给股市带来了巨大影响。CGPANN作为时间序列预测模型的最佳性能似乎是考虑中的情况的理想选择。该模型预测未来6天COVID-19病例的准确率高达98%。与其他当代模型相比,CGPANN似乎在准确性方面表现得很好。
{"title":"COVID-19 Spread Prediction and Its Impact on the Stock market price","authors":"Musa Khan, G. M. Khan","doi":"10.1109/ICAI55435.2022.9773481","DOIUrl":"https://doi.org/10.1109/ICAI55435.2022.9773481","url":null,"abstract":"Predicting the Covid-19 spread and its impact on the stock market is an important research challenge these days. In order to obtain the best forecasting model, we have exploited neuro-evolutionary technique Cartesian genetic programming evolved artificial neural network (CGPANN) based solution to predict the future cases of COVID-19 up to 6-days in advance. This helps authorities and paramedical staff to take precautionary measures on time which helps in counteracting the spreading of the virus. The rising number of COVID cases has caused a significant impact on the stock market. CGPANN being the best performer for the time series prediction model seems ideal for the case under consideration. The proposed model achieved an accuracy as high as 98% predicting COVID-19 cases for the next six days. When compared with other contemporary models CGPANN seems to perform well ahead in terms of accuracy.","PeriodicalId":146842,"journal":{"name":"2022 2nd International Conference on Artificial Intelligence (ICAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133394080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Review on Different Approaches for Assessing Student Attentiveness in Classroom using Behavioural Elements 运用行为要素评价学生课堂注意力的不同方法综述
Pub Date : 2022-03-30 DOI: 10.1109/ICAI55435.2022.9773418
Kainat, Sara Ali, Fahad Iqbal Khawaja, Yasar Avaz, Muhammad Saiid
Analyzing one's participation and attention may be useful in a variety of contexts, like work situations such as driving a car, defusing a bomb, and many learning environments. Increasing the student's involvement and participation in the classroom has been proven to improve learning results. Attention is core for effective learning, yet analyzing attention is a tricky task. People have been working on attention analysis for decades, and as a result, current learning systems contain methods for monitoring and reporting on students' attention states. Facial features and eye movements are some of the important behavioural features to access attentiveness. Approaches such as EEG signals, gaze detection, head and body posture detection are used in this context as they provide rich information about a person's behavior and thoughts. It also gives essential information for interpreting their nonverbal, cues. These are referred to be “honest signals” since they are unconscious patterns that reveal the focus of our attention. They give vital indications concerning teaching methods and students' responses to various conscious and unconscious teaching tactics inside the classroom. Examining verbal and nonverbal conduct in the classroom can give valuable input to the instructor. This paper will go through various approaches available for analyzing student attentiveness for effective learning in the classroom. Integrating different technical approaches with Machine learning and Deep learning models accuracy up to 90% can be observed in different research with minimum error.
分析一个人的参与和注意力可能在各种情况下都很有用,比如开车、拆除炸弹等工作情况,以及许多学习环境。提高学生在课堂上的参与度和参与度已被证明可以提高学习效果。注意力是有效学习的核心,然而分析注意力是一项棘手的任务。几十年来,人们一直在研究注意力分析,因此,当前的学习系统包含了监测和报告学生注意力状态的方法。面部特征和眼球运动是获得注意力的一些重要行为特征。脑电图信号、凝视检测、头部和身体姿势检测等方法在这种情况下被使用,因为它们提供了关于一个人的行为和思想的丰富信息。这也为解读他们的非语言暗示提供了必要的信息。这些被称为“诚实的信号”,因为它们是揭示我们注意力焦点的无意识模式。他们对教学方法和学生对课堂内各种有意识和无意识的教学策略的反应给出了重要的指示。检查课堂上的言语和非言语行为可以为教师提供有价值的输入。本文将通过各种方法来分析学生在课堂上有效学习的注意力。将不同的技术方法与机器学习和深度学习模型相结合,可以在不同的研究中以最小的误差观察到高达90%的精度。
{"title":"A Review on Different Approaches for Assessing Student Attentiveness in Classroom using Behavioural Elements","authors":"Kainat, Sara Ali, Fahad Iqbal Khawaja, Yasar Avaz, Muhammad Saiid","doi":"10.1109/ICAI55435.2022.9773418","DOIUrl":"https://doi.org/10.1109/ICAI55435.2022.9773418","url":null,"abstract":"Analyzing one's participation and attention may be useful in a variety of contexts, like work situations such as driving a car, defusing a bomb, and many learning environments. Increasing the student's involvement and participation in the classroom has been proven to improve learning results. Attention is core for effective learning, yet analyzing attention is a tricky task. People have been working on attention analysis for decades, and as a result, current learning systems contain methods for monitoring and reporting on students' attention states. Facial features and eye movements are some of the important behavioural features to access attentiveness. Approaches such as EEG signals, gaze detection, head and body posture detection are used in this context as they provide rich information about a person's behavior and thoughts. It also gives essential information for interpreting their nonverbal, cues. These are referred to be “honest signals” since they are unconscious patterns that reveal the focus of our attention. They give vital indications concerning teaching methods and students' responses to various conscious and unconscious teaching tactics inside the classroom. Examining verbal and nonverbal conduct in the classroom can give valuable input to the instructor. This paper will go through various approaches available for analyzing student attentiveness for effective learning in the classroom. Integrating different technical approaches with Machine learning and Deep learning models accuracy up to 90% can be observed in different research with minimum error.","PeriodicalId":146842,"journal":{"name":"2022 2nd International Conference on Artificial Intelligence (ICAI)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116282760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Wheat Rust Disease Classification using Edge-AI 基于Edge-AI的小麦锈病分类
Pub Date : 2022-03-30 DOI: 10.1109/ICAI55435.2022.9773489
Ihsan UI Haq, R. Mumtaz, M. Talha, Zunaira Shafaq, M. Owais
Wheat leaf rust is considered one of the most detrimental fungal diseases that spread rapidly after its first appearance and can significantly damage the entire crop field. This can lead to a severe decline in wheat yield, posing a serious threat to food security considering an unceasing growth in the country's population. The conventional method of wheat rust detection is visual inspection, which is an ineffective and unsuitable approach for large agricultural lands. Additionally, such monitoring is solely dependent on the farmer's knowledge base and experience. Towards such an end, an Edge AI-based system for detecting and classifying wheat leaves into healthy and rusted leaves in real-time is proposed. The dataset collected is analyzed with several machine learning-based classifiers where Random Forest outperformed with a classification accuracy of 97.3% and 82.8% using Gray Level Co-occurrence Matrix (GLCM) and binary feature extraction techniques respectively. In addition, a Deep Convolution Neural Network (DCNN) model is explored to classify rusted and healthy leaves, which showed an accuracy of 88.33 %. This trained DCNN model is also deployed on the edge device for real-time classification of wheat rust disease. The developed system would contribute to promoting technology-based solutions over old farming practices and assist in minimizing the spread of wheat rust disease.
小麦叶锈病被认为是最有害的真菌病害之一,它在首次出现后迅速传播,可以严重损害整个农田。这可能导致小麦产量严重下降,考虑到该国人口的不断增长,对粮食安全构成严重威胁。传统的小麦锈病检测方法是目测检测,对于大面积农田来说目测检测是一种无效且不适合的方法。此外,这种监测完全依赖于农民的知识基础和经验。为此,提出了一种基于边缘人工智能的小麦健康叶片和锈蚀叶片实时检测与分类系统。使用几种基于机器学习的分类器对收集到的数据集进行分析,其中Random Forest使用灰度共生矩阵(GLCM)和二元特征提取技术分别以97.3%和82.8%的分类准确率优于随机森林。此外,利用深度卷积神经网络(Deep Convolution Neural Network, DCNN)模型对锈叶和健康叶进行分类,准确率达到88.33%。该训练好的DCNN模型也被部署在边缘设备上,用于小麦锈病的实时分类。发达的系统将有助于促进以技术为基础的解决方案,而不是旧的耕作方式,并有助于最大限度地减少小麦锈病的传播。
{"title":"Wheat Rust Disease Classification using Edge-AI","authors":"Ihsan UI Haq, R. Mumtaz, M. Talha, Zunaira Shafaq, M. Owais","doi":"10.1109/ICAI55435.2022.9773489","DOIUrl":"https://doi.org/10.1109/ICAI55435.2022.9773489","url":null,"abstract":"Wheat leaf rust is considered one of the most detrimental fungal diseases that spread rapidly after its first appearance and can significantly damage the entire crop field. This can lead to a severe decline in wheat yield, posing a serious threat to food security considering an unceasing growth in the country's population. The conventional method of wheat rust detection is visual inspection, which is an ineffective and unsuitable approach for large agricultural lands. Additionally, such monitoring is solely dependent on the farmer's knowledge base and experience. Towards such an end, an Edge AI-based system for detecting and classifying wheat leaves into healthy and rusted leaves in real-time is proposed. The dataset collected is analyzed with several machine learning-based classifiers where Random Forest outperformed with a classification accuracy of 97.3% and 82.8% using Gray Level Co-occurrence Matrix (GLCM) and binary feature extraction techniques respectively. In addition, a Deep Convolution Neural Network (DCNN) model is explored to classify rusted and healthy leaves, which showed an accuracy of 88.33 %. This trained DCNN model is also deployed on the edge device for real-time classification of wheat rust disease. The developed system would contribute to promoting technology-based solutions over old farming practices and assist in minimizing the spread of wheat rust disease.","PeriodicalId":146842,"journal":{"name":"2022 2nd International Conference on Artificial Intelligence (ICAI)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126100245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Wheat Crop Field and Yield Prediction using Remote Sensing and Machine Learning 基于遥感和机器学习的小麦作物田与产量预测
Pub Date : 2022-03-30 DOI: 10.1109/ICAI55435.2022.9773663
Maheen Ayub, N. A. Khan, R. Haider
Agriculture plays an important role in the growth of a country's economy. Crop area and yield predictions using machine learning are important investigation domains in current research fields. Wheat is the most important food crop in Pakistan which is cultivated in the Rabi season. Weather conditions, Remote Sensing (RS) data, and Machine learning (ML) technologies can be used to forecast wheat yield before actual harvesting to assist the management of wheat production, trade, and storage. In this paper, a supervised ML based framework is proposed that extracts features/Vegetation Indices (VIs) including Enhanced Vegetation Index (EVI), Normalized Difference Vegetation Index (NDVI), Red Edge Normalized Difference Vegetation Index (RENDVI), and Normalized Difference Moisture Index (NDMI) from Sentinel-2 Satellite images and contributes for: estimation of wheat area, and identification of most effective VIs in wheat area estimation, prediction of wheat yield, and identification of most effective VIs and meteorological parameters in wheat yield prediction. In the initial experimental setup, good performance output obtained using the Random Forest (RF) machine learning algorithm therefore in this framework RF machine learning algorithm is focused on wheat area estimation and generation of Land Use Land Cover (LULC) maps which is capable of estimating area with an accuracy of 84%, consumer's accuracy of 81 %, producer's accuracy of 83% and kappa statistics of 0.80. LULC maps are used for wheat yield prediction. Multivariate regression forward stepwise technique is applied for yield prediction and selection of effective VIs and meteorological parameters. The adjusted coefficient of determination (R2) between reported and predicted yield found 0.84 with an error of 46.14 Kg/ha for yield prediction.
农业在一个国家的经济发展中起着重要作用。利用机器学习进行作物面积和产量预测是当前研究领域的重要研究领域。小麦是巴基斯坦最重要的粮食作物,在拉比季节种植。天气条件、遥感(RS)数据和机器学习(ML)技术可用于在实际收获前预测小麦产量,以协助小麦生产、贸易和储存的管理。本文提出了一个基于监督机器学习的框架,从Sentinel-2卫星图像中提取特征/植被指数(VIs),包括增强植被指数(EVI)、归一化植被指数(NDVI)、红边归一化植被指数(RENDVI)和归一化水分指数(NDMI),并对以下方面做出了贡献:小麦面积估算与最有效VIs识别;小麦产量预测与最有效VIs与气象参数识别。在最初的实验设置中,使用随机森林(RF)机器学习算法获得了良好的性能输出,因此在该框架中,RF机器学习算法专注于小麦面积估计和土地利用土地覆盖(LULC)地图的生成,该算法能够以84%的精度估计面积,消费者的精度为81%,生产者的精度为83%,kappa统计量为0.80。LULC地图用于小麦产量预测。采用多元正逐步回归技术进行产量预测和有效VIs及气象参数的选择。报告产量与预测产量的校正决定系数(R2)为0.84,预测误差为46.14 Kg/ha。
{"title":"Wheat Crop Field and Yield Prediction using Remote Sensing and Machine Learning","authors":"Maheen Ayub, N. A. Khan, R. Haider","doi":"10.1109/ICAI55435.2022.9773663","DOIUrl":"https://doi.org/10.1109/ICAI55435.2022.9773663","url":null,"abstract":"Agriculture plays an important role in the growth of a country's economy. Crop area and yield predictions using machine learning are important investigation domains in current research fields. Wheat is the most important food crop in Pakistan which is cultivated in the Rabi season. Weather conditions, Remote Sensing (RS) data, and Machine learning (ML) technologies can be used to forecast wheat yield before actual harvesting to assist the management of wheat production, trade, and storage. In this paper, a supervised ML based framework is proposed that extracts features/Vegetation Indices (VIs) including Enhanced Vegetation Index (EVI), Normalized Difference Vegetation Index (NDVI), Red Edge Normalized Difference Vegetation Index (RENDVI), and Normalized Difference Moisture Index (NDMI) from Sentinel-2 Satellite images and contributes for: estimation of wheat area, and identification of most effective VIs in wheat area estimation, prediction of wheat yield, and identification of most effective VIs and meteorological parameters in wheat yield prediction. In the initial experimental setup, good performance output obtained using the Random Forest (RF) machine learning algorithm therefore in this framework RF machine learning algorithm is focused on wheat area estimation and generation of Land Use Land Cover (LULC) maps which is capable of estimating area with an accuracy of 84%, consumer's accuracy of 81 %, producer's accuracy of 83% and kappa statistics of 0.80. LULC maps are used for wheat yield prediction. Multivariate regression forward stepwise technique is applied for yield prediction and selection of effective VIs and meteorological parameters. The adjusted coefficient of determination (R2) between reported and predicted yield found 0.84 with an error of 46.14 Kg/ha for yield prediction.","PeriodicalId":146842,"journal":{"name":"2022 2nd International Conference on Artificial Intelligence (ICAI)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122453662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Evolving computationally efficient prediction model for Stock Volatility using CGPANN 基于CGPANN的股票波动率快速预测模型
Pub Date : 2022-03-30 DOI: 10.1109/ICAI55435.2022.9773706
Niaz Muhammad, Syed Waqar Shah, G. M. Khan
Financial market volatility has become one of the most difficult applications for stock price forecasting in ongoing situations. The current statistical models for stock price forecasting are too rigid and inefficient to appropriately deal with the uncertainty and volatility inherent in stock data. CGPANN-CGP based ANNs and LSTM are the most common methods used these days to predict such dynamics in time series data. In comparison to other methodologies, studies have demonstrated that the application of Cartesian genetic programming evolved Artificial Neural Networks (CGPANNs) to time series forecasting problems produces better results, and LSTM can be competitive at times. CGPANN provides the ability to train both structure, topology, and weights of network to achieve the global optimum solution. The prediction model is trained on the behavior of stock exchange patterns and is based on trends in historical daily stock prices. The proposed CGPANN and LSTM models produced competitive results of 98.86% and 98.52% respectively. However, CGPANN architecture is capable computationally efficient than LSTM and its ability of quick predictions makes it ideal for real-time applications.
金融市场波动已成为当前形势下股票价格预测最困难的应用之一。现有的股票价格预测统计模型过于僵化,效率低下,无法恰当地处理股票数据固有的不确定性和波动性。基于CGPANN-CGP的神经网络和LSTM是目前最常用的预测时间序列数据动态的方法。与其他方法相比,研究表明,将笛卡尔遗传规划进化的人工神经网络(CGPANNs)应用于时间序列预测问题可以产生更好的结果,并且LSTM有时可以具有竞争力。CGPANN提供了训练网络结构、拓扑和权值的能力,以实现全局最优解。预测模型是根据股票交易模式的行为进行训练的,并基于历史每日股票价格的趋势。所提出的CGPANN和LSTM模型的竞争结果分别为98.86%和98.52%。然而,CGPANN体系结构的计算效率比LSTM高,其快速预测的能力使其成为实时应用的理想选择。
{"title":"Evolving computationally efficient prediction model for Stock Volatility using CGPANN","authors":"Niaz Muhammad, Syed Waqar Shah, G. M. Khan","doi":"10.1109/ICAI55435.2022.9773706","DOIUrl":"https://doi.org/10.1109/ICAI55435.2022.9773706","url":null,"abstract":"Financial market volatility has become one of the most difficult applications for stock price forecasting in ongoing situations. The current statistical models for stock price forecasting are too rigid and inefficient to appropriately deal with the uncertainty and volatility inherent in stock data. CGPANN-CGP based ANNs and LSTM are the most common methods used these days to predict such dynamics in time series data. In comparison to other methodologies, studies have demonstrated that the application of Cartesian genetic programming evolved Artificial Neural Networks (CGPANNs) to time series forecasting problems produces better results, and LSTM can be competitive at times. CGPANN provides the ability to train both structure, topology, and weights of network to achieve the global optimum solution. The prediction model is trained on the behavior of stock exchange patterns and is based on trends in historical daily stock prices. The proposed CGPANN and LSTM models produced competitive results of 98.86% and 98.52% respectively. However, CGPANN architecture is capable computationally efficient than LSTM and its ability of quick predictions makes it ideal for real-time applications.","PeriodicalId":146842,"journal":{"name":"2022 2nd International Conference on Artificial Intelligence (ICAI)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116002570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Dubbing and Facial Synchronization using Deep Learning 使用深度学习的自动配音和面部同步
Pub Date : 2022-03-30 DOI: 10.1109/ICAI55435.2022.9773697
Saad A. Bazaz, AbdurRehman Subhani, Syed Z.A. Hadi
With the recent global boom in video content creation and consumption during the pandemic, linguistics remains the only barrier in producing im-mersive content for global communities. To solve this, content creators use a manual dubbing process, where voice actors are hired to produce a “voiceover” over the video. We aim to break down the language barrier and thus make “videos for everyone”. We propose an end-to-end architecture that automatically translates videos and produces synchronized dubbed voices using deep learning models, in a specified target language. Our architecture takes a modular approach, allowing the user to tweak each component or replace it with a better one. We present our results from said architecture, and describe possible future motivations to scale this to accommodate multiple languages and multiple use cases. A sample of our results can be found here: https://youtu.be/eGB-gL6bDr4
随着最近全球视频内容创作和消费在疫情期间的繁荣,语言学仍然是为全球社区制作沉浸式内容的唯一障碍。为了解决这个问题,内容创作者使用手动配音过程,即聘请配音演员为视频制作“画外音”。我们的目标是打破语言障碍,制作“人人都能看的视频”。我们提出了一个端到端架构,该架构使用深度学习模型以指定的目标语言自动翻译视频并产生同步的配音声音。我们的架构采用模块化方法,允许用户调整每个组件或用更好的组件替换它。我们展示了从上述体系结构中得到的结果,并描述了将来扩展该体系结构以适应多种语言和多种用例的可能动机。我们的结果样本可以在这里找到:https://youtu.be/eGB-gL6bDr4
{"title":"Automated Dubbing and Facial Synchronization using Deep Learning","authors":"Saad A. Bazaz, AbdurRehman Subhani, Syed Z.A. Hadi","doi":"10.1109/ICAI55435.2022.9773697","DOIUrl":"https://doi.org/10.1109/ICAI55435.2022.9773697","url":null,"abstract":"With the recent global boom in video content creation and consumption during the pandemic, linguistics remains the only barrier in producing im-mersive content for global communities. To solve this, content creators use a manual dubbing process, where voice actors are hired to produce a “voiceover” over the video. We aim to break down the language barrier and thus make “videos for everyone”. We propose an end-to-end architecture that automatically translates videos and produces synchronized dubbed voices using deep learning models, in a specified target language. Our architecture takes a modular approach, allowing the user to tweak each component or replace it with a better one. We present our results from said architecture, and describe possible future motivations to scale this to accommodate multiple languages and multiple use cases. A sample of our results can be found here: https://youtu.be/eGB-gL6bDr4","PeriodicalId":146842,"journal":{"name":"2022 2nd International Conference on Artificial Intelligence (ICAI)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127274927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Segmentation of Brain Tumor from Medical Images with Novel U-Shaped Encoder Decoder Architecture 基于新型u型编码器结构的医学图像脑肿瘤分割
Pub Date : 2022-03-30 DOI: 10.1109/ICAI55435.2022.9773383
Farzana Mushtaq, Faisal Rehman, Hira Akram, Sameen Butt, Syeda Fareeha Batool, Maheen Jafer, Nadeem Sarfaraz, Anza Gul
One of the Challenging tasks in medical field and computer vision is automatic brain segmentation with MRI (Magnetic Resonance Images). From the literature, the importance of deep neural networks is cleared as they have provided effective results in brain tumor segmentation problem in terms of accuracy and time. Mostly the training time is issued due to image features and for this purpose extra computational power is required to train the neural network model. The gradient problem is overcome in this study to fine tune the Novel unit model. CNNs & U-Shaped encoder decoder architectures produce effective result than other neural networks in terms of accuracy and time. The comparison is also performed in this study to show the robustness of U- Shaped encoder decoder architecture. Novel encoder and decoder model accuracy is 0.947 %that is better than other neural networks e.g., CNNs. Further this model is roughly three time faster than other models in terms of training time that's why less computation power is required to train this model.
利用核磁共振图像对大脑进行自动分割是医学领域和计算机视觉领域具有挑战性的任务之一。从文献来看,深度神经网络的重要性是明确的,因为它在准确性和时间上为脑肿瘤分割问题提供了有效的结果。大多数情况下,训练时间是根据图像的特征来分配的,为此需要额外的计算能力来训练神经网络模型。本文克服了梯度问题,对新单元模型进行了微调。cnn & u型编码器和解码器结构在精度和时间方面比其他神经网络产生更有效的结果。本研究还进行了比较,以证明U型编码器解码器架构的鲁棒性。新颖的编码器和解码器模型精度为0.947%,优于cnn等其他神经网络。此外,就训练时间而言,这个模型大约比其他模型快三倍,这就是为什么训练这个模型所需的计算能力更少。
{"title":"Segmentation of Brain Tumor from Medical Images with Novel U-Shaped Encoder Decoder Architecture","authors":"Farzana Mushtaq, Faisal Rehman, Hira Akram, Sameen Butt, Syeda Fareeha Batool, Maheen Jafer, Nadeem Sarfaraz, Anza Gul","doi":"10.1109/ICAI55435.2022.9773383","DOIUrl":"https://doi.org/10.1109/ICAI55435.2022.9773383","url":null,"abstract":"One of the Challenging tasks in medical field and computer vision is automatic brain segmentation with MRI (Magnetic Resonance Images). From the literature, the importance of deep neural networks is cleared as they have provided effective results in brain tumor segmentation problem in terms of accuracy and time. Mostly the training time is issued due to image features and for this purpose extra computational power is required to train the neural network model. The gradient problem is overcome in this study to fine tune the Novel unit model. CNNs & U-Shaped encoder decoder architectures produce effective result than other neural networks in terms of accuracy and time. The comparison is also performed in this study to show the robustness of U- Shaped encoder decoder architecture. Novel encoder and decoder model accuracy is 0.947 %that is better than other neural networks e.g., CNNs. Further this model is roughly three time faster than other models in terms of training time that's why less computation power is required to train this model.","PeriodicalId":146842,"journal":{"name":"2022 2nd International Conference on Artificial Intelligence (ICAI)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124480940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Constraint Free Early Warning System for Flood Using Multivariate LSTM Network 基于多元LSTM网络的无约束洪水预警系统
Pub Date : 2022-03-30 DOI: 10.1109/ICAI55435.2022.9773495
Touqir Gohar, L. Hasan, G. M. Khan, Mehreen Mubashir
Floods are the world's most damaging natural disasters, which not only claim thousands of human lives but also result in huge damage to infrastructure. Floods if forecasted in advance can help in the reduction of damages. Flood prediction especially long term is a complex task as it involves many hydrological and metrological parameters. For the short and medium-term, machine learning methods seem to have contributed to a great extent in simulating mathematical modelling of the physical flow processes of floods. However, these developed model's performance lacks generalization. Such systems trained on one geographical location's data have degraded performance when exploited for another location. In this paper, Long Short-Term Memory (LSTM) machine learning algorithm was applied where the hourly river level, river flow, and rainfall data from Brooklyn station was used as input data to the model and test for one hour, two hours, four hours, six hours, eight hours, and twelve hours in advance for river level prediction at Hoppers Crossing station. The developed algorithm achieved an accuracy of 98% for one hour and 97.2 %, 96.14 %, 94.67%,94.61 %, and 93.55% for two, four, six, eight, and twelve hours respectively. These systems not only forecast the future water level but also help in estimating the water level in case of a sensor failure. Multivariate modelling is utilized to predict the unknown parameter from the given other parametric values, thus not only predicting the forecasted water level but also reporting the sensor failure.
洪水是世界上最具破坏性的自然灾害,它不仅夺去成千上万人的生命,而且对基础设施造成巨大破坏。如果提前预报洪水,可以帮助减少损失。洪水预测是一项复杂的任务,因为它涉及许多水文和气象参数。从短期和中期来看,机器学习方法似乎在很大程度上有助于模拟洪水物理流动过程的数学建模。然而,这些开发的模型的性能缺乏泛化。这种在一个地理位置的数据上训练的系统在用于另一个位置时性能会下降。本文采用LSTM (Long - Short-Term Memory)机器学习算法,将布鲁克林站每小时的河流水位、河流流量和降雨量数据作为模型的输入数据,并在Hoppers Crossing站提前1小时、2小时、4小时、6小时、8小时和12小时进行水位预测测试。该算法1小时的准确率为98%,2小时、4小时、6小时、8小时和12小时的准确率分别为97.2%、96.14%、94.67%、94.61%和93.55%。这些系统不仅可以预测未来的水位,还可以在传感器故障的情况下帮助估计水位。利用多元模型从给定的其他参数值中预测未知参数,不仅可以预测预测水位,还可以报告传感器故障。
{"title":"Constraint Free Early Warning System for Flood Using Multivariate LSTM Network","authors":"Touqir Gohar, L. Hasan, G. M. Khan, Mehreen Mubashir","doi":"10.1109/ICAI55435.2022.9773495","DOIUrl":"https://doi.org/10.1109/ICAI55435.2022.9773495","url":null,"abstract":"Floods are the world's most damaging natural disasters, which not only claim thousands of human lives but also result in huge damage to infrastructure. Floods if forecasted in advance can help in the reduction of damages. Flood prediction especially long term is a complex task as it involves many hydrological and metrological parameters. For the short and medium-term, machine learning methods seem to have contributed to a great extent in simulating mathematical modelling of the physical flow processes of floods. However, these developed model's performance lacks generalization. Such systems trained on one geographical location's data have degraded performance when exploited for another location. In this paper, Long Short-Term Memory (LSTM) machine learning algorithm was applied where the hourly river level, river flow, and rainfall data from Brooklyn station was used as input data to the model and test for one hour, two hours, four hours, six hours, eight hours, and twelve hours in advance for river level prediction at Hoppers Crossing station. The developed algorithm achieved an accuracy of 98% for one hour and 97.2 %, 96.14 %, 94.67%,94.61 %, and 93.55% for two, four, six, eight, and twelve hours respectively. These systems not only forecast the future water level but also help in estimating the water level in case of a sensor failure. Multivariate modelling is utilized to predict the unknown parameter from the given other parametric values, thus not only predicting the forecasted water level but also reporting the sensor failure.","PeriodicalId":146842,"journal":{"name":"2022 2nd International Conference on Artificial Intelligence (ICAI)","volume":"48 8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132994465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Man in the Middle Attack Detection for MQTT based IoT devices using different Machine Learning Algorithms 使用不同机器学习算法对基于MQTT的物联网设备进行中间人攻击检测
Pub Date : 2022-03-30 DOI: 10.1109/ICAI55435.2022.9773590
Ali Bin Mazhar Sultan, S. Mehmood, Hamza Zahid
The usage of appropriate data communication protocols is critical for long-term Internet of Things (IoT) implementation and operation. The publish/subscribe-based Message Queuing Telemetry Transport (MQTT) protocol is widely used in the IoT world. Cyber threats on devices and networks using MQTT protocols are expected to rise with the protocol's growing popularity among IoT manufacturers. Among these threats is the man in the middle (MiTM) threat, in which an attacker listens in on or modifies traffic between two parties by intercepting conversations between them. In this paper we have implemented five different machine learning model on an open-source dataset and evaluated different parameters like accuracy, precision, recall, F1 score and most importantly training time and test time because most of IoT network are hosted on resource constrained devices like Raspberry Pi.
使用合适的数据通信协议对于物联网(IoT)的长期实施和运行至关重要。基于发布/订阅的消息队列遥测传输(MQTT)协议广泛应用于物联网领域。随着MQTT协议在物联网制造商中的日益普及,对使用MQTT协议的设备和网络的网络威胁预计会上升。在这些威胁中有中间人(MiTM)威胁,攻击者通过拦截双方之间的对话来监听或修改双方之间的流量。在本文中,我们在一个开源数据集上实现了五种不同的机器学习模型,并评估了不同的参数,如准确性,精度,召回率,F1分数以及最重要的训练时间和测试时间,因为大多数物联网网络托管在资源受限的设备上,如树莓派。
{"title":"Man in the Middle Attack Detection for MQTT based IoT devices using different Machine Learning Algorithms","authors":"Ali Bin Mazhar Sultan, S. Mehmood, Hamza Zahid","doi":"10.1109/ICAI55435.2022.9773590","DOIUrl":"https://doi.org/10.1109/ICAI55435.2022.9773590","url":null,"abstract":"The usage of appropriate data communication protocols is critical for long-term Internet of Things (IoT) implementation and operation. The publish/subscribe-based Message Queuing Telemetry Transport (MQTT) protocol is widely used in the IoT world. Cyber threats on devices and networks using MQTT protocols are expected to rise with the protocol's growing popularity among IoT manufacturers. Among these threats is the man in the middle (MiTM) threat, in which an attacker listens in on or modifies traffic between two parties by intercepting conversations between them. In this paper we have implemented five different machine learning model on an open-source dataset and evaluated different parameters like accuracy, precision, recall, F1 score and most importantly training time and test time because most of IoT network are hosted on resource constrained devices like Raspberry Pi.","PeriodicalId":146842,"journal":{"name":"2022 2nd International Conference on Artificial Intelligence (ICAI)","volume":"302 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121266329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2022 2nd International Conference on Artificial Intelligence (ICAI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1