首页 > 最新文献

2021 24th International Conference on Computer and Information Technology (ICCIT)最新文献

英文 中文
An Improved K-means Clustering Algorithm for Multi-dimensional Multi-cluster data Using Meta-heuristics 基于元启发式的多维多聚类数据改进k -均值聚类算法
Pub Date : 2021-12-18 DOI: 10.1109/ICCIT54785.2021.9689836
Faisal Bin Ashraf, Abdul Matin, Md. Shafiur Raihan Shafi, Muhammad Usama Islam
k-means is the most widely used clustering algorithm which is an unsupervised technique that needs assumptions of centroids to begin the process. Hence, the problem is NP-hard and needs careful consideration and optimization to get a better quality of clusters of data. In this work, a meta-heuristic based genetic algorithm is proposed to optimize the centroid initialization process. The proposed method includes tournament selection, probability-based mutation, and elitism that leads to finding the optimal centroids for the clusters of a given dataset. Nine different and diversified datasets were used to test the performance of the proposed method in terms of the davies-bouldin index and it performed better in all the datasets than the standard k-means and minibatch k-means algorithm.
K-means是使用最广泛的聚类算法,它是一种无监督技术,需要假设质心来开始聚类过程。因此,这个问题是np困难的,需要仔细考虑和优化,以获得更好质量的数据簇。本文提出了一种基于元启发式的遗传算法来优化质心初始化过程。提出的方法包括锦标赛选择、基于概率的突变和精英主义,精英主义导致找到给定数据集的簇的最佳质心。用9个不同的数据集测试了该方法在davies-bouldin指数方面的性能,结果表明该方法在所有数据集上的性能都优于标准k-means和minibatch k-means算法。
{"title":"An Improved K-means Clustering Algorithm for Multi-dimensional Multi-cluster data Using Meta-heuristics","authors":"Faisal Bin Ashraf, Abdul Matin, Md. Shafiur Raihan Shafi, Muhammad Usama Islam","doi":"10.1109/ICCIT54785.2021.9689836","DOIUrl":"https://doi.org/10.1109/ICCIT54785.2021.9689836","url":null,"abstract":"k-means is the most widely used clustering algorithm which is an unsupervised technique that needs assumptions of centroids to begin the process. Hence, the problem is NP-hard and needs careful consideration and optimization to get a better quality of clusters of data. In this work, a meta-heuristic based genetic algorithm is proposed to optimize the centroid initialization process. The proposed method includes tournament selection, probability-based mutation, and elitism that leads to finding the optimal centroids for the clusters of a given dataset. Nine different and diversified datasets were used to test the performance of the proposed method in terms of the davies-bouldin index and it performed better in all the datasets than the standard k-means and minibatch k-means algorithm.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133686801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Toxicity Classification on Music Lyrics Using Machine Learning Algorithms 使用机器学习算法对音乐歌词进行毒性分类
Pub Date : 2021-12-18 DOI: 10.1109/ICCIT54785.2021.9689865
Md. Abdus Siddique, Md Imran Sarker, R. Ghosh, K. Gosh
Music lyrics have a broad scope of impacts on our day-to-day life. The connection between music and the cerebrum has been extensively studied as far as feeling and intellectual interaction. From school children to strict adherents, the audience has the right to taste great music. For example, men presented with physically rough verses tend to more generalized perspectives toward ladies. Listening to particularly toxic or nontoxic songs can affect our mood. Music recommendation system follows different features based on the user’s historical data. The listener’s mode could be improved if the recommendation system filters out toxicity. In this study, we classify lyrics in terms of toxicity and nontoxicity from different music genres and artists using machine learning (ML) algorithms. The toxicity and nontoxicity have been measured using high valence and low valence. From the results, we found that Random Forest (RF) is a much more effective toxicity classification classifier, giving an overall accuracy of 93%.
歌词对我们的日常生活有着广泛的影响。音乐和大脑之间的联系在感觉和智力互动方面得到了广泛的研究。从学生到严格的信徒,听众都有权利品尝伟大的音乐。例如,看到身体粗糙的诗句的男性倾向于对女性有更笼统的看法。听特别有毒或无害的歌曲会影响我们的情绪。音乐推荐系统根据用户的历史数据遵循不同的特征。如果推荐系统能够过滤掉有害信息,那么听众的模式将会得到改善。在这项研究中,我们使用机器学习(ML)算法,根据不同音乐流派和艺术家的毒性和非毒性对歌词进行分类。用高价和低价测定了其毒性和无毒性。从结果中,我们发现随机森林(RF)是一种更有效的毒性分类器,总体准确率为93%。
{"title":"Toxicity Classification on Music Lyrics Using Machine Learning Algorithms","authors":"Md. Abdus Siddique, Md Imran Sarker, R. Ghosh, K. Gosh","doi":"10.1109/ICCIT54785.2021.9689865","DOIUrl":"https://doi.org/10.1109/ICCIT54785.2021.9689865","url":null,"abstract":"Music lyrics have a broad scope of impacts on our day-to-day life. The connection between music and the cerebrum has been extensively studied as far as feeling and intellectual interaction. From school children to strict adherents, the audience has the right to taste great music. For example, men presented with physically rough verses tend to more generalized perspectives toward ladies. Listening to particularly toxic or nontoxic songs can affect our mood. Music recommendation system follows different features based on the user’s historical data. The listener’s mode could be improved if the recommendation system filters out toxicity. In this study, we classify lyrics in terms of toxicity and nontoxicity from different music genres and artists using machine learning (ML) algorithms. The toxicity and nontoxicity have been measured using high valence and low valence. From the results, we found that Random Forest (RF) is a much more effective toxicity classification classifier, giving an overall accuracy of 93%.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130106170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A Deep Neural Network for Multi-class Dry Beans Classification 基于深度神经网络的多类干豆分类
Pub Date : 2021-12-18 DOI: 10.1109/ICCIT54785.2021.9689905
M. Hasan, Muhammad Usama Islam, M. Sadeq
The technological explosion has paved the way for agriculture to flourish exponentially thus contributing to better yield of crops through the aid of machine learning, the Internet of things, mechanical systems in agriculture. In our research work, we have investigated various types of dry beans followed by a deep neural network based approach to classify the beans automatically. The results shows that our approach had an accuracy of 93.44%, and an F-1 score of 94.57%, with the dataset that consisted of 7 varieties of dry beans. Our results, which performed substantially better in comparison to traditional machine learning approaches aided us to devise further research scopes in the field of agricultural machine learning.
技术爆炸为农业的迅猛发展铺平了道路,从而通过机器学习、物联网和农业机械系统的帮助,提高了农作物的产量。在我们的研究工作中,我们研究了各种类型的干豆,然后采用基于深度神经网络的方法对豆类进行自动分类。结果表明,在7个干豆品种的数据集上,我们的方法准确率为93.44%,F-1得分为94.57%。与传统的机器学习方法相比,我们的结果表现得更好,这有助于我们在农业机器学习领域设计进一步的研究范围。
{"title":"A Deep Neural Network for Multi-class Dry Beans Classification","authors":"M. Hasan, Muhammad Usama Islam, M. Sadeq","doi":"10.1109/ICCIT54785.2021.9689905","DOIUrl":"https://doi.org/10.1109/ICCIT54785.2021.9689905","url":null,"abstract":"The technological explosion has paved the way for agriculture to flourish exponentially thus contributing to better yield of crops through the aid of machine learning, the Internet of things, mechanical systems in agriculture. In our research work, we have investigated various types of dry beans followed by a deep neural network based approach to classify the beans automatically. The results shows that our approach had an accuracy of 93.44%, and an F-1 score of 94.57%, with the dataset that consisted of 7 varieties of dry beans. Our results, which performed substantially better in comparison to traditional machine learning approaches aided us to devise further research scopes in the field of agricultural machine learning.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134071986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Feature and Performance Based Comparative Study on Serverless Frameworks 基于特性和性能的无服务器框架比较研究
Pub Date : 2021-12-18 DOI: 10.1109/ICCIT54785.2021.9689779
Sabiha Nasrin, T.I.M. Fahim Sahryer, A. B. M. A. Al Islam, Jannatun Noor
We conduct experiments on the different public clouds provided in this paper to bring out a comparative study, to help the developers understand the diversity of the platforms. This study will help them choose a suitable platform for their desired application. The contemporary and future usage of this diverse nature of cloud computing is hugely demanding. Thus, we elaborate our study on serverless cloud computing to suffice the demand. Serverless prevents a great deal of unessential consumption of power and is a pay-as-you-go service. This technology has added a great impact on software and application development. Although the major obstacle to this development field is that there is not enough documentation on how the big companies provide this facility and how their architecture is built. The comparative study on this diverse platform is missing in the literature. Therefore, our research is based on the on-demand serverless use cases and comparative study with necessary measures. This can be effective and efficient to use for further serverless implementation. Hence, we and others can follow our research for understanding the technical complexity.
我们在本文提供的不同公有云上进行了实验,进行了对比研究,帮助开发人员了解平台的多样性。这项研究将帮助他们为他们想要的应用选择合适的平台。当前和未来对云计算的这种多样性的使用要求非常高。因此,我们对无服务器云计算进行了详细的研究,以满足需求。无服务器防止了大量不必要的电力消耗,并且是一种随用随付的服务。该技术对软件和应用程序开发产生了巨大的影响。尽管该开发领域的主要障碍是没有足够的文档说明大公司如何提供该设施以及如何构建其体系结构。文献中缺乏对这一多样化平台的比较研究。因此,我们的研究是基于按需无服务器用例和必要措施的比较研究。这对于进一步的无服务器实现来说是有效和高效的。因此,我们和其他人可以跟随我们的研究来理解技术的复杂性。
{"title":"Feature and Performance Based Comparative Study on Serverless Frameworks","authors":"Sabiha Nasrin, T.I.M. Fahim Sahryer, A. B. M. A. Al Islam, Jannatun Noor","doi":"10.1109/ICCIT54785.2021.9689779","DOIUrl":"https://doi.org/10.1109/ICCIT54785.2021.9689779","url":null,"abstract":"We conduct experiments on the different public clouds provided in this paper to bring out a comparative study, to help the developers understand the diversity of the platforms. This study will help them choose a suitable platform for their desired application. The contemporary and future usage of this diverse nature of cloud computing is hugely demanding. Thus, we elaborate our study on serverless cloud computing to suffice the demand. Serverless prevents a great deal of unessential consumption of power and is a pay-as-you-go service. This technology has added a great impact on software and application development. Although the major obstacle to this development field is that there is not enough documentation on how the big companies provide this facility and how their architecture is built. The comparative study on this diverse platform is missing in the literature. Therefore, our research is based on the on-demand serverless use cases and comparative study with necessary measures. This can be effective and efficient to use for further serverless implementation. Hence, we and others can follow our research for understanding the technical complexity.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"204 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133095510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Assessment of Rehabilitation Exercises from Depth Sensor Data 从深度传感器数据评估康复训练
Pub Date : 2021-12-18 DOI: 10.1109/ICCIT54785.2021.9689826
Shehzan Haider Chowdhury, Murshed Al Amin, A. M. Rahman, M. A. Amin, A. Ali
Assessing the rehabilitation exercises are essential in the recovery and treatment of various musculoskeletal conditions following surgery. According to reports, over 90% of all rehabilitative exercise sessions are conducted in a home environment. As the number of patients grows, this method becomes prohibitively expensive. Providing technology support for home-based rehabilitation is an excellent approach to address this. The patient remains at home and does the exercises in front of the camera, with the footage or data being sent to the physician for comments on the exercises. In this paper, we propose two machine learning-based models to assess the quality of exercises where the data is captured by such kinect 3D sensors. The proposed models consist of a long short-term memory(LSTM) network which uses the time series skeletal data to predict the quality of the exercises. The first model uses the predefined features proposed by the physicians. For the second model, we extract features using graph convolutional network(GCN) on the skeletal data where each node represents a body part or joint in the body and the edges represent the connection between the body parts. We conclude that LSTM is more accurate at predicting the results when GCN features are used.
评估康复训练在手术后各种肌肉骨骼疾病的恢复和治疗中是必不可少的。据报道,超过90%的康复训练是在家庭环境中进行的。随着患者数量的增加,这种方法变得非常昂贵。为家庭康复提供技术支持是解决这一问题的极好方法。病人呆在家里,在摄像头前做练习,录像或数据会被发送给医生,让医生对练习做出评价。在本文中,我们提出了两个基于机器学习的模型来评估由kinect 3D传感器捕获的数据的练习质量。该模型由一个长短期记忆(LSTM)网络组成,该网络使用时间序列骨骼数据来预测练习的质量。第一个模型使用医生提出的预定义特征。对于第二个模型,我们在骨骼数据上使用图卷积网络(GCN)提取特征,其中每个节点代表身体的一个部位或关节,边缘代表身体部位之间的连接。我们得出结论,当使用GCN特征时,LSTM在预测结果方面更准确。
{"title":"Assessment of Rehabilitation Exercises from Depth Sensor Data","authors":"Shehzan Haider Chowdhury, Murshed Al Amin, A. M. Rahman, M. A. Amin, A. Ali","doi":"10.1109/ICCIT54785.2021.9689826","DOIUrl":"https://doi.org/10.1109/ICCIT54785.2021.9689826","url":null,"abstract":"Assessing the rehabilitation exercises are essential in the recovery and treatment of various musculoskeletal conditions following surgery. According to reports, over 90% of all rehabilitative exercise sessions are conducted in a home environment. As the number of patients grows, this method becomes prohibitively expensive. Providing technology support for home-based rehabilitation is an excellent approach to address this. The patient remains at home and does the exercises in front of the camera, with the footage or data being sent to the physician for comments on the exercises. In this paper, we propose two machine learning-based models to assess the quality of exercises where the data is captured by such kinect 3D sensors. The proposed models consist of a long short-term memory(LSTM) network which uses the time series skeletal data to predict the quality of the exercises. The first model uses the predefined features proposed by the physicians. For the second model, we extract features using graph convolutional network(GCN) on the skeletal data where each node represents a body part or joint in the body and the edges represent the connection between the body parts. We conclude that LSTM is more accurate at predicting the results when GCN features are used.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115608225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
The Eigenvalue Distribution of Hankel Matrix: A Tool for Spectral Estimation From Noisy Data 汉克尔矩阵的特征值分布:一种从噪声数据中估计频谱的工具
Pub Date : 2021-12-18 DOI: 10.1109/ICCIT54785.2021.9689845
Ahsanul Islam, Md Rakibul Hasan, Md. Zakir Hossain, M. Hasan
One of the key challenges of digital signal processing is to estimate sinusoidal components of an unknown signal. Researchers and engineers have been adopting various methods to analyze noisy signals and extract essential features of a given signal. Singular spectrum analysis (SSA) has been a popular and effective tool for extracting sinusoidal components of an unknown noisy signal. The process of singular spectrum analysis includes embedding time series into a Hankel matrix. The eigenvalue distribution of the Hankel matrix exhibits significant properties that can be used to estimate an unknown signal’s rhythmic components and frequency response. This paper proposes a method that utilizes the Hankel matrix’s eigenvalue distribution to estimate sinusoidal components from the frequency spectrum of a noisy signal. Firstly, an autoregressive (AR) model has been utilized for simulating time series employed to observe eigenvalue distributions and frequency spectrum. Nevertheless, the approach has been tested on real-life speech data to prove the applicability of the proposed mechanism on spectral estimation. Overall, results on both simulated and real data confirm the acceptability of the proposed method. This study suggests that eigenvalue distribution can be a helpful tool for estimating the frequency response of an unknown time series. Since the autoregressive model can be used to model various real-life data analyses, this study on eigenvalue distribution and frequency spectrum can be utilized in those real-life data. This approach will help estimate frequency response and identify rhythmic components of an unknown time series based on eigenvalue distribution.
数字信号处理的关键挑战之一是估计未知信号的正弦分量。研究人员和工程师一直在采用各种方法来分析噪声信号并提取给定信号的基本特征。奇异谱分析(SSA)是一种从未知噪声信号中提取正弦分量的有效方法。奇异谱分析过程包括将时间序列嵌入到汉克尔矩阵中。汉克尔矩阵的特征值分布具有重要的性质,可用于估计未知信号的节奏成分和频率响应。本文提出了一种利用汉克尔矩阵的特征值分布从噪声信号的频谱中估计正弦分量的方法。首先,利用自回归(AR)模型对观测特征值分布和频谱的时间序列进行模拟。尽管如此,该方法已在实际语音数据上进行了测试,以证明所提出的机制在频谱估计上的适用性。总体而言,仿真和实际数据的结果证实了所提方法的可接受性。本研究表明,特征值分布可以作为估计未知时间序列频率响应的有用工具。由于自回归模型可用于模拟各种实际数据分析,因此本研究的特征值分布和频谱可用于这些实际数据。这种方法将有助于估计频率响应和识别基于特征值分布的未知时间序列的节奏分量。
{"title":"The Eigenvalue Distribution of Hankel Matrix: A Tool for Spectral Estimation From Noisy Data","authors":"Ahsanul Islam, Md Rakibul Hasan, Md. Zakir Hossain, M. Hasan","doi":"10.1109/ICCIT54785.2021.9689845","DOIUrl":"https://doi.org/10.1109/ICCIT54785.2021.9689845","url":null,"abstract":"One of the key challenges of digital signal processing is to estimate sinusoidal components of an unknown signal. Researchers and engineers have been adopting various methods to analyze noisy signals and extract essential features of a given signal. Singular spectrum analysis (SSA) has been a popular and effective tool for extracting sinusoidal components of an unknown noisy signal. The process of singular spectrum analysis includes embedding time series into a Hankel matrix. The eigenvalue distribution of the Hankel matrix exhibits significant properties that can be used to estimate an unknown signal’s rhythmic components and frequency response. This paper proposes a method that utilizes the Hankel matrix’s eigenvalue distribution to estimate sinusoidal components from the frequency spectrum of a noisy signal. Firstly, an autoregressive (AR) model has been utilized for simulating time series employed to observe eigenvalue distributions and frequency spectrum. Nevertheless, the approach has been tested on real-life speech data to prove the applicability of the proposed mechanism on spectral estimation. Overall, results on both simulated and real data confirm the acceptability of the proposed method. This study suggests that eigenvalue distribution can be a helpful tool for estimating the frequency response of an unknown time series. Since the autoregressive model can be used to model various real-life data analyses, this study on eigenvalue distribution and frequency spectrum can be utilized in those real-life data. This approach will help estimate frequency response and identify rhythmic components of an unknown time series based on eigenvalue distribution.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114180163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classification of Food Reviews from Bengali Text using LSTM 基于LSTM的孟加拉语食品评论分类
Pub Date : 2021-12-18 DOI: 10.1109/ICCIT54785.2021.9689847
Md. Muhaiminul Islam, Tazrina Haque Mohana, Lamia Rukhsara
People of this modern era are very much dependable on online reviews when it is the matter of purchasing any product. It is vital to bring out information from the huge amount of accessible text reviews. People of almost every age often visit restaurants. In today’s world food review is the fundamental requirement for visiting restaurants. But selecting a restaurant based on reviews is not quite an easy task. Deciding whether a food is worth having or not can be difficult. Several factors including the price, quality, taste, quantity can influence the actual worth of a food. From the perspective of a consumer, it is a dilemma to select a food appropriately. Food quality prediction can be a challenging task due to the high number of reviews that should be considered for the accurate prediction. Most people nowadays select restaurants based on their preferred food’s review. But the reviews present on the social platforms are mostly broad. People don’t find it useful to read the whole review. Therefore, a model which is capable of accepting reviews as input and is able to predict the food quality as output can become a great solution to this problem. So in this study, we have introduced a method which will be able to classify long Bengali food reviews into precise classes such as Good, Bad and Best using LSTM. The whole dataset which was used in our experiment has been collected from Facebook food review groups. Among them 80% was used for model training and 20% data used for the validation. Our model was able to classify 1000 Bengali review with 98% training and 80% validation accuracy.
这个现代时代的人们在购买任何产品时都非常依赖在线评论。从大量可访问的文本评论中获取信息至关重要。几乎每个年龄段的人都经常去餐馆。在当今世界,美食评论是参观餐馆的基本要求。但根据评论来选择餐厅并不是一件容易的事。决定一种食物是否值得拥有是很困难的。包括价格、质量、味道、数量在内的几个因素都会影响食物的实际价值。从消费者的角度来看,选择合适的食物是一个两难的选择。食品质量预测可能是一项具有挑战性的任务,因为需要考虑大量的评论才能进行准确的预测。现在大多数人根据他们喜欢的食物的评价来选择餐馆。但社交平台上的评论大多是广泛的。人们不认为阅读整个评论是有用的。因此,一个能够接受评论作为输入,并能够预测食品质量作为输出的模型可以成为解决这个问题的一个很好的方法。因此,在本研究中,我们引入了一种方法,该方法将能够使用LSTM将长孟加拉食品评论分类为精确的类,如Good, Bad和Best。在我们的实验中使用的整个数据集是从Facebook的食物评论群中收集的。其中80%用于模型训练,20%用于验证。我们的模型能够以98%的训练和80%的验证准确率对1000篇孟加拉语评论进行分类。
{"title":"Classification of Food Reviews from Bengali Text using LSTM","authors":"Md. Muhaiminul Islam, Tazrina Haque Mohana, Lamia Rukhsara","doi":"10.1109/ICCIT54785.2021.9689847","DOIUrl":"https://doi.org/10.1109/ICCIT54785.2021.9689847","url":null,"abstract":"People of this modern era are very much dependable on online reviews when it is the matter of purchasing any product. It is vital to bring out information from the huge amount of accessible text reviews. People of almost every age often visit restaurants. In today’s world food review is the fundamental requirement for visiting restaurants. But selecting a restaurant based on reviews is not quite an easy task. Deciding whether a food is worth having or not can be difficult. Several factors including the price, quality, taste, quantity can influence the actual worth of a food. From the perspective of a consumer, it is a dilemma to select a food appropriately. Food quality prediction can be a challenging task due to the high number of reviews that should be considered for the accurate prediction. Most people nowadays select restaurants based on their preferred food’s review. But the reviews present on the social platforms are mostly broad. People don’t find it useful to read the whole review. Therefore, a model which is capable of accepting reviews as input and is able to predict the food quality as output can become a great solution to this problem. So in this study, we have introduced a method which will be able to classify long Bengali food reviews into precise classes such as Good, Bad and Best using LSTM. The whole dataset which was used in our experiment has been collected from Facebook food review groups. Among them 80% was used for model training and 20% data used for the validation. Our model was able to classify 1000 Bengali review with 98% training and 80% validation accuracy.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122581902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Identifying Author in Bengali Literature by Bi-LSTM with Attention Mechanism 基于注意机制的Bi-LSTM识别孟加拉文学作者
Pub Date : 2021-12-18 DOI: 10.1109/ICCIT54785.2021.9689840
Ibrahim Al Azhar, Sohel Ahmed, Md Saiful Islam, Aisha Khatun
Authorship Attribution is the task of determining the author of an unknown text using one’s writing patterns. It is a well-established task for high-resource languages like English, but it is challenging for low-resource languages like Bengali. In this paper, we propose a Bi-directional Long Short Term Memory(Bi-LSTM) model with self-attention mechanism to address this problem. GloVe embedding vectors encode the semantic and syntactic knowledge of words, which are then fed into the Bi-LSTM models. Moreover, attention mechanism enhances the model’s ability to learn the complex linguistics patterns through learnable parameters, which gives lower weights to common words and higher weights to keywords that capture an author’s stylistic components. It improves performance extract contextual features. We evaluate our model on multiple datasets and experiment with various architectures. Our proposed model outperforms the state-of-the-art model by 12.14%-20.24% in the BAAD6 author dataset, 1.05% - 7.34% in the BAAD16 author dataset, with best performance accuracy of 97.99%. The experimental results demonstrate that the Bi-LSTM model’s attention mechanism notably boosts performance. (The source code are shared as free tools at https://github.com/IbrahimAlAzhar/AuthorshipAttribution)
作者归属是使用一个人的写作模式来确定未知文本的作者的任务。对于像英语这样资源丰富的语言来说,这是一个既定的任务,但对于像孟加拉语这样资源贫乏的语言来说,这是一个挑战。本文提出了一种具有自注意机制的双向长短期记忆模型来解决这一问题。GloVe嵌入向量对单词的语义和句法知识进行编码,然后将其输入到Bi-LSTM模型中。此外,注意机制通过可学习的参数增强了模型学习复杂语言模式的能力,对常用词赋予较低的权重,对捕捉作者文体成分的关键词赋予较高的权重。它提高了提取上下文特性的性能。我们在多个数据集上评估我们的模型,并在各种架构上进行实验。该模型在BAAD6作者数据集上的性能优于现有模型12.14% ~ 20.24%,在BAAD16作者数据集上的性能优于现有模型1.05% ~ 7.34%,最佳性能准确率为97.99%。实验结果表明,Bi-LSTM模型的注意机制显著提高了性能。(源代码作为免费工具在https://github.com/IbrahimAlAzhar/AuthorshipAttribution上共享)
{"title":"Identifying Author in Bengali Literature by Bi-LSTM with Attention Mechanism","authors":"Ibrahim Al Azhar, Sohel Ahmed, Md Saiful Islam, Aisha Khatun","doi":"10.1109/ICCIT54785.2021.9689840","DOIUrl":"https://doi.org/10.1109/ICCIT54785.2021.9689840","url":null,"abstract":"Authorship Attribution is the task of determining the author of an unknown text using one’s writing patterns. It is a well-established task for high-resource languages like English, but it is challenging for low-resource languages like Bengali. In this paper, we propose a Bi-directional Long Short Term Memory(Bi-LSTM) model with self-attention mechanism to address this problem. GloVe embedding vectors encode the semantic and syntactic knowledge of words, which are then fed into the Bi-LSTM models. Moreover, attention mechanism enhances the model’s ability to learn the complex linguistics patterns through learnable parameters, which gives lower weights to common words and higher weights to keywords that capture an author’s stylistic components. It improves performance extract contextual features. We evaluate our model on multiple datasets and experiment with various architectures. Our proposed model outperforms the state-of-the-art model by 12.14%-20.24% in the BAAD6 author dataset, 1.05% - 7.34% in the BAAD16 author dataset, with best performance accuracy of 97.99%. The experimental results demonstrate that the Bi-LSTM model’s attention mechanism notably boosts performance. (The source code are shared as free tools at https://github.com/IbrahimAlAzhar/AuthorshipAttribution)","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"182 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127545219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Exploring Spectral and Spatial Features Using a Hybrid Approach Combining Stacked AutoEncoder and a Novel Convolutional Neural Network for Hyperspectral Image Classification 基于堆叠自编码器和卷积神经网络的高光谱图像分类混合方法研究光谱和空间特征
Pub Date : 2021-12-18 DOI: 10.1109/ICCIT54785.2021.9689851
Md. Rakibul Haque, Azmain Yakin Srizon, Md. Al Mamun
With the introduction of high-resolution hyperspectral sensors, hyperspectral images have become one of the paramount mediums of collecting information from remote places. Owing to the enormous dimension of spectral bands and high correlation between the bands, proper classification of hyperspectral images suffers seriously. Furthermore, proper exploration of merged spectral and spatial features remains challenging for traditional approaches. Keeping the above challenges in mind, we have proposed a properly tuned Stacked AutoEncoder(SAE) and a novel Convolutional neural network (CNN) architecture that simultaneously considers the output of all the convolutional blocks. We have used a benchmark hyperspectral dataset called KSC center. Experimental results have shown that our method has achieved an average accuracy of 99.50%, surpassing other state-of-the-art approaches significantly.
随着高分辨率高光谱传感器的引入,高光谱图像已成为远程信息采集的重要媒介之一。由于光谱波段的维数巨大,波段之间的相关性高,给高光谱图像的分类带来了很大的困难。此外,对合并的光谱和空间特征的适当探索仍然是传统方法的挑战。考虑到上述挑战,我们提出了一种适当调整的堆叠自动编码器(SAE)和一种新的卷积神经网络(CNN)架构,该架构同时考虑所有卷积块的输出。我们使用了一个叫做KSC中心的基准高光谱数据集。实验结果表明,我们的方法达到了99.50%的平均准确率,大大超过了其他最先进的方法。
{"title":"Exploring Spectral and Spatial Features Using a Hybrid Approach Combining Stacked AutoEncoder and a Novel Convolutional Neural Network for Hyperspectral Image Classification","authors":"Md. Rakibul Haque, Azmain Yakin Srizon, Md. Al Mamun","doi":"10.1109/ICCIT54785.2021.9689851","DOIUrl":"https://doi.org/10.1109/ICCIT54785.2021.9689851","url":null,"abstract":"With the introduction of high-resolution hyperspectral sensors, hyperspectral images have become one of the paramount mediums of collecting information from remote places. Owing to the enormous dimension of spectral bands and high correlation between the bands, proper classification of hyperspectral images suffers seriously. Furthermore, proper exploration of merged spectral and spatial features remains challenging for traditional approaches. Keeping the above challenges in mind, we have proposed a properly tuned Stacked AutoEncoder(SAE) and a novel Convolutional neural network (CNN) architecture that simultaneously considers the output of all the convolutional blocks. We have used a benchmark hyperspectral dataset called KSC center. Experimental results have shown that our method has achieved an average accuracy of 99.50%, surpassing other state-of-the-art approaches significantly.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124107765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Toxic Gas Sensor and Temperature Monitoring in Industries using Internet of Things (IoT) 使用物联网(IoT)的工业有毒气体传感器和温度监测
Pub Date : 2021-12-18 DOI: 10.1109/ICCIT54785.2021.9689802
Sayeda Islam Nahid, Mohammad Monirujjaman Khan
This paper suggests the design and execution of a Toxic gas sensor and temperature monitoring device using the Internet of Things (IoT). For many years, people have experienced severe health problems such as weariness, dizziness, shortness of breath, respiratory disorders, and even death as a result of hazardous gas exposure in the workplace. Working areas that employ a variety of chemicals, such as the textile and garment industry, and sites such as mines have poisonous fumes in the air that are detrimental to the human body. Furthermore, extremes in temperature can weaken both the body and the intellect, causing them to lose focus. As a result, it is critical to understand the to improve safety, reduce the amount of harmful gas in the surrounding air. This device detects dangerous chemicals in the air, including as methane, hydrogen, and carbon monoxide. It also keeps track of the ambient temperature. Excessive concentrations of poisonous gas in the air, as well as unusually hot or low temperatures, will set off the alarm and inform anyone nearby by audio and visual clues. The technology also refreshes the data on the web server and mobile app, allowing users to access it from any location in the world at any time.
本文提出了一种使用物联网(IoT)的有毒气体传感器和温度监测设备的设计和执行。多年来,由于在工作场所接触有害气体,人们经历了严重的健康问题,如疲劳、头晕、呼吸短促、呼吸系统疾病,甚至死亡。使用各种化学品的工作区域,如纺织和服装工业,以及矿山等场所的空气中有对人体有害的有毒烟雾。此外,极端的温度会削弱身体和智力,使他们失去注意力。因此,了解如何提高安全性,减少周围空气中有害气体的数量至关重要。这种装置可以检测空气中的危险化学物质,包括甲烷、氢和一氧化碳。它还能记录环境温度。空气中有毒气体浓度过高,以及异常高温或低温,都会触发警报,并通过声音和视觉线索通知附近的任何人。该技术还刷新了网络服务器和移动应用程序上的数据,允许用户从世界上任何地点随时访问这些数据。
{"title":"Toxic Gas Sensor and Temperature Monitoring in Industries using Internet of Things (IoT)","authors":"Sayeda Islam Nahid, Mohammad Monirujjaman Khan","doi":"10.1109/ICCIT54785.2021.9689802","DOIUrl":"https://doi.org/10.1109/ICCIT54785.2021.9689802","url":null,"abstract":"This paper suggests the design and execution of a Toxic gas sensor and temperature monitoring device using the Internet of Things (IoT). For many years, people have experienced severe health problems such as weariness, dizziness, shortness of breath, respiratory disorders, and even death as a result of hazardous gas exposure in the workplace. Working areas that employ a variety of chemicals, such as the textile and garment industry, and sites such as mines have poisonous fumes in the air that are detrimental to the human body. Furthermore, extremes in temperature can weaken both the body and the intellect, causing them to lose focus. As a result, it is critical to understand the to improve safety, reduce the amount of harmful gas in the surrounding air. This device detects dangerous chemicals in the air, including as methane, hydrogen, and carbon monoxide. It also keeps track of the ambient temperature. Excessive concentrations of poisonous gas in the air, as well as unusually hot or low temperatures, will set off the alarm and inform anyone nearby by audio and visual clues. The technology also refreshes the data on the web server and mobile app, allowing users to access it from any location in the world at any time.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"13 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124275278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2021 24th International Conference on Computer and Information Technology (ICCIT)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1