首页 > 最新文献

2022 25th International Conference on Computer and Information Technology (ICCIT)最新文献

英文 中文
Interpretable Garment Workers’ Productivity Prediction in Bangladesh Using Machine Learning Algorithms and Explainable AI 使用机器学习算法和可解释的人工智能预测孟加拉国服装工人的生产率
Pub Date : 2022-12-17 DOI: 10.1109/ICCIT57492.2022.10054863
Hasibul Hasan Sabuj, Nigar Sultana Nuha, Paul Richie Gomes, Aiman Lameesa, Md. Ashraful Alam
Bangladesh’s garment industry is widely recognized and plays a significant role in the current global market. The nation’s per capita income and citizens’ living standards have risen significantly with the noteworthy hard work performed by the employees in this industry. The garment sector is more efficient once the target production can be achieved without any difficulties. But a frequent issue that comprises within this industry is, often the actual garment producing productivity of the people working there do not reach the previously determined target-productivity. The business suffers a significant loss when the productivity gap appears in this process. This approach seeks to address this issue by prediction of the actual productivity of the workers. To attain this goal, a machine learning approach is suggested for the productivity prediction of the employees, after experimentation with five machine learning models. The proposed approach displays a reassuring level of prediction accuracy, with a minimalist MAE (Mean Absolute Error) of 0.072, which is less than the existing Deep Learning model with a MAE of 0.086. This indicates that, application of this process can play a vital role in setting an accurate target production which might lead to more profit and production in the sector. Also, this work contains an explainable AI technique named SHAP for interpreting the model in order to see further information within it.
孟加拉国的服装业得到广泛认可,在当前的全球市场上发挥着重要作用。这个行业的员工付出了值得注意的辛勤劳动,国家的人均收入和人民的生活水平有了显著的提高。一旦目标生产可以毫无困难地实现,服装行业的效率就会提高。但是这个行业中经常出现的一个问题是,在那里工作的人的实际服装生产生产率往往达不到先前确定的目标生产率。在此过程中出现生产力差距,企业将遭受重大损失。这种方法试图通过预测工人的实际生产率来解决这个问题。为了实现这一目标,在对五种机器学习模型进行实验后,提出了一种用于员工生产力预测的机器学习方法。所提出的方法显示出令人放心的预测精度水平,最低MAE(平均绝对误差)为0.072,低于现有深度学习模型的0.086。这表明,应用这一过程可以在设定准确的目标产量方面发挥至关重要的作用,这可能会导致该部门更多的利润和产量。此外,这项工作包含一种可解释的AI技术,名为SHAP,用于解释模型,以便查看其中的进一步信息。
{"title":"Interpretable Garment Workers’ Productivity Prediction in Bangladesh Using Machine Learning Algorithms and Explainable AI","authors":"Hasibul Hasan Sabuj, Nigar Sultana Nuha, Paul Richie Gomes, Aiman Lameesa, Md. Ashraful Alam","doi":"10.1109/ICCIT57492.2022.10054863","DOIUrl":"https://doi.org/10.1109/ICCIT57492.2022.10054863","url":null,"abstract":"Bangladesh’s garment industry is widely recognized and plays a significant role in the current global market. The nation’s per capita income and citizens’ living standards have risen significantly with the noteworthy hard work performed by the employees in this industry. The garment sector is more efficient once the target production can be achieved without any difficulties. But a frequent issue that comprises within this industry is, often the actual garment producing productivity of the people working there do not reach the previously determined target-productivity. The business suffers a significant loss when the productivity gap appears in this process. This approach seeks to address this issue by prediction of the actual productivity of the workers. To attain this goal, a machine learning approach is suggested for the productivity prediction of the employees, after experimentation with five machine learning models. The proposed approach displays a reassuring level of prediction accuracy, with a minimalist MAE (Mean Absolute Error) of 0.072, which is less than the existing Deep Learning model with a MAE of 0.086. This indicates that, application of this process can play a vital role in setting an accurate target production which might lead to more profit and production in the sector. Also, this work contains an explainable AI technique named SHAP for interpreting the model in order to see further information within it.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116648934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Bangla Handwritten Digit Recognition using RNN-CNN Hybrid Approach 使用RNN-CNN混合方法的孟加拉语手写数字识别
Pub Date : 2022-12-17 DOI: 10.1109/ICCIT57492.2022.10055089
A. Hasib Uddin, Joygun Khatun, Mehera Afroz Meghna, Prince Mahmud
The automatic recognition of handwritten English material has seen a lot of progress. However, research on automatic Bangla handwriting numerals recognition is far behind. Even the most effective recognizers now in use do not produce an adequate performance for real-world applications. This paper suggested a strategy based on deep neural networks. In this paper, we have used the BanglaLekha-Isolated handwriting dataset along with ResNet50 and DensNet201 models for benchmarking process. Then we proposed two new models one is a Gated Recurrent Unit (GRU) based and another one is a Hybrid of Convolutional Neural Network (CNN) and Convolutional Long Short-term Memory (ConvLSTM). As for our proposed GRU model it performs closely to the DensNet201 and REsNet50 models while requiring very few parameters compared to these two models. On the other hand, our proposed Hybrid ConvLSTM model outperforms both of the aforementioned benchmarking models. Finally, we have developed a new Bangla Handwriting Numerical dataset containing a total of seven thousand training, one thousand validation, and two thousand test images. Our proposed best-performing model (Hybrid ConvLSTM) achieves 98.84% accuracy in the test data of our dataset while the GRU model gained 91.33% test accuracy without any help of image preprocessing steps.
手写英语材料的自动识别已经取得了很大的进展。然而,对孟加拉文手写数字自动识别的研究还远远落后。即使是目前使用的最有效的识别器也不能在实际应用中产生足够的性能。本文提出了一种基于深度神经网络的策略。在本文中,我们使用BanglaLekha-Isolated手写数据集以及ResNet50和DensNet201模型进行基准测试过程。然后我们提出了两个新的模型,一个是基于门控循环单元(GRU)的模型,另一个是卷积神经网络(CNN)和卷积长短期记忆(ConvLSTM)的混合模型。对于我们提出的GRU模型,它的性能与DensNet201和REsNet50模型非常接近,而与这两个模型相比,它需要的参数很少。另一方面,我们提出的混合ConvLSTM模型优于上述两种基准测试模型。最后,我们开发了一个新的孟加拉语手写数字数据集,其中总共包含7000个训练图像、1000个验证图像和2000个测试图像。我们提出的最佳模型(Hybrid ConvLSTM)在我们数据集的测试数据中达到了98.84%的准确率,而GRU模型在没有任何图像预处理步骤的情况下获得了91.33%的测试准确率。
{"title":"Bangla Handwritten Digit Recognition using RNN-CNN Hybrid Approach","authors":"A. Hasib Uddin, Joygun Khatun, Mehera Afroz Meghna, Prince Mahmud","doi":"10.1109/ICCIT57492.2022.10055089","DOIUrl":"https://doi.org/10.1109/ICCIT57492.2022.10055089","url":null,"abstract":"The automatic recognition of handwritten English material has seen a lot of progress. However, research on automatic Bangla handwriting numerals recognition is far behind. Even the most effective recognizers now in use do not produce an adequate performance for real-world applications. This paper suggested a strategy based on deep neural networks. In this paper, we have used the BanglaLekha-Isolated handwriting dataset along with ResNet50 and DensNet201 models for benchmarking process. Then we proposed two new models one is a Gated Recurrent Unit (GRU) based and another one is a Hybrid of Convolutional Neural Network (CNN) and Convolutional Long Short-term Memory (ConvLSTM). As for our proposed GRU model it performs closely to the DensNet201 and REsNet50 models while requiring very few parameters compared to these two models. On the other hand, our proposed Hybrid ConvLSTM model outperforms both of the aforementioned benchmarking models. Finally, we have developed a new Bangla Handwriting Numerical dataset containing a total of seven thousand training, one thousand validation, and two thousand test images. Our proposed best-performing model (Hybrid ConvLSTM) achieves 98.84% accuracy in the test data of our dataset while the GRU model gained 91.33% test accuracy without any help of image preprocessing steps.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125383159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Continuous Sign Language Interpretation to Text Using Deep Learning Models 使用深度学习模型的连续手语文本解释
Pub Date : 2022-12-17 DOI: 10.1109/ICCIT57492.2022.10054721
Afridi Ibn Rahman, Zebel-E.-Noor Akhand, Tasin Al Nahian Khan, Anirudh Sarda, Subhi Bhuiyan, Mma Rakib, Zubayer Ahmed Fahim, Indronil Kundu
The COVID-19 pandemic has obligated people to adopt the virtual lifestyle. Currently, the use of videoconferencing to conduct business meetings is prevalent owing to the numerous benefits it presents. However, a large number of people with speech impediment find themselves handicapped to the new normal as they cannot communicate their ideas effectively, especially in fast paced meetings. Therefore, this paper aims to introduce an enriched dataset using an action recognition method with the most common phrases translated into American Sign Language (ASL) that are routinely used in professional meetings. It further proposes a sign language detecting and classifying model employing deep learning architectures, namely, CNN and LSTM. The performances of these models are analysed by employing different performance metrics like accuracy, recall, F1- Score and Precision. CNN and LSTM models yield an accuracy of 93.75% and 96.54% respectively, after being trained with the dataset introduced in this study. Therefore, the incorporation of the LSTM model into different cloud services, virtual private networks and softwares will allow people with speech impairment to use sign language, which will automatically be translated into captions using moving camera circumstances in real time. This will in turn equip other people with the tool to understand and grasp the message that is being conveyed and easily discuss and effectuate the ideas.
新冠肺炎疫情迫使人们采用虚拟生活方式。目前,使用视频会议进行商务会议是普遍的,因为它提供了许多好处。然而,大量有语言障碍的人发现自己无法适应新常态,因为他们无法有效地表达自己的想法,尤其是在快节奏的会议中。因此,本文旨在引入一个丰富的数据集,使用一种动作识别方法,将最常见的短语翻译成专业会议中经常使用的美国手语(ASL)。在此基础上,提出了一种基于CNN和LSTM深度学习的手语检测与分类模型。通过采用不同的性能指标,如准确率、召回率、F1- Score和精度,分析了这些模型的性能。CNN和LSTM模型使用本文引入的数据集进行训练后,准确率分别为93.75%和96.54%。因此,将LSTM模型整合到不同的云服务、虚拟专用网络和软件中,将允许有语言障碍的人使用手语,这些手语将通过移动的摄像机环境实时自动翻译成字幕。这将反过来为其他人提供理解和掌握所传达的信息的工具,并容易讨论和实现这些想法。
{"title":"Continuous Sign Language Interpretation to Text Using Deep Learning Models","authors":"Afridi Ibn Rahman, Zebel-E.-Noor Akhand, Tasin Al Nahian Khan, Anirudh Sarda, Subhi Bhuiyan, Mma Rakib, Zubayer Ahmed Fahim, Indronil Kundu","doi":"10.1109/ICCIT57492.2022.10054721","DOIUrl":"https://doi.org/10.1109/ICCIT57492.2022.10054721","url":null,"abstract":"The COVID-19 pandemic has obligated people to adopt the virtual lifestyle. Currently, the use of videoconferencing to conduct business meetings is prevalent owing to the numerous benefits it presents. However, a large number of people with speech impediment find themselves handicapped to the new normal as they cannot communicate their ideas effectively, especially in fast paced meetings. Therefore, this paper aims to introduce an enriched dataset using an action recognition method with the most common phrases translated into American Sign Language (ASL) that are routinely used in professional meetings. It further proposes a sign language detecting and classifying model employing deep learning architectures, namely, CNN and LSTM. The performances of these models are analysed by employing different performance metrics like accuracy, recall, F1- Score and Precision. CNN and LSTM models yield an accuracy of 93.75% and 96.54% respectively, after being trained with the dataset introduced in this study. Therefore, the incorporation of the LSTM model into different cloud services, virtual private networks and softwares will allow people with speech impairment to use sign language, which will automatically be translated into captions using moving camera circumstances in real time. This will in turn equip other people with the tool to understand and grasp the message that is being conveyed and easily discuss and effectuate the ideas.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126585367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SMOTE Based Credit Card Fraud Detection Using Convolutional Neural Network 基于SMOTE的卷积神经网络信用卡欺诈检测
Pub Date : 2022-12-17 DOI: 10.1109/ICCIT57492.2022.10054727
Md. Nawab Yousuf Ali, Taniya Kabir, Noushin Laila Raka, Sanzida Siddikha Toma, Md. Lizur Rahman, J. Ferdaus
Nowadays, fraud correlated with credit cards became very prevalent since a lot of people use credit cards for buying goods and services. Because of e-commerce and technological advancement, most transactions are happening online, which is increasing the risk of fraudulent transactions and resulting in huge losses financially. Therefore, an effective detection technique, as the quickest prediction option, should be developed to deter fraud from propagating. This paper targeted to develop a deep learning (DL)-based model on SMOTE oversampling technique to predict the fraudulent transactions of credit cards. The system used three popular DL algorithms: Artificial Neural Network (ANN), Convolutional Neural Network (CNN), and Long Short-Term Memory Recurrent Neural Network (LSTM RNN), and measured the best performer in terms of evaluation metrics. However, the results confirm that the CNN algorithm outperformed both ANN and LSTM RNN. Additionally, compared to previous studies, our CNN fraud detection program recorded high rates of accuracy in identifying fraudulent activity. The system achieved an accuracy of 99.97%, precision of 99.94%, recall of 99.99%, and F1-Score of 99.96%. This proposed scheme can help to reduce financial loss by detecting credit card scams or frauds globally.
如今,由于许多人使用信用卡购买商品和服务,与信用卡相关的欺诈行为变得非常普遍。由于电子商务和技术的进步,大多数交易都是在网上进行的,这增加了欺诈交易的风险,并造成了巨大的经济损失。因此,应该开发一种有效的检测技术,作为最快的预测选择,以阻止欺诈行为的传播。本文旨在开发一种基于SMOTE过采样技术的深度学习模型来预测信用卡欺诈交易。该系统使用了三种流行的深度学习算法:人工神经网络(ANN)、卷积神经网络(CNN)和长短期记忆递归神经网络(LSTM RNN),并根据评估指标衡量了表现最好的算法。然而,结果证实,CNN算法优于ANN和LSTM RNN。此外,与之前的研究相比,我们的CNN欺诈检测程序在识别欺诈活动方面的准确率很高。系统的准确率为99.97%,精密度为99.94%,召回率为99.99%,F1-Score为99.96%。这个提议的方案可以通过检测信用卡诈骗或全球欺诈来帮助减少经济损失。
{"title":"SMOTE Based Credit Card Fraud Detection Using Convolutional Neural Network","authors":"Md. Nawab Yousuf Ali, Taniya Kabir, Noushin Laila Raka, Sanzida Siddikha Toma, Md. Lizur Rahman, J. Ferdaus","doi":"10.1109/ICCIT57492.2022.10054727","DOIUrl":"https://doi.org/10.1109/ICCIT57492.2022.10054727","url":null,"abstract":"Nowadays, fraud correlated with credit cards became very prevalent since a lot of people use credit cards for buying goods and services. Because of e-commerce and technological advancement, most transactions are happening online, which is increasing the risk of fraudulent transactions and resulting in huge losses financially. Therefore, an effective detection technique, as the quickest prediction option, should be developed to deter fraud from propagating. This paper targeted to develop a deep learning (DL)-based model on SMOTE oversampling technique to predict the fraudulent transactions of credit cards. The system used three popular DL algorithms: Artificial Neural Network (ANN), Convolutional Neural Network (CNN), and Long Short-Term Memory Recurrent Neural Network (LSTM RNN), and measured the best performer in terms of evaluation metrics. However, the results confirm that the CNN algorithm outperformed both ANN and LSTM RNN. Additionally, compared to previous studies, our CNN fraud detection program recorded high rates of accuracy in identifying fraudulent activity. The system achieved an accuracy of 99.97%, precision of 99.94%, recall of 99.99%, and F1-Score of 99.96%. This proposed scheme can help to reduce financial loss by detecting credit card scams or frauds globally.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125172527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Medical Text Extraction and Classification from Prescription Images 处方图像的医学文本提取与分类
Pub Date : 2022-12-17 DOI: 10.1109/ICCIT57492.2022.10055123
Abdullah Mohammad Sakib, Bilkis Jamal Ferdosi, S. Jahan, Kashfia Jashim
The right to health is one of the fundamental human rights. Every state is obliged to provide healthcare facilities to its population. In Bangladesh, the government is working hard to provide a better healthcare system, though the country needs to go a long way to have a unified healthcare system. There is a lack of a proper referral system in the country, and proper diagnosis is hindered due to a patient’s lack of medical history. In this paper, we propose a system that helps the patient to create a medical history from images of the prescriptions. Our system extracts and classifies data from an unstructured Bangladeshi medical prescription that can be used to create a repository of medical history. The proposed method works in four phases: phase I text localization and extraction from the images of prescriptions, phase II - classification of the extracted images, phase III - image to text conversion using OCR, and phase IV - classification of the text in four categories symptoms, medicines, diagnostic tests, and others. For image classification, we use a very deep convolutional network, VGG-16 and for text classification, we use the Bidirectional Encoder Representations from Transformers (BERT) model. Performance evaluation of the proposed system is very promising and the system can be used in any country like Bangladesh to facilitate better treatment.
健康权是一项基本人权。每个州都有义务为其人民提供医疗保健设施。在孟加拉国,政府正在努力提供更好的医疗保健系统,尽管该国要建立统一的医疗保健系统还有很长的路要走。该国缺乏适当的转诊系统,由于患者缺乏病史,妨碍了适当的诊断。在本文中,我们提出了一个系统,帮助病人从处方的图像创建一个病史。我们的系统从非结构化的孟加拉国医疗处方中提取数据并对其进行分类,这些数据可用于创建病史存储库。提出的方法分四个阶段工作:第一阶段-从处方图像中进行文本定位和提取;第二阶段-对提取的图像进行分类;第三阶段-使用OCR进行图像到文本的转换;第四阶段-将文本分为症状、药物、诊断测试和其他四类。对于图像分类,我们使用非常深的卷积网络VGG-16,对于文本分类,我们使用来自变形金刚(BERT)模型的双向编码器表示。对拟议系统的绩效评估非常有希望,该系统可以在孟加拉国等任何国家使用,以促进更好的治疗。
{"title":"Medical Text Extraction and Classification from Prescription Images","authors":"Abdullah Mohammad Sakib, Bilkis Jamal Ferdosi, S. Jahan, Kashfia Jashim","doi":"10.1109/ICCIT57492.2022.10055123","DOIUrl":"https://doi.org/10.1109/ICCIT57492.2022.10055123","url":null,"abstract":"The right to health is one of the fundamental human rights. Every state is obliged to provide healthcare facilities to its population. In Bangladesh, the government is working hard to provide a better healthcare system, though the country needs to go a long way to have a unified healthcare system. There is a lack of a proper referral system in the country, and proper diagnosis is hindered due to a patient’s lack of medical history. In this paper, we propose a system that helps the patient to create a medical history from images of the prescriptions. Our system extracts and classifies data from an unstructured Bangladeshi medical prescription that can be used to create a repository of medical history. The proposed method works in four phases: phase I text localization and extraction from the images of prescriptions, phase II - classification of the extracted images, phase III - image to text conversion using OCR, and phase IV - classification of the text in four categories symptoms, medicines, diagnostic tests, and others. For image classification, we use a very deep convolutional network, VGG-16 and for text classification, we use the Bidirectional Encoder Representations from Transformers (BERT) model. Performance evaluation of the proposed system is very promising and the system can be used in any country like Bangladesh to facilitate better treatment.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123584448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bengali Crime News Classification Based on Newspaper Headlines using NLP 基于新闻标题的孟加拉语犯罪新闻分类
Pub Date : 2022-12-17 DOI: 10.1109/ICCIT57492.2022.10055391
N. Khan, Md Shamiul Islam, Fuad Chowdhury, Abdur Samad Siham, Nazmus Sakib
In our daily lives, newspapers and online news portals have become ubiquitous. These provide us with information on global events. Of all the news available in newspapers, crime news is the most significant. People read this kind of news with sincerity and considerable curiosity. We read a lot of Bangla newspapers and news sources, but we didn’t find any news on crime that was categorized. Perhaps categorizing the Bangla crime news would be helpful for the readers. Therefore, we decided to work on Bengali crime news classification, which will have a big influence in the Bengali community. However, categorizing crime news from daily newspaper headlines is not an easy task for a human. In this paper, we introduced a practical model to automatically annotate crime news from Bengali newspaper headlines in 6 predetermined crimes. In order to accomplish this goal, we have used TF-IDF for extracting features with 8 different machine learning and language classifier models (SVM, Decision Tree,Random Forest, LSTM, Bi-LSTM, BERT etc) and got best result by Sagor Sarkar’s Bangla-Bert-Base. The experimental result with 6293 training and 1574 testing samples shows 90.15% accuracy. This research output and dataset can be utilized by enthusiasts for further research purposes like subsetting crimes, crime status or judgment analysis etc. Our dataset will be available upon request @https://tinyurl.com/5n7wwaek.
在我们的日常生活中,报纸和在线新闻门户已经无处不在。这些为我们提供了有关全球事件的信息。在报纸上的所有新闻中,犯罪新闻是最重要的。人们怀着真诚和极大的好奇心读到这类新闻。我们阅读了很多孟加拉报纸和新闻来源,但我们没有发现任何关于犯罪的新闻被分类。或许将孟加拉的犯罪新闻分类会对读者有所帮助。因此,我们决定致力于孟加拉犯罪新闻分类,这将在孟加拉社区产生很大的影响。然而,对人类来说,从每日报纸标题中对犯罪新闻进行分类并不是一件容易的事。在本文中,我们介绍了一个实用的模型来自动标注6个预定犯罪的孟加拉语报纸头条新闻。为了实现这一目标,我们使用TF-IDF使用8种不同的机器学习和语言分类器模型(SVM, Decision Tree,Random Forest, LSTM, Bi-LSTM, BERT等)提取特征,并通过Sagor Sarkar的banga - BERT - base获得了最好的结果。6293个训练样本和1574个测试样本的实验结果表明,准确率为90.15%。这个研究成果和数据集可以被爱好者用于进一步的研究目的,如细分犯罪、犯罪状态或判断分析等。我们的数据集将在请求@https://tinyurl.com/5n7wwaek时提供。
{"title":"Bengali Crime News Classification Based on Newspaper Headlines using NLP","authors":"N. Khan, Md Shamiul Islam, Fuad Chowdhury, Abdur Samad Siham, Nazmus Sakib","doi":"10.1109/ICCIT57492.2022.10055391","DOIUrl":"https://doi.org/10.1109/ICCIT57492.2022.10055391","url":null,"abstract":"In our daily lives, newspapers and online news portals have become ubiquitous. These provide us with information on global events. Of all the news available in newspapers, crime news is the most significant. People read this kind of news with sincerity and considerable curiosity. We read a lot of Bangla newspapers and news sources, but we didn’t find any news on crime that was categorized. Perhaps categorizing the Bangla crime news would be helpful for the readers. Therefore, we decided to work on Bengali crime news classification, which will have a big influence in the Bengali community. However, categorizing crime news from daily newspaper headlines is not an easy task for a human. In this paper, we introduced a practical model to automatically annotate crime news from Bengali newspaper headlines in 6 predetermined crimes. In order to accomplish this goal, we have used TF-IDF for extracting features with 8 different machine learning and language classifier models (SVM, Decision Tree,Random Forest, LSTM, Bi-LSTM, BERT etc) and got best result by Sagor Sarkar’s Bangla-Bert-Base. The experimental result with 6293 training and 1574 testing samples shows 90.15% accuracy. This research output and dataset can be utilized by enthusiasts for further research purposes like subsetting crimes, crime status or judgment analysis etc. Our dataset will be available upon request @https://tinyurl.com/5n7wwaek.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125533576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative Analysis of Interpretable Mushroom Classification using Several Machine Learning Models 几种机器学习模型对可解释蘑菇分类的比较分析
Pub Date : 2022-12-17 DOI: 10.1109/ICCIT57492.2022.10055555
M. Ahmed, S. Afrose, Ashik Adnan, Nazifa Khanom, Md Sabbir Hossain, Md Humaion Kabir Mehedi, Annajiat Alim Rasel
An excellent substitute for red meat, mushrooms are a rich, calorie-efficient source of protein, fiber, and antioxidants. Mushrooms may also be rich sources of potent medications. Therefore, it’s important to classify edible and poisonous mushrooms. An interpretable system for the identification of mushrooms is being developed using machine learning methods and Explainable Artificial Intelligence (XAI) models. The Mushroom dataset from the UC Irvine Machine Learning Repository was the one utilized in this study. Among the six ML models, Decision Tree, Random Forest, and KNN performed flawlessly in this dataset, achieving 100% accuracy. Whereas, SVM had a 98% accuracy rate, compared to 95% for Logistic Regression and 93% for Naive Bayes. The two XAI models SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model Agnostic Explanation) were used to interpret the top three ML models.
蘑菇是红肉的极好替代品,富含蛋白质、纤维和抗氧化剂。蘑菇也可能是有效药物的丰富来源。因此,对食用蘑菇和有毒蘑菇进行分类是很重要的。利用机器学习方法和可解释的人工智能(XAI)模型,正在开发一种用于识别蘑菇的可解释系统。来自加州大学欧文分校机器学习库的蘑菇数据集是本研究中使用的数据集。在六个ML模型中,决策树、随机森林和KNN在该数据集中表现完美,准确率达到100%。然而,SVM的准确率为98%,而逻辑回归的准确率为95%,朴素贝叶斯的准确率为93%。两个XAI模型SHAP (SHapley Additive explanatory)和LIME (Local Interpretable Model Agnostic Explanation)被用来解释前三个ML模型。
{"title":"Comparative Analysis of Interpretable Mushroom Classification using Several Machine Learning Models","authors":"M. Ahmed, S. Afrose, Ashik Adnan, Nazifa Khanom, Md Sabbir Hossain, Md Humaion Kabir Mehedi, Annajiat Alim Rasel","doi":"10.1109/ICCIT57492.2022.10055555","DOIUrl":"https://doi.org/10.1109/ICCIT57492.2022.10055555","url":null,"abstract":"An excellent substitute for red meat, mushrooms are a rich, calorie-efficient source of protein, fiber, and antioxidants. Mushrooms may also be rich sources of potent medications. Therefore, it’s important to classify edible and poisonous mushrooms. An interpretable system for the identification of mushrooms is being developed using machine learning methods and Explainable Artificial Intelligence (XAI) models. The Mushroom dataset from the UC Irvine Machine Learning Repository was the one utilized in this study. Among the six ML models, Decision Tree, Random Forest, and KNN performed flawlessly in this dataset, achieving 100% accuracy. Whereas, SVM had a 98% accuracy rate, compared to 95% for Logistic Regression and 93% for Naive Bayes. The two XAI models SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model Agnostic Explanation) were used to interpret the top three ML models.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125665015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature Fusion Based Effective Brain Tumor Detection Approach Using MRI 基于特征融合的MRI有效脑肿瘤检测方法
Pub Date : 2022-12-17 DOI: 10.1109/ICCIT57492.2022.10055136
Farjana Parvin, Md. Al Mamun
Early identification of brain tumors greatly influences the clinical diagnosis process of a brain tumor patient. Therefore, this study suggests a brain tumor detection approach that merges deep and shallow features extracted from the brain MRI images in order to distinguish between non-tumor and tumor classes. We combine some pre-trained deep CNN architectures and the concept of transfer learning in our proposed framework to obtain high-level features from magnetic resonance images. Following the extraction, a Support Vector Machine classifier with radial basis function was used to evaluate the deep features. A deep feature vector is then created by combining the best three deep features that perform well on the SVM classifier. Even though deep features are crucial for classification, as the network becomes deeper, some low-level features might be lost. Therefore, a shallow network was intended to learn low-level information from the brain MRI. Deep and shallow features are then merged to compensate for the information loss. The fused feature vector is then employed, in order to train a support vector machine classifier. The experimental results were obtained on a publicly available dataset. Our proposed framework has achieved a high accuracy of 92.48% (with a precision of 93.64%, recall of 94.55%, and f1-score of 93.97%). The results also showed that utilizing this feature fusion enhances the performance of the classification framework and these results ensure the hypothesis that features fusion enables the compensation of low-level information lost. Moreover, our classification approach outperformed others when compared to state-of-the-art studies.
脑肿瘤的早期诊断对脑肿瘤患者的临床诊断有很大的影响。因此,本研究提出了一种脑肿瘤检测方法,该方法将从脑MRI图像中提取的深层和浅层特征合并,以区分非肿瘤和肿瘤类别。我们在我们提出的框架中结合了一些预训练的深度CNN架构和迁移学习的概念,以从磁共振图像中获得高级特征。提取后,采用径向基支持向量机分类器对深度特征进行评价。然后通过组合在SVM分类器上表现良好的三个最佳深度特征来创建深度特征向量。尽管深度特征对分类至关重要,但随着网络变得更深,一些低级特征可能会丢失。因此,浅层网络旨在从大脑MRI中学习低级信息。然后将深特征和浅特征合并以补偿信息损失。然后使用融合的特征向量来训练支持向量机分类器。实验结果是在一个公开的数据集上获得的。我们提出的框架达到了92.48%的高准确率(精密度为93.64%,召回率为94.55%,f1分数为93.97%)。结果还表明,利用这种特征融合提高了分类框架的性能,这些结果验证了特征融合能够补偿低级信息丢失的假设。此外,与最先进的研究相比,我们的分类方法优于其他方法。
{"title":"Feature Fusion Based Effective Brain Tumor Detection Approach Using MRI","authors":"Farjana Parvin, Md. Al Mamun","doi":"10.1109/ICCIT57492.2022.10055136","DOIUrl":"https://doi.org/10.1109/ICCIT57492.2022.10055136","url":null,"abstract":"Early identification of brain tumors greatly influences the clinical diagnosis process of a brain tumor patient. Therefore, this study suggests a brain tumor detection approach that merges deep and shallow features extracted from the brain MRI images in order to distinguish between non-tumor and tumor classes. We combine some pre-trained deep CNN architectures and the concept of transfer learning in our proposed framework to obtain high-level features from magnetic resonance images. Following the extraction, a Support Vector Machine classifier with radial basis function was used to evaluate the deep features. A deep feature vector is then created by combining the best three deep features that perform well on the SVM classifier. Even though deep features are crucial for classification, as the network becomes deeper, some low-level features might be lost. Therefore, a shallow network was intended to learn low-level information from the brain MRI. Deep and shallow features are then merged to compensate for the information loss. The fused feature vector is then employed, in order to train a support vector machine classifier. The experimental results were obtained on a publicly available dataset. Our proposed framework has achieved a high accuracy of 92.48% (with a precision of 93.64%, recall of 94.55%, and f1-score of 93.97%). The results also showed that utilizing this feature fusion enhances the performance of the classification framework and these results ensure the hypothesis that features fusion enables the compensation of low-level information lost. Moreover, our classification approach outperformed others when compared to state-of-the-art studies.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120947690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A 2D Convolution Neural Network Based Method for Human Emotion Classification from Speech Signal 基于二维卷积神经网络的语音情感分类方法
Pub Date : 2022-12-17 DOI: 10.1109/ICCIT57492.2022.10054811
Rakhi Rani Paul, S. Paul, Md. Ekramul Hamid
recognizing emotions from speech signals is one of the active research fields in the area of human information processing as well as man-machine interaction. Different persons have different emotions and altogether different ways of expressing them. In this paper, a 2D Convolutional Neural Network (CNN) based method is presented for human emotion classification. We consider RAVDESS and SAVEE datasets to evaluate the performance of the model. Initially, Mel-frequency cepstral coefficients MFCC features are extracted from the speech signals which are used for the training purpose. Here, we consider only forty (40) cepstrum coefficients per frame. The proposed 2D CNN model is trained to classify seven different emotional states (neutral, calm, happy, sad, angry, scared, disgust, surprised). We achieve 89.86% overall accuracy from our proposed model for the RAVDESS dataset and 83.57% for the SAVEE dataset respectively. It is found that happy class is classified with an accuracy of 96% for the RAVDESS dataset and 92% for the SAVEE dataset. Lastly, the result of our proposed model is compared with the other recent existing works. The performance of our proposed model is good enough because it achieves better accuracy than other models. This work has many real-life applications such as man-machine interaction, auto supervision, auxiliary lie detection, the discovery of dissatisfaction with the client’s mode, detecting neurological disordered patients and so on.
从语音信号中识别情感是人类信息处理和人机交互领域的研究热点之一。不同的人有不同的情绪,表达方式也完全不同。提出了一种基于二维卷积神经网络(CNN)的人类情感分类方法。我们考虑RAVDESS和SAVEE数据集来评估模型的性能。首先,从用于训练目的的语音信号中提取mel频率倒谱系数MFCC特征。这里,我们每帧只考虑40个倒谱系数。所提出的二维CNN模型被训练来分类七种不同的情绪状态(中性、平静、快乐、悲伤、愤怒、害怕、厌恶、惊讶)。我们提出的模型在RAVDESS数据集和SAVEE数据集上的总体准确率分别达到89.86%和83.57%。研究发现,RAVDESS数据集的快乐类分类准确率为96%,SAVEE数据集的准确率为92%。最后,将该模型的结果与其他近期已有的研究结果进行了比较。我们提出的模型的性能足够好,因为它达到了比其他模型更好的精度。这项工作在现实生活中有许多应用,如人机交互、自动监控、辅助测谎、发现对客户模式的不满、检测神经障碍患者等。
{"title":"A 2D Convolution Neural Network Based Method for Human Emotion Classification from Speech Signal","authors":"Rakhi Rani Paul, S. Paul, Md. Ekramul Hamid","doi":"10.1109/ICCIT57492.2022.10054811","DOIUrl":"https://doi.org/10.1109/ICCIT57492.2022.10054811","url":null,"abstract":"recognizing emotions from speech signals is one of the active research fields in the area of human information processing as well as man-machine interaction. Different persons have different emotions and altogether different ways of expressing them. In this paper, a 2D Convolutional Neural Network (CNN) based method is presented for human emotion classification. We consider RAVDESS and SAVEE datasets to evaluate the performance of the model. Initially, Mel-frequency cepstral coefficients MFCC features are extracted from the speech signals which are used for the training purpose. Here, we consider only forty (40) cepstrum coefficients per frame. The proposed 2D CNN model is trained to classify seven different emotional states (neutral, calm, happy, sad, angry, scared, disgust, surprised). We achieve 89.86% overall accuracy from our proposed model for the RAVDESS dataset and 83.57% for the SAVEE dataset respectively. It is found that happy class is classified with an accuracy of 96% for the RAVDESS dataset and 92% for the SAVEE dataset. Lastly, the result of our proposed model is compared with the other recent existing works. The performance of our proposed model is good enough because it achieves better accuracy than other models. This work has many real-life applications such as man-machine interaction, auto supervision, auxiliary lie detection, the discovery of dissatisfaction with the client’s mode, detecting neurological disordered patients and so on.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"117 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133788134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Single Line Outage Analysis on IEEE 39 Bus Network IEEE 39总线网络的单线中断分析
Pub Date : 2022-12-17 DOI: 10.1109/ICCIT57492.2022.10055232
Chokder Alamgir Bartend Russell, Shahriar Khan
With the recent blackouts in Texas, Manhattan, and NE US, the importance of analyzing outage is greater than ever. Contingency Analysis is helpful to increase the resiliency of power system by analyzing impact of different contingencies. The effect of single transmission line outage in a transmission network has been studied with IEEE 39 bus network. Each of the branches has been disconnected one at a time to find effect on generator constraints, voltage constraints of buses, transmission line loading and possibility of islanding. PSS®E Xplore 34 was used for the simulation. The results show that single transmission line outage may impact generator power factors, increase demand of reactive power from the generators, and overload other transmission lines. Transmission line outage may lead to several violations of system constraints leading to islanding. This study may help research on different impacts of single transmission line outage and improve power system resilience.
随着最近德克萨斯州、曼哈顿和东北部的停电,分析停电的重要性比以往任何时候都要大。应急分析通过分析不同突发事件的影响,有助于提高电力系统的应变能力。利用ieee39总线网络,研究了输电网络中单线停电的影响。每个支路一次断开一个,以找出对发电机约束、母线电压约束、传输线负载和孤岛可能性的影响。采用PSS®E Xplore 34进行模拟。结果表明,单线停电会影响发电机功率因数,增加发电机的无功需求,并使其他输电线路过载。输电线路停运可能导致多次违反系统约束而导致孤岛。该研究有助于研究输电线路单次停电的不同影响,提高电力系统的恢复能力。
{"title":"Single Line Outage Analysis on IEEE 39 Bus Network","authors":"Chokder Alamgir Bartend Russell, Shahriar Khan","doi":"10.1109/ICCIT57492.2022.10055232","DOIUrl":"https://doi.org/10.1109/ICCIT57492.2022.10055232","url":null,"abstract":"With the recent blackouts in Texas, Manhattan, and NE US, the importance of analyzing outage is greater than ever. Contingency Analysis is helpful to increase the resiliency of power system by analyzing impact of different contingencies. The effect of single transmission line outage in a transmission network has been studied with IEEE 39 bus network. Each of the branches has been disconnected one at a time to find effect on generator constraints, voltage constraints of buses, transmission line loading and possibility of islanding. PSS®E Xplore 34 was used for the simulation. The results show that single transmission line outage may impact generator power factors, increase demand of reactive power from the generators, and overload other transmission lines. Transmission line outage may lead to several violations of system constraints leading to islanding. This study may help research on different impacts of single transmission line outage and improve power system resilience.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131067596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 25th International Conference on Computer and Information Technology (ICCIT)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1