首页 > 最新文献

2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)最新文献

英文 中文
Classification of Histopathological Images of Colon Cancer Using Convolutional Neural Network Method 基于卷积神经网络的结肠癌组织病理图像分类
Yus Kelana, S. Rizal, Sofia Saidah
Colon cancer is cancer with the most deaths in Indonesian society. Detection of disease through histopathological images of colon cancer still uses manual methods with readings by doctors. So it is necessary to do a system to detect and classify colon cancer. This study aims to create a colon cancer classification system to reduce the time in classifying the categories of colon cancer. In this study, a classification system for colon cancer was created into two classes, namely adenocarcinomas and polyps. Colon cancer data used in this study is data obtained online through the Kaggle website which consists of 2000 histopathological images measuring 768 pixels in jpeg format. The system is built using the Convolutional Neural Network (CNN) method with the MobileNet architecture. The design of this system is made by analyzing parameters that affect system performance based on the influence of image size, optimizer, learning rate, activation function, and batch size. Parameters used in evaluating system performance are accuracy, precision, recall, and f1-score. The results of testing the system based on parameters obtained the best model with image size 224x224 pixels, Adam optimizer, learning rate 0.0001, sigmoid activation function, and batch size 40. The best results of the best model are 100% accuracy value, 100% precision value, 100% recall value, and 100% f1-score with a loss of 0.000135.
结肠癌是印尼社会中死亡人数最多的癌症。通过结肠癌的组织病理学图像检测疾病仍然使用人工方法和医生的读数。因此,有必要建立一套结肠癌的检测和分类系统。本研究旨在建立结肠癌分类系统,减少结肠癌分类的时间。本研究将结肠癌划分为腺癌和息肉两类。本研究使用的结肠癌数据是通过Kaggle网站在线获取的数据,该数据由2000张768像素的jpeg格式组织病理学图像组成。该系统采用基于MobileNet架构的卷积神经网络(CNN)方法构建。根据图像大小、优化器、学习率、激活函数和批处理大小对系统性能的影响,分析了影响系统性能的参数,进行了系统设计。用于评估系统性能的参数包括准确率、精密度、召回率和f1-score。基于参数对系统进行测试,得到图像尺寸为224x224像素,Adam优化器,学习率为0.0001,sigmoid激活函数,批量大小为40的最佳模型。最佳模型的最佳结果为100%准确率值、100%精度值、100%召回值和100% f1-score,损失为0.000135。
{"title":"Classification of Histopathological Images of Colon Cancer Using Convolutional Neural Network Method","authors":"Yus Kelana, S. Rizal, Sofia Saidah","doi":"10.1109/ICCoSITE57641.2023.10127795","DOIUrl":"https://doi.org/10.1109/ICCoSITE57641.2023.10127795","url":null,"abstract":"Colon cancer is cancer with the most deaths in Indonesian society. Detection of disease through histopathological images of colon cancer still uses manual methods with readings by doctors. So it is necessary to do a system to detect and classify colon cancer. This study aims to create a colon cancer classification system to reduce the time in classifying the categories of colon cancer. In this study, a classification system for colon cancer was created into two classes, namely adenocarcinomas and polyps. Colon cancer data used in this study is data obtained online through the Kaggle website which consists of 2000 histopathological images measuring 768 pixels in jpeg format. The system is built using the Convolutional Neural Network (CNN) method with the MobileNet architecture. The design of this system is made by analyzing parameters that affect system performance based on the influence of image size, optimizer, learning rate, activation function, and batch size. Parameters used in evaluating system performance are accuracy, precision, recall, and f1-score. The results of testing the system based on parameters obtained the best model with image size 224x224 pixels, Adam optimizer, learning rate 0.0001, sigmoid activation function, and batch size 40. The best results of the best model are 100% accuracy value, 100% precision value, 100% recall value, and 100% f1-score with a loss of 0.000135.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122828838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Implementation of Convolutional Neural Network for COVID19 Screening using X-Rays Images 卷积神经网络在covid - 19 x射线图像筛查中的实现
Mera Kartika Delimayanti, Anggi Mardiyono, Bambang Warsuta, Eka Suci Puspitaningrum, R. F. Naryanto, Agustien Naryaningsih
Various unwelcome conditions have existed since the introduction of the novel coronavirus illness (COVID-19). COVID-19 can cause fever, muscle soreness, shortness of breath, cough, headache, and other symptoms. Diagnosis at an early stage is a crucial aspect of successful treatment. Therefore, it is necessary to seek out alternate methods for COVID-19 detection. Among the existing imaging resources, X-ray images are generally accessible and inexpensive. Consequently, an alternate diagnostic tool for detecting COVID-19 instances is provided using available resources. In the first stages of COVID-19, X-rays detected the disease before it spread to the lungs and caused more damage. Machine learning models can help clinicians accomplish jobs more quickly and accurately. In addition to chest X-ray pictures and fundus images, deep learning algorithms have been used to diagnose illnesses. This research was conducted to classify the X-ray chest images in COVID-19 and normal cases based on the public datasets which were used. This analysis uses 5600 images from the accessible resources, and a Convolutional Neural Network (CNN) architecture with the VGG16 algorithm was employed to diagnose COVID19. VGG16 is object identification and classification method that can classify with greater precision than most other deep learning algorithms. Transfer learning and fine-tuning were employed to help for improving the performance. The results showed that the VGG16 network had an accuracy of 98.13%. This research has implications for the early detection of COVID-19 by using X-ray images. The experiment and analysis reveal our suggested method's promising and stable performance compared to the current standard.
自新型冠状病毒(COVID-19)传入以来,各种不受欢迎的情况一直存在。COVID-19可引起发烧、肌肉酸痛、呼吸短促、咳嗽、头痛和其他症状。早期诊断是成功治疗的一个关键方面。因此,有必要寻找新冠病毒检测的替代方法。在现有的成像资源中,x射线图像一般容易获得且价格低廉。因此,利用现有资源提供了用于检测COVID-19实例的替代诊断工具。在COVID-19的第一阶段,x射线在疾病扩散到肺部并造成更大损害之前检测到疾病。机器学习模型可以帮助临床医生更快、更准确地完成工作。除了胸部x光片和眼底图像外,深度学习算法还被用于诊断疾病。本研究基于所使用的公共数据集,对新冠肺炎患者和正常病例的胸部x线图像进行分类。该分析使用可访问资源中的5600张图像,并采用卷积神经网络(CNN)架构和VGG16算法对covid - 19进行诊断。VGG16是一种对象识别和分类方法,其分类精度高于大多数其他深度学习算法。采用迁移学习和微调来帮助提高性能。结果表明,VGG16网络的准确率为98.13%。这项研究对利用x射线图像早期发现COVID-19具有重要意义。实验和分析表明,与现行标准相比,该方法具有良好的稳定性。
{"title":"Implementation of Convolutional Neural Network for COVID19 Screening using X-Rays Images","authors":"Mera Kartika Delimayanti, Anggi Mardiyono, Bambang Warsuta, Eka Suci Puspitaningrum, R. F. Naryanto, Agustien Naryaningsih","doi":"10.1109/ICCoSITE57641.2023.10127845","DOIUrl":"https://doi.org/10.1109/ICCoSITE57641.2023.10127845","url":null,"abstract":"Various unwelcome conditions have existed since the introduction of the novel coronavirus illness (COVID-19). COVID-19 can cause fever, muscle soreness, shortness of breath, cough, headache, and other symptoms. Diagnosis at an early stage is a crucial aspect of successful treatment. Therefore, it is necessary to seek out alternate methods for COVID-19 detection. Among the existing imaging resources, X-ray images are generally accessible and inexpensive. Consequently, an alternate diagnostic tool for detecting COVID-19 instances is provided using available resources. In the first stages of COVID-19, X-rays detected the disease before it spread to the lungs and caused more damage. Machine learning models can help clinicians accomplish jobs more quickly and accurately. In addition to chest X-ray pictures and fundus images, deep learning algorithms have been used to diagnose illnesses. This research was conducted to classify the X-ray chest images in COVID-19 and normal cases based on the public datasets which were used. This analysis uses 5600 images from the accessible resources, and a Convolutional Neural Network (CNN) architecture with the VGG16 algorithm was employed to diagnose COVID19. VGG16 is object identification and classification method that can classify with greater precision than most other deep learning algorithms. Transfer learning and fine-tuning were employed to help for improving the performance. The results showed that the VGG16 network had an accuracy of 98.13%. This research has implications for the early detection of COVID-19 by using X-ray images. The experiment and analysis reveal our suggested method's promising and stable performance compared to the current standard.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125043116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Data Security Model of D2D Communication Using Blockchain for Disaster 基于区块链的D2D通信数据安全模型
Shakil Ahmed
Multiple natural disasters cause the loss of billions of dollars, resources, and human lives. Integrated with the latest emerging technologies like Device to Device (D2D) communication and the Internet of Things (IoT) plays a significant role in the current disaster management system by extending coverage area, processing a massive volume of data, minimising the data traffic load, as well as energy-efficient communication. However, several critical security issues still exist, which have attracted many researchers’ attention. Blockchain is one of the latest technologies that could solve D2D and IoT devices’ different security issues. This paper analysed and discussed various security issues for early and post-disaster communication systems. It proposed a Hyper ledger Fabric (HLF) Blockchain-based communication model for the disaster communication system to overcome security threats and enhance the response system by providing authenticated and authorised the participant node. This paper implemented the HLF Blockchain framework to validate the proposed model and measure the system performance matrices.
多重自然灾害造成数十亿美元、资源和生命的损失。与设备对设备(D2D)通信和物联网(IoT)等最新新兴技术相结合,通过扩展覆盖范围,处理大量数据,最大限度地减少数据流量负载以及节能通信,在当前的灾害管理系统中发挥着重要作用。然而,仍然存在一些关键的安全问题,引起了许多研究人员的注意。区块链是可以解决D2D和物联网设备不同安全问题的最新技术之一。本文分析和讨论了灾前和灾后通信系统的各种安全问题。提出了一种基于hyperledger Fabric (HLF)区块链的灾难通信系统通信模型,通过提供经过认证和授权的参与者节点,克服安全威胁,增强响应系统。本文实现了HLF区块链框架来验证所提出的模型并测量系统性能矩阵。
{"title":"A Novel Data Security Model of D2D Communication Using Blockchain for Disaster","authors":"Shakil Ahmed","doi":"10.1109/ICCoSITE57641.2023.10127771","DOIUrl":"https://doi.org/10.1109/ICCoSITE57641.2023.10127771","url":null,"abstract":"Multiple natural disasters cause the loss of billions of dollars, resources, and human lives. Integrated with the latest emerging technologies like Device to Device (D2D) communication and the Internet of Things (IoT) plays a significant role in the current disaster management system by extending coverage area, processing a massive volume of data, minimising the data traffic load, as well as energy-efficient communication. However, several critical security issues still exist, which have attracted many researchers’ attention. Blockchain is one of the latest technologies that could solve D2D and IoT devices’ different security issues. This paper analysed and discussed various security issues for early and post-disaster communication systems. It proposed a Hyper ledger Fabric (HLF) Blockchain-based communication model for the disaster communication system to overcome security threats and enhance the response system by providing authenticated and authorised the participant node. This paper implemented the HLF Blockchain framework to validate the proposed model and measure the system performance matrices.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131636703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prototype Design of Smart Diabetic Shoes with Lora Module Communication 基于Lora模块通信的糖尿病智能鞋原型设计
P. Purwono, A. Burhan, K. Nisa', Sony Kartika Wibisono, Iis Setiawan Mangkunegara, Pramesti Dewi, A. Ma’arif, Iswanto Suwarno
Diabetes is a disease that affects many people in the world. Diabetes mellitus is a type of metabolic disease in a person who suffers from blood glucose levels with extreme conditions, namely insufficient insulin production in the human body. Monitoring diabetes is an important concern for researchers because it can be useful for improving the quality of the nursing service system. One of the common conditions in diabetic patients is ulceration which is difficult to detect on time. Technology can minimize the total cost of monitoring chronic diseases such as Diabetes continuously and on time. This research focuses on solutions to produce IoT-based smart diabetic shoes that utilize pressure sensors and the temperature of the feet of people with diabetic feet. Smart diabetic shoes are made using the Lora E32 module to be applied to areas with poor internet connections. The results of the testing carried out for 60 seconds in this study succeeded in detecting the area of the foot that experienced the greatest pressure, which was located on the rear footrest with a portion of pressure of around 25 – 28% of the total body weight. The patient's foot temperature increases when the pedestal load is greater. The Lora E32 module also functions as a media transmitter and receiver at a distance of 2.2 km in sending sensor data.
糖尿病是一种影响世界上许多人的疾病。糖尿病是一种代谢性疾病,患者的血糖水平处于极端状态,即人体内胰岛素分泌不足。监测糖尿病是研究人员关注的一个重要问题,因为它可以帮助提高护理服务系统的质量。糖尿病患者的常见症状之一是溃疡,溃疡很难及时发现。技术可以将持续和及时监测糖尿病等慢性疾病的总成本降至最低。该研究的重点是利用压力传感器和糖尿病脚的温度,生产基于物联网的智能糖尿病鞋的解决方案。使用Lora E32模块制作的智能糖尿病鞋适用于互联网连接较差的地区。在这项研究中,60秒的测试结果成功地检测出了足部承受最大压力的区域,该区域位于后脚踏板上,压力约占总体重的25 - 28%。当基座负荷较大时,患者的足部温度升高。Lora E32模块还可以在2.2公里的距离上作为媒体发射器和接收器发送传感器数据。
{"title":"Prototype Design of Smart Diabetic Shoes with Lora Module Communication","authors":"P. Purwono, A. Burhan, K. Nisa', Sony Kartika Wibisono, Iis Setiawan Mangkunegara, Pramesti Dewi, A. Ma’arif, Iswanto Suwarno","doi":"10.1109/ICCoSITE57641.2023.10127713","DOIUrl":"https://doi.org/10.1109/ICCoSITE57641.2023.10127713","url":null,"abstract":"Diabetes is a disease that affects many people in the world. Diabetes mellitus is a type of metabolic disease in a person who suffers from blood glucose levels with extreme conditions, namely insufficient insulin production in the human body. Monitoring diabetes is an important concern for researchers because it can be useful for improving the quality of the nursing service system. One of the common conditions in diabetic patients is ulceration which is difficult to detect on time. Technology can minimize the total cost of monitoring chronic diseases such as Diabetes continuously and on time. This research focuses on solutions to produce IoT-based smart diabetic shoes that utilize pressure sensors and the temperature of the feet of people with diabetic feet. Smart diabetic shoes are made using the Lora E32 module to be applied to areas with poor internet connections. The results of the testing carried out for 60 seconds in this study succeeded in detecting the area of the foot that experienced the greatest pressure, which was located on the rear footrest with a portion of pressure of around 25 – 28% of the total body weight. The patient's foot temperature increases when the pedestal load is greater. The Lora E32 module also functions as a media transmitter and receiver at a distance of 2.2 km in sending sensor data.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131945391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Precision Rice Vending Machine by Using Multiple Load Cell and IoT Based 使用多个称重传感器和物联网的精密大米自动售货机
H. Yuliandoko, Farisqi Panduardi, N. Lusi, Sapto Wibowo
Indonesia has the world's largest Muslim population, so the potential for Zakat, Infaq, and Sadaqah (ZIS) funding is enormous. The existence of ZIS in the current Covid-19 pandemic is of extreme importance, as issues such as economic crisis, social inequality, and the rise of dhuafa (poor) are becoming more and more visible. However, in the presence of coronavirus, ZIS has become more challenging to manage and distribute. Bans on gatherings, distancing rules, and other restrictions to prevent the spread of the coronavirus make ZIS more challenging to manage. Therefore, we need innovations to build bridges between people in need and supporters. The Internet of Things (IoT) is one system that can bridge this. In this system, IoT enables machine-to-machine communication and reduces human interaction. This research uses IoT to support Rice Vending Machine with high accuracy. The accuracy is achieved using five load cells and a good rice way-out valve mechanism. In previous machines using a single load cell or open the valve according to a certain time. Based on its test results analysis and MAPE values, we found the machine's accuracy facing obtained very well, and the website monitoring was also very effective. This research provides innovation in ZIS management and dramatically benefits society.
印尼是世界上穆斯林人口最多的国家,因此天课(Zakat)、Infaq和Sadaqah (ZIS)的资金潜力巨大。随着经济危机、社会不平等、穷人崛起等问题越来越明显,在当前新冠肺炎大流行中,ZIS的存在具有极其重要的意义。然而,在冠状病毒的存在下,ZIS的管理和传播变得更具挑战性。禁止集会、隔离规定和其他防止冠状病毒传播的限制措施使ZIS的管理更具挑战性。因此,我们需要创新,在需要帮助的人和支持者之间架起桥梁。物联网(IoT)是一个可以弥合这一鸿沟的系统。在这个系统中,物联网实现了机器对机器的通信,减少了人与人之间的互动。本研究利用物联网技术支持高精度的自动售货机。精度是实现使用五个称重传感器和一个良好的大米出口阀机构。在以前的机器中使用单个称重传感器或按一定时间打开阀门。通过对其测试结果的分析和MAPE值,我们发现机器的精度得到了很好的面对,网站监测也很有效。这项研究为ZIS管理提供了创新,并为社会带来了巨大的效益。
{"title":"Precision Rice Vending Machine by Using Multiple Load Cell and IoT Based","authors":"H. Yuliandoko, Farisqi Panduardi, N. Lusi, Sapto Wibowo","doi":"10.1109/ICCoSITE57641.2023.10127836","DOIUrl":"https://doi.org/10.1109/ICCoSITE57641.2023.10127836","url":null,"abstract":"Indonesia has the world's largest Muslim population, so the potential for Zakat, Infaq, and Sadaqah (ZIS) funding is enormous. The existence of ZIS in the current Covid-19 pandemic is of extreme importance, as issues such as economic crisis, social inequality, and the rise of dhuafa (poor) are becoming more and more visible. However, in the presence of coronavirus, ZIS has become more challenging to manage and distribute. Bans on gatherings, distancing rules, and other restrictions to prevent the spread of the coronavirus make ZIS more challenging to manage. Therefore, we need innovations to build bridges between people in need and supporters. The Internet of Things (IoT) is one system that can bridge this. In this system, IoT enables machine-to-machine communication and reduces human interaction. This research uses IoT to support Rice Vending Machine with high accuracy. The accuracy is achieved using five load cells and a good rice way-out valve mechanism. In previous machines using a single load cell or open the valve according to a certain time. Based on its test results analysis and MAPE values, we found the machine's accuracy facing obtained very well, and the website monitoring was also very effective. This research provides innovation in ZIS management and dramatically benefits society.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129661646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Public Sentiment Analysis of KOMINFO Data Leaking by Bjorka using Support Vector Machine 基于支持向量机的KOMINFO数据泄露事件舆情分析
Rayhan Sabian, Antok Supriyanto, Sulistiowati
An account with the name Bjorka claims to have obtained billions of SIM card registration data in the form of Identity Card and Family Card Nuber from the government database of the Ministry of Communication and Informatics (Kemkominfo), people start questioning the cybersecurity of the government database. The appearance of the Bjorka hacker caused various responses on Twitter, some supported Bjorka’s action and some disagree. Hence the need for sentiment analysis to determine public sentiment is more towards negative or positive, so the government can do evaluation as well as strategic planning to deal with future data leaking incidents. This study uses tweets that contain public responses to predict negative or positive sentiment using Support Vector Machine algorithm. From a total of 1017 public response data, have been found 97.35% (990 tweets) to have negative sentiment and 2.65% (27 tweets) have positive sentiment, so it can be known that public responses are towards negative about data leaking by Bjorka. In conclusion, education to the public about data leaks by Bjorka is not the main priority to do for the government. The government can focus more on dealing with other sectors such as improving the security of the data itself.
一个名为Bjorka的账户声称从通信和信息部(Kemkominfo)的政府数据库(Kemkominfo)中获取了数十亿张身份证和家庭卡号形式的SIM卡注册数据,人们开始质疑政府数据库的网络安全。Bjorka黑客的出现在推特上引起了各种各样的反应,一些人支持Bjorka的行为,一些人不同意。因此,需要进行情绪分析,以确定公众的情绪是消极的还是积极的,以便政府可以进行评估和战略规划,以应对未来的数据泄露事件。本研究使用包含公众回应的推文,使用支持向量机算法来预测消极或积极的情绪。从总共1017条公众回应数据中,我们发现97.35%(990条)的人有负面情绪,2.65%(27条)的人有正面情绪,因此可以知道公众对Bjorka泄露数据的反应是负面的。总之,对公众进行有关Bjorka数据泄露的教育并不是政府的首要任务。政府可以把更多精力放在处理其他领域,比如提高数据本身的安全性。
{"title":"Public Sentiment Analysis of KOMINFO Data Leaking by Bjorka using Support Vector Machine","authors":"Rayhan Sabian, Antok Supriyanto, Sulistiowati","doi":"10.1109/ICCoSITE57641.2023.10127745","DOIUrl":"https://doi.org/10.1109/ICCoSITE57641.2023.10127745","url":null,"abstract":"An account with the name Bjorka claims to have obtained billions of SIM card registration data in the form of Identity Card and Family Card Nuber from the government database of the Ministry of Communication and Informatics (Kemkominfo), people start questioning the cybersecurity of the government database. The appearance of the Bjorka hacker caused various responses on Twitter, some supported Bjorka’s action and some disagree. Hence the need for sentiment analysis to determine public sentiment is more towards negative or positive, so the government can do evaluation as well as strategic planning to deal with future data leaking incidents. This study uses tweets that contain public responses to predict negative or positive sentiment using Support Vector Machine algorithm. From a total of 1017 public response data, have been found 97.35% (990 tweets) to have negative sentiment and 2.65% (27 tweets) have positive sentiment, so it can be known that public responses are towards negative about data leaking by Bjorka. In conclusion, education to the public about data leaks by Bjorka is not the main priority to do for the government. The government can focus more on dealing with other sectors such as improving the security of the data itself.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134174715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Robustly Optimized BERT using Random Oversampling for Analyzing Imbalanced Stock News Sentiment Data 基于随机过采样的稳健优化BERT分析不平衡股票新闻情绪数据
Salsabila Mazya Permataning Tyas, R. Sarno, Agus Tri Haryono, Kelly Rossa Sungkono
Stock news is one of the information sources that can used to monitor stock prices. The information from stock news usually contains positive and negative sentiments that can affect stock prices. Therefore, sentiment analysis is needed to process the sentiment of stock news. The stock news dataset is taken from Kaggle. From these data, there is an imbalanced class between positive and negative sentiment. This research proposed a method to solve the imbalance dataset with random oversampling which worked by randomly replicating several minority classes. This research presents several scenarios of pre-processing text with different stages, intending to get high accuracy. The classification method used in this paper is a robustly optimized Bidirectional Transformer Encoder Representation (RoBERTa). Besides that, this paper also compared with baseline of Machine Learning (ML) such as Multinomial Naïve Bayes, Bernoulli Naïve Bayes, Support Vector Machine, Random Forest Classifier, Logistic Regression and used two different text representation such as TF-IDF and Word2Vec. The best result in this research is obtained using RoBERTa method with the fourth scenario of pre-processing text, in which the stage of pre-processing in this scenario only removing hashtag, without removing punctuation, removing the number, converting number, stop word removal, and lemmatization. The performance result is 0.85 precision, 0,84 recall, 0,84 F1-score, and 86% for accuracy result.
股票新闻是可以用来监控股票价格的信息来源之一。来自股票新闻的信息通常包含积极和消极的情绪,这些情绪会影响股价。因此,需要情绪分析来处理股票新闻的情绪。股票新闻数据集取自Kaggle。从这些数据来看,积极情绪和消极情绪之间存在着不平衡的阶层。本研究提出了一种通过随机复制几个少数类来解决随机过采样不平衡数据集的方法。本研究提出了几种不同阶段的文本预处理方案,以期获得较高的准确率。本文使用的分类方法是一种鲁棒优化的双向变压器编码器表示(RoBERTa)。除此之外,本文还比较了多项Naïve贝叶斯、伯努利Naïve贝叶斯、支持向量机、随机森林分类器、逻辑回归等机器学习(ML)的基线,并使用了TF-IDF和Word2Vec两种不同的文本表示。在本研究中,使用RoBERTa方法预处理文本的第四个场景得到了最好的结果,该场景的预处理阶段只有去掉标签,没有去掉标点、去掉数字、转换数字、去掉停用词和按序排列。性能结果为精度0.85,召回率0.84,f1得分0.84,正确率86%。
{"title":"A Robustly Optimized BERT using Random Oversampling for Analyzing Imbalanced Stock News Sentiment Data","authors":"Salsabila Mazya Permataning Tyas, R. Sarno, Agus Tri Haryono, Kelly Rossa Sungkono","doi":"10.1109/ICCoSITE57641.2023.10127725","DOIUrl":"https://doi.org/10.1109/ICCoSITE57641.2023.10127725","url":null,"abstract":"Stock news is one of the information sources that can used to monitor stock prices. The information from stock news usually contains positive and negative sentiments that can affect stock prices. Therefore, sentiment analysis is needed to process the sentiment of stock news. The stock news dataset is taken from Kaggle. From these data, there is an imbalanced class between positive and negative sentiment. This research proposed a method to solve the imbalance dataset with random oversampling which worked by randomly replicating several minority classes. This research presents several scenarios of pre-processing text with different stages, intending to get high accuracy. The classification method used in this paper is a robustly optimized Bidirectional Transformer Encoder Representation (RoBERTa). Besides that, this paper also compared with baseline of Machine Learning (ML) such as Multinomial Naïve Bayes, Bernoulli Naïve Bayes, Support Vector Machine, Random Forest Classifier, Logistic Regression and used two different text representation such as TF-IDF and Word2Vec. The best result in this research is obtained using RoBERTa method with the fourth scenario of pre-processing text, in which the stage of pre-processing in this scenario only removing hashtag, without removing punctuation, removing the number, converting number, stop word removal, and lemmatization. The performance result is 0.85 precision, 0,84 recall, 0,84 F1-score, and 86% for accuracy result.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"159 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132621508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Revalidating the Encoder-Decoder Depths and Activation Function to Find Optimum Vanilla Transformer Model 重新验证编码器-解码器深度和激活函数以找到最佳香草变压器模型
Y. Heryadi, B. Wijanarko, Dina Fitria Murad, C. Tho, Kiyota Hashimoto
The transformer model has become a state-of-the-art model in Natural Language Processing. The initial transformer model, known as the vanilla transformer model, is designed to improve some prominent models in sequence modeling and transduction problems such as language modeling and machine translation. The initial transformer model has 6 stacks of identical encoder-decoder layers with an attention mechanism whose aim is to push limitations of common recurrent language models and encoder-decoder architectures. Its outstanding performance has inspired many researchers to extend the architecture to improve its performance and computation efficiency. Despite many extensions to the vanilla transformer, there is no clear explanation of the encoder-decoder set out depth in the vanilla transformer model. This paper presents exploration results on the effect of combination encoder-decoder layer depth and activation function in the feed-forward layer of the vanilla transformer model on its performance. The model is tested to address a downstream task: text translation from Bahasa Indonesia to the Sundanese language. Although the value difference is not significantly large, the empirical results show that the combination of depth = 2 with Sigmoid, Tanh, and ReLU activation function; and d = 6 with ReLU activation shows the highest average training accuracy. Interestingly, d = 6 and ReLU show the lowest average training and validation loss. However, statistically, there is no significant difference between depth and activation functions.
变压器模型已经成为自然语言处理中最先进的模型。最初的变压器模型,被称为香草变压器模型,旨在改进一些突出的模型在序列建模和转导问题,如语言建模和机器翻译。最初的转换器模型有6个相同的编码器-解码器层堆栈,具有注意机制,其目的是突破常见循环语言模型和编码器-解码器体系结构的限制。其出色的性能激发了许多研究人员对其进行扩展,以提高其性能和计算效率。尽管对vanilla transformer进行了许多扩展,但是对于vanilla transformer模型中的编码器-解码器设置深度并没有明确的解释。本文给出了香草变压器模型前馈层中组合编码器-解码器层深度和激活函数对其性能影响的探索结果。对该模型进行了测试,以解决下游任务:从印尼语到巽他语的文本翻译。虽然数值差异不是很大,但实证结果表明,深度= 2与Sigmoid、Tanh、ReLU激活函数的组合;和d = 6的ReLU激活显示最高的平均训练精度。有趣的是,d = 6和ReLU显示了最低的平均训练和验证损失。然而,在统计上,深度和激活函数之间没有显著差异。
{"title":"Revalidating the Encoder-Decoder Depths and Activation Function to Find Optimum Vanilla Transformer Model","authors":"Y. Heryadi, B. Wijanarko, Dina Fitria Murad, C. Tho, Kiyota Hashimoto","doi":"10.1109/ICCoSITE57641.2023.10127790","DOIUrl":"https://doi.org/10.1109/ICCoSITE57641.2023.10127790","url":null,"abstract":"The transformer model has become a state-of-the-art model in Natural Language Processing. The initial transformer model, known as the vanilla transformer model, is designed to improve some prominent models in sequence modeling and transduction problems such as language modeling and machine translation. The initial transformer model has 6 stacks of identical encoder-decoder layers with an attention mechanism whose aim is to push limitations of common recurrent language models and encoder-decoder architectures. Its outstanding performance has inspired many researchers to extend the architecture to improve its performance and computation efficiency. Despite many extensions to the vanilla transformer, there is no clear explanation of the encoder-decoder set out depth in the vanilla transformer model. This paper presents exploration results on the effect of combination encoder-decoder layer depth and activation function in the feed-forward layer of the vanilla transformer model on its performance. The model is tested to address a downstream task: text translation from Bahasa Indonesia to the Sundanese language. Although the value difference is not significantly large, the empirical results show that the combination of depth = 2 with Sigmoid, Tanh, and ReLU activation function; and d = 6 with ReLU activation shows the highest average training accuracy. Interestingly, d = 6 and ReLU show the lowest average training and validation loss. However, statistically, there is no significant difference between depth and activation functions.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"397 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113997160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Study of EMG-based Mouse Clicks Type Detection 基于肌电图的鼠标点击类型检测研究
R. B. Widodo, Devina Trixie, W. Swastika
The operation of a computer required humans to use several parts of their body. However, there were some conditions where humans cannot operate computers correctly or in a normal position; examples of these conditions were accident victims and people with disabilities. Therefore, a system was needed to help make it easier for these people to operate the computer. This study developed a system that can classify click types using EMG sensors, the K-NN method, and the SVM method. EMG sensors helped take data in the form of signals from human muscle contractions which will later be classified into left-click and right-click. At the same time, it was useful for classifying these types of clicks for the K-NN and SVM methods. Data from EMG sensors were trained using the K-NN and SVM methods using 54 data sets in each class, namely left-click and right-click classes. The K-NN method was trained using k=3, 5, 7, 9, and 11. The SVM method used linear kernels, Radial Basis Function (RBF), polynomials, and sigmoids. After that, the accuracy values of the two methods will be compared. The study has successfully classified the types of clicks based on the input from the EMG sensor using the K-NN method with the highest accuracy results using k=3, which was 81.81%, and the SVM method using polynomial kernels which were 84.84%. The highest accuracy value was obtained by comparing the two methods, namely using the polynomial kernel SVM method. Adding datasets and conducting experiments using other methods as further comparisons can be used to improve system accuracy.
电脑的操作需要人类使用身体的几个部位。然而,在某些情况下,人类不能正确操作计算机或在正常位置;这些情况的例子是事故受害者和残疾人。因此,需要一个系统来帮助这些人更容易地操作计算机。本研究开发了一个可以使用肌电传感器、K-NN方法和支持向量机方法对点击类型进行分类的系统。肌电图传感器以人体肌肉收缩信号的形式获取数据,这些信号将被分为左键点击和右键点击。同时,这对于K-NN和SVM方法分类这些类型的点击是有用的。使用K-NN和SVM方法训练来自肌电传感器的数据,每个类(即左键和右键类)使用54个数据集。k - nn方法使用k=3、5、7、9和11进行训练。支持向量机方法使用线性核、径向基函数(RBF)、多项式和s型。然后比较两种方法的精度值。本研究利用k - nn方法对肌电传感器输入的点击类型进行了分类,其中k=3的准确率最高,为81.81%,使用多项式核的SVM方法准确率最高,为84.84%。通过比较两种方法,即采用多项式核支持向量机方法,获得了最高的精度值。添加数据集和使用其他方法进行实验作为进一步的比较,可以用来提高系统的准确性。
{"title":"Study of EMG-based Mouse Clicks Type Detection","authors":"R. B. Widodo, Devina Trixie, W. Swastika","doi":"10.1109/ICCoSITE57641.2023.10127673","DOIUrl":"https://doi.org/10.1109/ICCoSITE57641.2023.10127673","url":null,"abstract":"The operation of a computer required humans to use several parts of their body. However, there were some conditions where humans cannot operate computers correctly or in a normal position; examples of these conditions were accident victims and people with disabilities. Therefore, a system was needed to help make it easier for these people to operate the computer. This study developed a system that can classify click types using EMG sensors, the K-NN method, and the SVM method. EMG sensors helped take data in the form of signals from human muscle contractions which will later be classified into left-click and right-click. At the same time, it was useful for classifying these types of clicks for the K-NN and SVM methods. Data from EMG sensors were trained using the K-NN and SVM methods using 54 data sets in each class, namely left-click and right-click classes. The K-NN method was trained using k=3, 5, 7, 9, and 11. The SVM method used linear kernels, Radial Basis Function (RBF), polynomials, and sigmoids. After that, the accuracy values of the two methods will be compared. The study has successfully classified the types of clicks based on the input from the EMG sensor using the K-NN method with the highest accuracy results using k=3, which was 81.81%, and the SVM method using polynomial kernels which were 84.84%. The highest accuracy value was obtained by comparing the two methods, namely using the polynomial kernel SVM method. Adding datasets and conducting experiments using other methods as further comparisons can be used to improve system accuracy.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114231399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of Rice Varieties and Cultivation Techniques based-on Hyperspectral Image using Multi-output Spectral Xception 基于多输出光谱异常的高光谱图像水稻品种识别及栽培技术
Shinta Aprilia Safitri, A. H. Saputro
The use of deep learning model with hyperspectral image had been developed as a food identification system. This method was known to have a high level of accuracy without damaging the test sample. However, most of the CNN models developed were only capable to identify single target. It was inefficient when used for multiple targets such as identification of rice quality, due to it represents by multiple parameters. The model must be trained separately for each target. In this study, we proposed a model called Multi-output Spectral Xception that could classify objects in multi-class multi-output problems with hyperspectral image input. The proposed model was built by replacing 2D convolution layer with 3D convolution layer. It effectively extracts the spectral and spatial features. The model was evaluated using Indonesian rice with eight varieties and two type of cultivation techniques. Performance evaluations were done by calculate its accuracy using the confusion matrix, then compared it with state-of-the-art models. The result showed that the proposed model achieved the best performance among the other models, which was 97,82% for its average accuracy score.
利用深度学习模型与高光谱图像相结合,开发了一种食品识别系统。这种方法在不损坏测试样品的情况下具有很高的准确性。然而,大多数开发的CNN模型只能识别单个目标。由于它是由多个参数表示的,因此在用于大米品质鉴定等多目标时效率不高。模型必须针对每个目标分别进行训练。在本研究中,我们提出了一个多输出光谱异常模型,该模型可以对输入高光谱图像的多类多输出问题中的目标进行分类。采用三维卷积层代替二维卷积层建立模型。它有效地提取了光谱和空间特征。该模型以印度尼西亚8个品种和2种栽培技术的水稻为研究对象进行了评价。通过使用混淆矩阵计算其精度来进行性能评估,然后将其与最先进的模型进行比较。结果表明,该模型在所有模型中表现最好,平均准确率为97.82%。
{"title":"Identification of Rice Varieties and Cultivation Techniques based-on Hyperspectral Image using Multi-output Spectral Xception","authors":"Shinta Aprilia Safitri, A. H. Saputro","doi":"10.1109/ICCoSITE57641.2023.10127747","DOIUrl":"https://doi.org/10.1109/ICCoSITE57641.2023.10127747","url":null,"abstract":"The use of deep learning model with hyperspectral image had been developed as a food identification system. This method was known to have a high level of accuracy without damaging the test sample. However, most of the CNN models developed were only capable to identify single target. It was inefficient when used for multiple targets such as identification of rice quality, due to it represents by multiple parameters. The model must be trained separately for each target. In this study, we proposed a model called Multi-output Spectral Xception that could classify objects in multi-class multi-output problems with hyperspectral image input. The proposed model was built by replacing 2D convolution layer with 3D convolution layer. It effectively extracts the spectral and spatial features. The model was evaluated using Indonesian rice with eight varieties and two type of cultivation techniques. Performance evaluations were done by calculate its accuracy using the confusion matrix, then compared it with state-of-the-art models. The result showed that the proposed model achieved the best performance among the other models, which was 97,82% for its average accuracy score.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"24 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116829915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1