首页 > 最新文献

2022 14th International Conference on Knowledge and Smart Technology (KST)最新文献

英文 中文
SIREN! Detecting Burmese Hate Speech Comments on Social Media 警笛!在社交媒体上发现缅甸人的仇恨言论评论
Pub Date : 2022-01-26 DOI: 10.1109/KST53302.2022.9729075
Khin Me Me Chit, Yi Yi Chan Myae Win Shein, Wai Yan, A. Khine
Hate Speech on Social Media is definitely an evolving threat for every nation, especially for countries like Myanmar. Lack of media and digital literacy is playing a huge role in making people insult to each other or misallocating their stresses to others without physical encounter. Moreover, disingenuous politicians fuel online hate speech campaigns backstage of the elections by targeting different religions in the regard of heretics and using racialism. To emphasize this matter, we scraped over 16,000 social media comments from the most popular social media platform in Myanmar and performed hate-speech research using those samples. With the precise definition of a hate speech labelling guideline, annotation on the sample dataset was done systematically and efficiently. Experiments and evaluations were conducted using different linear and non-linear deep-learning classification models. Performances of the models are at the peak in Logistic Regression among linear models with 0.8974 AUC score and XLM-RoBERTa among deep learning models with 0.8958 AUC score on the test dataset. We observed that it is more advantageous to use linear models on our dataset since they achieved comparable results to the deep learning models and have much lower computational cost.
社交媒体上的仇恨言论对每个国家来说都是一个不断演变的威胁,尤其是对缅甸这样的国家。缺乏媒体和数字素养是导致人们在没有身体接触的情况下相互侮辱或将压力错误地分配给他人的重要原因。此外,不诚实的政客们在选举后台通过针对不同宗教的异端和种族主义来煽动网上仇恨言论运动。为了强调这一点,我们从缅甸最受欢迎的社交媒体平台上收集了1.6万多条社交媒体评论,并利用这些样本进行了仇恨言论研究。通过对仇恨言论标注准则的精确定义,系统高效地对样本数据集进行标注。使用不同的线性和非线性深度学习分类模型进行了实验和评估。在测试数据集上AUC分数为0.8974的线性模型和AUC分数为0.8958的深度学习模型中,XLM-RoBERTa的Logistic回归性能达到了顶峰。我们观察到,在我们的数据集上使用线性模型更有利,因为它们获得了与深度学习模型相当的结果,并且计算成本要低得多。
{"title":"SIREN! Detecting Burmese Hate Speech Comments on Social Media","authors":"Khin Me Me Chit, Yi Yi Chan Myae Win Shein, Wai Yan, A. Khine","doi":"10.1109/KST53302.2022.9729075","DOIUrl":"https://doi.org/10.1109/KST53302.2022.9729075","url":null,"abstract":"Hate Speech on Social Media is definitely an evolving threat for every nation, especially for countries like Myanmar. Lack of media and digital literacy is playing a huge role in making people insult to each other or misallocating their stresses to others without physical encounter. Moreover, disingenuous politicians fuel online hate speech campaigns backstage of the elections by targeting different religions in the regard of heretics and using racialism. To emphasize this matter, we scraped over 16,000 social media comments from the most popular social media platform in Myanmar and performed hate-speech research using those samples. With the precise definition of a hate speech labelling guideline, annotation on the sample dataset was done systematically and efficiently. Experiments and evaluations were conducted using different linear and non-linear deep-learning classification models. Performances of the models are at the peak in Logistic Regression among linear models with 0.8974 AUC score and XLM-RoBERTa among deep learning models with 0.8958 AUC score on the test dataset. We observed that it is more advantageous to use linear models on our dataset since they achieved comparable results to the deep learning models and have much lower computational cost.","PeriodicalId":433638,"journal":{"name":"2022 14th International Conference on Knowledge and Smart Technology (KST)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123120903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural Networks for Real-Time Digital Emulation of Guitar Speaker Cabinet Impulse Response 基于神经网络的吉他音箱箱体脉冲响应实时数字仿真
Pub Date : 2022-01-26 DOI: 10.1109/KST53302.2022.9727233
Tantep Sinjanakhom, S. Chivapreecha
This This paper presents a real-time signal processing system in which a neural network generates the impulse response (IR) of a Marshall 1960A guitar cabinet with 25W Celestion speakers based on user-specified parameters. The parameters include the microphone type, position of the speaker on which the microphone is mounted, distance between the microphone and the cabinet, and off-axis tilting angle. The trained model of neural network can generate the impulse response for a speaker cabinet, as well as the sound of settings not included in training set. Cross-correlation, error-to-signal ratio, power spectral density error, and magnitude-squared coherence were all utilized to assess the model's output. Mean Opinion Score listening tests were performed to determine the similarity of the convolved guitar signals. According to the results, the emulated cabinet sounds were perceived to be nearly identical to the original sounds. The performance of the real-time audio plugin implementation is proved to be computationally efficient. Because raw IR data for each microphone configuration does not need to be saved directly to the PC's memory, utilizing it in music production work can be more convenient, allowing the user to modify the parameters while hearing the differences without having to repeat the IR file loading procedure.
本文介绍了一种实时信号处理系统,该系统利用神经网络根据用户设定的参数产生带有25W Celestion扬声器的Marshall 1960A吉他箱的脉冲响应(IR)。参数包括麦克风类型、安装麦克风的扬声器位置、麦克风与机柜的距离、离轴倾斜度等。训练后的神经网络模型可以生成扬声器箱体的脉冲响应,以及未包含在训练集中的设置的声音。互相关、误差信号比、功率谱密度误差和幅度平方相干性都被用来评估模型的输出。平均意见得分听力测试进行,以确定卷积吉他信号的相似性。根据结果,模拟的橱柜声音被认为与原始声音几乎相同。该实时音频插件实现的性能被证明是计算效率高的。由于每个麦克风配置的原始IR数据不需要直接保存到PC的内存中,因此在音乐制作工作中利用它可以更方便,允许用户在听到差异的同时修改参数,而无需重复IR文件加载过程。
{"title":"Neural Networks for Real-Time Digital Emulation of Guitar Speaker Cabinet Impulse Response","authors":"Tantep Sinjanakhom, S. Chivapreecha","doi":"10.1109/KST53302.2022.9727233","DOIUrl":"https://doi.org/10.1109/KST53302.2022.9727233","url":null,"abstract":"This This paper presents a real-time signal processing system in which a neural network generates the impulse response (IR) of a Marshall 1960A guitar cabinet with 25W Celestion speakers based on user-specified parameters. The parameters include the microphone type, position of the speaker on which the microphone is mounted, distance between the microphone and the cabinet, and off-axis tilting angle. The trained model of neural network can generate the impulse response for a speaker cabinet, as well as the sound of settings not included in training set. Cross-correlation, error-to-signal ratio, power spectral density error, and magnitude-squared coherence were all utilized to assess the model's output. Mean Opinion Score listening tests were performed to determine the similarity of the convolved guitar signals. According to the results, the emulated cabinet sounds were perceived to be nearly identical to the original sounds. The performance of the real-time audio plugin implementation is proved to be computationally efficient. Because raw IR data for each microphone configuration does not need to be saved directly to the PC's memory, utilizing it in music production work can be more convenient, allowing the user to modify the parameters while hearing the differences without having to repeat the IR file loading procedure.","PeriodicalId":433638,"journal":{"name":"2022 14th International Conference on Knowledge and Smart Technology (KST)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126906713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Electronic Nose for Analysis of Coffee Beans Obtained from Different Altitudes and Origin 不同海拔和产地咖啡豆的电子鼻分析
Pub Date : 2022-01-26 DOI: 10.1109/KST53302.2022.9729071
Wandee Aunsa-Ard, T. Kerdcharoen
The coffee industry is facing increasing challenges due to climate change, pests, diseases, which leads to the reduced production and negative impact on coffee qualities. Thus, quality assurance of coffee from production to roasting and brewing becomes more important, especially coffee flavor and aroma. This research aims to study the applicability of electronic nose (e-nose) and algorithm to detect coffee aroma obtained from different origins. The coffee beans used in this experiment were obtained from different areas in northern Thailand. These coffee beans have different growing conditions, altitude, processing and roasting condition. In this study, the three aspects of e-nose were investigated; (i) e-nose sensitivity to coffee odors, (ii) e-nose capability of correctly recognizing the detected odors and (iii) factors that influence coffee odors such as altitude, processing and roasting conditions. The e-nose system comprises of eight metal oxide semiconductor (MOX) gas sensors and in-house developed analysis software. Principal Component Analysis (PCA) is a classification algorithm for pattern recognition of different coffee aroma. Based on experimental results, the e-nose technology shows a capability to detect and distinguish the coffee odors caused by different altitude, processing and roasting process. E-nose is a suitable method for aroma detection in coffee industry to enhance the quality.
由于气候变化、害虫、疾病,咖啡行业面临着越来越多的挑战,导致产量下降,对咖啡品质产生负面影响。因此,咖啡从生产到烘焙、冲泡的质量保证就显得尤为重要,尤其是咖啡的风味和香气。本研究旨在研究电子鼻(e-nose)及其算法在不同产地咖啡香气检测中的适用性。实验中使用的咖啡豆来自泰国北部的不同地区。这些咖啡豆有不同的生长条件、海拔、加工和烘焙条件。本研究从三个方面对电子鼻进行了研究;(i)电子鼻对咖啡气味的敏感度,(ii)电子鼻正确识别检测到的气味的能力,以及(iii)影响咖啡气味的因素,如海拔高度、加工和烘焙条件。电子鼻系统由8个金属氧化物半导体(MOX)气体传感器和内部开发的分析软件组成。主成分分析(PCA)是一种用于咖啡香气模式识别的分类算法。实验结果表明,电子鼻技术能够检测和区分不同海拔、加工和烘焙过程产生的咖啡气味。在咖啡工业中,电子鼻是提高咖啡品质的一种合适的香气检测方法。
{"title":"Electronic Nose for Analysis of Coffee Beans Obtained from Different Altitudes and Origin","authors":"Wandee Aunsa-Ard, T. Kerdcharoen","doi":"10.1109/KST53302.2022.9729071","DOIUrl":"https://doi.org/10.1109/KST53302.2022.9729071","url":null,"abstract":"The coffee industry is facing increasing challenges due to climate change, pests, diseases, which leads to the reduced production and negative impact on coffee qualities. Thus, quality assurance of coffee from production to roasting and brewing becomes more important, especially coffee flavor and aroma. This research aims to study the applicability of electronic nose (e-nose) and algorithm to detect coffee aroma obtained from different origins. The coffee beans used in this experiment were obtained from different areas in northern Thailand. These coffee beans have different growing conditions, altitude, processing and roasting condition. In this study, the three aspects of e-nose were investigated; (i) e-nose sensitivity to coffee odors, (ii) e-nose capability of correctly recognizing the detected odors and (iii) factors that influence coffee odors such as altitude, processing and roasting conditions. The e-nose system comprises of eight metal oxide semiconductor (MOX) gas sensors and in-house developed analysis software. Principal Component Analysis (PCA) is a classification algorithm for pattern recognition of different coffee aroma. Based on experimental results, the e-nose technology shows a capability to detect and distinguish the coffee odors caused by different altitude, processing and roasting process. E-nose is a suitable method for aroma detection in coffee industry to enhance the quality.","PeriodicalId":433638,"journal":{"name":"2022 14th International Conference on Knowledge and Smart Technology (KST)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122026471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
GSAP: A Hybrid GRU and Self-Attention Based Model for Dual Medical NLP Tasks GSAP:基于GRU和自注意的双重医学NLP任务混合模型
Pub Date : 2022-01-26 DOI: 10.1109/KST53302.2022.9727234
Huey-Ing Liu, Meng-Wei Chen, Wei-Chun Kao, Yao-Wen Yeh, Cheng Yang
This paper proposes a hybrid Gated Recurrent Unit (GRU) and Self-Attention based model, named GSAP, for dual medical related NLP tasks. GSAP stacks three famous neural network units: GRU, self-attention and pooling of Convolutional Neural Network (CNN) to improve the accuracy. In the GSAP, GRU is first adopted to comprehend sentences. Second, the Self-Attention layer helps the model to focus on key points of inputs. Finally, the pooling layer eases the outfitting and upgrades the system accuracy. The proposed GSAP is applied to two different medical NLP tasks: medical QA matching and smoking status classification and demonstrates outstanding results. In the smoking prediction, GSAP obtains an accuracy around 80%. Regarding to the medical QA matching task, GSAP upgrades the accuracy up to around 90%.
本文提出了一种基于门控循环单元(GRU)和自注意的混合模型GSAP,用于双重医学相关的NLP任务。GSAP堆栈了三个著名的神经网络单元:GRU,自关注和卷积神经网络(CNN)的池化,以提高准确性。在GSAP中,首先采用GRU来理解句子。第二,自注意层帮助模型关注输入的关键点。最后,池化层简化了配置,提高了系统的精度。将该方法应用于两种不同的医学NLP任务:医学质量保证匹配和吸烟状态分类,并取得了显著的效果。在吸烟预测中,GSAP的准确率在80%左右。对于医疗QA匹配任务,GSAP将准确率提升到90%左右。
{"title":"GSAP: A Hybrid GRU and Self-Attention Based Model for Dual Medical NLP Tasks","authors":"Huey-Ing Liu, Meng-Wei Chen, Wei-Chun Kao, Yao-Wen Yeh, Cheng Yang","doi":"10.1109/KST53302.2022.9727234","DOIUrl":"https://doi.org/10.1109/KST53302.2022.9727234","url":null,"abstract":"This paper proposes a hybrid Gated Recurrent Unit (GRU) and Self-Attention based model, named GSAP, for dual medical related NLP tasks. GSAP stacks three famous neural network units: GRU, self-attention and pooling of Convolutional Neural Network (CNN) to improve the accuracy. In the GSAP, GRU is first adopted to comprehend sentences. Second, the Self-Attention layer helps the model to focus on key points of inputs. Finally, the pooling layer eases the outfitting and upgrades the system accuracy. The proposed GSAP is applied to two different medical NLP tasks: medical QA matching and smoking status classification and demonstrates outstanding results. In the smoking prediction, GSAP obtains an accuracy around 80%. Regarding to the medical QA matching task, GSAP upgrades the accuracy up to around 90%.","PeriodicalId":433638,"journal":{"name":"2022 14th International Conference on Knowledge and Smart Technology (KST)","volume":"223 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116164814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
RaaS (Robot-as-a-Service) focusing on the human-robot collaboration in industrial sites RaaS (Robot-as-a-Service)专注于工业现场的人机协作
Pub Date : 2022-01-26 DOI: 10.1109/KST53302.2022.9729068
Hahyeon Kim, Chen Li
In this paper, we present the concept of a robot service that interacts with a human operator through speech. Service-oriented architectures and cloud computing are now the dominant computer paradigms. Research on using robots as a service (Robot-as-a-Service. RaaS) is a new trend based on the integration of robots and embedded devices and web and cloud computing. However, while the demand for RaaS utilization within the industry is high, it has not successfully acquired many users. One of the reasons is that the accuracy of the robot's recognition-judgment-action process does not reach a level that users can trust, and the other reason is that it is challenging to learn how to control the robot. Therefore, this study focused on services that allow users to control robots easily within industrial sites. Speech recognition was implemented using RESTful API and Server-Client communication, and a mobile manipulator robot, what we call Little-Helper (LH), is used for implementation. According to the human operators, communicate with the robot by voice speech, increasing collaboration efficiency and productivity of the industry is expected.
在本文中,我们提出了通过语音与人类操作员交互的机器人服务的概念。面向服务的体系结构和云计算现在是占主导地位的计算机范式。机器人即服务(Robot-as-a-Service)的研究。RaaS是一种基于机器人、嵌入式设备、网络和云计算集成的新趋势。然而,虽然行业内对RaaS利用率的需求很高,但它并没有成功地获得很多用户。一个原因是机器人的识别-判断-行动过程的准确性没有达到用户可以信任的水平,另一个原因是学习如何控制机器人是具有挑战性的。因此,这项研究的重点是让用户在工业现场轻松控制机器人的服务。语音识别是使用RESTful API和服务器-客户端通信实现的,并且使用了一个移动机械手,我们称之为Little-Helper (LH)。根据人类操作员的说法,通过语音与机器人进行交流,有望提高行业的协作效率和生产力。
{"title":"RaaS (Robot-as-a-Service) focusing on the human-robot collaboration in industrial sites","authors":"Hahyeon Kim, Chen Li","doi":"10.1109/KST53302.2022.9729068","DOIUrl":"https://doi.org/10.1109/KST53302.2022.9729068","url":null,"abstract":"In this paper, we present the concept of a robot service that interacts with a human operator through speech. Service-oriented architectures and cloud computing are now the dominant computer paradigms. Research on using robots as a service (Robot-as-a-Service. RaaS) is a new trend based on the integration of robots and embedded devices and web and cloud computing. However, while the demand for RaaS utilization within the industry is high, it has not successfully acquired many users. One of the reasons is that the accuracy of the robot's recognition-judgment-action process does not reach a level that users can trust, and the other reason is that it is challenging to learn how to control the robot. Therefore, this study focused on services that allow users to control robots easily within industrial sites. Speech recognition was implemented using RESTful API and Server-Client communication, and a mobile manipulator robot, what we call Little-Helper (LH), is used for implementation. According to the human operators, communicate with the robot by voice speech, increasing collaboration efficiency and productivity of the industry is expected.","PeriodicalId":433638,"journal":{"name":"2022 14th International Conference on Knowledge and Smart Technology (KST)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134006789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of Prediction Models for Road Deaths On Road Network 路网道路死亡预测模型的比较
Pub Date : 2022-01-26 DOI: 10.1109/KST53302.2022.9729086
Thaninthorn Whasphutthisit, Watchareewan Jitsakul
This paper presents to compare prediction models for road deaths on road network by data mining techniques. In this work, the classifier is selected from four prediction algorithms: Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbor (KNN), and Neural Network (NN). The dead injured and dead people data in road accident data set of the Ministry of Transport, Thailand from January to April 2021. It has up to 8,560 records 46 attributes. This research has measured performance models with accuracy, precision, recall, and f-measure. The comparative results showed that the accuracy of RF is the most appropriate for predicting road deaths on road network with accuracy 89%, precision 0.86, recall 0.89, and f-measure 0.85.
本文采用数据挖掘技术对路网道路死亡预测模型进行了比较。在这项工作中,分类器从四种预测算法中选择:随机森林(RF),支持向量机(SVM), k -最近邻(KNN)和神经网络(NN)。泰国交通部2021年1 - 4月道路交通事故数据集中的死伤人数数据。它有多达8,560条记录46个属性。本研究用准确性、精密度、召回率和f-measure来衡量绩效模型。对比结果表明,RF预测路网道路死亡的准确度为89%,精密度为0.86,召回率为0.89,f-measure为0.85。
{"title":"Comparison of Prediction Models for Road Deaths On Road Network","authors":"Thaninthorn Whasphutthisit, Watchareewan Jitsakul","doi":"10.1109/KST53302.2022.9729086","DOIUrl":"https://doi.org/10.1109/KST53302.2022.9729086","url":null,"abstract":"This paper presents to compare prediction models for road deaths on road network by data mining techniques. In this work, the classifier is selected from four prediction algorithms: Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbor (KNN), and Neural Network (NN). The dead injured and dead people data in road accident data set of the Ministry of Transport, Thailand from January to April 2021. It has up to 8,560 records 46 attributes. This research has measured performance models with accuracy, precision, recall, and f-measure. The comparative results showed that the accuracy of RF is the most appropriate for predicting road deaths on road network with accuracy 89%, precision 0.86, recall 0.89, and f-measure 0.85.","PeriodicalId":433638,"journal":{"name":"2022 14th International Conference on Knowledge and Smart Technology (KST)","volume":"428 12","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133941274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Unsupervised and Ensemble-based Anomaly Detection Method for Network Security 基于集成的无监督网络安全异常检测方法
Pub Date : 2022-01-26 DOI: 10.1109/KST53302.2022.9729061
Donghun Yang, Myunggwon Hwang
Bigdata and IoT technologies are developing rapidly. Accordingly, consideration of network security is also emphasized, and efficient intrusion detection technology is required for detecting increasingly sophisticated network attacks. In this study, we propose an efficient network anomaly detection method based on ensemble and unsupervised learning. The proposed model is built by training an autoencoder, a representative unsupervised deep learning model, using only normal network traffic data. The anomaly score of the detection target data is derived by ensemble the reconstruction loss and the Mahalanobis distances for each layer output of the trained autoencoder. By applying a threshold to this score, network anomaly traffic can be efficiently detected. To evaluate the proposed model, we applied our method to UNSW-NB15 dataset. The results show that the overall performance of the proposed method is superior to those of the model using only the reconstruction loss of the autoencoder and the model applying the Mahalanobis distance to the raw data.
大数据、物联网技术快速发展。因此,对网络安全的考虑也日益受到重视,需要高效的入侵检测技术来检测日益复杂的网络攻击。在本研究中,我们提出了一种基于集成和无监督学习的高效网络异常检测方法。该模型是通过训练一个自编码器(一种代表性的无监督深度学习模型)来构建的,该模型仅使用正常的网络流量数据。将训练好的自编码器每层输出的重建损失和马氏距离综合起来,得到检测目标数据的异常分数。通过对该分数应用阈值,可以有效地检测网络异常流量。为了评估所提出的模型,我们将该方法应用于UNSW-NB15数据集。结果表明,该方法的总体性能优于只考虑自编码器重构损失的模型和对原始数据应用马氏距离的模型。
{"title":"Unsupervised and Ensemble-based Anomaly Detection Method for Network Security","authors":"Donghun Yang, Myunggwon Hwang","doi":"10.1109/KST53302.2022.9729061","DOIUrl":"https://doi.org/10.1109/KST53302.2022.9729061","url":null,"abstract":"Bigdata and IoT technologies are developing rapidly. Accordingly, consideration of network security is also emphasized, and efficient intrusion detection technology is required for detecting increasingly sophisticated network attacks. In this study, we propose an efficient network anomaly detection method based on ensemble and unsupervised learning. The proposed model is built by training an autoencoder, a representative unsupervised deep learning model, using only normal network traffic data. The anomaly score of the detection target data is derived by ensemble the reconstruction loss and the Mahalanobis distances for each layer output of the trained autoencoder. By applying a threshold to this score, network anomaly traffic can be efficiently detected. To evaluate the proposed model, we applied our method to UNSW-NB15 dataset. The results show that the overall performance of the proposed method is superior to those of the model using only the reconstruction loss of the autoencoder and the model applying the Mahalanobis distance to the raw data.","PeriodicalId":433638,"journal":{"name":"2022 14th International Conference on Knowledge and Smart Technology (KST)","volume":"127 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114276027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Explainable Digital Currency Candlestick Pattern AI Learner 可解释的数字货币烛台模式人工智能学习者
Pub Date : 2022-01-26 DOI: 10.1109/KST53302.2022.9727231
Jun-Hao Chen, Cheng-Han Wu, Yun-Cheng Tsai, Samuel Yen-Chi Chen
More and more hedge funds have integrated AI techniques into the traditional trading strategy to speculate on Digital Currency. Among the conventional technical analysis, candlestick pattern recognition is a critical financial trading technique by visual judgment on graphical price movement. A model with high accuracy still can not meet the demand under the highly regulated financial industry that requires understanding the decision-making and quantifying the potential risk. Despite the deep convolutional neural networks (CNNs) have a significant performance. Especially in a highly speculative market, blindly trusting a black-box model will incur lots of troubles. Therefore, it is necessary to incorporate explainability into a DNN-based classic trading strategy, candlestick pattern recognition. It can make an acceptable justification for traders in the Digital Currency market. The paper exposes the black box and provides two algorithms as following. The first is an Adversarial Interpreter to explore the explainability. The second is an Adversarial Generator to enhance the model's explainability. To trust in the AI model and understand its judgment, the participant adopts powerful AI techniques to create more possibilities for AI in the Digital Currency market.
越来越多的对冲基金将人工智能技术融入到传统的交易策略中,投机数字货币。在传统的技术分析中,烛台模式识别是一项重要的金融交易技术,通过对图形价格运动进行视觉判断。在高度监管的金融行业中,一个高精度的模型仍然不能满足对决策的理解和对潜在风险的量化的需求。尽管深度卷积神经网络(cnn)有着显著的性能。特别是在高度投机的市场中,盲目相信黑盒模型会带来很多麻烦。因此,有必要将可解释性纳入基于dnn的经典交易策略,烛台模式识别。它可以为数字货币市场的交易者提供一个可接受的理由。本文揭示了黑盒,并提供了以下两种算法。第一种是对抗性解释器,探讨其可解释性。第二种是对抗生成器,以增强模型的可解释性。为了信任AI模型,理解其判断,参与者采用强大的AI技术,为AI在数字货币市场创造更多的可能性。
{"title":"Explainable Digital Currency Candlestick Pattern AI Learner","authors":"Jun-Hao Chen, Cheng-Han Wu, Yun-Cheng Tsai, Samuel Yen-Chi Chen","doi":"10.1109/KST53302.2022.9727231","DOIUrl":"https://doi.org/10.1109/KST53302.2022.9727231","url":null,"abstract":"More and more hedge funds have integrated AI techniques into the traditional trading strategy to speculate on Digital Currency. Among the conventional technical analysis, candlestick pattern recognition is a critical financial trading technique by visual judgment on graphical price movement. A model with high accuracy still can not meet the demand under the highly regulated financial industry that requires understanding the decision-making and quantifying the potential risk. Despite the deep convolutional neural networks (CNNs) have a significant performance. Especially in a highly speculative market, blindly trusting a black-box model will incur lots of troubles. Therefore, it is necessary to incorporate explainability into a DNN-based classic trading strategy, candlestick pattern recognition. It can make an acceptable justification for traders in the Digital Currency market. The paper exposes the black box and provides two algorithms as following. The first is an Adversarial Interpreter to explore the explainability. The second is an Adversarial Generator to enhance the model's explainability. To trust in the AI model and understand its judgment, the participant adopts powerful AI techniques to create more possibilities for AI in the Digital Currency market.","PeriodicalId":433638,"journal":{"name":"2022 14th International Conference on Knowledge and Smart Technology (KST)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122607339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Continuity of line detection methods based on the Radon transform 基于Radon变换的连续性线检测方法
Pub Date : 2022-01-26 DOI: 10.1109/KST53302.2022.9729056
Pat Vatiwutipong
The Radon transform is one of the most well-known tools for line detection methods. The drawback of the Radon transform is that it is not continuous. So slight change in an image may lead to a considerable difference in the detection line. This is undesirable. We solve this problem by the modified version of the Radon transform called the d-Radon transform. Several mathematical properties of this modified transform, especially a continuity property of line detection methods, were studied. We focus on a space of image containing several sizes of circles. A metric function on that space is proposed to measure the change of images. By this new transformation, continuity property is obtained.
Radon变换是最著名的直线检测方法之一。拉东变换的缺点是它不是连续的。因此,图像的微小变化可能导致检测线出现相当大的差异。这是不可取的。我们通过Radon变换的改进版本——d-Radon变换来解决这个问题。研究了该改进变换的几个数学性质,特别是直线检测方法的连续性。我们关注一个包含不同大小圆的图像空间。在该空间上提出了度量函数来度量图像的变化。通过这种新的变换,得到了连续性质。
{"title":"Continuity of line detection methods based on the Radon transform","authors":"Pat Vatiwutipong","doi":"10.1109/KST53302.2022.9729056","DOIUrl":"https://doi.org/10.1109/KST53302.2022.9729056","url":null,"abstract":"The Radon transform is one of the most well-known tools for line detection methods. The drawback of the Radon transform is that it is not continuous. So slight change in an image may lead to a considerable difference in the detection line. This is undesirable. We solve this problem by the modified version of the Radon transform called the d-Radon transform. Several mathematical properties of this modified transform, especially a continuity property of line detection methods, were studied. We focus on a space of image containing several sizes of circles. A metric function on that space is proposed to measure the change of images. By this new transformation, continuity property is obtained.","PeriodicalId":433638,"journal":{"name":"2022 14th International Conference on Knowledge and Smart Technology (KST)","volume":"180 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123008239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised concept identification from a large corpus of research documents 从大量研究文献的语料库中进行无监督概念识别
Pub Date : 2022-01-26 DOI: 10.1109/KST53302.2022.9729060
Watcharachat Plangsri, Nalina Phisanbut, P. Piamsa-nga
Research documents play a crucial role in data-driven research. Identifying concepts in a corpus of research documents can lead to a better understanding of the current stage of research. It can reveal fruitful concepts hidden inside the corpus. However, manually analyzing the corpus is laborious and inefficient. Automating the process is challenging due to the lack of background knowledge to fill the semantic gap that exists between humans and machines. To address this issue, we introduce a novel method that leverages information from an online open resource, namely Wikipedia, to build background knowledge automatically. An experiment on a set of 13,636 research documents shows that the framework can effectively and efficiently identify broad range of concepts within a large text corpus by exploiting only Wikipedia categories and documents' titles.
研究文件在数据驱动的研究中起着至关重要的作用。识别研究文件语料库中的概念可以更好地理解当前的研究阶段。它可以揭示隐藏在语料库中的富有成果的概念。然而,手工分析语料库是费力和低效的。由于缺乏背景知识来填补人与机器之间存在的语义差距,因此自动化过程具有挑战性。为了解决这个问题,我们引入了一种利用在线开放资源(即维基百科)的信息自动构建背景知识的新方法。一组13,636个研究文档的实验表明,该框架可以通过仅利用维基百科的分类和文档标题,有效地识别大型文本语料库中的广泛概念。
{"title":"Unsupervised concept identification from a large corpus of research documents","authors":"Watcharachat Plangsri, Nalina Phisanbut, P. Piamsa-nga","doi":"10.1109/KST53302.2022.9729060","DOIUrl":"https://doi.org/10.1109/KST53302.2022.9729060","url":null,"abstract":"Research documents play a crucial role in data-driven research. Identifying concepts in a corpus of research documents can lead to a better understanding of the current stage of research. It can reveal fruitful concepts hidden inside the corpus. However, manually analyzing the corpus is laborious and inefficient. Automating the process is challenging due to the lack of background knowledge to fill the semantic gap that exists between humans and machines. To address this issue, we introduce a novel method that leverages information from an online open resource, namely Wikipedia, to build background knowledge automatically. An experiment on a set of 13,636 research documents shows that the framework can effectively and efficiently identify broad range of concepts within a large text corpus by exploiting only Wikipedia categories and documents' titles.","PeriodicalId":433638,"journal":{"name":"2022 14th International Conference on Knowledge and Smart Technology (KST)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114685097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 14th International Conference on Knowledge and Smart Technology (KST)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1