首页 > 最新文献

2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)最新文献

英文 中文
Personal Data Matters: New Opportunities from Lifelogs 个人数据问题:来自Lifelogs的新机遇
C. Gurrin
In this position paper, the motivation for, and early progress in lifelog analytics and retrieval are presented. Early progress in the field is reviewed with a specific focus on the challenge of developing personal Memex-style lifelog search engines.
在这篇立场文件中,介绍了生命日志分析和检索的动机和早期进展。回顾了该领域的早期进展,并特别关注开发个人memex风格的生活日志搜索引擎的挑战。
{"title":"Personal Data Matters: New Opportunities from Lifelogs","authors":"C. Gurrin","doi":"10.1109/iSAI-NLP54397.2021.9678155","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678155","url":null,"abstract":"In this position paper, the motivation for, and early progress in lifelog analytics and retrieval are presented. Early progress in the field is reviewed with a specific focus on the challenge of developing personal Memex-style lifelog search engines.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127130991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Thai Language Tweet Emotion Prediction based on Use of Emojis 基于表情符号使用的泰语推文情感预测
S. Kongyoung, Kanokorn Trakultaweekoon, A. Rugchatjaroen
Thai Language can be handled/considered in the same group of Chinese and Japanese where no explicit spaces exist between words. This article presents a work on the emotional identification of tweets based on the use of emojis which focuses on a Thai language context. The use of emojis in user tweets indicates the writer’s emotions. The first phase of this study was to collect Thai tweets, clean them, and then to make a primary classification of the emojis into groups using K-mean clustering. These group clusters are used as target outputs for the prediction of emoji classes. It was found that 22 is the appropriate K for considering 70 emojis for a collected set of tweets. The corpus includes any level of Thai language usage, which means that the processed data can consist of suffixes, slang, and unknown word from tokenization process. The vector representation advances the unknown accent. In sum, this research created a corpus of short messages collected from Twitter which were grouped into 22 emoji-classes. The corpus includes 7,825,857 messages prepared for classification based on emotions by applying 2 biLSTM layers. A table of emojis is proposed based on Ekman’s six basic emotions: anger, disgust, fear, joy, sadness, and surprise were evaluated in both objective and subjective tests. The results show that word vectors work well for the classification of emotions through the use of emojis.
泰语可以在中文和日语的同一组中处理/考虑,单词之间没有明显的空格。本文介绍了一项基于表情符号使用的推文情感识别工作,重点关注泰语语境。用户在推特中使用表情符号表明了作者的情绪。这项研究的第一阶段是收集泰国的推文,清理它们,然后使用k -均值聚类对表情符号进行初步分类。这些组簇被用作预测表情符号类别的目标输出。研究发现,在一组推文中考虑70个表情符号时,22是合适的K。语料库包括任何级别的泰语用法,这意味着处理的数据可以由词缀、俚语和来自标记化过程的未知单词组成。向量表示推进未知重音。总之,本研究创建了一个从Twitter上收集的短信语料库,将其分为22个表情符号类别。该语料库包括7,825,857条消息,通过应用2个biLSTM层进行基于情绪的分类。根据Ekman的六种基本情绪:愤怒、厌恶、恐惧、喜悦、悲伤和惊讶,提出了表情符号表,并在客观和主观测试中进行了评估。结果表明,单词向量可以很好地通过使用表情符号对情绪进行分类。
{"title":"Thai Language Tweet Emotion Prediction based on Use of Emojis","authors":"S. Kongyoung, Kanokorn Trakultaweekoon, A. Rugchatjaroen","doi":"10.1109/iSAI-NLP54397.2021.9678160","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678160","url":null,"abstract":"Thai Language can be handled/considered in the same group of Chinese and Japanese where no explicit spaces exist between words. This article presents a work on the emotional identification of tweets based on the use of emojis which focuses on a Thai language context. The use of emojis in user tweets indicates the writer’s emotions. The first phase of this study was to collect Thai tweets, clean them, and then to make a primary classification of the emojis into groups using K-mean clustering. These group clusters are used as target outputs for the prediction of emoji classes. It was found that 22 is the appropriate K for considering 70 emojis for a collected set of tweets. The corpus includes any level of Thai language usage, which means that the processed data can consist of suffixes, slang, and unknown word from tokenization process. The vector representation advances the unknown accent. In sum, this research created a corpus of short messages collected from Twitter which were grouped into 22 emoji-classes. The corpus includes 7,825,857 messages prepared for classification based on emotions by applying 2 biLSTM layers. A table of emojis is proposed based on Ekman’s six basic emotions: anger, disgust, fear, joy, sadness, and surprise were evaluated in both objective and subjective tests. The results show that word vectors work well for the classification of emotions through the use of emojis.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123363258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Breast Cancer Detection using IR-UWB with Deep Learning 基于深度学习的IR-UWB乳腺癌检测
Mawin Khumdee, Pongpol Assawaroongsakul, P. Phasukkit, Nongluck Houngkamhang
This paper proposes breast cancer positioning detection using the IR-UWB system with deep learning, which is an interesting alternative method. When compared to ultrasound, x-ray mammogram, and CT-scan, there are several advantages to using IR-UWB, including low cost, less energy required, less long-term effect, portability, and providing much more breast cancer screening access for patients. Nowadays, the IR-UWB system has many techniques for processing IR-UWB signals, and one of the most interesting technique is using deep learning. In this study, we collected data from nine IR-UWB antennas. Then, the prepared data is fed through Deep Neural Networks to find the hidden patterns of signal and predict the cancer position which are 16 of breast cancer positions and one of undetected, also known as 17 classes. The model gave an average accuracy up to 95.60%.
本文提出利用IR-UWB系统结合深度学习进行乳腺癌定位检测,这是一种有趣的替代方法。与超声波、x光乳房x线照片和ct扫描相比,使用IR-UWB有几个优点,包括成本低、所需能源少、长期影响小、便携性好,并为患者提供更多的乳腺癌筛查机会。目前,红外-超宽带系统有许多处理红外-超宽带信号的技术,其中最有趣的技术之一是使用深度学习。在这项研究中,我们收集了9个IR-UWB天线的数据。然后,将准备好的数据通过深度神经网络(Deep Neural Networks)进行输入,发现信号的隐藏模式,并预测出16个乳腺癌位置和1个未被发现的癌症位置,也称为17类。该模型的平均准确率高达95.60%。
{"title":"Breast Cancer Detection using IR-UWB with Deep Learning","authors":"Mawin Khumdee, Pongpol Assawaroongsakul, P. Phasukkit, Nongluck Houngkamhang","doi":"10.1109/iSAI-NLP54397.2021.9678158","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678158","url":null,"abstract":"This paper proposes breast cancer positioning detection using the IR-UWB system with deep learning, which is an interesting alternative method. When compared to ultrasound, x-ray mammogram, and CT-scan, there are several advantages to using IR-UWB, including low cost, less energy required, less long-term effect, portability, and providing much more breast cancer screening access for patients. Nowadays, the IR-UWB system has many techniques for processing IR-UWB signals, and one of the most interesting technique is using deep learning. In this study, we collected data from nine IR-UWB antennas. Then, the prepared data is fed through Deep Neural Networks to find the hidden patterns of signal and predict the cancer position which are 16 of breast cancer positions and one of undetected, also known as 17 classes. The model gave an average accuracy up to 95.60%.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"22 8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125686559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Multiple Objectives Flexible Jobshop Scheduling Using Reinforcement Learning 基于强化学习的多目标柔性作业车间调度
Thanaphut Khuntiyaporn, Pokpong Songmuang, W. Limprasert
Jobshop Scheduling Problem is a classic complex problem in every field, such as education, business, and daily life. This problem has been changed due to the changing of problem space. For this reason, JSP problems are categorized into many different types, which consist of The General Jobshop Scheduling (GJSP), The Flexible Jobshop Scheduling (FJSP) and The Multiple-routes Jobshop Scheduling (MrJSP). However, most of the research that tries to solve the JSP problem has focused on the shortest makespan scheduling. Still, sometimes the minimum makespan can be led to very high operating costs, which have a significant impact on operating results. Therefore, the Multiple-objectives Flexible Jobshop Scheduling Problem (M-FJSP) become the focused problem in this research. The proposed method is a Reinforcement Learning Model (RL) with a Q-Learning algorithm. The experimental dataset uses data from the OR-Library, which is the collection for a variety of Operation Research (OR) problems. Our proposed models will be compared between the three different states definition in which we expect the meta-heuristic model will be the best performance model.
作业车间调度问题是一个经典的复杂问题,存在于教育、商业和日常生活等各个领域。由于问题空间的变化,这个问题也发生了变化。由于这个原因,JSP问题被分为许多不同的类型,包括通用作业车间调度(GJSP)、灵活作业车间调度(FJSP)和多路由作业车间调度(MrJSP)。然而,大多数试图解决JSP问题的研究都集中在最短完工时间调度上。然而,有时最小的最大作业时间可能会导致非常高的运营成本,从而对运营结果产生重大影响。因此,多目标柔性作业车间调度问题(M-FJSP)成为本文研究的重点问题。该方法是一种带有Q-Learning算法的强化学习模型(RL)。实验数据集使用OR库中的数据,该库是各种运筹学(OR)问题的集合。我们提出的模型将在三种不同的状态定义之间进行比较,我们预计元启发式模型将是最佳性能模型。
{"title":"The Multiple Objectives Flexible Jobshop Scheduling Using Reinforcement Learning","authors":"Thanaphut Khuntiyaporn, Pokpong Songmuang, W. Limprasert","doi":"10.1109/iSAI-NLP54397.2021.9678152","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678152","url":null,"abstract":"Jobshop Scheduling Problem is a classic complex problem in every field, such as education, business, and daily life. This problem has been changed due to the changing of problem space. For this reason, JSP problems are categorized into many different types, which consist of The General Jobshop Scheduling (GJSP), The Flexible Jobshop Scheduling (FJSP) and The Multiple-routes Jobshop Scheduling (MrJSP). However, most of the research that tries to solve the JSP problem has focused on the shortest makespan scheduling. Still, sometimes the minimum makespan can be led to very high operating costs, which have a significant impact on operating results. Therefore, the Multiple-objectives Flexible Jobshop Scheduling Problem (M-FJSP) become the focused problem in this research. The proposed method is a Reinforcement Learning Model (RL) with a Q-Learning algorithm. The experimental dataset uses data from the OR-Library, which is the collection for a variety of Operation Research (OR) problems. Our proposed models will be compared between the three different states definition in which we expect the meta-heuristic model will be the best performance model.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129799686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The comparison of the proposed recommended system with actual data 提出的推荐系统与实际数据的比较
L. Kovavisaruch, T. Sanpechuda
Recommendation systems for the museum have been active in the past decade. It used to be a difficult task to make the personalized recommended list for museum-goer. However, with the current technology, research can provide the list for visitors via technology such as mobile applications. We have proposed a recommendation system based on social filtering and statistical methods in the previous paper. This paper applies the F1-score to evaluate our recommendation methods on the actual visitor loggers from Chao sampradaya national museum. We compare the social filtering method with the statistical method and benchmark with the random recommendation. In comparison, the statistical method gives the same result as social filtering when the time is limited. The longer time the visitor spends in the museum, the better result from the social filtering. However, in terms of calculation complexity, the statistical method outperforms social filtering.
博物馆的推荐系统在过去十年一直很活跃。过去,为参观博物馆的人制作个性化的推荐名单是一项艰巨的任务。然而,以目前的技术,研究可以通过诸如移动应用程序等技术为访问者提供列表。我们在之前的文章中提出了一种基于社会过滤和统计方法的推荐系统。本文运用f1分值对我们的推荐方法对Chao sampradaya国立博物馆的实际游客记录者进行了评价。将社会过滤方法与统计方法进行比较,将基准测试方法与随机推荐方法进行比较。相比之下,在时间有限的情况下,统计方法得到的结果与社会过滤相同。参观者在博物馆停留的时间越长,社会过滤效果越好。然而,在计算复杂度方面,统计方法优于社会过滤。
{"title":"The comparison of the proposed recommended system with actual data","authors":"L. Kovavisaruch, T. Sanpechuda","doi":"10.1109/iSAI-NLP54397.2021.9678151","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678151","url":null,"abstract":"Recommendation systems for the museum have been active in the past decade. It used to be a difficult task to make the personalized recommended list for museum-goer. However, with the current technology, research can provide the list for visitors via technology such as mobile applications. We have proposed a recommendation system based on social filtering and statistical methods in the previous paper. This paper applies the F1-score to evaluate our recommendation methods on the actual visitor loggers from Chao sampradaya national museum. We compare the social filtering method with the statistical method and benchmark with the random recommendation. In comparison, the statistical method gives the same result as social filtering when the time is limited. The longer time the visitor spends in the museum, the better result from the social filtering. However, in terms of calculation complexity, the statistical method outperforms social filtering.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116531961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Low Computation and Real-Time Shoe Detection with Timestamp for Production Tracking in Shoe Manufacturing 基于时间戳的低计算实时鞋品检测技术在鞋品生产跟踪中的应用
Tith Vong, C. Jeenanunta, Apinun Tunpan, Nisit Sirimarnkit
Production planners could not get the update on the actual number of products in real-time. They do not realize the unmatched production until a few days later. Thus, the planners need to revise their production plan with reserve capacity for this unmatched production, and it causes manufacturing to waste a lot of time and money. The production outcome is usually manually counted at the end of the day and recorded on paper. This paper proposes an image processing system for counting products with a timestamp. The YOLOv4-tiny and DNN-OpenCV are utilized to detect an object. The detected object will be counted using the intersection detection and tesseract engine to extract time from the video. The object detection is trained using the 10 folds technique with 106 object photos. The proposed approach is tested with 8 videos for counting accuracy and timestamp accuracy. The testing result reveals that our proposed method achieves 100% of object counting and timestamp accuracy of 80 % compared with the manual counting with the timestamp. The proposed technique is suitable for counting objects with timestamps in real-time.
生产计划人员无法实时获得产品实际数量的最新信息。直到几天后,他们才意识到这是无与伦比的生产。因此,计划者需要修改他们的生产计划,保留这种不匹配的生产能力,这导致制造业浪费了大量的时间和金钱。生产结果通常在一天结束时手工计算并记录在纸上。提出了一种带时间戳的产品计数图像处理系统。YOLOv4-tiny和DNN-OpenCV用于检测目标。检测到的目标将被计数使用交集检测和tesseract引擎从视频中提取时间。使用10折叠技术对106张目标照片进行目标检测训练。用8个视频测试了该方法的计数精度和时间戳精度。测试结果表明,与使用时间戳进行人工计数相比,该方法的目标计数准确率达到100%,时间戳准确率达到80%。该方法适用于对带有时间戳的对象进行实时计数。
{"title":"The Low Computation and Real-Time Shoe Detection with Timestamp for Production Tracking in Shoe Manufacturing","authors":"Tith Vong, C. Jeenanunta, Apinun Tunpan, Nisit Sirimarnkit","doi":"10.1109/iSAI-NLP54397.2021.9678163","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678163","url":null,"abstract":"Production planners could not get the update on the actual number of products in real-time. They do not realize the unmatched production until a few days later. Thus, the planners need to revise their production plan with reserve capacity for this unmatched production, and it causes manufacturing to waste a lot of time and money. The production outcome is usually manually counted at the end of the day and recorded on paper. This paper proposes an image processing system for counting products with a timestamp. The YOLOv4-tiny and DNN-OpenCV are utilized to detect an object. The detected object will be counted using the intersection detection and tesseract engine to extract time from the video. The object detection is trained using the 10 folds technique with 106 object photos. The proposed approach is tested with 8 videos for counting accuracy and timestamp accuracy. The testing result reveals that our proposed method achieves 100% of object counting and timestamp accuracy of 80 % compared with the manual counting with the timestamp. The proposed technique is suitable for counting objects with timestamps in real-time.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126083483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Bilingual IT Service Desk Ticket Classification Using Language Model Pre-training Techniques 基于语言模型预训练技术的双语IT服务台票证分类
Language model pre-training techniques have been successfully applied to several natural language processing and text-mining tasks. However, existing published studies regarding automatic IT service desk ticket categorization were mostly conducted using the traditional bag-of-words (BoW) model and focused on the tickets that contain only one language. Therefore, this paper presents an examination of applying the state-of-the-art language model pre-training approaches to automatically determine the service category of bilingual IT service desk tickets, particularly for those tickets that contain Thai and/or English texts. Three well-known algorithms, mBERT, ULMFiT, and XLM-R, are investigated in this study using an in-house real-world dataset. Three Ensemble methods with bag-of-words text representation are used as performance evaluation baselines. According to our experimental results, language model pre-training techniques are superior to the BoW-based Ensemble methods for bilingual IT ticket categorization tasks. XLM-R gives the highest overall performance at 87.02% accuracy and 86.96% F1-score on the test dataset, followed by ULMFiT, mBERT and Ensemble methods, respectively
语言模型预训练技术已经成功地应用于一些自然语言处理和文本挖掘任务中。然而,现有发表的关于IT服务台票证自动分类的研究大多是使用传统的词袋(BoW)模型进行的,并且主要关注只包含一种语言的票证。因此,本文提出了一项应用最先进的语言模型预训练方法来自动确定双语IT服务台票的服务类别的研究,特别是那些包含泰语和/或英语文本的票。本研究使用内部真实数据集对mBERT、ULMFiT和XLM-R这三种知名算法进行了研究。采用三种具有词袋文本表示的集成方法作为性能评估基准。根据实验结果,语言模型预训练技术在双语IT票据分类任务中优于基于bow的集成方法。在测试数据集上,XLM-R给出了最高的整体性能,准确率为87.02%,f1得分为86.96%,其次是ULMFiT、mBERT和Ensemble方法
{"title":"Bilingual IT Service Desk Ticket Classification Using Language Model Pre-training Techniques","authors":"","doi":"10.1109/iSAI-NLP54397.2021.9678179","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678179","url":null,"abstract":"Language model pre-training techniques have been successfully applied to several natural language processing and text-mining tasks. However, existing published studies regarding automatic IT service desk ticket categorization were mostly conducted using the traditional bag-of-words (BoW) model and focused on the tickets that contain only one language. Therefore, this paper presents an examination of applying the state-of-the-art language model pre-training approaches to automatically determine the service category of bilingual IT service desk tickets, particularly for those tickets that contain Thai and/or English texts. Three well-known algorithms, mBERT, ULMFiT, and XLM-R, are investigated in this study using an in-house real-world dataset. Three Ensemble methods with bag-of-words text representation are used as performance evaluation baselines. According to our experimental results, language model pre-training techniques are superior to the BoW-based Ensemble methods for bilingual IT ticket categorization tasks. XLM-R gives the highest overall performance at 87.02% accuracy and 86.96% F1-score on the test dataset, followed by ULMFiT, mBERT and Ensemble methods, respectively","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129352783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Partial Facial Identification using Transfer Learning Technique 基于迁移学习技术的部分人脸识别
S. Watcharabutsarakham, S. Marukatat, Supphachoke Suntiwichaya, Chanchai Junlouchai
In today’s world, people go outside wearing a face mask, so face detection and face recognition models need to take this into account. Facial recognition has been researched widely with various algorithms. Since the coronavirus disease of 2019 (COVID-19) outbreak has spread across Thailand, our use of face recognition models has reminded people to wear a face mask. This is because when people go outside, they are likely to be exposed to facial image detection and classification methods which are used for authentication and authorization. In this paper, we use transfer learning such as YOLOv3 and training with public datasets and donation datasets. Our models can recognize faces with a 98.7% accuracy rate and identify faces including those with face masks-with a 92.7% accuracy rate.
在当今世界,人们出门都戴着口罩,所以人脸检测和人脸识别模型需要考虑到这一点。人脸识别已经得到了广泛的研究,使用了各种算法。自2019年冠状病毒病(COVID-19)疫情在泰国蔓延以来,我们使用人脸识别模型提醒人们戴口罩。这是因为当人们外出时,他们很可能会接触到用于身份验证和授权的面部图像检测和分类方法。在本文中,我们使用迁移学习,如YOLOv3,并使用公共数据集和捐赠数据集进行训练。我们的模型识别人脸的准确率为98.7%,识别包括戴口罩在内的人脸的准确率为92.7%。
{"title":"Partial Facial Identification using Transfer Learning Technique","authors":"S. Watcharabutsarakham, S. Marukatat, Supphachoke Suntiwichaya, Chanchai Junlouchai","doi":"10.1109/iSAI-NLP54397.2021.9678189","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678189","url":null,"abstract":"In today’s world, people go outside wearing a face mask, so face detection and face recognition models need to take this into account. Facial recognition has been researched widely with various algorithms. Since the coronavirus disease of 2019 (COVID-19) outbreak has spread across Thailand, our use of face recognition models has reminded people to wear a face mask. This is because when people go outside, they are likely to be exposed to facial image detection and classification methods which are used for authentication and authorization. In this paper, we use transfer learning such as YOLOv3 and training with public datasets and donation datasets. Our models can recognize faces with a 98.7% accuracy rate and identify faces including those with face masks-with a 92.7% accuracy rate.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128465523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KaleCare: Smart Farm for Kale with Pests Detection System using Machine Learning KaleCare:使用机器学习的甘蓝害虫检测系统的智能农场
Natthaphon Tachai, Perapat Yato, Teerachai Muangpan, Krittakom Srijiranon, Narissara Eiamkanitchat
Kale is a popular ingredient in Thai cuisine and can be grown year-round. However, kale requires particular care, especially pests. Therefore, this study applies the Internet of Things to propose the KaleCare, a smart farm management system for kale with four main functions including automatic watering based on weather forecasting, automatic fertilizing, reporting, and pest detection for cutworms, and aphids. There are three processes to create the pest classification models for pest detection function. Firstly, the raw images were applied to the GrabCut to remove the background. Secondly, data augmentation was applied to generate images due to the small amount of raw data. Finally, the modified GoogLeNet reduced the original GoogLeNet structure is proposed to classify both types of pests. The experimental results show that the proposed model outperforms with 0.8903 and 0.7959 in average classification rate and 0.886 and 0.7965 in average F1-score to classify cutworm and aphid, respectively.
羽衣甘蓝是泰国菜中很受欢迎的食材,全年都可以种植。然而,羽衣甘蓝需要特别照顾,尤其是害虫。因此,本研究运用物联网技术,提出了羽衣甘蓝智能农场管理系统KaleCare,该系统具有基于天气预报的自动浇水、自动施肥、自动报告、毛虫和蚜虫害虫检测等四大功能。创建害虫检测功能的害虫分类模型有三个过程。首先,将原始图像应用于GrabCut,去除背景;其次,由于原始数据较少,采用数据增强方法生成图像。最后,对原有的GoogLeNet结构进行了简化,提出了改进的GoogLeNet结构对两类害虫进行分类。实验结果表明,该模型对线虫和蚜虫的平均分类率分别为0.8903和0.7959,平均f1评分分别为0.886和0.7965。
{"title":"KaleCare: Smart Farm for Kale with Pests Detection System using Machine Learning","authors":"Natthaphon Tachai, Perapat Yato, Teerachai Muangpan, Krittakom Srijiranon, Narissara Eiamkanitchat","doi":"10.1109/iSAI-NLP54397.2021.9678178","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678178","url":null,"abstract":"Kale is a popular ingredient in Thai cuisine and can be grown year-round. However, kale requires particular care, especially pests. Therefore, this study applies the Internet of Things to propose the KaleCare, a smart farm management system for kale with four main functions including automatic watering based on weather forecasting, automatic fertilizing, reporting, and pest detection for cutworms, and aphids. There are three processes to create the pest classification models for pest detection function. Firstly, the raw images were applied to the GrabCut to remove the background. Secondly, data augmentation was applied to generate images due to the small amount of raw data. Finally, the modified GoogLeNet reduced the original GoogLeNet structure is proposed to classify both types of pests. The experimental results show that the proposed model outperforms with 0.8903 and 0.7959 in average classification rate and 0.886 and 0.7965 in average F1-score to classify cutworm and aphid, respectively.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"229 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116435030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
LimeSoda: Dataset for Fake News Detection in Healthcare Domain LimeSoda:医疗保健领域假新闻检测数据集
Patomporn Payoungkhamdee, Peerachet Porkaew, Atthasith Sinthunyathum, Phattharaphon Songphum, Witsarut Kawidam, Wichayut Loha-Udom, P. Boonkwan, Vipas Sutantayawalee
In this paper, we present our Thai fake news dataset in the healthcare domain, LIMESODA, with the construction guideline. Each document in the dataset is classified as fact, fake, or undefined. Moreover, we also provide token-level annotations for validating classifier decisions. Five high-level annotation tags1 are 1) misleading headline 2) imposter 3) fabrication 4) false connection and 5) misleading content. We curate and manually annotated 7,191 documents with these tags. We evaluate our dataset with two deep learning approaches; RNN and Transformer baselines and analyzed token-level contributions to understand model behaviors. For the RNN model, we use the attention weights as token-level contributions. For Transformer models, we use the integrated gradient method at the embedding layers. We finally compared these token-level contributions with human annotations. Although our baseline models yield promising performances, we found that tokens that support model decisions are quite different from human annotation.
在本文中,我们提出了我们在医疗保健领域的泰国假新闻数据集,LIMESODA,以及构建指南。数据集中的每个文档被分类为事实、虚假或未定义。此外,我们还提供了用于验证分类器决策的令牌级注释。五个高级注释标签1)误导性标题2)冒名顶替者3)捏造4)虚假联系5)误导性内容。我们用这些标签整理和手动注释了7191个文档。我们用两种深度学习方法来评估我们的数据集;RNN和Transformer基线,并分析了令牌级别的贡献,以理解模型行为。对于RNN模型,我们使用注意力权重作为令牌级别的贡献。对于Transformer模型,我们在嵌入层上使用积分梯度方法。最后,我们将这些标记级贡献与人工注释进行了比较。尽管我们的基线模型产生了很好的性能,但我们发现支持模型决策的标记与人类注释有很大的不同。
{"title":"LimeSoda: Dataset for Fake News Detection in Healthcare Domain","authors":"Patomporn Payoungkhamdee, Peerachet Porkaew, Atthasith Sinthunyathum, Phattharaphon Songphum, Witsarut Kawidam, Wichayut Loha-Udom, P. Boonkwan, Vipas Sutantayawalee","doi":"10.1109/iSAI-NLP54397.2021.9678187","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678187","url":null,"abstract":"In this paper, we present our Thai fake news dataset in the healthcare domain, LIMESODA, with the construction guideline. Each document in the dataset is classified as fact, fake, or undefined. Moreover, we also provide token-level annotations for validating classifier decisions. Five high-level annotation tags1 are 1) misleading headline 2) imposter 3) fabrication 4) false connection and 5) misleading content. We curate and manually annotated 7,191 documents with these tags. We evaluate our dataset with two deep learning approaches; RNN and Transformer baselines and analyzed token-level contributions to understand model behaviors. For the RNN model, we use the attention weights as token-level contributions. For Transformer models, we use the integrated gradient method at the embedding layers. We finally compared these token-level contributions with human annotations. Although our baseline models yield promising performances, we found that tokens that support model decisions are quite different from human annotation.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128986795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1