首页 > 最新文献

2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)最新文献

英文 中文
International Joint Conference 2022 2022年国际联席会议
{"title":"International Joint Conference 2022","authors":"","doi":"10.1109/isai-nlp56921.2022.9960258","DOIUrl":"https://doi.org/10.1109/isai-nlp56921.2022.9960258","url":null,"abstract":"","PeriodicalId":399019,"journal":{"name":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128425217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Shrimp-growth Estimation Based on ResNeXt for an Automatic Feeding-tray Lifting System Used in Shrimp Farming 基于ResNeXt的对虾养殖自动投料盘提升系统对虾生长估算
Chanon Nontarit, T. Kondo, Warakorn Khamkaew, Jaroenmit Woradet, Jessada Karnjana
The shrimp agriculturists monitor shrimp growth by observing the size of shrimps in the feeding tray with the naked eye. This approach is time-consuming and needs experienced workers. This study proposes an automatic approach for estimating shrimp size using images. A mask region-based convolutional neural network with ResNeXt was trained to detect shrimps in an image. The detection model achieved an overall precision of 74.45%, recall of 72.20%, Fl score of 73.31 %, and AP of 69.04%. The two unique methods were proposed for estimating shrimp size. The first method achieved a mean absolute error of 0.30 cm and a mean absolute percentage error of 3.97%. The second method achieved a mean absolute error of 0.35 cm and a mean absolute percentage error of 4.59%. The proposed system achieved an automatic shrimp size estimation from the image and provided helpful information for agriculturists.
虾农通过肉眼观察饲养盘中虾的大小来监测虾的生长。这种方法耗时且需要有经验的工人。本研究提出了一种利用图像自动估计虾大小的方法。利用ResNeXt对基于掩模区域的卷积神经网络进行训练,检测图像中的虾类。该检测模型的总体准确率为74.45%,召回率为72.20%,Fl评分为73.31%,AP评分为69.04%。提出了两种独特的估算虾大小的方法。第一种方法的平均绝对误差为0.30 cm,平均绝对百分比误差为3.97%。第二种方法的平均绝对误差为0.35 cm,平均绝对百分比误差为4.59%。该系统实现了对虾大小的自动估计,为农学家提供了有用的信息。
{"title":"Shrimp-growth Estimation Based on ResNeXt for an Automatic Feeding-tray Lifting System Used in Shrimp Farming","authors":"Chanon Nontarit, T. Kondo, Warakorn Khamkaew, Jaroenmit Woradet, Jessada Karnjana","doi":"10.1109/iSAI-NLP56921.2022.9960243","DOIUrl":"https://doi.org/10.1109/iSAI-NLP56921.2022.9960243","url":null,"abstract":"The shrimp agriculturists monitor shrimp growth by observing the size of shrimps in the feeding tray with the naked eye. This approach is time-consuming and needs experienced workers. This study proposes an automatic approach for estimating shrimp size using images. A mask region-based convolutional neural network with ResNeXt was trained to detect shrimps in an image. The detection model achieved an overall precision of 74.45%, recall of 72.20%, Fl score of 73.31 %, and AP of 69.04%. The two unique methods were proposed for estimating shrimp size. The first method achieved a mean absolute error of 0.30 cm and a mean absolute percentage error of 3.97%. The second method achieved a mean absolute error of 0.35 cm and a mean absolute percentage error of 4.59%. The proposed system achieved an automatic shrimp size estimation from the image and provided helpful information for agriculturists.","PeriodicalId":399019,"journal":{"name":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128197532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Design and Construct Quadcopter to Detect Wild Elephant to Alert 设计和建造四轴飞行器探测野象警报
Jiranuwat Piriyasupakij, Ratchada Prasitphan
The encroachment of wild elephants in farming areas and villagers' habitats has caused conflicts among communities resulting to violence in preventing and driving elephants out of the area. The organizers have seen the impact of this problem and aims to solve it by designing and building a prototype Quadcopter that can survey and detect these wild elephants to warn villages and prevent further damage. The model takes crisp image of the detected wild elephants at a distance of 1M to 80% and alerts the forest technicians via the application. The test results showed that there were problems in the controls for it requires GPS signal. In the future, improvements and fixes in various parts shall be made to further develop innovative approaches that solve real-world problems.
野生大象对农区和村民栖息地的侵犯引发了社区之间的冲突,导致防止和驱逐大象的暴力行为。组织者已经看到了这个问题的影响,他们的目标是通过设计和建造一个原型四轴飞行器来解决这个问题,这个飞行器可以调查和探测这些野生大象,向村庄发出警告,防止进一步的破坏。该模型对被探测到的野生大象在1M到80%的距离内进行清晰成像,并通过应用程序提醒森林技术人员。测试结果表明,由于需要GPS信号,控制存在问题。在未来,需要对各个部分进行改进和修复,以进一步开发解决现实问题的创新方法。
{"title":"Design and Construct Quadcopter to Detect Wild Elephant to Alert","authors":"Jiranuwat Piriyasupakij, Ratchada Prasitphan","doi":"10.1109/iSAI-NLP56921.2022.9960248","DOIUrl":"https://doi.org/10.1109/iSAI-NLP56921.2022.9960248","url":null,"abstract":"The encroachment of wild elephants in farming areas and villagers' habitats has caused conflicts among communities resulting to violence in preventing and driving elephants out of the area. The organizers have seen the impact of this problem and aims to solve it by designing and building a prototype Quadcopter that can survey and detect these wild elephants to warn villages and prevent further damage. The model takes crisp image of the detected wild elephants at a distance of 1M to 80% and alerts the forest technicians via the application. The test results showed that there were problems in the controls for it requires GPS signal. In the future, improvements and fixes in various parts shall be made to further develop innovative approaches that solve real-world problems.","PeriodicalId":399019,"journal":{"name":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126649816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Construction of Thai WordNet through Translation Equivalence 通过翻译对等构建泰语词网
Dhanon Leenoi, Alongkorn Alongkornchai, Akkharawoot Takhom, P. Boonkwan, Thenchai Sunnithi
WordNet is a crucial language resource associated with artificial intelligence activities, for instance, constructing building models for advancement of computational linguistics and natural language processing, or representing statistical insights through knowledge graphs that emulate cognition and human understanding. Thai WordNet has been developed in many approaches, e.g., a merge approach in gold standard, and semi-auto construction with a bilingual dictionary. However, existing Thai WordNet is not easy to find words fit with the definition of synsets; and cover cultural gaps between the different languages of which needed to be aware. This paper presents a methodology of Translation Equivalence in order to construct Thai language resource, called LST22 Thai WordNet.
WordNet是与人工智能活动相关的重要语言资源,例如,构建用于推进计算语言学和自然语言处理的建筑模型,或通过模拟认知和人类理解的知识图表示统计见解。泰语WordNet有多种开发方法,例如,黄金标准的合并方法,以及使用双语词典的半自动构建。然而,现有的泰语WordNet并不容易找到符合同义词集定义的词;并涵盖不同语言之间需要注意的文化差异。本文提出了一种翻译对等的方法,用于构建泰语资源LST22泰语WordNet。
{"title":"A Construction of Thai WordNet through Translation Equivalence","authors":"Dhanon Leenoi, Alongkorn Alongkornchai, Akkharawoot Takhom, P. Boonkwan, Thenchai Sunnithi","doi":"10.1109/iSAI-NLP56921.2022.9960263","DOIUrl":"https://doi.org/10.1109/iSAI-NLP56921.2022.9960263","url":null,"abstract":"WordNet is a crucial language resource associated with artificial intelligence activities, for instance, constructing building models for advancement of computational linguistics and natural language processing, or representing statistical insights through knowledge graphs that emulate cognition and human understanding. Thai WordNet has been developed in many approaches, e.g., a merge approach in gold standard, and semi-auto construction with a bilingual dictionary. However, existing Thai WordNet is not easy to find words fit with the definition of synsets; and cover cultural gaps between the different languages of which needed to be aware. This paper presents a methodology of Translation Equivalence in order to construct Thai language resource, called LST22 Thai WordNet.","PeriodicalId":399019,"journal":{"name":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122996651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Portfolio Optimization and Rebalancing with Transaction Cost: A Case Study in the Stock Exchange of Thailand 交易成本下的投资组合优化与再平衡:以泰国证券交易所为例
Apichat Chaweewanchon, Rujira Chaysiri
Portfolio optimization is one of the most intriguing topics in the field of finance. The purpose is to maximize return while minimizing risk. In this paper, we investigate the experimental performance of the classical Markowitz portfolio optimization with and without rebalancing based on the minimum risk in terms of portfolio return, portfolio risk, and Sharpe ratio, and compare the results to the experiments with transaction cost. The importance of this work stems from the fact that, while the MV model is extensively utilized, its use in the Thai stock market is limited. This analysis uses the historical close prices of 50 stocks from the Stock Exchange of Thailand 50 Index (SET50) between January 2018 and December 2021. The experiment showed that a portfolio with a rebalancing approach outperforms a portfolio without a rebalancing strategy.
投资组合优化是金融领域最有趣的话题之一。其目的是在风险最小化的同时实现收益最大化。本文从投资组合收益、投资组合风险和夏普比率三个方面考察了基于最小风险再平衡和不考虑最小风险再平衡的经典Markowitz投资组合优化的实验表现,并与考虑交易成本的实验结果进行了比较。这项工作的重要性源于这样一个事实,即虽然MV模型被广泛使用,但它在泰国股票市场中的使用是有限的。本分析使用泰国证券交易所50指数(SET50)在2018年1月至2021年12月期间的50只股票的历史收盘价。实验表明,采用再平衡策略的投资组合优于不采用再平衡策略的投资组合。
{"title":"Portfolio Optimization and Rebalancing with Transaction Cost: A Case Study in the Stock Exchange of Thailand","authors":"Apichat Chaweewanchon, Rujira Chaysiri","doi":"10.1109/iSAI-NLP56921.2022.9960260","DOIUrl":"https://doi.org/10.1109/iSAI-NLP56921.2022.9960260","url":null,"abstract":"Portfolio optimization is one of the most intriguing topics in the field of finance. The purpose is to maximize return while minimizing risk. In this paper, we investigate the experimental performance of the classical Markowitz portfolio optimization with and without rebalancing based on the minimum risk in terms of portfolio return, portfolio risk, and Sharpe ratio, and compare the results to the experiments with transaction cost. The importance of this work stems from the fact that, while the MV model is extensively utilized, its use in the Thai stock market is limited. This analysis uses the historical close prices of 50 stocks from the Stock Exchange of Thailand 50 Index (SET50) between January 2018 and December 2021. The experiment showed that a portfolio with a rebalancing approach outperforms a portfolio without a rebalancing strategy.","PeriodicalId":399019,"journal":{"name":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134406342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
ISAI-NLP-AIOT 2022 Technical Oral Sessions ISAI-NLP-AIOT 2022技术口头会议
{"title":"ISAI-NLP-AIOT 2022 Technical Oral Sessions","authors":"","doi":"10.1109/isai-nlp56921.2022.9960253","DOIUrl":"https://doi.org/10.1109/isai-nlp56921.2022.9960253","url":null,"abstract":"","PeriodicalId":399019,"journal":{"name":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126763876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Analysis of Acoustic Features for Attention Score in Thai MoCA Assessment 泰国语MoCA评价中注意评分的声学特征分析
Wirot Treemongkolchok, P. Punyabukkana, Dittaya Wanvarie, Ploy N. Pratanwanich
Screening tests like the Montreal Cognitive Assessment (MoCA) can help diagnose mild cognitive impairment (MCI). MoCA comprises subtests that span various cognitive domains. Numerous researchers attempt to detect MCI by employing speech-related features such as acoustic, linguistic, and prosodic features. However, the features can distinguish patients with MCI from healthy people but do not describe each patient's specific cognitive domain impairment. This study focuses on Digit Backward Span (DBS) and Digit Forward Span (DFS), subtests related to the cognitive attention domain in MoCA. We develop a model and identify the most relevant speech features for the domain from a recorded voice from these subtests in the Thai MoCA. We rank features by their importance and found that using a subset of important features has higher predictive power than using the entire feature set in impairment in the attention domain. The most important features in both tests are the median duration of voice and the duration of voice.
像蒙特利尔认知评估(MoCA)这样的筛选测试可以帮助诊断轻度认知障碍(MCI)。MoCA包括跨越不同认知领域的子测试。许多研究人员试图通过使用语音相关的特征,如声学、语言和韵律特征来检测轻度认知障碍。然而,这些特征可以将MCI患者与健康人区分开来,但不能描述每个患者的特定认知域损伤。本研究的重点是数字后向广度(DBS)和数字前向广度(DFS)这两个与认知注意领域相关的子测试。我们开发了一个模型,并从泰国MoCA的这些子测试中记录的语音中识别出该领域最相关的语音特征。我们根据特征的重要性对其进行排序,发现在注意力域使用重要特征的子集比使用整个特征集具有更高的预测能力。这两个测试中最重要的特征是声音的中位数持续时间和声音的持续时间。
{"title":"An Analysis of Acoustic Features for Attention Score in Thai MoCA Assessment","authors":"Wirot Treemongkolchok, P. Punyabukkana, Dittaya Wanvarie, Ploy N. Pratanwanich","doi":"10.1109/iSAI-NLP56921.2022.9960272","DOIUrl":"https://doi.org/10.1109/iSAI-NLP56921.2022.9960272","url":null,"abstract":"Screening tests like the Montreal Cognitive Assessment (MoCA) can help diagnose mild cognitive impairment (MCI). MoCA comprises subtests that span various cognitive domains. Numerous researchers attempt to detect MCI by employing speech-related features such as acoustic, linguistic, and prosodic features. However, the features can distinguish patients with MCI from healthy people but do not describe each patient's specific cognitive domain impairment. This study focuses on Digit Backward Span (DBS) and Digit Forward Span (DFS), subtests related to the cognitive attention domain in MoCA. We develop a model and identify the most relevant speech features for the domain from a recorded voice from these subtests in the Thai MoCA. We rank features by their importance and found that using a subset of important features has higher predictive power than using the entire feature set in impairment in the attention domain. The most important features in both tests are the median duration of voice and the duration of voice.","PeriodicalId":399019,"journal":{"name":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"328 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115965757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forex Price Movement Prediction Using Stacking Machine Learning Models 使用堆叠机器学习模型预测外汇价格走势
Thanapol Kurujitkosol, Akkharawoot Takhom, Sasiporn Usanavasin
Forex is an attractive choice for investors who admire any making profit challenges in the fluctuating market. But on the other hand, it means investors can lose money at the same time. Many investors look for ways to reduce the risks by finding price movement prediction tools. Therefore, this paper proposes the Stacking Machine Learning Models to predict the future price direction to help investors to decide and plan strategies. We experimented with comparing baseline models to evaluate the accuracy performance. In addition, we improve the accuracy performance using Technical Analysis and Fibonacci Retracements to gain an accuracy of 90%.
对于那些喜欢在波动的市场中赚取利润的投资者来说,外汇是一个有吸引力的选择。但另一方面,这也意味着投资者可能同时赔钱。许多投资者通过寻找价格走势预测工具来寻找降低风险的方法。因此,本文提出了堆叠机器学习模型来预测未来的价格方向,以帮助投资者决策和规划策略。我们通过比较基线模型来评估精度性能。此外,我们使用技术分析和斐波那契回调来提高准确度性能,以获得90%的准确度。
{"title":"Forex Price Movement Prediction Using Stacking Machine Learning Models","authors":"Thanapol Kurujitkosol, Akkharawoot Takhom, Sasiporn Usanavasin","doi":"10.1109/iSAI-NLP56921.2022.9960245","DOIUrl":"https://doi.org/10.1109/iSAI-NLP56921.2022.9960245","url":null,"abstract":"Forex is an attractive choice for investors who admire any making profit challenges in the fluctuating market. But on the other hand, it means investors can lose money at the same time. Many investors look for ways to reduce the risks by finding price movement prediction tools. Therefore, this paper proposes the Stacking Machine Learning Models to predict the future price direction to help investors to decide and plan strategies. We experimented with comparing baseline models to evaluate the accuracy performance. In addition, we improve the accuracy performance using Technical Analysis and Fibonacci Retracements to gain an accuracy of 90%.","PeriodicalId":399019,"journal":{"name":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131774143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Real-time Multiple Analog Gauges Reader for an Autonomous Robot Application 用于自主机器人应用的实时多模拟仪表阅读器
Visarut Trairattanapa, Sasin Phimsiri, Chaitat Utintu, Riu Cherdchusakulcha, Teepakorn Tosawadi, Ek Thamwiwatthana, Suchat Tungjitnob, Peemapol Tangamonsiri, A. Takutruea, Apirat Keomeesuan, Tanapoom Jitnaknan, V. Suttichaya
With the development of robotic technology, au-tonomous robots have been extended to production industries to substitute manual tasks like routine operations. In the general manufacturer, analog gauges are the most commonly utilized and required operators for manual reading. Accordingly, an analog gauge reading can be considered a fundamental feature for the operator robots to be fully automated for inspection purposes. This paper presents the methods for reading multiple analog gauges automatically using a camera. The processing pipeline consists of two main stages: 1) gauge detector for extracting individual gauges and 2) gauge reader for estimating gauge values. For gauge detectors, we propose three different YOLOvS architecture sizes. The gauge readers are mainly categorized into computer-vision approach (CV), and deep learning regression approaches. The deep learning approaches consist of two CNN-based backbones, ResNetSO and EfficientNetV2BO, and one transformer-based SwinTransformer. Finally, we introduce the feasibility of the combination of each gauge detector and reader. As a result, the YOLOv5m detector with EfficientNetV2BO CNN backbone reader theoretically achieves the best performance but is not practical for industrial applications. In contrast, we introduce the YOLOv5m detector with the CV method as the most robust multiple gauge reader. As a result, it reaches the comparative performances to the EfficientNetV2BO backbone and is more compatible with robotic applications.
随着机器人技术的发展,自主机器人已经扩展到生产行业,以替代日常操作等人工任务。在一般制造商中,模拟仪表是最常用的,需要操作人员手动读数。因此,模拟仪表读数可以被认为是操作机器人完全自动化检测的基本特征。本文介绍了用摄像机自动读取多个模拟量规的方法。处理管道包括两个主要阶段:1)用于提取单个仪表的仪表检测器和2)用于估计仪表值的仪表读取器。对于测量探测器,我们提出了三种不同的yolov架构尺寸。测量阅读器主要分为计算机视觉方法(CV)和深度学习回归方法。深度学习方法包括两个基于cnn的主干,ResNetSO和EfficientNetV2BO,以及一个基于变压器的SwinTransformer。最后,介绍了各仪表检测器与读取器组合的可行性。因此,具有EfficientNetV2BO CNN骨干阅读器的YOLOv5m探测器在理论上达到了最佳性能,但在工业应用中并不实用。相比之下,我们介绍了具有CV方法的YOLOv5m检测器,作为最稳健的多仪表读取器。因此,它达到了与EfficientNetV2BO主干的比较性能,并且与机器人应用更加兼容。
{"title":"Real-time Multiple Analog Gauges Reader for an Autonomous Robot Application","authors":"Visarut Trairattanapa, Sasin Phimsiri, Chaitat Utintu, Riu Cherdchusakulcha, Teepakorn Tosawadi, Ek Thamwiwatthana, Suchat Tungjitnob, Peemapol Tangamonsiri, A. Takutruea, Apirat Keomeesuan, Tanapoom Jitnaknan, V. Suttichaya","doi":"10.1109/iSAI-NLP56921.2022.9960268","DOIUrl":"https://doi.org/10.1109/iSAI-NLP56921.2022.9960268","url":null,"abstract":"With the development of robotic technology, au-tonomous robots have been extended to production industries to substitute manual tasks like routine operations. In the general manufacturer, analog gauges are the most commonly utilized and required operators for manual reading. Accordingly, an analog gauge reading can be considered a fundamental feature for the operator robots to be fully automated for inspection purposes. This paper presents the methods for reading multiple analog gauges automatically using a camera. The processing pipeline consists of two main stages: 1) gauge detector for extracting individual gauges and 2) gauge reader for estimating gauge values. For gauge detectors, we propose three different YOLOvS architecture sizes. The gauge readers are mainly categorized into computer-vision approach (CV), and deep learning regression approaches. The deep learning approaches consist of two CNN-based backbones, ResNetSO and EfficientNetV2BO, and one transformer-based SwinTransformer. Finally, we introduce the feasibility of the combination of each gauge detector and reader. As a result, the YOLOv5m detector with EfficientNetV2BO CNN backbone reader theoretically achieves the best performance but is not practical for industrial applications. In contrast, we introduce the YOLOv5m detector with the CV method as the most robust multiple gauge reader. As a result, it reaches the comparative performances to the EfficientNetV2BO backbone and is more compatible with robotic applications.","PeriodicalId":399019,"journal":{"name":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"70 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120929117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
iSAI-NLP-AIoT Organizing Committee iSAI-NLP-AIoT组委会
{"title":"iSAI-NLP-AIoT Organizing Committee","authors":"","doi":"10.1109/isai-nlp56921.2022.9960282","DOIUrl":"https://doi.org/10.1109/isai-nlp56921.2022.9960282","url":null,"abstract":"","PeriodicalId":399019,"journal":{"name":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125923542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1