首页 > 最新文献

2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)最新文献

英文 中文
Image Processing for Classification of Rice Varieties with Deep Convolutional Neural Networks 基于深度卷积神经网络的水稻品种分类图像处理
Mathuros Panmuang, Chonnikarn Rodmorn, Suriya Pinitkan
This research applied the Deep Convolutional Neural Networks and used the VGG16 model to screen rice varieties by images. The rice varieties selected in the experiment include five varieties: KorKhor 23, Suphanburi 1, Pathum Thani 1, Chainat 1, and Hom Mali Rice 105, totaling 1,500 images. The results of the experiments and model testing showed that the accuracy obtained by training the images of rice seeds is 85%, which is highly reliable. Therefore, the model was used to develop a website that can be accessed via web browsers and mobile apps where farmers or related persons can upload rice seed images to the system so that the system can predict what variety of rice it is and according to the testing of the system, it was found that it can make an accurate forecast of rice varieties.
本研究应用深度卷积神经网络,利用VGG16模型对水稻品种进行图像筛选。试验选用的水稻品种包括KorKhor 23、Suphanburi 1、Pathum Thani 1、Chainat 1和hommali rice 105 5个品种,共1500张图片。实验和模型测试结果表明,通过训练得到的水稻种子图像准确率为85%,具有较高的可靠性。因此,利用该模型开发了一个网站,可以通过浏览器和移动应用程序访问,农民或相关人员可以将水稻种子图像上传到系统中,系统可以预测水稻的品种,根据系统的测试,发现它可以准确预测水稻的品种。
{"title":"Image Processing for Classification of Rice Varieties with Deep Convolutional Neural Networks","authors":"Mathuros Panmuang, Chonnikarn Rodmorn, Suriya Pinitkan","doi":"10.1109/iSAI-NLP54397.2021.9678184","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678184","url":null,"abstract":"This research applied the Deep Convolutional Neural Networks and used the VGG16 model to screen rice varieties by images. The rice varieties selected in the experiment include five varieties: KorKhor 23, Suphanburi 1, Pathum Thani 1, Chainat 1, and Hom Mali Rice 105, totaling 1,500 images. The results of the experiments and model testing showed that the accuracy obtained by training the images of rice seeds is 85%, which is highly reliable. Therefore, the model was used to develop a website that can be accessed via web browsers and mobile apps where farmers or related persons can upload rice seed images to the system so that the system can predict what variety of rice it is and according to the testing of the system, it was found that it can make an accurate forecast of rice varieties.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116138875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Robustness Improvement against G.726 Speech Codec for Semi-fragile Watermarking in Speech Signals with Singular Spectrum Analysis and Quantization Index Modulation 基于奇异谱分析和量化指标调制的语音信号半脆弱水印对G.726语音编解码器的鲁棒性改进
Norranat Songsriboonsit, Kasorn Galajit, Jessada Karnjana, W. Kongprawechnon, P. Aimmanee
Semi-fragile watermarking in speech signals is proposed to solve problems relating to unauthorized speech modification. However, previous methods are fragile against some non-malicious attacks or white noise with a high signal-to-noise ratio. This paper aims to solve this problem by proposing a new watermarking technique based on singular spectrum analysis and quantization index modulation. The singular spectrum analysis is used to extract singular values of segments of speech signals. A watermark bit is embedded into each frame by slightly modifying its singular values according to the quantization index modulation. The experimental results show that the sound quality of a watermarked signal is comparable to that of its original signal. The watermark-bit extraction precision is also similar to that of existing methods. However, the proposed method is robust against non-malicious attacks, such as G.726 speech codec and white noise with a high signal-to-noise ratio.
为了解决未经授权的语音修改问题,提出了语音信号中的半脆弱水印。然而,以往的方法在面对非恶意攻击或高信噪比的白噪声时很脆弱。为了解决这一问题,本文提出了一种基于奇异谱分析和量化指标调制的新型水印技术。奇异谱分析用于提取语音信号片段的奇异值。根据量化指标调制,将水印位的奇异值稍加修改,嵌入到每一帧中。实验结果表明,水印信号的音质与原始信号的音质相当。该方法的水印比特提取精度与现有方法相似。该方法对G.726语音编解码和白噪声等非恶意攻击具有较强的鲁棒性,信噪比较高。
{"title":"Robustness Improvement against G.726 Speech Codec for Semi-fragile Watermarking in Speech Signals with Singular Spectrum Analysis and Quantization Index Modulation","authors":"Norranat Songsriboonsit, Kasorn Galajit, Jessada Karnjana, W. Kongprawechnon, P. Aimmanee","doi":"10.1109/iSAI-NLP54397.2021.9678181","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678181","url":null,"abstract":"Semi-fragile watermarking in speech signals is proposed to solve problems relating to unauthorized speech modification. However, previous methods are fragile against some non-malicious attacks or white noise with a high signal-to-noise ratio. This paper aims to solve this problem by proposing a new watermarking technique based on singular spectrum analysis and quantization index modulation. The singular spectrum analysis is used to extract singular values of segments of speech signals. A watermark bit is embedded into each frame by slightly modifying its singular values according to the quantization index modulation. The experimental results show that the sound quality of a watermarked signal is comparable to that of its original signal. The watermark-bit extraction precision is also similar to that of existing methods. However, the proposed method is robust against non-malicious attacks, such as G.726 speech codec and white noise with a high signal-to-noise ratio.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122125983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
sylbreak4all: Regular Expressions for Syllable Breaking of Nine Major Ethnic Languages of Myanmar sylbreak4all:缅甸九大民族语言拆音节规则表达式
Ye Kyaw Thu, Hlaing Myat New, Hninn Aye Thant, Hay Man Htun, H. Mon, May Myat Myat Khaing, Hsu Pan Oo, Pale Phyu, Nang Aeindray Kyaw, T. Oo, T. Oo, Thet Thet Zin, T. Oo
Unlike many other western languages, the Myanmar language uses a syllabic writing system and no space between words. Syllable segmentation is the necessary preprocess for natural language processing (NLP) tasks such as grapheme-to-phoneme (g2p) conversion, machine translation, romanization, and so on. In this study, sylbreak4all, a syllable segmentation tool, was developed for nine major ethnic languages of Myanmar, and they are Burmese, Shan, Pa’o, Pwo Kayin, S’gaw Kayin, Rakhine, Myeik, Dawei, and Mon by using regular expression (RE) patterns.
与许多其他西方语言不同,缅甸语使用音节书写系统,单词之间没有空格。音节分词是自然语言处理(NLP)任务(如字素到音素(g2p)转换、机器翻译、罗马化等)的必要预处理。本研究利用正则表达式(正则表达式)模式,对缅甸9个主要民族语言(缅甸语、掸邦语、帕奥语、普沃克语、S 'gaw克语、若开邦语、米耶克语、达维语和孟语)开发了音节分词工具sylbreak4all。
{"title":"sylbreak4all: Regular Expressions for Syllable Breaking of Nine Major Ethnic Languages of Myanmar","authors":"Ye Kyaw Thu, Hlaing Myat New, Hninn Aye Thant, Hay Man Htun, H. Mon, May Myat Myat Khaing, Hsu Pan Oo, Pale Phyu, Nang Aeindray Kyaw, T. Oo, T. Oo, Thet Thet Zin, T. Oo","doi":"10.1109/iSAI-NLP54397.2021.9678188","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678188","url":null,"abstract":"Unlike many other western languages, the Myanmar language uses a syllabic writing system and no space between words. Syllable segmentation is the necessary preprocess for natural language processing (NLP) tasks such as grapheme-to-phoneme (g2p) conversion, machine translation, romanization, and so on. In this study, sylbreak4all, a syllable segmentation tool, was developed for nine major ethnic languages of Myanmar, and they are Burmese, Shan, Pa’o, Pwo Kayin, S’gaw Kayin, Rakhine, Myeik, Dawei, and Mon by using regular expression (RE) patterns.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117229475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Welcome Message from the General Co-Chair 联席主席的欢迎辞
{"title":"Welcome Message from the General Co-Chair","authors":"","doi":"10.1109/isai-nlp54397.2021.9678186","DOIUrl":"https://doi.org/10.1109/isai-nlp54397.2021.9678186","url":null,"abstract":"","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128085849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning-Based Acoustic Emission Scheme for Rail Crack Monitoring 基于深度学习的钢轨裂纹监测声发射方案
W. Suwansin, P. Phasukkit
This research proposes a single-sensor acoustic emission (AE) scheme for detection and localization of crack in steel rail (rail head, rail web, and rail foot) under load. In the operation, AE signals were captured by the AE sensor and converted into digital signal data by AE data acquisition module. The digital data were used total variation denoising (TVD) algorithm to remove ambient and wheel/rail contact noises, and the denoised data were processed and classified to localize cracks in the steel rail using a deep learning algorithmic model. The AE signals of pencil lead break at the head, web, and foot of steel rail were used to train (80 % of the input data) and test (20%) the algorithmic model. In training and testing the algorithm, the AE signals were divided into two groupings (150 and 300 AE signals) and the classification accuracy compared. The deep learning-based AE scheme was also implemented on-site to detect cracks in the steel rail. The total accuracy under the first and second groupings were 86.6 % and 96.6 %. The novelty of this research lies in the use of single AE sensor and AE signal-driven deep learning algorithm to detect and localize cracks in the steel rail, unlike conventional AE crack-localization technology which relies on two or more sensors and human interpretation.
本研究提出了一种单传感器声发射(AE)方案,用于钢轨(轨头、轨腹板和轨脚)在载荷作用下的裂纹检测和定位。在操作中,声发射信号由声发射传感器采集,通过声发射数据采集模块转换为数字信号数据。利用全变分去噪(TVD)算法去除环境噪声和轮轨接触噪声,利用深度学习算法模型对去噪后的数据进行处理和分类,定位钢轨裂纹。利用钢轨头部、腹板和脚处铅笔芯断裂的声发射信号对算法模型进行训练(80%的输入数据)和测试(20%)。在训练和测试算法时,将声发射信号分为两组(150和300个声发射信号),并对分类准确率进行比较。并在现场实施了基于深度学习的声发射方案来检测钢轨裂纹。第一组和第二组的总准确率分别为86.6%和96.6%。本研究的新颖之处在于使用单个声发射传感器和声发射信号驱动的深度学习算法来检测和定位钢轨中的裂纹,而不像传统的声发射裂纹定位技术依赖于两个或多个传感器和人工解释。
{"title":"Deep Learning-Based Acoustic Emission Scheme for Rail Crack Monitoring","authors":"W. Suwansin, P. Phasukkit","doi":"10.1109/iSAI-NLP54397.2021.9678162","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678162","url":null,"abstract":"This research proposes a single-sensor acoustic emission (AE) scheme for detection and localization of crack in steel rail (rail head, rail web, and rail foot) under load. In the operation, AE signals were captured by the AE sensor and converted into digital signal data by AE data acquisition module. The digital data were used total variation denoising (TVD) algorithm to remove ambient and wheel/rail contact noises, and the denoised data were processed and classified to localize cracks in the steel rail using a deep learning algorithmic model. The AE signals of pencil lead break at the head, web, and foot of steel rail were used to train (80 % of the input data) and test (20%) the algorithmic model. In training and testing the algorithm, the AE signals were divided into two groupings (150 and 300 AE signals) and the classification accuracy compared. The deep learning-based AE scheme was also implemented on-site to detect cracks in the steel rail. The total accuracy under the first and second groupings were 86.6 % and 96.6 %. The novelty of this research lies in the use of single AE sensor and AE signal-driven deep learning algorithm to detect and localize cracks in the steel rail, unlike conventional AE crack-localization technology which relies on two or more sensors and human interpretation.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124402148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Incorporation of Contextual Information into BERT for Dialog Act Classification in Japanese 基于BERT的日语对话行为分类研究
Shun Katada, Kiyoaki Shirai, S. Okada
Recently developed Bidirectional Encoder Representations from Transformers (BERT) outperforms the state-of-the-art in many natural language processing tasks in English. Although contextual information is known to be useful for dialog act classification, fine-tuning BERT with contextual information has not been investigated, especially in head final languages such as Japanese. This paper investigates whether BERT with contextual information performs well on dialog act classification in Japanese open-domain conversation. In our proposed model, not only the utterance itself but also the information about previous utterances and turn-taking are taken into account. Results of experiments on a Japanese dialog corpus showed that the incorporation of the contextual information improved the F1-score by 6.7 points.
最近开发的变形金刚双向编码器表示(BERT)在许多英语自然语言处理任务中表现优于最先进的技术。虽然上下文信息对对话行为分类很有用,但还没有研究过使用上下文信息对BERT进行微调,特别是在日语等头尾语言中。本文研究了基于上下文信息的BERT在日语开放域会话中的对话行为分类效果。在我们提出的模型中,不仅考虑了话语本身,还考虑了之前话语的信息和轮次。在日语对话语料库上的实验结果表明,语境信息的加入使f1得分提高了6.7分。
{"title":"Incorporation of Contextual Information into BERT for Dialog Act Classification in Japanese","authors":"Shun Katada, Kiyoaki Shirai, S. Okada","doi":"10.1109/iSAI-NLP54397.2021.9678172","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678172","url":null,"abstract":"Recently developed Bidirectional Encoder Representations from Transformers (BERT) outperforms the state-of-the-art in many natural language processing tasks in English. Although contextual information is known to be useful for dialog act classification, fine-tuning BERT with contextual information has not been investigated, especially in head final languages such as Japanese. This paper investigates whether BERT with contextual information performs well on dialog act classification in Japanese open-domain conversation. In our proposed model, not only the utterance itself but also the information about previous utterances and turn-taking are taken into account. Results of experiments on a Japanese dialog corpus showed that the incorporation of the contextual information improved the F1-score by 6.7 points.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125686408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Citation Contexts in Scholarly Papers for Research Data Search 利用学术论文引文上下文进行研究数据检索
Research data have been increasingly published. A search function on a research data repository is crucial to improve the accessibility. Generally, the search is executed based on metadata written by the creators of the research data. However, such the metadata may not be sufficiently descriptive because there can exist features or usages that the creators did not expect. The information about features or usages generated by users might appear in citation contexts of research data in their scholarly papers. In this study, we set and discuss the hypothesis that citation contexts in scholarly papers are useful for research data search. First, we investigated whether adding citation contexts to the metadata can enrich information about the research data. Concretely, the existing metadata and the citation contexts were collected, and their overlap was examined. Furthermore, a retrieval experiment was conducted to confirm the effectiveness of the citation contexts. The results indicated the usefulness of the citation contexts in scholarly papers.
越来越多的研究数据被发表。研究数据存储库的搜索功能是提高可访问性的关键。通常,搜索是基于研究数据创建者编写的元数据执行的。然而,这样的元数据可能没有足够的描述性,因为可能存在创建者没有预料到的特性或用法。用户生成的有关特征或用法的信息可能出现在其学术论文的研究数据的引文上下文中。在本研究中,我们设定并讨论了学术论文中的引文上下文对研究数据检索有用的假设。首先,我们研究了在元数据中添加引文上下文是否可以丰富研究数据的信息。具体而言,收集现有元数据和引文上下文,并检查它们的重叠程度。此外,通过检索实验验证了引文上下文的有效性。结果表明引文上下文在学术论文中的有效性。
{"title":"Using Citation Contexts in Scholarly Papers for Research Data Search","authors":"","doi":"10.1109/iSAI-NLP54397.2021.9678165","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678165","url":null,"abstract":"Research data have been increasingly published. A search function on a research data repository is crucial to improve the accessibility. Generally, the search is executed based on metadata written by the creators of the research data. However, such the metadata may not be sufficiently descriptive because there can exist features or usages that the creators did not expect. The information about features or usages generated by users might appear in citation contexts of research data in their scholarly papers. In this study, we set and discuss the hypothesis that citation contexts in scholarly papers are useful for research data search. First, we investigated whether adding citation contexts to the metadata can enrich information about the research data. Concretely, the existing metadata and the citation contexts were collected, and their overlap was examined. Furthermore, a retrieval experiment was conducted to confirm the effectiveness of the citation contexts. The results indicated the usefulness of the citation contexts in scholarly papers.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"17 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124914716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of Important Utterances in Narrative Speech Using Attentive Listening Responses 叙事性言语中重要话语的识别与注意倾听反应
Since narratives contain a variety of knowledge, it is useful to record their contents. It is difficult to read the transcription even if the narrative speech is transcribed directly because the transcription contains redundant contents. Thus, it is effective to summarize the narrative transcriptions. This paper proposes a method for identifying important utterances in narratives with the aim of summarizing them. The method uses not only the narrative by the speaker but also the attentive listening responses by the listeners for identifying the important utterances. Attentive listening responses are conversational responses to positively show that listeners are attentively listening to the narratives, e.g., back-channel feedbacks. The more important an utterance in a narrative is, the more likely the listeners seem to react to it. In this study, we focus on attentive listening responses as the listener’s reactions. We experimentally evaluated the effectiveness of using attentive listening responses in identifying important utterances in narratives.
因为叙事包含了各种各样的知识,所以记录它们的内容是有用的。即使直接抄写叙事性讲话,也很难读懂,因为抄写的内容是多余的。因此,对叙事抄本进行总结是有效的。本文提出了一种识别叙事中重要话语并对其进行总结的方法。该方法不仅利用说话人的叙述,而且利用听者的认真倾听反应来识别重要话语。注意倾听反应是积极表明听者正在专心倾听叙述的会话反应,如反向渠道反馈。叙述中一个话语越重要,听者似乎就越有可能对它做出反应。在本研究中,我们将注意力集中在倾听者的反应上。我们通过实验评估了使用专注倾听反应识别叙事中重要话语的有效性。
{"title":"Identification of Important Utterances in Narrative Speech Using Attentive Listening Responses","authors":"","doi":"10.1109/iSAI-NLP54397.2021.9678154","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678154","url":null,"abstract":"Since narratives contain a variety of knowledge, it is useful to record their contents. It is difficult to read the transcription even if the narrative speech is transcribed directly because the transcription contains redundant contents. Thus, it is effective to summarize the narrative transcriptions. This paper proposes a method for identifying important utterances in narratives with the aim of summarizing them. The method uses not only the narrative by the speaker but also the attentive listening responses by the listeners for identifying the important utterances. Attentive listening responses are conversational responses to positively show that listeners are attentively listening to the narratives, e.g., back-channel feedbacks. The more important an utterance in a narrative is, the more likely the listeners seem to react to it. In this study, we focus on attentive listening responses as the listener’s reactions. We experimentally evaluated the effectiveness of using attentive listening responses in identifying important utterances in narratives.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133227937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning-Based Human Recognition Through the Wall using UWB radar 基于深度学习的超宽带雷达穿墙人体识别
Pongpol Assawaroongsakul, Mawin Khumdee, P. Phasukkit, Nongluck Houngkamhang
Human activity detection in obscured or invisible area, for instance, human detection through the wall has become an interesting topic because it has potential for security, rescue, activity analysis application, etc. UWB radar, a detection system produces short radio frequency pulses and measures the reflected signals which UWB pulses have high spatial resolution and enable penetration in dielectric materials, was used to collect human activity through the wall signals at the frequency range of 3 GHz in this research. Subsequently, we applied signal data with the Deep Neural Network model to classify 5 classes of human activity including standing, walking, sitting, laying, and no-human gave the F1 score up to 96.94%.
模糊或不可见区域的人体活动检测,如穿墙检测,因其在安全、救援、活动分析等方面的应用潜力而成为一个有趣的话题。超宽带雷达是一种产生短射频脉冲并测量反射信号的探测系统,超宽带脉冲具有高空间分辨率和穿透介质材料的能力,本研究利用超宽带雷达在3ghz频率范围内通过墙壁信号采集人类活动。随后,我们将信号数据与Deep Neural Network模型结合,对站立、行走、坐着、躺着、无人等5类人体活动进行分类,F1得分高达96.94%。
{"title":"Deep Learning-Based Human Recognition Through the Wall using UWB radar","authors":"Pongpol Assawaroongsakul, Mawin Khumdee, P. Phasukkit, Nongluck Houngkamhang","doi":"10.1109/iSAI-NLP54397.2021.9678182","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678182","url":null,"abstract":"Human activity detection in obscured or invisible area, for instance, human detection through the wall has become an interesting topic because it has potential for security, rescue, activity analysis application, etc. UWB radar, a detection system produces short radio frequency pulses and measures the reflected signals which UWB pulses have high spatial resolution and enable penetration in dielectric materials, was used to collect human activity through the wall signals at the frequency range of 3 GHz in this research. Subsequently, we applied signal data with the Deep Neural Network model to classify 5 classes of human activity including standing, walking, sitting, laying, and no-human gave the F1 score up to 96.94%.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131124675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
HoogBERTa: Multi-task Sequence Labeling using Thai Pretrained Language Representation HoogBERTa:使用泰语预训练语言表示的多任务序列标记
Peerachet Porkaew, P. Boonkwan, T. Supnithi
Recently, pretrained language representations like BERT and RoBERTa have drawn more and more attention in NLP. In this work we propose a pretrained language representation for Thai language, which based on RoBERTa architecture. Our monolingual data used in the training are collected from publicly available resources including Wikipedia, OpenSubtitles, news and articles. Although the pretrained model can be fine-tuned for wide area of individual tasks, fine-tuning the model with multiple objectives also yields a surprisingly effective model. We evaluated the performance of our multi-task model on part-of-speech tagging, named entity recognition and clause boundary prediction. Our model achieves the comparable performance to strong single-task baselines. Our code and models are available at https://github.com/lstnlp/hoogberta.
近年来,BERT和RoBERTa等预训练语言表征在NLP领域受到越来越多的关注。在这项工作中,我们提出了一个基于RoBERTa架构的泰语预训练语言表示。我们在培训中使用的单语数据是从公开资源中收集的,包括维基百科、open字幕、新闻和文章。虽然预训练的模型可以针对单个任务的广泛区域进行微调,但对具有多个目标的模型进行微调也会产生令人惊讶的有效模型。我们评估了我们的多任务模型在词性标注、命名实体识别和子句边界预测方面的性能。我们的模型实现了与强大的单任务基线相当的性能。我们的代码和模型可在https://github.com/lstnlp/hoogberta上获得。
{"title":"HoogBERTa: Multi-task Sequence Labeling using Thai Pretrained Language Representation","authors":"Peerachet Porkaew, P. Boonkwan, T. Supnithi","doi":"10.1109/iSAI-NLP54397.2021.9678190","DOIUrl":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678190","url":null,"abstract":"Recently, pretrained language representations like BERT and RoBERTa have drawn more and more attention in NLP. In this work we propose a pretrained language representation for Thai language, which based on RoBERTa architecture. Our monolingual data used in the training are collected from publicly available resources including Wikipedia, OpenSubtitles, news and articles. Although the pretrained model can be fine-tuned for wide area of individual tasks, fine-tuning the model with multiple objectives also yields a surprisingly effective model. We evaluated the performance of our multi-task model on part-of-speech tagging, named entity recognition and clause boundary prediction. Our model achieves the comparable performance to strong single-task baselines. Our code and models are available at https://github.com/lstnlp/hoogberta.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132445199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1