首页 > 最新文献

2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)最新文献

英文 中文
Integration of Gesture Control with Large Display Environments Using SAGE2 基于SAGE2的手势控制与大型显示环境的集成
Thanatorn Boonnak, V. Visoottiviseth, J. Haga, Dylan Kobayashi, J. Leigh
With the ever-increasing amount of information, data scientists continue to explore new technologies that will help them access and interact with data in a variety of domains and situations. One technology of particular interest is Scalable Resolution Shared Displays (SRSD) that use a web-based collaboration middleware called SAGE2. These display systems are ideal for exploring large data sets in data intensive applications; however, interacting with content on these largewalls intuitively and rapidly remains a challenge to be addressed. This work introduces a prototype user interface basedon a simple, hand gesture-based approach to control content in a SAGE2 workspace using the Leap Motion controller. Our implementation and preliminary testing of the interface demonstrates its potential as a more natural interaction modalitywhen exploring big data sets.
随着信息量的不断增加,数据科学家不断探索新技术,以帮助他们访问各种领域和情况下的数据并与之交互。一种特别有趣的技术是可伸缩分辨率共享显示(SRSD),它使用基于web的协作中间件SAGE2。这些显示系统非常适合在数据密集型应用中探索大型数据集;然而,与这些大墙上的内容进行直观和快速的交互仍然是一个有待解决的挑战。这项工作介绍了一个基于简单的、基于手势的方法的原型用户界面,该方法使用Leap Motion控制器来控制SAGE2工作空间中的内容。我们对界面的实现和初步测试表明,在探索大数据集时,它有可能成为一种更自然的交互方式。
{"title":"Integration of Gesture Control with Large Display Environments Using SAGE2","authors":"Thanatorn Boonnak, V. Visoottiviseth, J. Haga, Dylan Kobayashi, J. Leigh","doi":"10.1109/JCSSE.2018.8457323","DOIUrl":"https://doi.org/10.1109/JCSSE.2018.8457323","url":null,"abstract":"With the ever-increasing amount of information, data scientists continue to explore new technologies that will help them access and interact with data in a variety of domains and situations. One technology of particular interest is Scalable Resolution Shared Displays (SRSD) that use a web-based collaboration middleware called SAGE2. These display systems are ideal for exploring large data sets in data intensive applications; however, interacting with content on these largewalls intuitively and rapidly remains a challenge to be addressed. This work introduces a prototype user interface basedon a simple, hand gesture-based approach to control content in a SAGE2 workspace using the Leap Motion controller. Our implementation and preliminary testing of the interface demonstrates its potential as a more natural interaction modalitywhen exploring big data sets.","PeriodicalId":338973,"journal":{"name":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"222 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131670643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Automatic Sentiment Summarization from Aspect-based Customer Reviews 一种基于方面的顾客评论自动情感总结方法
T. A. Tran, Jarunee Duangsuwan, W. Wettayaprasit
Online reviews play an important role in helping companies or governments to improve product quality and services. However, these reviews are increasing day by day. It is difficult to go through the amount of these reviews and to summarize the important information manually. We proposed a novel Automatic Sentiment Summarization (ASS) system. This system has two phases. The first phase is the aspect-based representation used to represent ranked knowledge on aspect opinion calculated by using frequencies, polarity, and opinion strength. The second phase is the review summary generation used to automatically produce review summary by ranking aspect based on information of the aspect. The generated summary is more coherent by applying natural language generation technique. Furthermore, the proposed ASS system allows users to add new reviews in the same domain in order to update the generated summary. The experiments used the sentiment aspect dataset benchmarks such as customer product/service reviews for Canon, Nikon, and Laptop. The generated summaries from the proposed ASS system are well performed compared with other systems extractive summarization and abstractive summarization.
在线评论在帮助公司或政府提高产品质量和服务方面发挥着重要作用。然而,这些评论日益增多。手动浏览这些审查的数量并总结重要信息是很困难的。提出了一种新的自动情感摘要(ASS)系统。这个系统有两个阶段。第一阶段是基于方面的表示,用于表示通过使用频率、极性和意见强度计算的方面意见上的排名知识。第二阶段是评审摘要生成,用于根据方面的信息对方面进行排序,从而自动生成评审摘要。采用自然语言生成技术生成的摘要更加连贯。此外,建议的ASS系统允许用户在同一域中添加新的评论,以便更新生成的摘要。实验使用了情感方面数据集基准,如佳能、尼康和笔记本电脑的客户产品/服务评论。与其他系统的抽取摘要和抽象摘要相比,该系统生成的摘要具有良好的性能。
{"title":"A Novel Automatic Sentiment Summarization from Aspect-based Customer Reviews","authors":"T. A. Tran, Jarunee Duangsuwan, W. Wettayaprasit","doi":"10.1109/JCSSE.2018.8457346","DOIUrl":"https://doi.org/10.1109/JCSSE.2018.8457346","url":null,"abstract":"Online reviews play an important role in helping companies or governments to improve product quality and services. However, these reviews are increasing day by day. It is difficult to go through the amount of these reviews and to summarize the important information manually. We proposed a novel Automatic Sentiment Summarization (ASS) system. This system has two phases. The first phase is the aspect-based representation used to represent ranked knowledge on aspect opinion calculated by using frequencies, polarity, and opinion strength. The second phase is the review summary generation used to automatically produce review summary by ranking aspect based on information of the aspect. The generated summary is more coherent by applying natural language generation technique. Furthermore, the proposed ASS system allows users to add new reviews in the same domain in order to update the generated summary. The experiments used the sentiment aspect dataset benchmarks such as customer product/service reviews for Canon, Nikon, and Laptop. The generated summaries from the proposed ASS system are well performed compared with other systems extractive summarization and abstractive summarization.","PeriodicalId":338973,"journal":{"name":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114537835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Classification of Dhamma Esan Characters By Transfer Learning of a Deep Neural Network 基于深度神经网络迁移学习的佛法峨山文字分类
Narit Hnoohom, Sumeth Yuenyong
We present an image classification of Dhamma Esan characters by fine-tuning the Inception V3 deep neural network trained on the ImageNet dataset. Dhamma Esan is a traditional alphabet used in the north-eastern region of Thailand, primarily written on Corypha leaves for the purpose of recording Buddhist scriptures. Preservation of these historical documents calls for the ability to classify the characters of the alphabet in order to facilitate digital indexing and searching, as well as assist anyone trying to read them. Our dataset consists of over 70,000 Dhamma Esan character images, much larger than any previous work. The result of ten-fold cross-validation showed that our model had 100% accuracy for four folds, and 99.99% for the other six folds. The previous best accuracy reported was 97.77%. We also developed a Dhamma Esan character classification web service where users can upload images of characters and get immediate classification results as well as mapping to the modern Thai alphabet.
我们通过微调在ImageNet数据集上训练的Inception V3深度神经网络,提出了一种Dhamma Esan字符的图像分类方法。Dhamma Esan是泰国东北部地区使用的一种传统字母,主要是为了记录佛教经文而写在香叶上。保存这些历史文献需要对字母表中的字符进行分类的能力,以便于数字索引和搜索,以及帮助任何人试图阅读它们。我们的数据集包含超过70,000个达摩依山的字符图像,比以前的任何工作都要大得多。十重交叉验证结果表明,模型对其中四重的准确率为100%,对另外六重的准确率为99.99%。此前报道的最佳准确率为97.77%。我们还开发了一个达摩依山文字分类网站服务,用户可以上传文字图像,并立即获得分类结果,以及映射到现代泰语字母表。
{"title":"Classification of Dhamma Esan Characters By Transfer Learning of a Deep Neural Network","authors":"Narit Hnoohom, Sumeth Yuenyong","doi":"10.1109/JCSSE.2018.8457361","DOIUrl":"https://doi.org/10.1109/JCSSE.2018.8457361","url":null,"abstract":"We present an image classification of Dhamma Esan characters by fine-tuning the Inception V3 deep neural network trained on the ImageNet dataset. Dhamma Esan is a traditional alphabet used in the north-eastern region of Thailand, primarily written on Corypha leaves for the purpose of recording Buddhist scriptures. Preservation of these historical documents calls for the ability to classify the characters of the alphabet in order to facilitate digital indexing and searching, as well as assist anyone trying to read them. Our dataset consists of over 70,000 Dhamma Esan character images, much larger than any previous work. The result of ten-fold cross-validation showed that our model had 100% accuracy for four folds, and 99.99% for the other six folds. The previous best accuracy reported was 97.77%. We also developed a Dhamma Esan character classification web service where users can upload images of characters and get immediate classification results as well as mapping to the modern Thai alphabet.","PeriodicalId":338973,"journal":{"name":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122612476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Development of Low-Cost in-the-Ear EEG Prototype 低成本耳内脑电图样机的研制
Chanavit Athavipach, S. Pan-Ngum, P. Israsena
This study focused on building a low-cost wearable EEG device multiple hour usage. The device suitable for long period monitoring is in-the-ear EEG, which has desirable wearable characteristics. With electrode in an earbud, it is relatively simple to install and wear. The in-the-ear prototype in this study was built from earphone rubber as an earpiece and silver-adhesive fabric as electrodes. Raw materials cost 3 dollar per piece. The impedance measurement from in-the-ear EEG is comparable to those of commercial electrodes. Signal verifications were conducted by teeth clenching, ASSR, MMN, and correlation. The signal verification results show that there is a strong correlation between in-the-ear EEG and T7/T8 signals. (γ-coefficient = 0.912)
本研究的重点是构建一种低成本的可穿戴式多小时使用脑电图设备。适合长周期监测的装置是耳内脑电图,具有良好的可穿戴特性。电极在耳塞内,安装和佩戴相对简单。本研究中的耳内原型是用耳机橡胶作为耳塞,用银胶织物作为电极。原材料每件3美元。耳内脑电图的阻抗测量与商用电极相当。通过咬牙、ASSR、MMN、相关性进行信号验证。信号验证结果表明,耳内脑电信号与T7/T8信号具有较强的相关性。(γ-系数= 0.912)
{"title":"Development of Low-Cost in-the-Ear EEG Prototype","authors":"Chanavit Athavipach, S. Pan-Ngum, P. Israsena","doi":"10.1109/JCSSE.2018.8457324","DOIUrl":"https://doi.org/10.1109/JCSSE.2018.8457324","url":null,"abstract":"This study focused on building a low-cost wearable EEG device multiple hour usage. The device suitable for long period monitoring is in-the-ear EEG, which has desirable wearable characteristics. With electrode in an earbud, it is relatively simple to install and wear. The in-the-ear prototype in this study was built from earphone rubber as an earpiece and silver-adhesive fabric as electrodes. Raw materials cost 3 dollar per piece. The impedance measurement from in-the-ear EEG is comparable to those of commercial electrodes. Signal verifications were conducted by teeth clenching, ASSR, MMN, and correlation. The signal verification results show that there is a strong correlation between in-the-ear EEG and T7/T8 signals. (γ-coefficient = 0.912)","PeriodicalId":338973,"journal":{"name":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"101 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114120414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Exploiting Building Blocks in Hard Problems with Modified Compact Genetic Algorithm 利用改进的紧凑遗传算法求解困难问题中的构建块
Kamonluk Suksen, P. Chongstitvatana
In Evolutionary Computation, good substructures that are combined into good solutions are called building blocks. In this context, building blocks are common structure of high- quality solutions. The compact genetic algorithm is an extension of the genetic algorithm that replaces the latter’s population of chromosomes with a probability distribution from which candidate solutions can be generated. This paper describes an algorithm that exploits building blocks with compact genetic algorithm in order to solve difficult optimization problems under the assumption that we have already known building blocks. The main idea is to update the probability vectors as a group of bits that represents building blocks thus avoiding the disruption of the building blocks. Comparisons of the new algorithm with a conventional compact genetic algorithm on trap-function and traveling salesman problems indicate the utility of the proposed algorithm. It is most effective when the problem instants have common structures that can be identify as building blocks.
在进化计算中,组合成好的解决方案的好的子结构被称为构建块。在这种情况下,构建块是高质量解决方案的常见结构。紧凑遗传算法是遗传算法的一种扩展,它将遗传算法的染色体种群替换为可生成候选解的概率分布。本文描述了一种基于紧凑遗传算法的构建块算法,在已知构建块的前提下求解复杂的优化问题。其主要思想是将概率向量更新为代表构建块的一组位,从而避免构建块的破坏。将该算法与传统的紧凑遗传算法在陷阱函数和旅行商问题上的比较表明了该算法的实用性。当问题瞬间具有可识别为构建块的共同结构时,它是最有效的。
{"title":"Exploiting Building Blocks in Hard Problems with Modified Compact Genetic Algorithm","authors":"Kamonluk Suksen, P. Chongstitvatana","doi":"10.1109/JCSSE.2018.8457386","DOIUrl":"https://doi.org/10.1109/JCSSE.2018.8457386","url":null,"abstract":"In Evolutionary Computation, good substructures that are combined into good solutions are called building blocks. In this context, building blocks are common structure of high- quality solutions. The compact genetic algorithm is an extension of the genetic algorithm that replaces the latter’s population of chromosomes with a probability distribution from which candidate solutions can be generated. This paper describes an algorithm that exploits building blocks with compact genetic algorithm in order to solve difficult optimization problems under the assumption that we have already known building blocks. The main idea is to update the probability vectors as a group of bits that represents building blocks thus avoiding the disruption of the building blocks. Comparisons of the new algorithm with a conventional compact genetic algorithm on trap-function and traveling salesman problems indicate the utility of the proposed algorithm. It is most effective when the problem instants have common structures that can be identify as building blocks.","PeriodicalId":338973,"journal":{"name":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126921376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Bandwidth Reservation Approach to Improve Quality of Service in Software-Defined Networking: A Performance Analysis 提高软件定义网络服务质量的带宽预留方法:性能分析
Amirhossein Moravejosharieh, Michael J. Watts, Yu Song
Software-Defined Networking (SDN) is a new networking paradigm designed to resolve traditional IP network shortcomings by breaking the vertical integration of control and data planes. SDN separates the network control logic from underlying routers and switches and introduces the ability to program the network. Bandwidth reservation is an approach offered in SDN-enabled networks to guarantee relatively high Quality of Service for different types of media, e.g., video, audio or data. Although, this approach has been proven to be worthy of considering in SDN, there are still some concerns regarding its applicability in a relatively large networks. In this paper, we have evaluated the performance of bandwidth reservation approach in a relatively large-scaled SDN-enabled network in terms of its suitability when the number of users demanding for reserved bandwidth becomes larger. The obtained results from our simulation study show that bandwidth reservation can be beneficial only when the number of users asking for guaranteed bandwidth is relatively smaller than other users. Moreover, higher end-to-end QoS delivery can be achieved as an immediate outcome of deploying bandwidth reservation approach for a particular type of traffic flow, however, at the cost of incurring negative impact on other types of traffic flow in terms of achievable network throughput.
软件定义网络(SDN)是一种新的网络模式,它打破了控制平面和数据平面的垂直整合,解决了传统IP网络的缺点。SDN将网络控制逻辑从底层路由器和交换机中分离出来,并引入了对网络进行编程的能力。带宽预留是在支持sdn的网络中提供的一种方法,以保证对不同类型的媒体(例如视频、音频或数据)提供相对较高的服务质量。虽然这种方法在SDN中已经被证明是值得考虑的,但是在相对较大的网络中,它的适用性仍然存在一些问题。在本文中,我们评估了带宽保留方法在一个相对大规模的支持sdn的网络中,当需求保留带宽的用户数量增加时的适用性。仿真研究结果表明,只有当要求保证带宽的用户数量相对较少时,带宽预留才有好处。此外,为特定类型的流量部署带宽保留方法可以立即实现更高的端到端QoS交付,然而,其代价是在可实现的网络吞吐量方面对其他类型的流量产生负面影响。
{"title":"Bandwidth Reservation Approach to Improve Quality of Service in Software-Defined Networking: A Performance Analysis","authors":"Amirhossein Moravejosharieh, Michael J. Watts, Yu Song","doi":"10.1109/JCSSE.2018.8457339","DOIUrl":"https://doi.org/10.1109/JCSSE.2018.8457339","url":null,"abstract":"Software-Defined Networking (SDN) is a new networking paradigm designed to resolve traditional IP network shortcomings by breaking the vertical integration of control and data planes. SDN separates the network control logic from underlying routers and switches and introduces the ability to program the network. Bandwidth reservation is an approach offered in SDN-enabled networks to guarantee relatively high Quality of Service for different types of media, e.g., video, audio or data. Although, this approach has been proven to be worthy of considering in SDN, there are still some concerns regarding its applicability in a relatively large networks. In this paper, we have evaluated the performance of bandwidth reservation approach in a relatively large-scaled SDN-enabled network in terms of its suitability when the number of users demanding for reserved bandwidth becomes larger. The obtained results from our simulation study show that bandwidth reservation can be beneficial only when the number of users asking for guaranteed bandwidth is relatively smaller than other users. Moreover, higher end-to-end QoS delivery can be achieved as an immediate outcome of deploying bandwidth reservation approach for a particular type of traffic flow, however, at the cost of incurring negative impact on other types of traffic flow in terms of achievable network throughput.","PeriodicalId":338973,"journal":{"name":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"367 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121730262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Transfer Learning for Leaf Classification with Convolutional Neural Networks 基于卷积神经网络的叶子分类迁移学习
H. Esmaeili, T. Phoka
Convolutional Neural Network (CNN) is taking a big role in image classification. B ut f ully t raining i mages by using CNN takes a plenty of time and uses a very large data set. This paper will focus on transfer learning, a technique that takes a pre-trained model e.g., Inception, Resnet or MobileNets models then retrains the model from the existing weights for a new classification p roblem. T he r etrain t echnique drastically decreases time spending in the training process and many fewer number of image data is required to yield high accuracy trained networks. This paper considers the problem of leaf image classification t hat t he e xisting a pproaches t ake m uch e ffort to choose various types of imagefeatures for classification. This also reflects p utting b iases b y c hoosing s ome f eatures a nd ignoring the other information in images. This paper will conduct the experiments in accuracy comparison between traditional leaf image classification using image processing techniques and CNN with transfer learning. The result will show that without much knowledge in image processing, the leaf image classification can be achieved with high accuracy using the transfer learning technique.
卷积神经网络(CNN)在图像分类中发挥着重要作用。但是完全使用CNN来训练i张图片需要花费大量的时间和使用非常大的数据集。本文将重点关注迁移学习,这是一种采用预训练模型(例如Inception, Resnet或MobileNets模型)的技术,然后根据现有的权重对模型进行重新训练,以解决新的分类问题。该技术大大减少了在训练过程中花费的时间,并且产生高精度训练网络所需的图像数据数量更少。本文考虑了树叶图像的分类问题,因为现有的方法都需要花费很大的精力来选择各种类型的图像特征进行分类。这也反映了通过选择图像中的某些特征而忽略图像中的其他信息来减少图像。本文将对采用图像处理技术的传统树叶图像分类与采用迁移学习的CNN进行准确率对比实验。结果表明,在不需要太多图像处理知识的情况下,利用迁移学习技术可以实现高精度的叶片图像分类。
{"title":"Transfer Learning for Leaf Classification with Convolutional Neural Networks","authors":"H. Esmaeili, T. Phoka","doi":"10.1109/JCSSE.2018.8457364","DOIUrl":"https://doi.org/10.1109/JCSSE.2018.8457364","url":null,"abstract":"Convolutional Neural Network (CNN) is taking a big role in image classification. B ut f ully t raining i mages by using CNN takes a plenty of time and uses a very large data set. This paper will focus on transfer learning, a technique that takes a pre-trained model e.g., Inception, Resnet or MobileNets models then retrains the model from the existing weights for a new classification p roblem. T he r etrain t echnique drastically decreases time spending in the training process and many fewer number of image data is required to yield high accuracy trained networks. This paper considers the problem of leaf image classification t hat t he e xisting a pproaches t ake m uch e ffort to choose various types of imagefeatures for classification. This also reflects p utting b iases b y c hoosing s ome f eatures a nd ignoring the other information in images. This paper will conduct the experiments in accuracy comparison between traditional leaf image classification using image processing techniques and CNN with transfer learning. The result will show that without much knowledge in image processing, the leaf image classification can be achieved with high accuracy using the transfer learning technique.","PeriodicalId":338973,"journal":{"name":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122001487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
JCSSE 2018 Reviewers Page JCSSE 2018审稿人页面
{"title":"JCSSE 2018 Reviewers Page","authors":"","doi":"10.1109/jcsse.2018.8457374","DOIUrl":"https://doi.org/10.1109/jcsse.2018.8457374","url":null,"abstract":"","PeriodicalId":338973,"journal":{"name":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123085084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comparative Study on Various Deep Learning Techniques for Thai NLP Lexical and Syntactic Tasks on Noisy Data 不同深度学习技术在泰国语NLP有噪声数据下词法和句法任务的比较研究
Amarin Jettakul, Chavisa Thamjarat, Kawin Liaowongphuthorn, Can Udomcharoenchaikit, P. Vateekul, P. Boonkwan
In Natural Language Processing (NLP), there are three fundamental tasks of NLP which are Tokenization being a part of a lexical level, Part-of-Speech tagging (POS) and Named-Entity-Recognition (NER) being parts of a syntactic level. Recently, there have been many deep learning researches showing their success in many domains. However, there has been no comparative study for Thai NLP to suggest the most suitable technique for each task yet. In this paper, we aim to provide a performance comparison among various deep learning-based techniques on three NLP tasks, and study the effect on synthesized OOV words and the OOV handling algorithm with Levenshtein distance had been provided due to the fact that most existing works relied on a set of vocabularies in the trained model and not being fit for noisy text in the real use case. Our three experiments were conducted on BEST 2010 I2R, a standard Thai NLP corpus on F1 measurement, with the different percentage of noises having been synthesized. Firstly, for Tokenization, the result shows that Synthai, a jointed bidirectional LSTM, has the best performance. Additionally, for POS, bi-directional LSTM with CRF has obtained the best performance. For NER, variational bi-directional LSTM with CRF has outperformed other methods. Finally, the effect of noises reduces the performance of all algorithms on these foundation tasks and the result shows that our OOV handling technique could improve the performance on noisy data.
在自然语言处理(NLP)中,有三个基本任务:词法层面的标记化、词性标注(POS)和句法层面的命名实体识别(NER)。近年来,深度学习的研究在许多领域都取得了成功。然而,目前还没有对泰国NLP进行比较研究,以建议最适合每个任务的技术。在本文中,我们旨在对基于深度学习的各种技术在三种NLP任务上的性能进行比较,并研究对合成OOV词的影响,以及由于现有的大多数工作依赖于训练模型中的一组词汇表而不适合实际用例中的噪声文本,因此提供了Levenshtein距离的OOV处理算法。我们的三个实验是在BEST 2010 I2R上进行的,这是一个标准的泰国NLP语料库,用于F1测量,合成了不同百分比的噪声。首先,在标记化方面,结果表明联合双向LSTM Synthai具有最好的性能。此外,在POS中,带CRF的双向LSTM获得了最好的性能。对于NER,基于CRF的变分双向LSTM优于其他方法。最后,噪声的影响降低了所有算法在这些基础任务上的性能,结果表明我们的OOV处理技术可以提高对噪声数据的性能。
{"title":"A Comparative Study on Various Deep Learning Techniques for Thai NLP Lexical and Syntactic Tasks on Noisy Data","authors":"Amarin Jettakul, Chavisa Thamjarat, Kawin Liaowongphuthorn, Can Udomcharoenchaikit, P. Vateekul, P. Boonkwan","doi":"10.1109/JCSSE.2018.8457368","DOIUrl":"https://doi.org/10.1109/JCSSE.2018.8457368","url":null,"abstract":"In Natural Language Processing (NLP), there are three fundamental tasks of NLP which are Tokenization being a part of a lexical level, Part-of-Speech tagging (POS) and Named-Entity-Recognition (NER) being parts of a syntactic level. Recently, there have been many deep learning researches showing their success in many domains. However, there has been no comparative study for Thai NLP to suggest the most suitable technique for each task yet. In this paper, we aim to provide a performance comparison among various deep learning-based techniques on three NLP tasks, and study the effect on synthesized OOV words and the OOV handling algorithm with Levenshtein distance had been provided due to the fact that most existing works relied on a set of vocabularies in the trained model and not being fit for noisy text in the real use case. Our three experiments were conducted on BEST 2010 I2R, a standard Thai NLP corpus on F1 measurement, with the different percentage of noises having been synthesized. Firstly, for Tokenization, the result shows that Synthai, a jointed bidirectional LSTM, has the best performance. Additionally, for POS, bi-directional LSTM with CRF has obtained the best performance. For NER, variational bi-directional LSTM with CRF has outperformed other methods. Finally, the effect of noises reduces the performance of all algorithms on these foundation tasks and the result shows that our OOV handling technique could improve the performance on noisy data.","PeriodicalId":338973,"journal":{"name":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115278721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
An Approach to Bézier Curve Approximation by Circular Arcs 用圆弧逼近bsamzier曲线的一种方法
Taweechai Nuntawisuttiwong, N. Dejdumrong
This paper presents a method to approximate Béziercurves by a sequence of arc splines with inscribed regular polygon. The proposed algorithm uses the arc length approximation method in subdividing a Bézier curve into subcurves which have equal arc length. Each subcurve is interpolated with a line segment which is a side of the inscribed polygon of a curve. Curve segments are then clustered into a circular arc by evaluating interior angles of inscribed polygon. This method represents a Bézier curve with the minimum number of circular arcs and acceptable errors. The experimental results are provided the similarity of original curve and approximated arc spline. The approximated arc spline which is the result of proposed algorithm is compatible for vector and raster graphic format.
本文提出了一种用圆弧样条序列来近似bsamzier曲线的方法。该算法采用弧长近似法将bsamizier曲线细分为弧长相等的子曲线。每个子曲线都用线段插值,线段是曲线的内切多边形的一条边。然后通过计算内切多边形的内角,将曲线段聚类成圆弧。该方法表示具有最小圆弧数和可接受误差的bsamizier曲线。实验结果证明了原始曲线与近似弧样条曲线的相似性。该算法得到的近似弧样条曲线兼容矢量和栅格图形格式。
{"title":"An Approach to Bézier Curve Approximation by Circular Arcs","authors":"Taweechai Nuntawisuttiwong, N. Dejdumrong","doi":"10.1109/JCSSE.2018.8457180","DOIUrl":"https://doi.org/10.1109/JCSSE.2018.8457180","url":null,"abstract":"This paper presents a method to approximate Béziercurves by a sequence of arc splines with inscribed regular polygon. The proposed algorithm uses the arc length approximation method in subdividing a Bézier curve into subcurves which have equal arc length. Each subcurve is interpolated with a line segment which is a side of the inscribed polygon of a curve. Curve segments are then clustered into a circular arc by evaluating interior angles of inscribed polygon. This method represents a Bézier curve with the minimum number of circular arcs and acceptable errors. The experimental results are provided the similarity of original curve and approximated arc spline. The approximated arc spline which is the result of proposed algorithm is compatible for vector and raster graphic format.","PeriodicalId":338973,"journal":{"name":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124184988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1