The Crowdsourcing Method to Normalize “Bahasa Alay”, a Case of Indonesian Corpus

Rianto, Achmad Benny Mutiara, Eri Prasetyo Wibowo, P. Insap Santosa
{"title":"The Crowdsourcing Method to Normalize “Bahasa Alay”, a Case of Indonesian Corpus","authors":"Rianto, Achmad Benny Mutiara, Eri Prasetyo Wibowo, P. Insap Santosa","doi":"10.1109/ICIC50835.2020.9288534","DOIUrl":null,"url":null,"abstract":"In verbal communication, people use sentences that can be classified into two categories, namely formal and non- formal. The former meets the grammatical standard as prescribed by linguistic rules of the language, while the latter deviates it. In daily communication, however, non-formal sentences are more intensively used because they are more practical and easier to understand. With this deviation, nonformal sentences cause problems in linguistic computation because most linguistic computations use formal languages that already have standard rules. This research aims to develop an Indonesian closed corpus related to airline ticket reservations. The data used to develop the corpus are taken from conversations between customer service staff and consumers in airline ticket reservations. This is a preliminary study to propose and develop a chatbot in airline ticket reservations. The result of this study is the Indonesian closed corpus related to airline ticket reservations to determine the right response for consumers.","PeriodicalId":413610,"journal":{"name":"2020 Fifth International Conference on Informatics and Computing (ICIC)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Fifth International Conference on Informatics and Computing (ICIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIC50835.2020.9288534","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In verbal communication, people use sentences that can be classified into two categories, namely formal and non- formal. The former meets the grammatical standard as prescribed by linguistic rules of the language, while the latter deviates it. In daily communication, however, non-formal sentences are more intensively used because they are more practical and easier to understand. With this deviation, nonformal sentences cause problems in linguistic computation because most linguistic computations use formal languages that already have standard rules. This research aims to develop an Indonesian closed corpus related to airline ticket reservations. The data used to develop the corpus are taken from conversations between customer service staff and consumers in airline ticket reservations. This is a preliminary study to propose and develop a chatbot in airline ticket reservations. The result of this study is the Indonesian closed corpus related to airline ticket reservations to determine the right response for consumers.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
以印尼语语料库为例:“印尼语”规范化的众包方法
在言语交际中,人们使用的句子可以分为两类,即正式和非正式。前者符合语言规则所规定的语法标准,后者则偏离语言规则。然而,在日常交际中,非正式句的使用频率更高,因为它们更实用,更容易理解。由于这种偏差,非形式语句会在语言计算中引起问题,因为大多数语言计算使用已经具有标准规则的形式语言。本研究旨在开发与机票预订相关的印尼语封闭语料库。用于开发语料库的数据取自客户服务人员与机票预订消费者之间的对话。这是一个初步的研究,提出并开发一个聊天机器人在机票预订。本研究的结果是印度尼西亚封闭语料库相关的机票预订,以确定正确的回应,为消费者。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Task Design for Indonesian Cultural Heritage Data Collection with Crowdsourcing PenalViz: A Web-Based Visualization Tool for the Indonesian Penal Code Examining GOJEK Drivers' Loyalty: The Influence of GOJEK's Partnership Mechanism and Service Quality Modeling and Analysis of Three-Phase Active Power Filter Integrated Photovoltaic as a Reactive Power Compensator Using the Simulink Matlab Tool An Evaluation of Internet Addiction Test (IAT)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1