Automatic analysis of caregiver input and child production

IF 0.4 0 LANGUAGE & LINGUISTICS Korean Linguistics Pub Date : 2022-09-30 DOI:10.1075/kl.20002.shi
Gyu-Ho Shin
{"title":"Automatic analysis of caregiver input and child production","authors":"Gyu-Ho Shin","doi":"10.1075/kl.20002.shi","DOIUrl":null,"url":null,"abstract":"\n The present study explores the applicability of Natural Language Processing (NLP) techniques to investigate child\n corpora in Korean. We employ caregiver input and child production data in the CHILDES database, currently the largest and\n open-access Korean child corpus data, and apply NLP techniques to the data in two ways: automatic Part-of-Speech tagging by\n adapting a machine learning algorithm, and (semi-)automatic extraction of constructional patterns expressing a transitive event\n (active transitive and suffixal passive). As the first empirical report on NLP-assisted analysis of Korean child corpora, this\n study is expected to reveal its advantages and drawbacks, thereby opening the window to furthering corpus-mediated research on\n child language development in Korean. Implications of this study’s findings will also contribute to research practice regarding\n developmental studies on Korean through child corpora, ensuring the reproducibility of procedures and results, which is often\n lacking in previous corpus-based research on child language development in Korean.","PeriodicalId":29725,"journal":{"name":"Korean Linguistics","volume":" ","pages":""},"PeriodicalIF":0.4000,"publicationDate":"2022-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Korean Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1075/kl.20002.shi","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 2

Abstract

The present study explores the applicability of Natural Language Processing (NLP) techniques to investigate child corpora in Korean. We employ caregiver input and child production data in the CHILDES database, currently the largest and open-access Korean child corpus data, and apply NLP techniques to the data in two ways: automatic Part-of-Speech tagging by adapting a machine learning algorithm, and (semi-)automatic extraction of constructional patterns expressing a transitive event (active transitive and suffixal passive). As the first empirical report on NLP-assisted analysis of Korean child corpora, this study is expected to reveal its advantages and drawbacks, thereby opening the window to furthering corpus-mediated research on child language development in Korean. Implications of this study’s findings will also contribute to research practice regarding developmental studies on Korean through child corpora, ensuring the reproducibility of procedures and results, which is often lacking in previous corpus-based research on child language development in Korean.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
自动分析照顾者的输入和孩子的生产
本研究探讨自然语言处理(NLP)技术在韩语儿童语料库研究中的适用性。我们在CHILDES数据库(目前最大的开放访问韩语儿童语料库数据)中使用照顾者输入和儿童生产数据,并以两种方式对数据应用NLP技术:通过采用机器学习算法自动标记词性,以及(半)自动提取表达及物事件的结构模式(主动及物和后缀被动)。本研究是首个使用nlp辅助分析韩语儿童语料库的实证报告,希望能够揭示其优势和不足,从而为进一步开展语料库介导的韩语儿童语言发展研究打开一扇窗。本研究结果的启示也将有助于通过儿童语料库进行韩语发展研究的研究实践,确保程序和结果的可重复性,这在以往基于语料库的韩语儿童语言发展研究中经常缺乏。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
0.30
自引率
0.00%
发文量
0
期刊最新文献
On the theoretical basis of textbooks of Korean linguistics: for the students of the convergence era A study on Suffix Clipped Words in Korean: Focusing on the Clipping of Suffix Ending Syllable ‘i’ On the Education of Korean Linguistics in the Age of Convergence The Meaning structures in sino-korean word with Four Characters consisting of ‘之’: Focusing on the composition of ‘non-Predicative two-character word + ji(之) + single-character word’ Diachronic Changes in Kyorinsuji"s Spellings Published in the Early Meiji Era: Focusing on the Replacement of "hɐ" with "heo-"
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1