Morphological Analysis of Egyptian Children Corpus by KIDEVAL Program

H. Salama, S. Alansary, Amany Elshazly
{"title":"Morphological Analysis of Egyptian Children Corpus by KIDEVAL Program","authors":"H. Salama, S. Alansary, Amany Elshazly","doi":"10.1109/ESOLEC54569.2022.10009437","DOIUrl":null,"url":null,"abstract":"The aim of this study is to provide a morphological analysis of the Egyptian children corpus, which is a morphologically tagged and disambiguated in CHILDES. This allows the KIDEVAL program to be readily used on the corpus to address questions regarding the acquisition of Egyptian Arabic. KIDEVAL is one of the useful tools in CLAN program which has been particularly useful toolsets in the study of language acquisition in many languages. However, applications of corpus-based analyses to Egyptian children's language have not yet been conducted. This study describes how to use the KIDEVAL program for analyzing Egyptian children's language and study the development of word frequency patterns of parts of speech and order of development of grammatical morphemes in Egyptian Arabic. The output of morphological analysis enables researchers to study and answer many questions regarding the development of a grammatical morpheme in Egyptian Arabic, as well as a lot of questions that can readily be probed with KIDEVAL. The Egyptian Arabic corpus is downloaded from the Arabic part of the CHILDES database. It comprises 10transcripts from Egyptian-speaking children aged 1;7 to3;8 years, with a total of 25,645 words. The KIDEVAL program analysis profile for Egyptian Arabic children's corpus in this study reveals extensive and valuable analysis, displaying the number of occurrences of each part of speech for each child depends on his age which includes 54 categories and subcategories. The usage of the KIDEVAL tool is efficient because it reduces the time needed to label the corpus manually.","PeriodicalId":179850,"journal":{"name":"2022 20th International Conference on Language Engineering (ESOLEC)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 20th International Conference on Language Engineering (ESOLEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ESOLEC54569.2022.10009437","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The aim of this study is to provide a morphological analysis of the Egyptian children corpus, which is a morphologically tagged and disambiguated in CHILDES. This allows the KIDEVAL program to be readily used on the corpus to address questions regarding the acquisition of Egyptian Arabic. KIDEVAL is one of the useful tools in CLAN program which has been particularly useful toolsets in the study of language acquisition in many languages. However, applications of corpus-based analyses to Egyptian children's language have not yet been conducted. This study describes how to use the KIDEVAL program for analyzing Egyptian children's language and study the development of word frequency patterns of parts of speech and order of development of grammatical morphemes in Egyptian Arabic. The output of morphological analysis enables researchers to study and answer many questions regarding the development of a grammatical morpheme in Egyptian Arabic, as well as a lot of questions that can readily be probed with KIDEVAL. The Egyptian Arabic corpus is downloaded from the Arabic part of the CHILDES database. It comprises 10transcripts from Egyptian-speaking children aged 1;7 to3;8 years, with a total of 25,645 words. The KIDEVAL program analysis profile for Egyptian Arabic children's corpus in this study reveals extensive and valuable analysis, displaying the number of occurrences of each part of speech for each child depends on his age which includes 54 categories and subcategories. The usage of the KIDEVAL tool is efficient because it reduces the time needed to label the corpus manually.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用kidval程序对埃及儿童语料库进行形态学分析
本研究的目的是提供一个形态学分析的埃及儿童语料库,这是一个形态标记和消除歧义在CHILDES。这使得KIDEVAL程序可以很容易地在语料库上使用,以解决有关埃及阿拉伯语获取的问题。KIDEVAL是CLAN程序中非常有用的工具之一,在许多语言的语言习得研究中一直是非常有用的工具集。然而,基于语料库的分析尚未应用于埃及儿童语言。本研究描述了如何使用KIDEVAL程序分析埃及儿童的语言,研究埃及阿拉伯语词性词频模式的发展和语法语素的发展顺序。形态分析的输出使研究人员能够研究和回答关于埃及阿拉伯语语法语素发展的许多问题,以及许多可以很容易地用KIDEVAL探索的问题。埃及阿拉伯语语料库是从CHILDES数据库的阿拉伯语部分下载的。它包括10份来自说埃及语的1、7到3、8岁儿童的成绩单,共计25,645个单词。本研究中埃及阿拉伯语儿童语料库的KIDEVAL程序分析概况揭示了广泛而有价值的分析,显示了每个儿童的每个词性的出现次数取决于他的年龄,其中包括54个类别和子类别。使用KIDEVAL工具是有效的,因为它减少了手动标记语料库所需的时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Novel Dataset for Known and Unknown Ancient Arabic Manuscripts Sentiment Analysis: Amazon Electronics Reviews Using BERT and Textblob Arabic Documents Layout Analysis (ADLA) using Fine-tuned Faster RCN Towards a Psycholinguistic Database of Arabic Neural Networks for Bilingual Machine Translation Model
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1