{"title":"建立阿拉伯语心理语言学数据库","authors":"N. Fathy, S. Alansary","doi":"10.1109/ESOLEC54569.2022.10009144","DOIUrl":null,"url":null,"abstract":"Psycholinguistic databases are indispensable resources for psycholinguistic and computational research. Many languages have such valuable resources, such as English, Croatian, Dutch, French, and Chinese. Unfortunately, Arabic doesn't have such databases. This research aims at introducing the guidelines for building a psycholinguistic database of Arabic. The database will be available in two phases: the first is a psycholinguistic phase in which subjective ratings are collected for several variables such as concreteness, imageability, subjective frequency, and number of meanings, the second is a computational phase in which ratings are stacked with other linguistic information obtained from corpora, such as root, stem, objective frequency, number of syllables, and word length. This phase is meant to provide an online searchable release that can be used by psycholinguists and computational linguists for building cognitive-based artificial intelligence models. This survey is meant to introduce the building process of the psycholinguistic phase in detail.","PeriodicalId":179850,"journal":{"name":"2022 20th International Conference on Language Engineering (ESOLEC)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards a Psycholinguistic Database of Arabic\",\"authors\":\"N. Fathy, S. Alansary\",\"doi\":\"10.1109/ESOLEC54569.2022.10009144\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Psycholinguistic databases are indispensable resources for psycholinguistic and computational research. Many languages have such valuable resources, such as English, Croatian, Dutch, French, and Chinese. Unfortunately, Arabic doesn't have such databases. This research aims at introducing the guidelines for building a psycholinguistic database of Arabic. The database will be available in two phases: the first is a psycholinguistic phase in which subjective ratings are collected for several variables such as concreteness, imageability, subjective frequency, and number of meanings, the second is a computational phase in which ratings are stacked with other linguistic information obtained from corpora, such as root, stem, objective frequency, number of syllables, and word length. This phase is meant to provide an online searchable release that can be used by psycholinguists and computational linguists for building cognitive-based artificial intelligence models. This survey is meant to introduce the building process of the psycholinguistic phase in detail.\",\"PeriodicalId\":179850,\"journal\":{\"name\":\"2022 20th International Conference on Language Engineering (ESOLEC)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 20th International Conference on Language Engineering (ESOLEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ESOLEC54569.2022.10009144\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 20th International Conference on Language Engineering (ESOLEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ESOLEC54569.2022.10009144","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Psycholinguistic databases are indispensable resources for psycholinguistic and computational research. Many languages have such valuable resources, such as English, Croatian, Dutch, French, and Chinese. Unfortunately, Arabic doesn't have such databases. This research aims at introducing the guidelines for building a psycholinguistic database of Arabic. The database will be available in two phases: the first is a psycholinguistic phase in which subjective ratings are collected for several variables such as concreteness, imageability, subjective frequency, and number of meanings, the second is a computational phase in which ratings are stacked with other linguistic information obtained from corpora, such as root, stem, objective frequency, number of syllables, and word length. This phase is meant to provide an online searchable release that can be used by psycholinguists and computational linguists for building cognitive-based artificial intelligence models. This survey is meant to introduce the building process of the psycholinguistic phase in detail.