{"title":"新动词与词典:西班牙语动词中新词汇的自动检测方法","authors":"A. Castro, Rogelio Nazar, Irene Renau","doi":"10.1093/ijl/ecab009","DOIUrl":null,"url":null,"abstract":"\n The appearance of new verbs can be observed regularly, but verbs are not frequently investigated in neology, and they are difficult to detect automatically. In this study, a corpus-based method is proposed to detect Spanish verbs with a series of algorithms that analyse the morphology of regular verbs. The vocabulary was drawn from a large corpus and contrasted with a major dictionary of Spanish. Then, a series of filters were applied to distinguish between valid neologism candidates and spelling mistakes. Around 88% of the neologisms proposed by the method were correct and we estimate that the system detected 76% of the neologisms present in the corpus. This procedure can be included in the workflow of a lexicographic project as a regular part of the task, as a systematic way of collecting new verbs from the data and avoiding under-representation or bias.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":null,"pages":null},"PeriodicalIF":0.8000,"publicationDate":"2021-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"New verbs and dictionaries: A method for the automatic detection of neology in Spanish verbs\",\"authors\":\"A. Castro, Rogelio Nazar, Irene Renau\",\"doi\":\"10.1093/ijl/ecab009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n The appearance of new verbs can be observed regularly, but verbs are not frequently investigated in neology, and they are difficult to detect automatically. In this study, a corpus-based method is proposed to detect Spanish verbs with a series of algorithms that analyse the morphology of regular verbs. The vocabulary was drawn from a large corpus and contrasted with a major dictionary of Spanish. Then, a series of filters were applied to distinguish between valid neologism candidates and spelling mistakes. Around 88% of the neologisms proposed by the method were correct and we estimate that the system detected 76% of the neologisms present in the corpus. This procedure can be included in the workflow of a lexicographic project as a regular part of the task, as a systematic way of collecting new verbs from the data and avoiding under-representation or bias.\",\"PeriodicalId\":45657,\"journal\":{\"name\":\"International Journal of Lexicography\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2021-09-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Lexicography\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1093/ijl/ecab009\",\"RegionNum\":2,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Lexicography","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1093/ijl/ecab009","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
New verbs and dictionaries: A method for the automatic detection of neology in Spanish verbs
The appearance of new verbs can be observed regularly, but verbs are not frequently investigated in neology, and they are difficult to detect automatically. In this study, a corpus-based method is proposed to detect Spanish verbs with a series of algorithms that analyse the morphology of regular verbs. The vocabulary was drawn from a large corpus and contrasted with a major dictionary of Spanish. Then, a series of filters were applied to distinguish between valid neologism candidates and spelling mistakes. Around 88% of the neologisms proposed by the method were correct and we estimate that the system detected 76% of the neologisms present in the corpus. This procedure can be included in the workflow of a lexicographic project as a regular part of the task, as a systematic way of collecting new verbs from the data and avoiding under-representation or bias.
期刊介绍:
The International Journal of Lexicography was launched in 1988. Interdisciplinary as well as international, it is concerned with all aspects of lexicography, including issues of design, compilation and use, and with dictionaries of all languages, though the chief focus is on dictionaries of the major European languages - monolingual and bilingual, synchronic and diachronic, pedagogical and encyclopedic. The Journal recognizes the vital role of lexicographical theory and research, and of developments in related fields such as computational linguistics, and welcomes contributions in these areas.