{"title":"PolylexFLE","authors":"A. Todirascu, T. François, Marion Cargill","doi":"10.1075/itl.22031.tod","DOIUrl":null,"url":null,"abstract":"\n MWE knowledge is key in the process of learning a foreign language, but its teaching remains hindered by the lack\n of list of expressions connected to pedagogical aims. In this paper, we present an extended version of the PolylexFLE database,\n containing 4,525 French multiword expressions (MWE) of three types: idioms, collocations or fixed expressions. In order to propose\n exercises following the difficulty scale of the European Framework of Reference for Languages (CEFR), we used a mixed approach\n (manual and automatic) to annotate 1,186 expressions according to the CEFR levels. The paper focuses mostly on the automatic\n procedure that first identifies the expressions from the PolylexFLE database (and their variants) in a corpus of pedagogical texts\n (with CEFR labels) using a pattern-based system. In a second step, their distribution in this corpus is estimated and transformed\n into a single CEFR level. The automatic approach proposed is finally evaluated by 52 French as foreign language learners.","PeriodicalId":510772,"journal":{"name":"ITL - International Journal of Applied Linguistics","volume":"19 13","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ITL - International Journal of Applied Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1075/itl.22031.tod","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
MWE knowledge is key in the process of learning a foreign language, but its teaching remains hindered by the lack
of list of expressions connected to pedagogical aims. In this paper, we present an extended version of the PolylexFLE database,
containing 4,525 French multiword expressions (MWE) of three types: idioms, collocations or fixed expressions. In order to propose
exercises following the difficulty scale of the European Framework of Reference for Languages (CEFR), we used a mixed approach
(manual and automatic) to annotate 1,186 expressions according to the CEFR levels. The paper focuses mostly on the automatic
procedure that first identifies the expressions from the PolylexFLE database (and their variants) in a corpus of pedagogical texts
(with CEFR labels) using a pattern-based system. In a second step, their distribution in this corpus is estimated and transformed
into a single CEFR level. The automatic approach proposed is finally evaluated by 52 French as foreign language learners.