This article examines the status of constructed controlled terminologies from the perspective of the coverage of terms/concepts. To facilitate controlled authoring of Japanese texts of the municipal domain and promote machine translatability into English, we constructed terminologies in the following way: (1) Japanese-English term pairs are extracted from aligned texts; (2) term variations are controlled by defining preferred and proscribed terms for both languages. To assess the coverage of the constructed terminologies, we propose a quantitative extrapolation method that estimates the potential vocabulary size. The coverage estimations show that the coverage of terms for Japanese is higher than that for English by about 10%, which reflects the greater diversity of the translated English terms. The coverage of concepts reaches around 60% for both Japanese and English. The method also enables us to quantitatively estimate how much effort is needed to further increase the coverage.
{"title":"Building controlled bilingual terminologies for the municipal domain and evaluating them using a coverage estimation\u0000 approach","authors":"Rei Miyata, K. Kageura","doi":"10.1075/TERM.00017.MIY","DOIUrl":"https://doi.org/10.1075/TERM.00017.MIY","url":null,"abstract":"\u0000 This article examines the status of constructed controlled terminologies from the perspective of the coverage of terms/concepts. To\u0000 facilitate controlled authoring of Japanese texts of the municipal domain and promote machine translatability into English, we\u0000 constructed terminologies in the following way: (1) Japanese-English term pairs are extracted from aligned texts; (2) term\u0000 variations are controlled by defining preferred and proscribed terms for both languages. To assess the coverage of the constructed\u0000 terminologies, we propose a quantitative extrapolation method that estimates the potential vocabulary size. The coverage\u0000 estimations show that the coverage of terms for Japanese is higher than that for English by about 10%, which\u0000 reflects the greater diversity of the translated English terms. The coverage of concepts reaches around 60% for\u0000 both Japanese and English. The method also enables us to quantitatively estimate how much effort is needed to further increase the\u0000 coverage.","PeriodicalId":44429,"journal":{"name":"Terminology","volume":null,"pages":null},"PeriodicalIF":0.8,"publicationDate":"2018-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46955730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}