{"title":"Computational Approach for Processing of Control Engineer Text: Applications for Corpus Lexicography","authors":"J. Vitayapirak, P. Ratiroch-anant","doi":"10.1109/ICCIS.2006.252266","DOIUrl":null,"url":null,"abstract":"This research project reflects an increased awareness of the need to improve the flow of information on control engineering technology and the current lack of bilingualized (English-Thai-English) dictionary in this field. The central aim of this project is to develop a corpus of control engineering text. It starts with a survey of users' needs in control engineering English from 3 areas, i.e. control systems, automation and instrumentation, extracted from textbooks and journals. The corpus comprises 2,141,293 words (tokens) of running text. The linguistic data of the corpus provides insights into the sublanguage of control engineering. The corpus was then analysed by concordance program named WordSmith Tools Package to discover the frequency list of individual words, and collocations. The corpus findings in terms of word frequencies, corpus evidence on word combinations and typical usage from the concordance are used to find out the technical terms and to develop 665 entries for the bilingualized learners' dictionary of control engineering","PeriodicalId":296028,"journal":{"name":"2006 IEEE Conference on Cybernetics and Intelligent Systems","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 IEEE Conference on Cybernetics and Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIS.2006.252266","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This research project reflects an increased awareness of the need to improve the flow of information on control engineering technology and the current lack of bilingualized (English-Thai-English) dictionary in this field. The central aim of this project is to develop a corpus of control engineering text. It starts with a survey of users' needs in control engineering English from 3 areas, i.e. control systems, automation and instrumentation, extracted from textbooks and journals. The corpus comprises 2,141,293 words (tokens) of running text. The linguistic data of the corpus provides insights into the sublanguage of control engineering. The corpus was then analysed by concordance program named WordSmith Tools Package to discover the frequency list of individual words, and collocations. The corpus findings in terms of word frequencies, corpus evidence on word combinations and typical usage from the concordance are used to find out the technical terms and to develop 665 entries for the bilingualized learners' dictionary of control engineering