Pub Date : 2019-12-10DOI: 10.26615/978-2-9701095-6-3_013
Ángela Luque Giráldez, Míriam Seghiri Domínguez
{"title":"Extracción Terminológica Basada en Corpus Para la Traducción de Fichas Técnicas de Impresoras 3D","authors":"Ángela Luque Giráldez, Míriam Seghiri Domínguez","doi":"10.26615/978-2-9701095-6-3_013","DOIUrl":"https://doi.org/10.26615/978-2-9701095-6-3_013","url":null,"abstract":"","PeriodicalId":259759,"journal":{"name":"Proceedings of the Third International Conference, Europhras 2019, Computational and Corpus-Based Phraseology","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128931781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2019-09-25DOI: 10.26615/978-2-9701095-6-3_001
M. Albano, Rosanna Badalamenti
Phraseology in Learner Language: The Case of French Idioms and Collocations Translated by Italian speaking Adult Learners. Tradulex
学习者语言中的短语学:法语习语和搭配的案例,由讲意大利语的成人学习者翻译。Tradulex
{"title":"Phraseology in Learner Language: The Case of French Idioms and Collocations Translated by Italian-speaking Adult Learners","authors":"M. Albano, Rosanna Badalamenti","doi":"10.26615/978-2-9701095-6-3_001","DOIUrl":"https://doi.org/10.26615/978-2-9701095-6-3_001","url":null,"abstract":"Phraseology in Learner Language: The Case of French Idioms and Collocations Translated by Italian speaking Adult Learners. Tradulex","PeriodicalId":259759,"journal":{"name":"Proceedings of the Third International Conference, Europhras 2019, Computational and Corpus-Based Phraseology","volume":"133 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124275870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2019-09-25DOI: 10.26615/978-2-9701095-6-3_014
I. Ivanković, Zagreb Croatia Linguistics, Goranka Blagus Bartolec
{"title":"Corpus Analysis of Complex Names with Common Nouns in Croatian","authors":"I. Ivanković, Zagreb Croatia Linguistics, Goranka Blagus Bartolec","doi":"10.26615/978-2-9701095-6-3_014","DOIUrl":"https://doi.org/10.26615/978-2-9701095-6-3_014","url":null,"abstract":"","PeriodicalId":259759,"journal":{"name":"Proceedings of the Third International Conference, Europhras 2019, Computational and Corpus-Based Phraseology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125499215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2019-09-25DOI: 10.26615/978-2-9701095-6-3_008
S. Cataldo
{"title":"Procesos de Reconocimiento e Interpretación de Unidades Fraseológicas Metafóricas y Factores Influyentes","authors":"S. Cataldo","doi":"10.26615/978-2-9701095-6-3_008","DOIUrl":"https://doi.org/10.26615/978-2-9701095-6-3_008","url":null,"abstract":"","PeriodicalId":259759,"journal":{"name":"Proceedings of the Third International Conference, Europhras 2019, Computational and Corpus-Based Phraseology","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122463701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2019-09-25DOI: 10.26615/978-2-9701095-6-3_010
F. Gobbo
This paper presents the application of the constructive adpositional grammars (CxAdGrams) to phraseological units, through the special case study of Esperanto. Constructive linguistics is an approach to human language analysis that considers constructions, themselves being paradigms of language-in-use, as the first units. Unlike other constructional approaches, constructive linguists apply formalisms in understanding linguistic phenomena. The adpositional paradigm is the most developed formalism in constructive linguistics, which is understandable by humans and machine-readable at the same time. The term ‘constructive’ should also be understood in formal terms, as the adpositional paradigm is based on constructive mathematics, and in particular on topos-theory. From a theoretical perspective, CxAdGrams describe human languages in terms of constructions, described adpositional trees (in short, adtrees). This paper aims to explain why such an interpretation of constructions in terms of adtrees can be useful for a deeper understanding of phraseology. Esperanto is the case study chosen so to give an empirical base to CxAdGrams. In particular, we illustrate the problematisation of its phraseology as well as the advantages of Esperanto in setting up workable prototypes in a short time.
{"title":"Constructive Linguistics for Computational Phraseology: the Esperanto Case","authors":"F. Gobbo","doi":"10.26615/978-2-9701095-6-3_010","DOIUrl":"https://doi.org/10.26615/978-2-9701095-6-3_010","url":null,"abstract":"This paper presents the application of the constructive adpositional grammars (CxAdGrams) to phraseological units, through the special case study of Esperanto. Constructive linguistics is an approach to human language analysis that considers constructions, themselves being paradigms of language-in-use, as the first units. Unlike other constructional approaches, constructive linguists apply formalisms in understanding linguistic phenomena. The adpositional paradigm is the most developed formalism in constructive linguistics, which is understandable by humans and machine-readable at the same time. The term ‘constructive’ should also be understood in formal terms, as the adpositional paradigm is based on constructive mathematics, and in particular on topos-theory. From a theoretical perspective, CxAdGrams describe human languages in terms of constructions, described adpositional trees (in short, adtrees). This paper aims to explain why such an interpretation of constructions in terms of adtrees can be useful for a deeper understanding of phraseology. Esperanto is the case study chosen so to give an empirical base to CxAdGrams. In particular, we illustrate the problematisation of its phraseology as well as the advantages of Esperanto in setting up workable prototypes in a short time.","PeriodicalId":259759,"journal":{"name":"Proceedings of the Third International Conference, Europhras 2019, Computational and Corpus-Based Phraseology","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127934591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2019-09-25DOI: 10.26615/978-2-9701095-6-3_021
Gökhan Doğru
{"title":"Automatic Term Extraction from Turkish to English Medical Corpus","authors":"Gökhan Doğru","doi":"10.26615/978-2-9701095-6-3_021","DOIUrl":"https://doi.org/10.26615/978-2-9701095-6-3_021","url":null,"abstract":"","PeriodicalId":259759,"journal":{"name":"Proceedings of the Third International Conference, Europhras 2019, Computational and Corpus-Based Phraseology","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121713869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2019-09-25DOI: 10.26615/978-2-9701095-6-3_022
Jack Halpern
. A basic assumption in bilingual lexicography and machine translation (MT) is that the linguistic units of one language correspond to those of another language. But even in close language pairs, such as Spanish and English, there are numerous exceptions, while in some language pairs, such as English and Japanese, cross-linguistic lexical anisomorphism is so great that it becomes literally impossible to map certain words and phrases across these languages. This is especially true of linguistic units that consists of multiple components, or multiword units (MWUs). The recognition and accurate translation of MWUs play a critical role in enhancing the quality of machine translation[9]. In spite of the recent advances in MT based on neural networks (NMT), MWUs still present major challenges to MT technology. This paper discusses the fundamental principles for identifying and selecting MWUs for inclusion in bilingual dictionaries, both for humans and for MT systems (MT lexicons). It attempts to define the various subtypes of MWU based on lexicographic principles derived from exten-sive experience in bilingual lexicography, especially the compilation of a large-scale full-form lexicon for Spanish-English MT. It also introduces some large-scale resources designed to significantly enhance the translation accuracy of multiword proper nouns.
{"title":"Lexicographic Criteria for Selecting Multiword Units for MT Lexicons","authors":"Jack Halpern","doi":"10.26615/978-2-9701095-6-3_022","DOIUrl":"https://doi.org/10.26615/978-2-9701095-6-3_022","url":null,"abstract":". A basic assumption in bilingual lexicography and machine translation (MT) is that the linguistic units of one language correspond to those of another language. But even in close language pairs, such as Spanish and English, there are numerous exceptions, while in some language pairs, such as English and Japanese, cross-linguistic lexical anisomorphism is so great that it becomes literally impossible to map certain words and phrases across these languages. This is especially true of linguistic units that consists of multiple components, or multiword units (MWUs). The recognition and accurate translation of MWUs play a critical role in enhancing the quality of machine translation[9]. In spite of the recent advances in MT based on neural networks (NMT), MWUs still present major challenges to MT technology. This paper discusses the fundamental principles for identifying and selecting MWUs for inclusion in bilingual dictionaries, both for humans and for MT systems (MT lexicons). It attempts to define the various subtypes of MWU based on lexicographic principles derived from exten-sive experience in bilingual lexicography, especially the compilation of a large-scale full-form lexicon for Spanish-English MT. It also introduces some large-scale resources designed to significantly enhance the translation accuracy of multiword proper nouns.","PeriodicalId":259759,"journal":{"name":"Proceedings of the Third International Conference, Europhras 2019, Computational and Corpus-Based Phraseology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128429069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2019-09-25DOI: 10.26615/978-2-9701095-6-3_009
Ana Galvão, J. Baptista, N. Mamede
Processing verbal idioms is a challenging task for Natural Language Processing systems because they are syntactically analysable strings, with a wellformed structure, identical to that of distributionally free sentences, but whose meaning is for the most part non-compositional. This paper presents recent advances in processing European Portuguese verbal idioms. From a lexicon-grammar matrix, containing +2,500 verbal idioms and +100 (structural, distributional and transformational) properties, parsing rules are automatically generated, within the framework of a rule-based incremental parser. They are then integrated in STRING, a fully-fledged natural language processing system for Portuguese. The system now identifies not only the idioms’ base forms, but also the sentences resulting from some productive and very general transformations (passive, pronominalisation), admitted by some of these idioms. Other improvements include: a newly developed lexicon-grammar validator, a new generation module for transformations’ examples, and a new, more granular, evaluation module. An intrinsic evaluation achieves an overall recall of 92.5%.
{"title":"Processing European Portuguese Verbal Idioms: From the Lexicon-Grammar to a Rule-based Parser","authors":"Ana Galvão, J. Baptista, N. Mamede","doi":"10.26615/978-2-9701095-6-3_009","DOIUrl":"https://doi.org/10.26615/978-2-9701095-6-3_009","url":null,"abstract":"Processing verbal idioms is a challenging task for Natural Language Processing systems because they are syntactically analysable strings, with a wellformed structure, identical to that of distributionally free sentences, but whose meaning is for the most part non-compositional. This paper presents recent advances in processing European Portuguese verbal idioms. From a lexicon-grammar matrix, containing +2,500 verbal idioms and +100 (structural, distributional and transformational) properties, parsing rules are automatically generated, within the framework of a rule-based incremental parser. They are then integrated in STRING, a fully-fledged natural language processing system for Portuguese. The system now identifies not only the idioms’ base forms, but also the sentences resulting from some productive and very general transformations (passive, pronominalisation), admitted by some of these idioms. Other improvements include: a newly developed lexicon-grammar validator, a new generation module for transformations’ examples, and a new, more granular, evaluation module. An intrinsic evaluation achieves an overall recall of 92.5%.","PeriodicalId":259759,"journal":{"name":"Proceedings of the Third International Conference, Europhras 2019, Computational and Corpus-Based Phraseology","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125003968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2019-09-25DOI: 10.26615/978-2-9701095-6-3_016
C. Nigrelli
{"title":"On the Impact of (Il)literacy on L2 Italian Acquisition of Unaccompanied Foreign Minors","authors":"C. Nigrelli","doi":"10.26615/978-2-9701095-6-3_016","DOIUrl":"https://doi.org/10.26615/978-2-9701095-6-3_016","url":null,"abstract":"","PeriodicalId":259759,"journal":{"name":"Proceedings of the Third International Conference, Europhras 2019, Computational and Corpus-Based Phraseology","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117146739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2019-09-25DOI: 10.26615/978-2-9701095-6-3_006
Goranka Blagus Bartolec, Zagreb Croatia Linguistics, I. Ivanković
{"title":"Orthography in Practice: Corpus-based Verification of Writing Ktetics in MWUs in Croatian","authors":"Goranka Blagus Bartolec, Zagreb Croatia Linguistics, I. Ivanković","doi":"10.26615/978-2-9701095-6-3_006","DOIUrl":"https://doi.org/10.26615/978-2-9701095-6-3_006","url":null,"abstract":"","PeriodicalId":259759,"journal":{"name":"Proceedings of the Third International Conference, Europhras 2019, Computational and Corpus-Based Phraseology","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114472287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}