Linguamatica最新文献

英文中文

PoNTE: apontando para corpos de aprendizes de tradução avançados 桥梁:针对高级翻译学习者群体

IF 0.6 Q4 LINGUISTICS

Linguamatica

Pub Date : 2014-07-31 DOI: 10.21814/LM.6.1.171

Diana Santos

E possivel ensinar usando os materiais criados pelos proprios alunos, e ao mesmo tempo anota-los para obter mais material que fique publico para mais professores e estudantes? E possivel desenvolver o DISPARA, inicialmente concebido para disponibilizar corpos de traducao "tradicionais", de forma a conter mais um nivel de anotacao de "critica de traducao"? O projeto PoNTE pretende ser uma abordagem inicial a estas duas questoes. Neste artigo, descrevo o tipo de comentarios e anotacao critica que seria desejavel ter codificado num corpo deste genero, a sua primeira implementacao e estudos realizados, e os problemas tecnicos que ainda se poem na gestao de um corpo sempre crescente.

是否有可能使用学生自己创建的材料进行教学，同时将它们写下来，以便为更多的教师和学生提供更多的材料?是否有可能开发射，最初的设计是提供“传统”翻译体，以包含另一个级别的“翻译批评”注释?桥梁项目旨在作为解决这两个问题的初步方法。在这篇文章中，我描述了在这类机构中编码的评论和关键注释的类型，它的第一次实现和研究，以及在管理一个不断增长的机构中仍然存在的技术问题。

引用次数: 2

Realização de Previsões com Conteúdos Textuais em Português 用葡萄牙语文本内容进行预测

IF 0.6 Q4 LINGUISTICS

Linguamatica

Pub Date : 2014-07-31 DOI: 10.21814/LM.6.1.174

Indira Gandi Mascarenhas de Brito, Bruno Martins

A previsao de quantidades do mundo real com base em informacao textual atraiu recentemente um interesse significativo, embora os estudos anteriores se tenham concentrado em aplicacoes que envolvem apenas textos em ingles. Este artigo apresenta um estudo experimental sobre a realizacao de previsoes com base em textos em portugues, envolvendo o uso de documentos associados a tres dominios distintos. Relatamos experiencias utilizando diferentes tipos de modelos de regressao, usando esquemas de ponderacao para as carateristicas descritivas do atual estado da arte, e usando carateristicas descritivas derivadas de representacoes para as palavras baseadas no agrupamento automatico das mesmas. Atraves de experiencias, demonstramos que modelos de regressao usando a informacao textual atingem melhores resultados, quando comparados com abordagens simples tais como realizar as previsoes com base no valor medio dos dados de treino. Demonstramos ainda que as representacoes de documentos mais ricas (e.g., usando o algoritmo de Brown para o agrupamento automatico de palavras, e o esquema de ponderacao das carateristicas denominado Delta-TF-IDF) resultam em ligeiras melhorias no desempenho.

基于文本信息预测真实世界的数量最近引起了人们的极大兴趣，尽管以前的研究集中在只涉及英语文本的应用上。本文提出了一项基于葡萄牙语文本进行预测的实验研究，涉及使用与三个不同领域相关的文件。我们报告了使用不同类型的回归模型的经验，对当前技术状态的描述特征使用加权方案，并使用基于自动聚类的单词表征衍生的描述特征。通过实验证明，使用文本信息的回归模型与基于训练数据平均值进行预测等简单方法相比，效果更好。我们还证明了更丰富的文档表示(例如，使用Brown算法进行自动单词聚类，以及称为Delta-TF-IDF的特征加权方案)会导致性能的轻微改善。

引用次数: 0

Usando Grades de Entidades na Análise Automática de Coerência Local em Textos Científicos 在科学文本局部一致性自动分析中的实体网格

IF 0.6 Q4 LINGUISTICS

Linguamatica

Pub Date : 2014-07-31 DOI: 10.21814/LM.6.1.176

Alison Rafael Polpeta Freitas, V. D. Feltrim

EnglishIn this paper we investigate the applicability of Barzilay and Lapata’s (2008) entity-grid model in the evaluation of local coherence in scientific abstracts written in Portuguese. More specifically, we focused on assessing whether such model could be employed in the implementation of a classifier capable of detecting linearity breaks that affect text coherence. Our experimental results are close to those of the original entity-grid model for English and very similar to the results reported by related works for other languages. In experiments with scientific abstracts, results are close to those obtained by human judges, showing that the entity-grid model can be used in the investigated context. portuguesEste artigo apresenta os resultados de uma investigacao acerca da aplicabilidade do modelo grade de entidades proposto por Barzilay e Lapata (2008) na avaliacao de coeencia local em resumos cientificos escritos em portugues. Mais especificamente, se buscou avaliar se tal modelo poderia ser empregado na implementacao de um classificador capaz de detectar quebras de linearidade que afetam a coerencia dos resumos. Os resultados experimentais se mostraram proximos aos do modelo original para a lingua inglesa e semelhantes aos relatados por trabalhos relacionados para outras linguas. Nos experimentos com resumos cientificos, os resultados foram proximos ao obtido por juizes humanos, mostrando que o modelo grade de entidades tem potencial para ser aplicado no contexto investigado.

在本文中，我们研究了Barzilay和Lapata(2008)的实体网格模型在评估葡萄牙语科学摘要局部一致性中的适用性。更具体地说，我们专注于评估这种模型是否可以用于实现能够检测影响文本连贯性的线性中断的分类器。我们的实验结果与英语实体网格模型的原始结果接近，与其他语言的相关研究结果非常相似。在科学摘要的实验中，结果与人类判断的结果接近，表明实体网格模型可以在研究的背景下使用。本文介绍了葡萄牙的研究成果，并提出了一种新的研究方法，即在葡萄牙的研究成果中采用了一种新的方法，即在巴西和拉帕塔(2008)提出了一种新的方法，即在葡萄牙的研究成果中采用一种新的方法。具体地说，本文提出了一种基于数据集的数据集模型，实现了基于数据集的数据集分类能力和基于线性数据集的数据集检测能力。实验结果表明，最接近的近义词与原语的近义词相比，更接近的近义词与原语的近义词相比，更接近的近义词与原语的近义词相比，更接近的近义词与原语的近义词相比。号experimentos com resumos cientificos、os resultados有孔虫下月的ao obtido为什么juizes humanos, mostrando, o莫德罗年级de entidades tem潜在对位ser aplicado没有contexto investigado。

{"title":"Usando Grades de Entidades na Análise Automática de Coerência Local em Textos Científicos","authors":"Alison Rafael Polpeta Freitas, V. D. Feltrim","doi":"10.21814/LM.6.1.176","DOIUrl":"https://doi.org/10.21814/LM.6.1.176","url":null,"abstract":"EnglishIn this paper we investigate the applicability of Barzilay and Lapata’s (2008) entity-grid model in the evaluation of local coherence in scientific abstracts written in Portuguese. More specifically, we focused on assessing whether such model could be employed in the implementation of a classifier capable of detecting linearity breaks that affect text coherence. Our experimental results are close to those of the original entity-grid model for English and very similar to the results reported by related works for other languages. In experiments with scientific abstracts, results are close to those obtained by human judges, showing that the entity-grid model can be used in the investigated context. portuguesEste artigo apresenta os resultados de uma investigacao acerca da aplicabilidade do modelo grade de entidades proposto por Barzilay e Lapata (2008) na avaliacao de coeencia local em resumos cientificos escritos em portugues. Mais especificamente, se buscou avaliar se tal modelo poderia ser empregado na implementacao de um classificador capaz de detectar quebras de linearidade que afetam a coerencia dos resumos. Os resultados experimentais se mostraram proximos aos do modelo original para a lingua inglesa e semelhantes aos relatados por trabalhos relacionados para outras linguas. Nos experimentos com resumos cientificos, os resultados foram proximos ao obtido por juizes humanos, mostrando que o modelo grade de entidades tem potencial para ser aplicado no contexto investigado.","PeriodicalId":41819,"journal":{"name":"Linguamatica","volume":"6 1","pages":"29-40"},"PeriodicalIF":0.6,"publicationDate":"2014-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68370649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

NERP-CRF: uma ferramenta para o reconhecimento de entidades nomeadas por meio de Conditional Random Fields NERP-CRF:一种通过条件随机字段识别命名实体的工具

IF 0.6 Q4 LINGUISTICS

Linguamatica

Pub Date : 2014-07-31 DOI: 10.21814/LM.6.1.177

Daniela O. F. do Amaral, Renata Vieira

Conditional Random Fields (CRF) e um metodo probabilistico de predicao estruturada que tem sido amplamente aplicado em diversas areas, tais como a de Processamento da Linguagem Natural (PLN), incluindo o Reconhecimento de Entidades Nomeadas (REN), visao computacional e bioinformatica. Nesse sentido, propoe-se a realizacao da tarefa de REN aplicando o metodo CRF e, sequencialmente, e feita uma avaliacao do seu desempenho com base no corpus do HAREM. Conclui-se que, nos testes realizados, o sistema NERP-CRF obteve os melhores resultados de Precisao quando comparado com os sistemas avaliados no mesmo corpus, com plenas condicoes de ser um sistema competitivo e eficaz.

条件随机数域(CRF)是一种概率结构谓词方法，已广泛应用于自然语言处理(nlp)、命名实体识别(REN)、计算视觉和生物信息学等领域。在此意义上，提出了应用CRF方法执行REN任务，并基于后宫语料库依次对REN任务的表现进行评价。我们的结论是，在测试中，NERP-CRF系统与在同一语料库中评估的系统相比，获得了最好的精度结果，具有竞争力和有效性。

引用次数: 18

La subjetivización del de que en el español de Colombia 在哥伦比亚的西班牙语中，主体性

IF 0.6 Q4 LINGUISTICS

Linguamatica

Pub Date : 2013-12-31 DOI: 10.21814/lm.5.2.158

Matías Guzmán Naranjo

This paper deals with the phenomenon known as dequeismo in Spanish, in particular with the semantic differences between canonical sentences with que and dequeista sentences. I analyze previous corpus studies of dequeismo, test their predictions with an independent corpus of spoken Colombian Spanish, and finally carry out a logistic regression to test new possible parameters that might influence speaker's choice. The result is that most previous accounts of dequeismo are not consistent with the corpus used for this study, and that dequeismo seems to be influenced by the discourse context (real or non real situations ), and speaker's subjectivity.

本文研究了西班牙语中的“脱句”现象，重点讨论了典型句中“脱句”和“脱句”的语义差异。我分析了以前关于dequeismo的语料库研究，用一个独立的哥伦比亚西班牙语口语语料库测试了他们的预测，最后进行了逻辑回归，以测试可能影响说话者选择的新参数。结果是，大多数先前关于脱句的描述与本研究使用的语料库不一致，而且脱句似乎受到话语语境(真实或非真实情景)和说话人主体性的影响。

引用次数: 0

Hacia un modelo computacional unificado del lenguaje natural 走向统一的自然语言计算模型

IF 0.6 Q4 LINGUISTICS

Linguamatica

Pub Date : 2013-12-31 DOI: 10.21814/LM.5.2.161

B. González

?Que tipo de formalismo debe utilizarse para representar el lenguaje natural? Es necesario un formalismo capaz de describir adecuadamente todas las secuencias de las lenguas naturales. Pero, ademas, en la medida de lo posible, debe ser un formalismo sencillo, de un coste computacional reducido. Esta pregunta ha generado mucha controversia entre las principales escuelas generativas: la Gramatica Transformacional y las Gramaticas de Unificacion. En este articulo se defiende que, pese a las diferencias existentes, en ultima instancia, tales escuelas formalizan el lenguaje humano mediante un mismo tipo de formalismo bien definido: lo que Noam Chomsky llamo lenguaje independiente del contexto. Bajo el prisma de este articulo, la Linguistica actual esta en condiciones de ofrecer un modelo computacional unificado del lenguaje natural.

应该用什么样的形式主义来表示自然语言?需要一种能够充分描述自然语言所有序列的形式主义。但同时，在可能的情况下，它应该是一种简单的形式主义，降低计算成本。这个问题在主要的生成学派中引起了很多争议:转化语法和统一语法。在这篇文章中，我们提出了一种理论，认为语言是一种独立于上下文的语言，尽管存在差异，但最终这些学校通过一种定义明确的形式主义将人类语言形式化。从本文的角度来看，目前的语言学能够提供一个统一的自然语言计算模型。

引用次数: 2

Testuen sinplifikazio automatikoa: arloaren egungo egoera 自动文本简化：图标的当前位置

IF 0.6 Q4 LINGUISTICS

Linguamatica

Pub Date : 2013-12-31 DOI: 10.21814/LM.5.2.163

Itziar Gonzalez-Dios, M.ª Jesús Aranzabe, A. D. D. Ilarraza

The aim of this paper is to give an overview of the state-of-art in automatic text simplification. To that end, we present the systems and prototypes according to the language the are built for, their target audience and the type of simplification (syntactic, lexical or both) they perform. Moreover, we expound the different evaluation methods that have been carried out with these systems and the resources and tools developed so far.

本文的目的是概述自动文本简化的最新进展。为此，我们根据所构建的语言、目标受众和它们所执行的简化类型(语法、词汇或两者)来呈现系统和原型。此外，我们还阐述了利用这些系统进行的不同评估方法以及迄今为止开发的资源和工具。

引用次数: 6

Desenvolvimento de um recurso léxico com papéis semânticos para o português 为葡萄牙语开发具有语义角色的词汇资源

IF 0.6 Q4 LINGUISTICS

Linguamatica

Pub Date : 2013-12-31 DOI: 10.21814/LM.5.2.167

Leonardo Zilio, Carlos Ramisch, M. J. B. Finatto

The objectives of this study are as follows: to present a methodology for the development of a lexical resource with semantic information; to compare semantic roles in specialized and non-specialized language; and to observe the semantic role labeling (SRL) made by a group of annotators. Two experiments revolving around SRL in Portuguese were developed: a comparison between data in specialized and non-specialized language corpora; and an annotation evaluation for verifying the agreement among multiple annotators for the task of SRL. As for results, a qualitative difference between the corpora was observed, and the most prominent feature was the omission of agents in specialized texts. There was little agreement among annotators, which points toward the necessity of more training, or a simplification of the task, which does not seem to be possible at this stage of development.

本研究的目的是:提出一种具有语义信息的词汇资源开发方法;比较专门化语言和非专门化语言的语义角色;观察一组注释者所做的语义角色标注(SRL)。围绕葡萄牙语的SRL进行了两个实验:专业语料库和非专业语料库数据的比较;以及用于验证SRL任务的多个注释者之间的一致性的注释评估。从结果来看，两种语料库之间存在着质的差异，最突出的特征是在专业文本中省略了代理。注释者之间几乎没有达成一致意见，这表明有必要进行更多的培训，或者简化任务，这在目前的发展阶段似乎是不可能的。

引用次数: 2

Imaxin|software - 16 Anos Desenvolvendo Aplicações No Campo Do Processamento Da Linguagem Natural Multilingue Imaxin |软件- 16年多语言开发领域自然语言处理应用程序

IF 0.6 Q4 LINGUISTICS

Linguamatica

Pub Date : 2013-12-31 DOI: 10.21814/LM.5.2.170

José Ramom Pichel Campos, D. Rey, Antonio Fernández Cabezas, Luz Castro Peña

imaxin|software e uma empresa criada em 1997 por quatro titulados em engenharia informatica com o objetivo de desenvolver videojogos multimedia educativos e processamento da linguagem natural. 16 anos depois tem desenvolvido recursos, ferramentas, aplicacoes multilingues para diferentes linguas: Portugues (Galiza, Portugal, Brasil, etc.), Espanhol (Espanha, Argentina, Mexico, etc.), Ingles, Catalao, Frances. Neste artigo redigido em portugues-galego faremos uma descricao daqueles principais fitos deste percurso tecnologico e humano.

imaxin |软件和工程公司成立于1997年由四个能力的其目的为教育游戏的开发多媒体和自然语言处理。16年后，它为不同的语言开发了资源、工具和多语言应用程序:葡萄牙语(加利西亚、葡萄牙、巴西等)、西班牙语(西班牙、阿根廷、墨西哥等)、英语、加泰罗尼亚语、法语。在这篇用葡萄牙语-加利西亚语写的文章中，我们将描述这条技术和人类道路的主要目标。

引用次数: 0

Hacia un tratamiento computacional del Aktionsart 对Aktionsart的计算处理

IF 0.6 Q4 LINGUISTICS

Linguamatica

Pub Date : 2013-12-31 DOI: 10.21814/LM.5.2.162

Juan Aparicio, I. Castellón, Marta Coll-Florit

In the area of Natural Language Processing (NLP), when creating intelligent applications, semantic processing is essential. However, research currently being conducted in NLP is still far from achieving deep levels of understanding of language. The main goal of our research is the representation of Aktionsart (how the event expressed by a verb is construed as unfolding over time). One of the basic difficulties presented by the semantic processing of language is establishing classes, mainly due to the gradual nature of meaning, and the high incidence of context in the interpretation of the different units. In this work we focus on the presentation of the lexical aspectual classes of our proposal. The total number of defined classes is classified into two groups, simple classes: states, processes and points, the combination of which gives rise to the complex classes: culminations, accomplishments and graduals. This presentation will take place both from the theoretical point of view, and its computational implementation.

在自然语言处理(NLP)领域，在创建智能应用程序时，语义处理是必不可少的。然而，目前在NLP领域进行的研究还远远没有达到对语言的深度理解。我们研究的主要目标是动作的表征(动词表达的事件如何随着时间的推移而展开)。语言语义处理的基本困难之一是建立类，这主要是由于意义的渐进性质，以及在解释不同单位时语境的高度发生率。在这项工作中，我们重点介绍了我们的建议的词汇方面类。定义的类的总数分为两组，简单类:状态、过程和点，它们的组合产生了复杂类:高潮、成就和渐进。本报告将从理论角度和计算实现两方面进行。

引用次数: 2

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Linguamatica

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀