Proceedings of the conference. Association for Computational Linguistics. Meeting最新文献

英文中文

Proceedings of the conference. Association for Computational Linguistics. Meeting

Pub Date : 2016-07-14 DOI: 10.18653/V1/E17-1038

Tsendsuren Munkhdalai, Hong Yu

We present a memory augmented neural network for natural language understanding: Neural Semantic Encoders. NSE is equipped with a novel memory update rule and has a variable sized encoding memory that evolves over time and maintains the understanding of input sequences through read, compose and write operations. NSE can also access 1 multiple and shared memories. In this paper, we demonstrated the effectiveness and the flexibility of NSE on five different natural language tasks: natural language inference, question answering, sentence classification, document sentiment analysis and machine translation where NSE achieved state-of-the-art performance when evaluated on publically available benchmarks. For example, our shared-memory model showed an encouraging result on neural machine translation, improving an attention-based baseline by approximately 1.0 BLEU.

我们提出了一种用于自然语言理解的记忆增强神经网络:神经语义编码器。NSE配备了一种新颖的内存更新规则，并具有可变大小的编码内存，随着时间的推移而发展，并通过读、写和写操作保持对输入序列的理解。NSE还可以访问多个内存和共享内存。在本文中，我们展示了NSE在五个不同的自然语言任务上的有效性和灵活性:自然语言推理、问题回答、句子分类、文档情感分析和机器翻译，其中NSE在公开可用的基准测试中获得了最先进的性能。例如，我们的共享内存模型在神经机器翻译上显示出令人鼓舞的结果，将基于注意力的基线提高了大约1.0 BLEU。

引用次数: 133

Exploring Autism Spectrum Disorders Using HLT. 使用 HLT 探索自闭症谱系障碍。

Proceedings of the conference. Association for Computational Linguistics. Meeting

Pub Date : 2016-06-01 DOI: 10.18653/v1/w16-0308

Julia Parish-Morris, Mark Liberman, Neville Ryant, Christopher Cieri, Leila Bateman, Emily Ferguson, Robert T Schultz

The phenotypic complexity of Autism Spectrum Disorder motivates the application of modern computational methods to large collections of observational data, both for improved clinical diagnosis and for better scientific understanding. We have begun to create a corpus of annotated language samples relevant to this research, and we plan to join with other researchers in pooling and publishing such resources on a large scale. The goal of this paper is to present some initial explorations to illustrate the opportunities that such datasets will afford.

自闭症谱系障碍的表型复杂，促使我们将现代计算方法应用于大量观察数据的收集，以改进临床诊断和提高科学认识。我们已经开始创建与这项研究相关的注释语言样本语料库，并计划与其他研究人员一起大规模汇集和发布此类资源。本文旨在介绍一些初步探索，以说明此类数据集将带来的机遇。

引用次数: 0

Measuring idiosyncratic interests in children with autism. 测量自闭症儿童的特殊兴趣。

Proceedings of the conference. Association for Computational Linguistics. Meeting

Pub Date : 2015-07-01 DOI: 10.3115/v1/p15-2035

Masoud Rouhizadeh, Emily Prud'hommeaux, Jan van Santen, Richard Sproat

A defining symptom of autism spectrum disorder (ASD) is the presence of restricted and repetitive activities and interests, which can surface in language as a perseverative focus on idiosyncratic topics. In this paper, we use semantic similarity measures to identify such idiosyncratic topics in narratives produced by children with and without ASD. We find that neurotypical children tend to use the same words and semantic concepts when retelling the same narrative, while children with ASD, even when producing accurate retellings, use different words and concepts relative not only to neurotypical children but also to other children with ASD. Our results indicate that children with ASD not only stray from the target topic but do so in idiosyncratic ways according to their own restricted interests.

自闭症谱系障碍(ASD)的一个典型症状是存在限制和重复的活动和兴趣，这些活动和兴趣可以在语言中表现为对特殊话题的持续关注。在本文中，我们使用语义相似性测量来识别自闭症儿童和非自闭症儿童所产生的叙事中的这些特殊主题。我们发现，神经正常儿童在复述同样的故事时倾向于使用相同的单词和语义概念，而自闭症儿童即使在准确复述时，使用的单词和概念不仅与神经正常儿童不同，也与其他自闭症儿童不同。我们的研究结果表明，自闭症儿童不仅会偏离目标话题，而且会根据他们自己有限的兴趣以特殊的方式偏离目标话题。

引用次数: 5

An Extension of BLANC to System Mentions. BLANC对系统提及的扩展。

Proceedings of the conference. Association for Computational Linguistics. Meeting

Pub Date : 2014-06-01 DOI: 10.3115/v1/P14-2005

Xiaoqiang Luo, Sameer Pradhan, Marta Recasens, Eduard Hovy

BLANC is a link-based coreference evaluation metric for measuring the quality of coreference systems on gold mentions. This paper extends the original BLANC ("BLANC-gold" henceforth) to system mentions, removing the gold mention assumption. The proposed BLANC falls back seamlessly to the original one if system mentions are identical to gold mentions, and it is shown to strongly correlate with existing metrics on the 2011 and 2012 CoNLL data.

BLANC是一个基于链接的共同参考评价指标，用于测量黄金提及的共同参考系统的质量。本文将原有的BLANC(以下简称BLANC-gold)扩展到系统提及，去掉了黄金提及的假设。如果系统提及与黄金提及相同，则拟议的BLANC可以无缝地回落到原始的BLANC，并且它与2011年和2012年CoNLL数据上的现有指标密切相关。

引用次数: 36

Scoring Coreference Partitions of Predicted Mentions: A Reference Implementation. 预测提及的评分参考分区:参考实现。

Proceedings of the conference. Association for Computational Linguistics. Meeting

Pub Date : 2014-06-01 DOI: 10.3115/v1/P14-2006

Sameer Pradhan, Xiaoqiang Luo, Marta Recasens, Eduard Hovy, Vincent Ng, Michael Strube

The definitions of two coreference scoring metrics- B³ and CEAF-are underspecified with respect to predicted, as opposed to key (or gold) mentions. Several variations have been proposed that manipulate either, or both, the key and predicted mentions in order to get a one-to-one mapping. On the other hand, the metric BLANC was, until recently, limited to scoring partitions of key mentions. In this paper, we (i) argue that mention manipulation for scoring predicted mentions is unnecessary, and potentially harmful as it could produce unintuitive results; (ii) illustrate the application of all these measures to scoring predicted mentions; (iii) make available an open-source, thoroughly-tested reference implementation of the main coreference evaluation measures; and (iv) rescore the results of the CoNLL-2011/2012 shared task systems with this implementation. This will help the community accurately measure and compare new end-to-end coreference resolution algorithms.

相对于关键(或黄金)提及，两个共同参考评分指标——B3和cef——的定义相对于预测来说没有明确。为了获得一对一的映射，已经提出了几种变体，可以对关键字和预测项进行操作或同时操作。另一方面，直到最近，度量BLANC还仅限于对关键提及的分区进行评分。在本文中，我们(i)认为对预测提及进行评分的提及操作是不必要的，并且可能有害，因为它可能产生不直观的结果;(ii)说明所有这些措施对预测提及评分的应用;(iii)就主要的共同参考评估措施，提供一个开放源码、经过彻底测试的参考实施方案;(iv)通过此实现重新记录CoNLL-2011/2012共享任务系统的结果。这将有助于社区准确地测量和比较新的端到端共同参考分辨率算法。

{"title":"Scoring Coreference Partitions of Predicted Mentions: A Reference Implementation.","authors":"Sameer Pradhan, Xiaoqiang Luo, Marta Recasens, Eduard Hovy, Vincent Ng, Michael Strube","doi":"10.3115/v1/P14-2006","DOIUrl":"https://doi.org/10.3115/v1/P14-2006","url":null,"abstract":"The definitions of two coreference scoring metrics- B3 and CEAF-are underspecified with respect to predicted, as opposed to key (or gold) mentions. Several variations have been proposed that manipulate either, or both, the key and predicted mentions in order to get a one-to-one mapping. On the other hand, the metric BLANC was, until recently, limited to scoring partitions of key mentions. In this paper, we (i) argue that mention manipulation for scoring predicted mentions is unnecessary, and potentially harmful as it could produce unintuitive results; (ii) illustrate the application of all these measures to scoring predicted mentions; (iii) make available an open-source, thoroughly-tested reference implementation of the main coreference evaluation measures; and (iv) rescore the results of the CoNLL-2011/2012 shared task systems with this implementation. This will help the community accurately measure and compare new end-to-end coreference resolution algorithms.","PeriodicalId":74541,"journal":{"name":"Proceedings of the conference. Association for Computational Linguistics. Meeting","volume":"2014 ","pages":"30-35"},"PeriodicalIF":0.0,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3115/v1/P14-2006","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35225788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 185

Interpretable Semantic Vectors from a Joint Model of Brain- and Text-Based Meaning. 大脑和文本意义联合模型中的可解释语义向量

Proceedings of the conference. Association for Computational Linguistics. Meeting

Pub Date : 2014-06-01 DOI: 10.3115/v1/p14-1046

Alona Fyshe, Partha P Talukdar, Brian Murphy, Tom M Mitchell

Vector space models (VSMs) represent word meanings as points in a high dimensional space. VSMs are typically created using a large text corpora, and so represent word semantics as observed in text. We present a new algorithm (JNNSE) that can incorporate a measure of semantics not previously used to create VSMs: brain activation data recorded while people read words. The resulting model takes advantage of the complementary strengths and weaknesses of corpus and brain activation data to give a more complete representation of semantics. Evaluations show that the model 1) matches a behavioral measure of semantics more closely, 2) can be used to predict corpus data for unseen words and 3) has predictive power that generalizes across brain imaging technologies and across subjects. We believe that the model is thus a more faithful representation of mental vocabularies.

向量空间模型（VSM）将词义表示为高维空间中的点。VSM 通常使用大型文本语料库创建，因此代表的是在文本中观察到的词义。我们提出了一种新算法（JNNSE），该算法可以将以前未用于创建 VSM 的语义度量方法纳入其中：即在人们阅读单词时记录的大脑激活数据。由此产生的模型利用了语料库和脑激活数据的互补优缺点，对语义进行了更完整的表述。评估结果表明，该模型：1）与语义的行为测量更为匹配；2）可用于预测未见词语的语料库数据；3）具有跨脑成像技术和跨受试者的预测能力。因此，我们认为该模型能更忠实地反映心理词汇。

引用次数: 0

Evaluation of SPARQL query generation from natural language questions. 评估从自然语言问题生成的SPARQL查询。

Proceedings of the conference. Association for Computational Linguistics. Meeting

Pub Date : 2013-09-01

K Bretonnel Cohen, Jin-Dong Kim

SPARQL queries have become the standard for querying linked open data knowledge bases, but SPARQL query construction can be challenging and time-consuming even for experts. SPARQL query generation from natural language questions is an attractive modality for interfacing with LOD. However, how to evaluate SPARQL query generation from natural language questions is a mostly open research question. This paper presents some issues that arise in SPARQL query generation from natural language, a test suite for evaluating performance with respect to these issues, and a case study in evaluating a system for SPARQL query generation from natural language questions.

SPARQL查询已经成为查询链接的开放数据知识库的标准，但是SPARQL查询的构建具有挑战性，而且耗时，即使对专家来说也是如此。从自然语言问题生成SPARQL查询是与LOD接口的一种有吸引力的方式。然而，如何评估由自然语言问题生成的SPARQL查询是一个非常开放的研究问题。本文介绍了从自然语言生成SPARQL查询时出现的一些问题，一个用于评估与这些问题相关的性能的测试套件，以及一个评估从自然语言问题生成SPARQL查询的系统的案例研究。

引用次数: 0

Topic Modeling Based Classification of Clinical Reports. 基于主题建模的临床报告分类。

Proceedings of the conference. Association for Computational Linguistics. Meeting

Pub Date : 2013-08-01

Efsun Sarioglu, Kabir Yadav, Hyeong-Ah Choi

Electronic health records (EHRs) contain important clinical information about patients. Some of these data are in the form of free text and require preprocessing to be able to used in automated systems. Efficient and effective use of this data could be vital to the speed and quality of health care. As a case study, we analyzed classification of CT imaging reports into binary categories. In addition to regular text classification, we utilized topic modeling of the entire dataset in various ways. Topic modeling of the corpora provides interpretable themes that exist in these reports. Representing reports according to their topic distributions is more compact than bag-of-words representation and can be processed faster than raw text in subsequent automated processes. A binary topic model was also built as an unsupervised classification approach with the assumption that each topic corresponds to a class. And, finally an aggregate topic classifier was built where reports are classified based on a single discriminative topic that is determined from the training dataset. Our proposed topic based classifier system is shown to be competitive with existing text classification techniques and provides a more efficient and interpretable representation.

电子健康记录（EHR）包含有关患者的重要临床信息。其中一些数据是自由文本的形式，需要进行预处理才能在自动化系统中使用。高效和有效地使用这些数据对医疗保健的速度和质量至关重要。作为一个案例研究，我们分析了CT成像报告的二元分类。除了常规的文本分类外，我们还以各种方式利用了整个数据集的主题建模。语料库的主题建模提供了这些报告中存在的可解释的主题。根据主题分布表示报告比单词袋表示更紧凑，并且在随后的自动化过程中可以比原始文本更快地处理。还建立了一个二元主题模型，作为一种无监督的分类方法，假设每个主题对应一个类。最后，建立了一个聚合主题分类器，其中基于从训练数据集确定的单个判别主题对报告进行分类。我们提出的基于主题的分类器系统与现有的文本分类技术相比具有竞争力，并提供了更高效和可解释的表示。

{"title":"Topic Modeling Based Classification of Clinical Reports.","authors":"Efsun Sarioglu, Kabir Yadav, Hyeong-Ah Choi","doi":"","DOIUrl":"","url":null,"abstract":"Electronic health records (EHRs) contain important clinical information about patients. Some of these data are in the form of free text and require preprocessing to be able to used in automated systems. Efficient and effective use of this data could be vital to the speed and quality of health care. As a case study, we analyzed classification of CT imaging reports into binary categories. In addition to regular text classification, we utilized topic modeling of the entire dataset in various ways. Topic modeling of the corpora provides interpretable themes that exist in these reports. Representing reports according to their topic distributions is more compact than bag-of-words representation and can be processed faster than raw text in subsequent automated processes. A binary topic model was also built as an unsupervised classification approach with the assumption that each topic corresponds to a class. And, finally an aggregate topic classifier was built where reports are classified based on a single discriminative topic that is determined from the training dataset. Our proposed topic based classifier system is shown to be competitive with existing text classification techniques and provides a more efficient and interpretable representation.","PeriodicalId":74541,"journal":{"name":"Proceedings of the conference. Association for Computational Linguistics. Meeting","volume":"2013 ","pages":"67-73"},"PeriodicalIF":0.0,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10544137/pdf/nihms-1932324.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41165188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the conference. Association for Computational Linguistics. Meeting

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀