首页 > 最新文献

Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting最新文献

英文 中文
EMR Coding with Semi-Parametric Multi-Head Matching Networks. 半参数多头匹配网络的EMR编码。
Anthony Rios, Ramakanth Kavuluru

Coding EMRs with diagnosis and procedure codes is an indispensable task for billing, secondary data analyses, and monitoring health trends. Both speed and accuracy of coding are critical. While coding errors could lead to more patient-side financial burden and mis-interpretation of a patient's well-being, timely coding is also needed to avoid backlogs and additional costs for the healthcare facility. In this paper, we present a new neural network architecture that combines ideas from few-shot learning matching networks, multi-label loss functions, and convolutional neural networks for text classification to significantly outperform other state-of-the-art models. Our evaluations are conducted using a well known deidentified EMR dataset (MIMIC) with a variety of multi-label performance measures.

使用诊断和程序代码对电子病历进行编码是计费、辅助数据分析和监测健康趋势不可或缺的任务。编码的速度和准确性都至关重要。虽然编码错误可能导致更多的患者方面的经济负担和对患者健康状况的错误解释,但也需要及时编码,以避免积压和医疗机构的额外成本。在本文中,我们提出了一种新的神经网络架构,它结合了来自少量学习匹配网络、多标签损失函数和用于文本分类的卷积神经网络的思想,显著优于其他最先进的模型。我们的评估是使用一个众所周知的未识别EMR数据集(MIMIC)进行的,其中包含各种多标签性能测量。
{"title":"EMR Coding with Semi-Parametric Multi-Head Matching Networks.","authors":"Anthony Rios,&nbsp;Ramakanth Kavuluru","doi":"10.18653/v1/N18-1189","DOIUrl":"https://doi.org/10.18653/v1/N18-1189","url":null,"abstract":"<p><p>Coding EMRs with diagnosis and procedure codes is an indispensable task for billing, secondary data analyses, and monitoring health trends. Both speed and accuracy of coding are critical. While coding errors could lead to more patient-side financial burden and mis-interpretation of a patient's well-being, timely coding is also needed to avoid backlogs and additional costs for the healthcare facility. In this paper, we present a new neural network architecture that combines ideas from few-shot learning matching networks, multi-label loss functions, and convolutional neural networks for text classification to significantly outperform other state-of-the-art models. Our evaluations are conducted using a well known deidentified EMR dataset (MIMIC) with a variety of multi-label performance measures.</p>","PeriodicalId":74542,"journal":{"name":"Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting","volume":"2018 ","pages":"2081-2091"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6105925/pdf/nihms-985153.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36432294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
A Multi-Context Character Prediction Model for a Brain-Computer Interface. 脑机接口的多上下文字符预测模型。
Shiran Dudy, Steven Bedrick, Shaobin Xu, David A Smith

Brain-computer interfaces and other augmentative and alternative communication devices introduce language-modeing challenges distinct from other character-entry methods. In particular, the acquired signal of the EEG (electroencephalogram) signal is noisier, which, in turn, makes the user intent harder to decipher. In order to adapt to this condition, we propose to maintain ambiguous history for every time step, and to employ, apart from the character language model, word information to produce a more robust prediction system. We present preliminary results that compare this proposed Online-Context Language Model (OCLM) to current algorithms that are used in this type of setting. Evaluations on both perplexity and predictive accuracy demonstrate promising results when dealing with ambiguous histories in order to provide to the front end a distribution of the next character the user might type.

脑机接口和其他增强和替代通信设备引入了不同于其他字符输入方法的语言建模挑战。特别是,采集到的EEG(脑电图)信号噪声更大,这反过来又使用户意图更难被破译。为了适应这种情况,我们建议在每个时间步保持模糊历史,并且除了使用字符语言模型外,还使用单词信息来产生更稳健的预测系统。我们提出了初步的结果,将这个提议的在线上下文语言模型(OCLM)与当前在这种类型的设置中使用的算法进行比较。在处理模棱两可的历史时,为了向前端提供用户可能键入的下一个字符的分布,对困惑度和预测准确性的评估显示了有希望的结果。
{"title":"A Multi-Context Character Prediction Model for a Brain-Computer Interface.","authors":"Shiran Dudy,&nbsp;Steven Bedrick,&nbsp;Shaobin Xu,&nbsp;David A Smith","doi":"10.18653/v1/w18-1210","DOIUrl":"https://doi.org/10.18653/v1/w18-1210","url":null,"abstract":"<p><p>Brain-computer interfaces and other augmentative and alternative communication devices introduce language-modeing challenges distinct from other character-entry methods. In particular, the acquired signal of the EEG (electroencephalogram) signal is noisier, which, in turn, makes the user intent harder to decipher. In order to adapt to this condition, we propose to maintain ambiguous history for every time step, and to employ, apart from the character language model, word information to produce a more robust prediction system. We present preliminary results that compare this proposed Online-Context Language Model (OCLM) to current algorithms that are used in this type of setting. Evaluations on both perplexity and predictive accuracy demonstrate promising results when dealing with ambiguous histories in order to provide to the front end a distribution of the next character the user might type.</p>","PeriodicalId":74542,"journal":{"name":"Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting","volume":"2018 ","pages":"72-77"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8087439/pdf/nihms-1001613.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38861816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Syntactic Patterns Improve Information Extraction for Medical Search 语法模式改进医学搜索的信息提取
Roma Patel, Yinfei Yang, I. Marshall, A. Nenkova, Byron C. Wallace
Medical professionals search the published literature by specifying the type of patients, the medical intervention(s) and the outcome measure(s) of interest. In this paper we demonstrate how features encoding syntactic patterns improve the performance of state-of-the-art sequence tagging models (both neural and linear) for information extraction of these medically relevant categories. We present an analysis of the type of patterns exploited and of the semantic space induced for these, i.e., the distributed representations learned for identified multi-token patterns. We show that these learned representations differ substantially from those of the constituent unigrams, suggesting that the patterns capture contextual information that is otherwise lost.
医学专业人员通过指定患者类型、医疗干预和感兴趣的结果测量来搜索已发表的文献。在本文中,我们展示了特征编码语法模式如何提高最先进的序列标记模型(神经和线性)的性能,用于这些医学相关类别的信息提取。我们分析了利用的模式类型和为这些模式诱导的语义空间,即为已识别的多标记模式学习的分布式表示。我们发现,这些学习到的表征与那些组成单字图的表征有很大的不同,这表明这些模式捕捉了否则会丢失的上下文信息。
{"title":"Syntactic Patterns Improve Information Extraction for Medical Search","authors":"Roma Patel, Yinfei Yang, I. Marshall, A. Nenkova, Byron C. Wallace","doi":"10.18653/v1/N18-2060","DOIUrl":"https://doi.org/10.18653/v1/N18-2060","url":null,"abstract":"Medical professionals search the published literature by specifying the type of patients, the medical intervention(s) and the outcome measure(s) of interest. In this paper we demonstrate how features encoding syntactic patterns improve the performance of state-of-the-art sequence tagging models (both neural and linear) for information extraction of these medically relevant categories. We present an analysis of the type of patterns exploited and of the semantic space induced for these, i.e., the distributed representations learned for identified multi-token patterns. We show that these learned representations differ substantially from those of the constituent unigrams, suggesting that the patterns capture contextual information that is otherwise lost.","PeriodicalId":74542,"journal":{"name":"Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting","volume":"18 1","pages":"371-377"},"PeriodicalIF":0.0,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74621974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Bidirectional RNN for Medical Event Detection in Electronic Health Records 基于双向RNN的电子病历医疗事件检测
Abhyuday N. Jagannatha, Hong Yu
Sequence labeling for extraction of medical events and their attributes from unstructured text in Electronic Health Record (EHR) notes is a key step towards semantic understanding of EHRs. It has important applications in health informatics including pharmacovigilance and drug surveillance. The state of the art supervised machine learning models in this domain are based on Conditional Random Fields (CRFs) with features calculated from fixed context windows. In this application, we explored recurrent neural network frameworks and show that they significantly out-performed the CRF models.
从电子健康记录(EHR)笔记的非结构化文本中提取医疗事件及其属性的序列标记是实现电子健康记录语义理解的关键一步。它在包括药物警戒和药物监测在内的卫生信息学中有着重要的应用。该领域最先进的监督机器学习模型是基于条件随机场(CRFs)的,其特征是从固定的上下文窗口计算出来的。在这个应用中,我们探索了循环神经网络框架,并表明它们明显优于CRF模型。
{"title":"Bidirectional RNN for Medical Event Detection in Electronic Health Records","authors":"Abhyuday N. Jagannatha, Hong Yu","doi":"10.18653/v1/N16-1056","DOIUrl":"https://doi.org/10.18653/v1/N16-1056","url":null,"abstract":"Sequence labeling for extraction of medical events and their attributes from unstructured text in Electronic Health Record (EHR) notes is a key step towards semantic understanding of EHRs. It has important applications in health informatics including pharmacovigilance and drug surveillance. The state of the art supervised machine learning models in this domain are based on Conditional Random Fields (CRFs) with features calculated from fixed context windows. In this application, we explored recurrent neural network frameworks and show that they significantly out-performed the CRF models.","PeriodicalId":74542,"journal":{"name":"Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting","volume":"40 1","pages":"473-482"},"PeriodicalIF":0.0,"publicationDate":"2016-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82718166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 271
Similarity Measures for Quantifying Restrictive and Repetitive Behavior in Conversations of Autistic Children. 孤独症儿童对话中限制性和重复性行为量化的相似性测量。
Masoud Rouhizadeh, Richard Sproat, Jan van Santen

Restrictive and repetitive behavior (RRB) is a core symptom of autism spectrum disorder (ASD) and are manifest in language. Based on this, we expect children with autism to talk about fewer topics, and more repeatedly, during their conversations. We thus hypothesize a higher semantic overlap ratio between dialogue turns in children with ASD compared to those with typical development (TD). Participants of this study include children ages 4-8, 44 with TD and 25 with ASD without language impairment. We apply several semantic similarity metrics to the children's dialogue turns in semi-structured conversations with examiners. We find that children with ASD have significantly more semantically overlapping turns than children with TD, across different turn intervals. These results support our hypothesis, and could provide a convenient and robust ASD-specific behavioral marker.

限制性重复行为(RRB)是自闭症谱系障碍(ASD)的核心症状之一,主要表现在语言上。基于此,我们期望自闭症儿童在对话中谈论更少的话题,更多的重复。因此,我们假设与典型发育(TD)儿童相比,ASD儿童对话回合之间的语义重叠率更高。这项研究的参与者包括4-8岁的儿童,44岁患有TD, 25岁患有ASD,没有语言障碍。我们将几个语义相似度指标应用于儿童与考官的半结构化对话中的对话回合。我们发现,在不同的转弯间隔中,ASD儿童的语义重叠转弯明显多于TD儿童。这些结果支持了我们的假设,并可以提供一个方便和强大的asd特异性行为标记。
{"title":"Similarity Measures for Quantifying Restrictive and Repetitive Behavior in Conversations of Autistic Children.","authors":"Masoud Rouhizadeh,&nbsp;Richard Sproat,&nbsp;Jan van Santen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Restrictive and repetitive behavior (RRB) is a core symptom of autism spectrum disorder (ASD) and are manifest in language. Based on this, we expect children with autism to talk about fewer topics, and more repeatedly, during their conversations. We thus hypothesize a higher semantic overlap ratio between dialogue turns in children with ASD compared to those with typical development (TD). Participants of this study include children ages 4-8, 44 with TD and 25 with ASD without language impairment. We apply several semantic similarity metrics to the children's dialogue turns in semi-structured conversations with examiners. We find that children with ASD have significantly more semantically overlapping turns than children with TD, across different turn intervals. These results support our hypothesis, and could provide a convenient and robust ASD-specific behavioral marker.</p>","PeriodicalId":74542,"journal":{"name":"Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting","volume":"2015 ","pages":"117-123"},"PeriodicalIF":0.0,"publicationDate":"2015-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5499994/pdf/nihms707956.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35155830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated morphological analysis of clinical language samples. 临床语言样本的自动形态学分析。
Kyle Gorman, Steven Bedrick, Géza Kiss, Eric Morley, Rosemary Ingham, Metrah Mohammad, Katina Papadakis, Jan P H van Santen

Quantitative analysis of clinical language samples is a powerful tool for assessing and screening developmental language impairments, but requires extensive manual transcription, annotation, and calculation, resulting in error-prone results and clinical underutilization. We describe a system that performs automated morphological analysis needed to calculate statistics such as the mean length of utterance in morphemes (MLUM), so that these statistics can be computed directly from orthographic transcripts. Estimates of MLUM computed by this system are closely comparable to those produced by manual annotation. Our system can be used in conjunction with other automated annotation techniques, such as maze detection. This work represents an important first step towards increased automation of language sample analysis, and towards attendant benefits of automation, including clinical greater utilization and reduced variability in care delivery.

临床语言样本的定量分析是评估和筛查发育性语言障碍的有力工具,但需要大量的人工转录、注释和计算,导致结果容易出错和临床未充分利用。我们描述了一个执行自动形态学分析所需的系统,以计算统计数据,如语素中的平均话语长度(MLUM),以便这些统计数据可以直接从正字法转录本中计算出来。该系统计算的MLUM估计值与人工标注的估计值接近。我们的系统可以与其他自动标注技术结合使用,比如迷宫检测。这项工作是朝着提高语言样本分析自动化迈出的重要的第一步,也是朝着自动化带来的好处迈出的重要的第一步,包括临床更大的利用率和减少医疗服务的可变性。
{"title":"Automated morphological analysis of clinical language samples.","authors":"Kyle Gorman,&nbsp;Steven Bedrick,&nbsp;Géza Kiss,&nbsp;Eric Morley,&nbsp;Rosemary Ingham,&nbsp;Metrah Mohammad,&nbsp;Katina Papadakis,&nbsp;Jan P H van Santen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Quantitative analysis of clinical language samples is a powerful tool for assessing and screening developmental language impairments, but requires extensive manual transcription, annotation, and calculation, resulting in error-prone results and clinical underutilization. We describe a system that performs automated morphological analysis needed to calculate statistics such as the mean length of utterance in morphemes (MLUM), so that these statistics can be computed directly from orthographic transcripts. Estimates of MLUM computed by this system are closely comparable to those produced by manual annotation. Our system can be used in conjunction with other automated annotation techniques, such as maze detection. This work represents an important first step towards increased automation of language sample analysis, and towards attendant benefits of automation, including clinical greater utilization and reduced variability in care delivery.</p>","PeriodicalId":74542,"journal":{"name":"Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting","volume":"2015 ","pages":"108-116"},"PeriodicalIF":0.0,"publicationDate":"2015-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5499995/pdf/nihms707958.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35154924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated morphological analysis of clinical language samples 临床语言样本的自动形态学分析
Kyle Gorman, Steven Bedrick, G. Kiss, E. Morley, Rosemary Ingham, Metrah Mohammed, Katina Papadakis, J. V. Santen
Quantitative analysis of clinical language samples is a powerful tool for assessing and screening developmental language impairments, but requires extensive manual transcription, annotation, and calculation, resulting in error-prone results and clinical underutilization. We describe a system that performs automated morphological analysis needed to calculate statistics such as the mean length of utterance in morphemes (MLUM), so that these statistics can be computed directly from orthographic transcripts. Estimates of MLUM computed by this system are closely comparable to those produced by manual annotation. Our system can be used in conjunction with other automated annotation techniques, such as maze detection. This work represents an important first step towards increased automation of language sample analysis, and towards attendant benefits of automation, including clinical greater utilization and reduced variability in care delivery.
临床语言样本的定量分析是评估和筛查发育性语言障碍的有力工具,但需要大量的人工转录、注释和计算,导致结果容易出错和临床未充分利用。我们描述了一个执行自动形态学分析所需的系统,以计算统计数据,如语素中的平均话语长度(MLUM),以便这些统计数据可以直接从正字法转录本中计算出来。该系统计算的MLUM估计值与人工标注的估计值接近。我们的系统可以与其他自动标注技术结合使用,比如迷宫检测。这项工作是朝着提高语言样本分析自动化迈出的重要的第一步,也是朝着自动化带来的好处迈出的重要的第一步,包括临床更大的利用率和减少医疗服务的可变性。
{"title":"Automated morphological analysis of clinical language samples","authors":"Kyle Gorman, Steven Bedrick, G. Kiss, E. Morley, Rosemary Ingham, Metrah Mohammed, Katina Papadakis, J. V. Santen","doi":"10.3115/v1/W15-1213","DOIUrl":"https://doi.org/10.3115/v1/W15-1213","url":null,"abstract":"Quantitative analysis of clinical language samples is a powerful tool for assessing and screening developmental language impairments, but requires extensive manual transcription, annotation, and calculation, resulting in error-prone results and clinical underutilization. We describe a system that performs automated morphological analysis needed to calculate statistics such as the mean length of utterance in morphemes (MLUM), so that these statistics can be computed directly from orthographic transcripts. Estimates of MLUM computed by this system are closely comparable to those produced by manual annotation. Our system can be used in conjunction with other automated annotation techniques, such as maze detection. This work represents an important first step towards increased automation of language sample analysis, and towards attendant benefits of automation, including clinical greater utilization and reduced variability in care delivery.","PeriodicalId":74542,"journal":{"name":"Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting","volume":"81 1","pages":"108-116"},"PeriodicalIF":0.0,"publicationDate":"2015-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85409295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Similarity Measures for Quantifying Restrictive and Repetitive Behavior in Conversations of Autistic Children 孤独症儿童对话中限制性和重复性行为量化的相似性测量
Masoud Rouhizadeh, R. Sproat, J. Santen
Restrictive and repetitive behavior (RRB) is a core symptom of autism spectrum disorder (ASD) and are manifest in language. Based on this, we expect children with autism to talk about fewer topics, and more repeatedly, during their conversations. We thus hypothesize a higher semantic overlap ratio between dialogue turns in children with ASD compared to those with typical development (TD). Participants of this study include children ages 4-8, 44 with TD and 25 with ASD without language impairment. We apply several semantic similarity metrics to the children's dialogue turns in semi-structured conversations with examiners. We find that children with ASD have significantly more semantically overlapping turns than children with TD, across different turn intervals. These results support our hypothesis, and could provide a convenient and robust ASD-specific behavioral marker.
限制性重复行为(RRB)是自闭症谱系障碍(ASD)的核心症状之一,主要表现在语言上。基于此,我们期望自闭症儿童在对话中谈论更少的话题,更多的重复。因此,我们假设与典型发育(TD)儿童相比,ASD儿童对话回合之间的语义重叠率更高。这项研究的参与者包括4-8岁的儿童,44岁患有TD, 25岁患有ASD,没有语言障碍。我们将几个语义相似度指标应用于儿童与考官的半结构化对话中的对话回合。我们发现,在不同的转弯间隔中,ASD儿童的语义重叠转弯明显多于TD儿童。这些结果支持了我们的假设,并可以提供一个方便和强大的asd特异性行为标记。
{"title":"Similarity Measures for Quantifying Restrictive and Repetitive Behavior in Conversations of Autistic Children","authors":"Masoud Rouhizadeh, R. Sproat, J. Santen","doi":"10.3115/v1/W15-1214","DOIUrl":"https://doi.org/10.3115/v1/W15-1214","url":null,"abstract":"Restrictive and repetitive behavior (RRB) is a core symptom of autism spectrum disorder (ASD) and are manifest in language. Based on this, we expect children with autism to talk about fewer topics, and more repeatedly, during their conversations. We thus hypothesize a higher semantic overlap ratio between dialogue turns in children with ASD compared to those with typical development (TD). Participants of this study include children ages 4-8, 44 with TD and 25 with ASD without language impairment. We apply several semantic similarity metrics to the children's dialogue turns in semi-structured conversations with examiners. We find that children with ASD have significantly more semantically overlapping turns than children with TD, across different turn intervals. These results support our hypothesis, and could provide a convenient and robust ASD-specific behavioral marker.","PeriodicalId":74542,"journal":{"name":"Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting","volume":"53 1","pages":"117-123"},"PeriodicalIF":0.0,"publicationDate":"2015-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73291386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Anafora: A Web-based General Purpose Annotation Tool. 一个基于web的通用注释工具。
Wei-Te Chen, Will Styler

Anafora is a newly-developed open source web-based text annotation tool built to be lightweight, flexible, easy to use and capable of annotating with a variety of schemas, simple and complex. Anafora allows secure web-based annotation of any plaintext file with both spanned (e.g. named entity or markable) and relation annotations, as well as adjudication for both types of annotation. Anafora offers automatic set assignment and progress-tracking, centralized and human-editable XML annotation schemas, and file-based storage and organization of data in a human-readable single-file XML format.

ananfora是一个新开发的基于web的开源文本注释工具,它轻量级、灵活、易于使用,并且能够使用各种简单和复杂的模式进行注释。ananfora允许对任何明文文件进行安全的基于web的注释,包括跨(例如,命名实体或可标记)和关系注释,以及对这两种类型的注释的裁决。ananfora提供了自动设置分配和进度跟踪,集中的和人类可编辑的XML注释模式,以及以人类可读的单文件XML格式的基于文件的数据存储和组织。
{"title":"Anafora: A Web-based General Purpose Annotation Tool.","authors":"Wei-Te Chen,&nbsp;Will Styler","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Anafora is a newly-developed open source web-based text annotation tool built to be lightweight, flexible, easy to use and capable of annotating with a variety of schemas, simple and complex. Anafora allows secure web-based annotation of any plaintext file with both spanned (e.g. named entity or markable) and relation annotations, as well as adjudication for both types of annotation. Anafora offers automatic set assignment and progress-tracking, centralized and human-editable XML annotation schemas, and file-based storage and organization of data in a human-readable single-file XML format.</p>","PeriodicalId":74542,"journal":{"name":"Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting","volume":"2013 ","pages":"14-19"},"PeriodicalIF":0.0,"publicationDate":"2013-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5657237/pdf/nihms619402.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35649498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributional semantic models for the evaluation of disordered language. 无序语言评价的分布语义模型。
Masoud Rouhizadeh, Emily Prud'hommeaux, Brian Roark, Jan van Santen

Atypical semantic and pragmatic expression is frequently reported in the language of children with autism. Although this atypicality often manifests itself in the use of unusual or unexpected words and phrases, the rate of use of such unexpected words is rarely directly measured or quantified. In this paper, we use distributional semantic models to automatically identify unexpected words in narrative retellings by children with autism. The classification of unexpected words is sufficiently accurate to distinguish the retellings of children with autism from those with typical development. These techniques demonstrate the potential of applying automated language analysis techniques to clinically elicited language data for diagnostic purposes.

非典型语义和语用表达在自闭症儿童的语言中经常被报道。尽管这种非典型性经常表现在使用不寻常或意想不到的单词和短语上,但这些意想不到的单词的使用频率很少被直接测量或量化。本文采用分布语义模型对自闭症儿童复述中的意外词进行自动识别。对意外单词的分类足够准确,可以区分自闭症儿童和正常发育儿童的复述。这些技术展示了将自动语言分析技术应用于临床提取的语言数据以用于诊断目的的潜力。
{"title":"Distributional semantic models for the evaluation of disordered language.","authors":"Masoud Rouhizadeh,&nbsp;Emily Prud'hommeaux,&nbsp;Brian Roark,&nbsp;Jan van Santen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Atypical semantic and pragmatic expression is frequently reported in the language of children with autism. Although this atypicality often manifests itself in the use of unusual or unexpected words and phrases, the rate of use of such unexpected words is rarely directly measured or quantified. In this paper, we use distributional semantic models to automatically identify unexpected words in narrative retellings by children with autism. The classification of unexpected words is sufficiently accurate to distinguish the retellings of children with autism from those with typical development. These techniques demonstrate the potential of applying automated language analysis techniques to clinically elicited language data for diagnostic purposes.</p>","PeriodicalId":74542,"journal":{"name":"Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting","volume":"2013 ","pages":"709-714"},"PeriodicalIF":0.0,"publicationDate":"2013-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4237315/pdf/nihms504421.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32833488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1