Proceedings of the Second Workshop on Economics and Natural Language Processing最新文献

英文中文

Complaint Analysis and Classification for Economic and Food Safety 经济与食品安全投诉分析与分类

Proceedings of the Second Workshop on Economics and Natural Language Processing

Pub Date : 2019-11-01 DOI: 10.18653/v1/D19-5107

João Filgueiras, Luís Barbosa, Gil Rocha, Henrique Lopes Cardoso, Luís Paulo Reis, J. Machado, Ana Maria Oliveira

Governmental institutions are employing artificial intelligence techniques to deal with their specific problems and exploit their huge amounts of both structured and unstructured information. In particular, natural language processing and machine learning techniques are being used to process citizen feedback. In this paper, we report on the use of such techniques for analyzing and classifying complaints, in the context of the Portuguese Economic and Food Safety Authority. Grounded in its operational process, we address three different classification problems: target economic activity, implied infraction severity level, and institutional competence. We show promising results obtained using feature-based approaches and traditional classifiers, with accuracy scores above 70%, and analyze the shortcomings of our current results and avenues for further improvement, taking into account the intended use of our classifiers in helping human officers to cope with thousands of yearly complaints.

政府机构正在利用人工智能技术来处理他们的具体问题，并利用他们大量的结构化和非结构化信息。特别是，自然语言处理和机器学习技术被用于处理公民反馈。在本文中，我们报告了在葡萄牙经济和食品安全局的背景下使用这种技术来分析和分类投诉。在其操作过程的基础上，我们解决了三个不同的分类问题:目标经济活动、隐含的违规严重程度和机构能力。我们展示了使用基于特征的方法和传统分类器获得的有希望的结果，准确率超过70%，并分析了我们目前结果的缺点和进一步改进的途径，考虑到我们的分类器在帮助人类官员处理每年成千上万的投诉方面的预期用途。

引用次数: 9

Forecasting Firm Material Events from 8-K Reports 从8-K报告预测公司重大事件

Proceedings of the Second Workshop on Economics and Natural Language Processing

Pub Date : 2019-11-01 DOI: 10.18653/v1/D19-5104

Shuang (Sophie) Zhai, Zhu Zhang

In this paper, we show deep learning models can be used to forecast firm material event sequences based on the contents in the company’s 8-K Current Reports. Specifically, we exploit state-of-the-art neural architectures, including sequence-to-sequence (Seq2Seq) architecture and attention mechanisms, in the model. Our 8K-powered deep learning model demonstrates promising performance in forecasting firm future event sequences. The model is poised to benefit various stakeholders, including management and investors, by facilitating risk management and decision making.

在本文中，我们展示了深度学习模型可用于基于公司8-K当前报告中的内容预测公司重大事件序列。具体来说，我们在模型中利用了最先进的神经架构，包括序列到序列(Seq2Seq)架构和注意力机制。我们的8k动力深度学习模型在预测公司未来事件序列方面表现出色。通过促进风险管理和决策，该模型将使包括管理层和投资者在内的各种利益相关者受益。

引用次数: 8

Extracting Complex Relations from Banking Documents 从银行文件中提取复杂关系

Proceedings of the Second Workshop on Economics and Natural Language Processing

Pub Date : 1900-01-01 DOI: 10.18653/v1/D19-5101

Berke Oral, Erdem Emekligil, S. Arslan, Gülşen Eryiğit

In order to automate banking processes (e.g. payments, money transfers, foreign trade), we need to extract banking transactions from different types of mediums such as faxes, e-mails, and scanners. Banking orders may be considered as complex documents since they contain quite complex relations compared to traditional datasets used in relation extraction research. In this paper, we present our method to extract intersentential, nested and complex relations from banking orders, and introduce a relation extraction method based on maximal clique factorization technique. We demonstrate 11% error reduction over previous methods.

为了自动化银行流程(例如支付、汇款、对外贸易)，我们需要从不同类型的媒介(如传真、电子邮件和扫描仪)中提取银行交易。与关系提取研究中使用的传统数据集相比，银行订单可能被认为是复杂的文档，因为它们包含相当复杂的关系。本文提出了从银行订单中提取间隔关系、嵌套关系和复杂关系的方法，并介绍了一种基于极大团分解技术的关系提取方法。我们证明，与以前的方法相比，误差降低了11%。

引用次数: 7

Annotation Process for the Dialog Act Classification of a Taglish E-commerce Q&A Corpus 英语电子商务问答语料库对话行为分类的标注过程

Proceedings of the Second Workshop on Economics and Natural Language Processing

Pub Date : 1900-01-01 DOI: 10.18653/v1/D19-5108

Jared Rivera, Jan Caleb Oliver Pensica, Jolene Valenzuela, Alfonso Secuya, C. Cheng

With conversational agents or chatbots making up in quantity of replies rather than quality, the need to identify user intent has become a main concern to improve these agents. Dialog act (DA) classification tackles this concern, and while existing studies have already addressed DA classification in general contexts, no training corpora in the context of e-commerce is available to the public. This research addressed the said insufficiency by building a text-based corpus of 7,265 posts from the question and answer section of products on Lazada Philippines. The SWBD-DAMSL tagset for DA classification was modified to 28 tags fitting the categories applicable to e-commerce conversations. The posts were annotated manually by three (3) human annotators and preprocessing techniques decreased the vocabulary size from 6,340 to 1,134. After analysis, the corpus was composed dominantly of single-label posts, with 34% of the corpus having multiple intent tags. The annotated corpus allowed insights toward the structure of posts created with single to multiple intents.

由于会话代理或聊天机器人的回复数量大于质量，识别用户意图的需求已成为改进这些代理的主要关注点。对话行为(DA)分类解决了这一问题，虽然现有的研究已经解决了一般情况下的DA分类问题，但没有电子商务背景下的培训语料库可供公众使用。本研究通过建立一个基于文本的语料库，解决了上述不足，该语料库来自Lazada菲律宾产品问答部分的7,265个帖子。用于数据数据分类的SWBD-DAMSL标记集被修改为28个标记，适合适用于电子商务对话的类别。这些帖子由3名人工注释者手工注释，预处理技术将词汇量从6340个减少到1134个。经过分析，语料库主要由单标签帖子组成，其中34%的语料库具有多个意图标签。带注释的语料库允许深入了解由单个或多个意图创建的帖子的结构。

{"title":"Annotation Process for the Dialog Act Classification of a Taglish E-commerce Q&A Corpus","authors":"Jared Rivera, Jan Caleb Oliver Pensica, Jolene Valenzuela, Alfonso Secuya, C. Cheng","doi":"10.18653/v1/D19-5108","DOIUrl":"https://doi.org/10.18653/v1/D19-5108","url":null,"abstract":"With conversational agents or chatbots making up in quantity of replies rather than quality, the need to identify user intent has become a main concern to improve these agents. Dialog act (DA) classification tackles this concern, and while existing studies have already addressed DA classification in general contexts, no training corpora in the context of e-commerce is available to the public. This research addressed the said insufficiency by building a text-based corpus of 7,265 posts from the question and answer section of products on Lazada Philippines. The SWBD-DAMSL tagset for DA classification was modified to 28 tags fitting the categories applicable to e-commerce conversations. The posts were annotated manually by three (3) human annotators and preprocessing techniques decreased the vocabulary size from 6,340 to 1,134. After analysis, the corpus was composed dominantly of single-label posts, with 34% of the corpus having multiple intent tags. The annotated corpus allowed insights toward the structure of posts created with single to multiple intents.","PeriodicalId":119881,"journal":{"name":"Proceedings of the Second Workshop on Economics and Natural Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129806346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the Second Workshop on Economics and Natural Language Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀