Proceedings of the ACM Symposium on Document Engineering 2018最新文献

英文中文

Proceedings of the ACM Symposium on Document Engineering 2018 2018年ACM文献工程研讨会论文集

Proceedings of the ACM Symposium on Document Engineering 2018

Pub Date : 2018-08-28 DOI: 10.1145/3209280

引用次数: 0

Choosing Math Features for BM25 Ranking with Tangent-L 用切线- l选择BM25排序的数学特征

Proceedings of the ACM Symposium on Document Engineering 2018

Pub Date : 2018-08-28 DOI: 10.1145/3209280.3209527

Dallas J. Fraser, Andrew Kane, Frank Wm. Tompa

Combining text and mathematics when searching in a corpus with extensive mathematical notation remains an open problem. Recent results for Tangent-3 on the math and text retrieval task at NTCIR-12, for example, have room for improvement, even though formula retrieval appeared to be fairly successful. This paper explores how to adapt the state-of-the-art BM25 text ranking method to work well when searching for math together with text. Following the approach proposed for the Tangent math search system, we use symbol layout trees to represent math formulae. We extract features from the symbol layout trees to serve as search terms to be ranked using BM25 and then explore the effects on retrieval performance of various classes of features. Based on the results, we recommend which features can be used effectively in a conventional text-based retrieval engine. We validate our overall approach using a NTCIR-12 math and text benchmark.

在具有广泛数学符号的语料库中进行搜索时，将文本和数学结合起来仍然是一个悬而未决的问题。例如，在ntcirr -12中，Tangent-3在数学和文本检索任务上的最新结果有改进的空间，尽管公式检索似乎相当成功。本文探讨了如何适应最先进的BM25文本排序方法，使其在搜索数学和文本时能够很好地工作。在此基础上，我们使用符号布局树来表示数学公式。我们从符号布局树中提取特征作为搜索项，使用BM25进行排序，然后探讨不同类别的特征对检索性能的影响。根据结果，我们推荐哪些特征可以在传统的基于文本的检索引擎中有效地使用。我们使用ntir -12数学和文本基准来验证我们的整体方法。

引用次数: 19

STEVE 史蒂夫

Proceedings of the ACM Symposium on Document Engineering 2018

Pub Date : 2018-08-28 DOI: 10.1145/3209280.3209521

Douglas Paulo De Mattos, Débora C. Muchaluat-Saade

This paper proposes an interactive multimedia authoring tool called STEVE (Spatio-Temporal View Editor) and a new multimedia model called SIMM (Simple Interactive Multimedia Model). STEVE aims at allowing users with no knowledge of multimedia authoring languages and models to create hypermedia applications for web and digital TV systems in a user-friendly way. Compared with existing multimedia authoring tools, STEVE is the unique tool that allows ordinary users to export hypermedia applications to HTML5 and NCL documents. STEVE uses an event-based temporal synchronization model called SIMM that exactly fits its needs. SIMM provides high-level temporal, spatial and interactivity relations to make authoring with STEVE easier. Usability tests show that, according to users, STEVE allowed them to create multimedia applications and export them as HTML5 and NCL documents in a few minutes without programming.

引用次数: 12

Exploiting patterns and templates for technical documentation 利用技术文档的模式和模板

Proceedings of the ACM Symposium on Document Engineering 2018

Pub Date : 2018-08-28 DOI: 10.1145/3209280.3209537

A. Caponi, A. Iorio, F. Vitali, Paolo Alberti, M. Scatá

There are several domains in which the documents are made of reusable pieces. Template languages have been widely studied by the document engineering community to deal with common structures and textual fragments. Though, templating mechanisms are often hidden in mainstream word-precessors and even unknown by common users. This paper presents a pattern-based language for templates, serialized in HTML and exploited in a user-friendly WYSIWYG editor for writing technical documentation. We discuss the deployment of the editor by an engineering company in the railway domain, as well as some generalized lessons learned about templates.

有几个域中的文档是由可重用的部分组成的。模板语言已被文档工程界广泛研究，用于处理通用结构和文本片段。但是，模板机制通常隐藏在主流的word前身中，甚至不为普通用户所知。本文提出了一种基于模式的模板语言，在HTML中序列化，并在用户友好的所见即所得编辑器中开发，用于编写技术文档。我们讨论了一家工程公司在铁路领域中对编辑器的部署，以及一些关于模板的一般经验教训。

引用次数: 5

diffi 曲折

Proceedings of the ACM Symposium on Document Engineering 2018

Pub Date : 2018-08-28 DOI: 10.1145/3209280.3229084

Gioele Barabucci

diffi (diff improved) is a comparison tool whose primary goal is to describe the differences between the content of two documents regardless of their formats. diffi examines the stacks of abstraction levels of the two documents to be compared, finds which levels can be compared, selects one or more appropriate comparison algorithms and calculates the delta(s) between the two documents. Finally, the deltas are serialized using the extended unified patch format, an extension of the common unified patch format. The produced deltas describe the differences between all the comparable levels of the inputs documents. Users and developers of patch visualization tools have, thus, the choice to focus on their preferred level of abstraction.

引用次数: 1

Identifying the Relative Importance of Customer Issues on Product Ratings through Machine Learning 通过机器学习识别客户问题对产品评级的相对重要性

Proceedings of the ACM Symposium on Document Engineering 2018

Pub Date : 2018-08-28 DOI: 10.1145/3209280.3229113

Himanshu Tiwari, Shameed Sait, Md Imbesat Hassan Rizvi, Niranjan Damera-Venkata

Millions of customer reviews for products are available online across hundreds of different websites. These reviews have a tremendous influence on the purchase decision of new customers and in creating a positive brand image. Understanding which of the product issues are critical in determining the product ratings is crucial for marketing teams. We have developed a solution which can derive deep insights from customer reviews which goes significantly beyond keyword based analysis. Our solution can identify key customer issues voiced in the reviews and the impact of each of these on the final rating that a customer gives the product. This insight is very actionable as it helps identify which customer concerns are responsible for bad ratings of products.

数以百万计的客户对产品的评论可以在数百个不同的网站上在线获得。这些评论对新顾客的购买决定和建立积极的品牌形象有巨大的影响。了解哪些产品问题对确定产品评级至关重要，这对营销团队至关重要。我们已经开发了一种解决方案，可以从客户评论中获得深刻的见解，这大大超出了基于关键字的分析。我们的解决方案可以识别评论中提出的关键客户问题，以及每个问题对客户对产品的最终评级的影响。这种洞察力非常具有可操作性，因为它有助于确定哪些客户关注的问题导致了产品的不良评级。

引用次数: 2

Integrating Global Attention for Pairwise Text Comparison 整合全局注意力的文本两两比较

Proceedings of the ACM Symposium on Document Engineering 2018

Pub Date : 2018-08-28 DOI: 10.1145/3209280.3229119

Jie Mei, Xiang Jiang, Aminul Islam, A. Mohammad, E. Milios

Attention guides computation to focus on important parts of the input data. For pairwise input, existing attention approaches tend to bias towards trivial repetitions (e.g. punctuations and stop words) between two texts, and thus failed to contribute reasonable guidance to model predictions. As a remedy, we suggest taking into account the corpus-level information via global-aware attention. In this paper, we propose an attention mechanism that makes use of intratext, inter-text and global contextual information. We undertake an ablation study on paraphrase identification, and demonstrate that the proposed attention mechanism can obviate the downsides of trivial repetitions and provide interpretable word weightings.

注意力引导计算集中在输入数据的重要部分。对于两两输入，现有的注意方法往往倾向于两个文本之间的琐碎重复(例如标点和停顿词)，因此无法为模型预测提供合理的指导。作为补救措施，我们建议通过全局感知注意力来考虑语料库级别的信息。本文提出了一种利用语篇内、语篇间和全局语境信息的注意机制。我们对释义识别进行了消融研究，并证明了所提出的注意机制可以消除琐碎重复的缺点，并提供可解释的单词权重。

引用次数: 0

Visual Text Analytics: Techniques for Linguistic Information Visualization 视觉文本分析:语言信息可视化技术

Proceedings of the ACM Symposium on Document Engineering 2018

Pub Date : 2018-08-28 DOI: 10.1145/3209280.3232795

Mennatallah El-Assady

Visual Text Analytics has been an active area of interdisciplinary research (http://textvis.lnu.se/). This interactive tutorial is designed to give attendees an introduction to the area of information visualization, with a focus on linguistic visualization. After an introduction to the basic principles of information visualization and visual analytics, this tutorial will give an overview of the broad spectrum of linguistic and text visualization techniques, as well as their application areas [3]. This will be followed by a hands-on session that will allow participants to design their own visualizations using tools (e.g., Tableau), libraries (e.g., d3.js), or applying sketching techniques [4]. Some sample datasets will be provided by the instructor. Besides general techniques, special access will be provided to use the VisArgue framework [1] for the analysis of selected datasets.

可视化文本分析一直是一个活跃的跨学科研究领域(http://textvis.lnu.se/)。本互动式教程旨在向与会者介绍信息可视化领域，重点是语言可视化。在介绍了信息可视化和可视化分析的基本原理之后，本教程将对广泛的语言和文本可视化技术及其应用领域进行概述[3]。接下来将是一个动手环节，允许参与者使用工具(例如Tableau)，库(例如d3.js)或应用素描技术设计自己的可视化[4]。一些样本数据集将由讲师提供。除了一般技术外，还将提供使用VisArgue框架[1]分析选定数据集的特殊访问权限。

引用次数: 0

SlideDiff

Proceedings of the ACM Symposium on Document Engineering 2018

Pub Date : 2018-08-28 DOI: 10.1145/3209280.3229107

Laurent Denoue, S. Carter, M. Cooper

SlideDiff is a system that automatically creates an animated rendering of textual and media differences between two versions of a slide presentation. While previous work focused on either textual or image data, SlideDiff integrates both text and media changes, as well as their interactions, for example when adding an image forces nearby text boxes to shrink. Given two versions of a slide (not the full history of edits), SlideDiff detects the textual and image differences, and then animates the changes by mimicking what a user might have done, such as moving the cursor, typing text, resizing image boxes, adding images. This editing metaphor is well known to most users, helping them better understand what has changed, and fosters a sense of connection between remote workers, derived from communicating both the revision process as well as its results. After detection of text and image differences, the animations are rendered in HTML and CSS, including mouse cursor motion, text and image box selection and resizing, text deletion and insertion with its cursor. We discuss strategies for animating changes, in particular the importance of starting with large changes and finishing with smaller edits, and provide details of the implementation using modern HTML and CSS.

{"title":"SlideDiff","authors":"Laurent Denoue, S. Carter, M. Cooper","doi":"10.1145/3209280.3229107","DOIUrl":"https://doi.org/10.1145/3209280.3229107","url":null,"abstract":"SlideDiff is a system that automatically creates an animated rendering of textual and media differences between two versions of a slide presentation. While previous work focused on either textual or image data, SlideDiff integrates both text and media changes, as well as their interactions, for example when adding an image forces nearby text boxes to shrink. Given two versions of a slide (not the full history of edits), SlideDiff detects the textual and image differences, and then animates the changes by mimicking what a user might have done, such as moving the cursor, typing text, resizing image boxes, adding images. This editing metaphor is well known to most users, helping them better understand what has changed, and fosters a sense of connection between remote workers, derived from communicating both the revision process as well as its results. After detection of text and image differences, the animations are rendered in HTML and CSS, including mouse cursor motion, text and image box selection and resizing, text deletion and insertion with its cursor. We discuss strategies for animating changes, in particular the importance of starting with large changes and finishing with smaller edits, and provide details of the implementation using modern HTML and CSS.","PeriodicalId":234145,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering 2018","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114479435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

SwiftLaTeX SwiftLaTeX

Proceedings of the ACM Symposium on Document Engineering 2018

Pub Date : 2018-08-28 DOI: 10.1145/3209280.3209522

Elliott Wen, Gerald Weber

The text processing tool LATEX has prevailed as a standard in many fields of exact sciences; it is evident that LATEX is likely to be here to stay. From that perspective, it is important to explore what are the best possible ways to support the author in efficiently editing documents. There have been several approaches that provide graphical editing support for LATEX. We argue that a true WYSIWYG (What You See Is What You Get) approach is a justified requirement for future systems and we present here the first cloud-based true WYSIWYG editor. This allows the author to edit the document in its print form directly in a web-based PDF viewer. Building such a system creates unique challenges compared to existing approaches. We identify these challenges and name workable solutions. We also provide a usability evaluation of the new system. In short our finding is that editing LATEX directly in the PDF view is possible for a wide range of edits and valuable for many major user groups and use cases; hence it is a fair requirement for future top-of-the-line LATEX editors.

引用次数: 4

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the ACM Symposium on Document Engineering 2018

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀