首页 > 最新文献

Proceedings of the 2015 ACM Symposium on Document Engineering最新文献

英文 中文
Similarity-Based Support for Text Reuse in Technical Writing 技术写作中基于相似度的文本重用支持
Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797068
Axel J. Soto, A. Mohammad, Andrew Albert, Aminul Islam, E. Milios, Michael Doyle, R. Minghim, Maria Cristina Ferreira de Oliveira
Technical writing in professional environments, such as user manual authoring for new products, is a task that relies heavily on reuse of content. Therefore, technical content is typically created following a strategy where modular units of text have references to each other. One of the main challenges faced by technical authors is to avoid duplicating existing content, as this adds unnecessary effort, generates undesirable inconsistencies, and dramatically increases maintenance and translation costs. However, there are few computational tools available to support this activity. This paper investigates the use of different similarity methods for the task of identification of reuse opportunities in technical writing. We evaluated our results using existing ground truth as well as feedback from technical authors. Finally, we also propose a tool that combines text similarity algorithms with interactive visualizations to aid authors in understanding differences in a collection of topics and identifying reuse opportunities.
专业环境中的技术写作,比如为新产品编写用户手册,是一项严重依赖于内容重用的任务。因此,技术内容通常是按照文本的模块单元相互引用的策略创建的。技术作者面临的主要挑战之一是避免重复现有内容,因为这会增加不必要的工作,产生不希望看到的不一致,并极大地增加维护和翻译成本。然而,很少有可用的计算工具来支持这种活动。本文研究了在技术写作中使用不同的相似度方法来识别重用机会的任务。我们使用现有的基础事实以及技术作者的反馈来评估我们的结果。最后,我们还提出了一个将文本相似度算法与交互式可视化相结合的工具,以帮助作者理解主题集合中的差异并识别重用机会。
{"title":"Similarity-Based Support for Text Reuse in Technical Writing","authors":"Axel J. Soto, A. Mohammad, Andrew Albert, Aminul Islam, E. Milios, Michael Doyle, R. Minghim, Maria Cristina Ferreira de Oliveira","doi":"10.1145/2682571.2797068","DOIUrl":"https://doi.org/10.1145/2682571.2797068","url":null,"abstract":"Technical writing in professional environments, such as user manual authoring for new products, is a task that relies heavily on reuse of content. Therefore, technical content is typically created following a strategy where modular units of text have references to each other. One of the main challenges faced by technical authors is to avoid duplicating existing content, as this adds unnecessary effort, generates undesirable inconsistencies, and dramatically increases maintenance and translation costs. However, there are few computational tools available to support this activity. This paper investigates the use of different similarity methods for the task of identification of reuse opportunities in technical writing. We evaluated our results using existing ground truth as well as feedback from technical authors. Finally, we also propose a tool that combines text similarity algorithms with interactive visualizations to aid authors in understanding differences in a collection of topics and identifying reuse opportunities.","PeriodicalId":106339,"journal":{"name":"Proceedings of the 2015 ACM Symposium on Document Engineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130836331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Exploring Scholarly Papers Through Citations 通过引文探索学术论文
Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797065
A. Iorio, Raffaele Giannella, Francesco Poggi, S. Peroni, F. Vitali
Bibliographies are fundamental components of academic papers and both the scientific research and its evaluation are fundamentally organized around the correct examination and classification of scientific bibliographies. Currently, most digital libraries publish bibliographic information about their content for free, and many include the citations (outgoing and in some cases even incoming) to the papers they manage. Unfortunately no sophistication is spent for these lists: monolithic pieces of text where it is even difficult to tell automatically the authors, the title and publication details, and where users are provided with no mechanisms to filter and access full context of each citation. For instance, there is no way to know in which sentence a work was cited (the citation context) and why (the citation function). In this paper we introduce a novel environment for navigating, filtering and making sense of citations. The interface, called BEX, exploits data freely available in a Link Open Dataset about scholarly papers; end-user testing proved its efficacy and usability.
参考书目是学术论文的基本组成部分,科学研究及其评价基本上都是围绕着科学书目的正确审查和分类来组织的。目前,大多数数字图书馆都免费发布有关其内容的书目信息,许多图书馆还包括对其管理的论文的引用(在某些情况下甚至是引用)。不幸的是,这些列表并不复杂:单个文本甚至很难自动识别作者、标题和出版细节,并且用户没有提供过滤和访问每个引用的完整上下文的机制。例如,没有办法知道在哪个句子中引用了一个作品(引用上下文)和为什么(引用功能)。本文介绍了一种用于引文导航、过滤和理解的新环境。这个名为BEX的接口利用了学术论文链接开放数据集(Link Open Dataset)中免费提供的数据;最终用户测试证明了其有效性和可用性。
{"title":"Exploring Scholarly Papers Through Citations","authors":"A. Iorio, Raffaele Giannella, Francesco Poggi, S. Peroni, F. Vitali","doi":"10.1145/2682571.2797065","DOIUrl":"https://doi.org/10.1145/2682571.2797065","url":null,"abstract":"Bibliographies are fundamental components of academic papers and both the scientific research and its evaluation are fundamentally organized around the correct examination and classification of scientific bibliographies. Currently, most digital libraries publish bibliographic information about their content for free, and many include the citations (outgoing and in some cases even incoming) to the papers they manage. Unfortunately no sophistication is spent for these lists: monolithic pieces of text where it is even difficult to tell automatically the authors, the title and publication details, and where users are provided with no mechanisms to filter and access full context of each citation. For instance, there is no way to know in which sentence a work was cited (the citation context) and why (the citation function). In this paper we introduce a novel environment for navigating, filtering and making sense of citations. The interface, called BEX, exploits data freely available in a Link Open Dataset about scholarly papers; end-user testing proved its efficacy and usability.","PeriodicalId":106339,"journal":{"name":"Proceedings of the 2015 ACM Symposium on Document Engineering","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132849294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Hiding Information in Multiple Level-line Moirés 隐藏信息在多个电平线莫伊拉西
Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797078
T. Walger, R. Hersch
Secure documents often comprise an information layer that is hard to reproduce. Moiré techniques for the prevention of counterfeiting rely on the superposition of an array of transparent lines or microlenses on top of a base layer containing hidden information. Level-line moirés consist of shapes that appear to be beating upon relative translation of a revealing grating on top of a base, in which the desired information is encoded. Usually, the base only contains the information corresponding to one moiré. In order to increase the difficulty of counterfeiting, we use tessellations to incorporate two or more moirés within the same layer. With the method we propose, the information corresponding to up to seven level-line moirés can be embedded within a single base layer. The moirés are recovered with a revealer printed on a transparency or with an array of cylindrical lenses. This method is general and can be extended to other fabrication technologies.
安全文档通常包含难以复制的信息层。防伪技术依赖于在包含隐藏信息的基础层上叠加一组透明线或微透镜。平面线莫尔海姆由一些形状组成,这些形状似乎是在一个基底上的一个暴露光栅的相对平移上跳动的,在这个基底上,所需的信息被编码。通常,基只包含与一个进程相对应的信息。为了增加伪造的难度,我们使用镶嵌将两个或更多的moirsams合并在同一层内。利用我们提出的方法,可以在单个基础层内嵌入最多七个电平线莫尔海姆的对应信息。通过在透明片上印上一个揭示器或一组圆柱形透镜来回收莫尔梅斯。该方法具有通用性,可推广到其它制造工艺中。
{"title":"Hiding Information in Multiple Level-line Moirés","authors":"T. Walger, R. Hersch","doi":"10.1145/2682571.2797078","DOIUrl":"https://doi.org/10.1145/2682571.2797078","url":null,"abstract":"Secure documents often comprise an information layer that is hard to reproduce. Moiré techniques for the prevention of counterfeiting rely on the superposition of an array of transparent lines or microlenses on top of a base layer containing hidden information. Level-line moirés consist of shapes that appear to be beating upon relative translation of a revealing grating on top of a base, in which the desired information is encoded. Usually, the base only contains the information corresponding to one moiré. In order to increase the difficulty of counterfeiting, we use tessellations to incorporate two or more moirés within the same layer. With the method we propose, the information corresponding to up to seven level-line moirés can be embedded within a single base layer. The moirés are recovered with a revealer printed on a transparency or with an array of cylindrical lenses. This method is general and can be extended to other fabrication technologies.","PeriodicalId":106339,"journal":{"name":"Proceedings of the 2015 ACM Symposium on Document Engineering","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133072061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Session details: Document Understanding 会话详细信息:文档理解
Pub Date : 2015-09-08 DOI: 10.1145/3256808
P. King
{"title":"Session details: Document Understanding","authors":"P. King","doi":"10.1145/3256808","DOIUrl":"https://doi.org/10.1145/3256808","url":null,"abstract":"","PeriodicalId":106339,"journal":{"name":"Proceedings of the 2015 ACM Symposium on Document Engineering","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124891273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fine Grained Access of Interactive Personal Health Records 交互式个人健康记录的细粒度访问
Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797098
H. Balinsky, Nassir Mohammad
Electronic Personal Healthcare Records (PHRs) provide the means for individuals to hold, update and share their medical information in a digitally accessible form. However, the sensitive nature of healthcare information and the functional limitations of PHRs has resulted in their acceptance remaining relatively low. This is primarily due to fears of security and privacy in the current central authority based technologies on offer. In order to alleviate these concerns, whilst maintaining security, ease of access and distribution, we propose a PHR format that utilizes and extends a secure composite document format, Publicly Posted Composite Documents [1], originally designed for cross-organizational business workflows. The proposed PHR ensures data is always encrypted whilst traversing non-secure channels, with fine-grained access control built in to enable multiple people to have differential access to the same PHR. End-to-end encryption using Password Key Derivation Functions ensures no central authority is required to have access to plaintext data or decryption keys. This allows safe cooperation with Cloud Service Providers (CSPs) who act as the primary storage and vehicle by which PHRs can be shared. Our PHRs are designed to be partially downloaded and exported on request, and to gather PHR formatted data securely from an ecosystem of healthcare devices.
电子个人医疗记录(PHRs)为个人提供了以数字可访问形式保存、更新和共享其医疗信息的手段。然而,医疗信息的敏感性和phrr的功能限制导致其接受度仍然相对较低。这主要是由于对当前基于所提供技术的中央权威的安全和隐私的担忧。为了缓解这些担忧,同时保持安全性、易访问性和分发性,我们提出了一种PHR格式,它利用并扩展了一种安全的复合文档格式,即公开发布复合文档[1],该格式最初是为跨组织的业务工作流设计的。提议的PHR确保数据在穿越非安全通道时始终是加密的,内置了细粒度的访问控制,使多人能够对同一PHR进行不同的访问。使用密码密钥派生功能的端到端加密确保不需要中央权威机构访问明文数据或解密密钥。这允许与云服务提供商(csp)进行安全合作,csp作为主要存储和共享phrr的载体。我们的PHR被设计成可以根据要求部分下载和导出,并且可以从医疗保健设备生态系统中安全地收集PHR格式的数据。
{"title":"Fine Grained Access of Interactive Personal Health Records","authors":"H. Balinsky, Nassir Mohammad","doi":"10.1145/2682571.2797098","DOIUrl":"https://doi.org/10.1145/2682571.2797098","url":null,"abstract":"Electronic Personal Healthcare Records (PHRs) provide the means for individuals to hold, update and share their medical information in a digitally accessible form. However, the sensitive nature of healthcare information and the functional limitations of PHRs has resulted in their acceptance remaining relatively low. This is primarily due to fears of security and privacy in the current central authority based technologies on offer. In order to alleviate these concerns, whilst maintaining security, ease of access and distribution, we propose a PHR format that utilizes and extends a secure composite document format, Publicly Posted Composite Documents [1], originally designed for cross-organizational business workflows. The proposed PHR ensures data is always encrypted whilst traversing non-secure channels, with fine-grained access control built in to enable multiple people to have differential access to the same PHR. End-to-end encryption using Password Key Derivation Functions ensures no central authority is required to have access to plaintext data or decryption keys. This allows safe cooperation with Cloud Service Providers (CSPs) who act as the primary storage and vehicle by which PHRs can be shared. Our PHRs are designed to be partially downloaded and exported on request, and to gather PHR formatted data securely from an ecosystem of healthcare devices.","PeriodicalId":106339,"journal":{"name":"Proceedings of the 2015 ACM Symposium on Document Engineering","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124160175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Session details: Knowledge Extraction 会话细节:知识提取
Pub Date : 2015-09-08 DOI: 10.1145/3256802
S. Simske
{"title":"Session details: Knowledge Extraction","authors":"S. Simske","doi":"10.1145/3256802","DOIUrl":"https://doi.org/10.1145/3256802","url":null,"abstract":"","PeriodicalId":106339,"journal":{"name":"Proceedings of the 2015 ACM Symposium on Document Engineering","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121301264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generating Abstractive Summaries from Meeting Transcripts 从会议记录中生成抽象摘要
Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797061
Siddhartha Banerjee, P. Mitra, Kazunari Sugiyama
Summaries of meetings are very important as they convey the essential content of discussions in a concise form. Both participants and non-participants are interested in the summaries of meetings to plan for their future work. Generally, it is time consuming to read and understand the whole documents. Therefore, summaries play an important role as the readers are interested in only the important context of discussions. In this work, we address the task of meeting document summarization. Automatic summarization systems on meeting conversations developed so far have been primarily extractive, resulting in unacceptable summaries that are hard to read. The extracted utterances contain disfluencies that affect the quality of the extractive summaries. To make summaries much more readable, we propose an approach to generating abstractive summaries by fusing important content from several utterances. We first separate meeting transcripts into various topic segments, and then identify the important utterances in each segment using a supervised learning approach. The important utterances are then combined together to generate a one-sentence summary. In the text generation step, the dependency parses of the utterances in each segment are combined together to create a directed graph. The most informative and well-formed sub-graph obtained by integer linear programming (ILP) is selected to generate a one-sentence summary for each topic segment. The ILP formulation reduces disfluencies by leveraging grammatical relations that are more prominent in non-conversational style of text, and therefore generates summaries that is comparable to human-written abstractive summaries. Experimental results show that our method can generate more informative summaries than the baselines. In addition, readability assessments by human judges as well as log-likelihood estimates obtained from the dependency parser show that our generated summaries are significantly readable and well-formed.
会议摘要非常重要,因为它们以简洁的形式传达了讨论的基本内容。参与者和非参与者都对会议摘要感兴趣,以便计划他们未来的工作。一般来说,阅读和理解整个文档是很耗时的。因此,摘要起着重要的作用,因为读者只对讨论的重要背景感兴趣。在这项工作中,我们解决了会议文件摘要的任务。迄今为止开发的会议会话自动摘要系统主要是摘录性的,导致无法接受的摘要难以阅读。提取的话语包含影响提取摘要质量的不流畅。为了使摘要更具可读性,我们提出了一种通过融合几个话语中的重要内容来生成抽象摘要的方法。我们首先将会议记录分成不同的主题片段,然后使用监督学习方法识别每个片段中的重要话语。然后将重要的话语组合在一起生成一句话摘要。在文本生成步骤中,将每个片段中话语的依赖句法组合在一起,形成一个有向图。选择由整数线性规划(ILP)得到的信息量最大、构造良好的子图,为每个主题段生成一句话摘要。ILP公式通过利用在非会话式文本中更为突出的语法关系来减少不流畅,因此生成的摘要可与人类编写的抽象摘要相媲美。实验结果表明,该方法可以生成比基线更有信息量的摘要。此外,由人类判断的可读性评估以及从依赖解析器获得的对数似然估计表明,我们生成的摘要具有显著的可读性和良好的格式。
{"title":"Generating Abstractive Summaries from Meeting Transcripts","authors":"Siddhartha Banerjee, P. Mitra, Kazunari Sugiyama","doi":"10.1145/2682571.2797061","DOIUrl":"https://doi.org/10.1145/2682571.2797061","url":null,"abstract":"Summaries of meetings are very important as they convey the essential content of discussions in a concise form. Both participants and non-participants are interested in the summaries of meetings to plan for their future work. Generally, it is time consuming to read and understand the whole documents. Therefore, summaries play an important role as the readers are interested in only the important context of discussions. In this work, we address the task of meeting document summarization. Automatic summarization systems on meeting conversations developed so far have been primarily extractive, resulting in unacceptable summaries that are hard to read. The extracted utterances contain disfluencies that affect the quality of the extractive summaries. To make summaries much more readable, we propose an approach to generating abstractive summaries by fusing important content from several utterances. We first separate meeting transcripts into various topic segments, and then identify the important utterances in each segment using a supervised learning approach. The important utterances are then combined together to generate a one-sentence summary. In the text generation step, the dependency parses of the utterances in each segment are combined together to create a directed graph. The most informative and well-formed sub-graph obtained by integer linear programming (ILP) is selected to generate a one-sentence summary for each topic segment. The ILP formulation reduces disfluencies by leveraging grammatical relations that are more prominent in non-conversational style of text, and therefore generates summaries that is comparable to human-written abstractive summaries. Experimental results show that our method can generate more informative summaries than the baselines. In addition, readability assessments by human judges as well as log-likelihood estimates obtained from the dependency parser show that our generated summaries are significantly readable and well-formed.","PeriodicalId":106339,"journal":{"name":"Proceedings of the 2015 ACM Symposium on Document Engineering","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123318075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Towards Mobile OCR: How to Take a Good Picture of a Document Without Sight 走向移动OCR:如何在没有视觉的情况下拍出好照片
Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797066
M. Cutter, R. Manduchi
The advent of mobile OCR (optical character recognition) applications on regular smartphones holds great promise for enabling blind people to access printed information. Unfortunately, these systems suffer from a problem: in order for OCR output to be meaningful, a well-framed image of the document needs to be taken, something that is difficult to do without sight. This contribution presents an experimental investigation of how blind people position and orient a camera phone while acquiring document images. We developed experimental software to investigate if verbal guidance aids in the acquisition of OCR-readable images without sight. We report on our participant's feedback and performance before and after assistance from our software
普通智能手机上的移动OCR(光学字符识别)应用程序的出现为盲人访问印刷信息带来了巨大的希望。不幸的是,这些系统存在一个问题:为了使OCR输出有意义,需要拍摄文档的框架良好的图像,这在没有视觉的情况下很难做到。这个贡献提出了盲人如何定位和定位相机手机,而获取文件图像的实验调查。我们开发了实验软件来研究语言引导是否有助于获取无视力的ocr可读图像。我们报告参与者在软件帮助前后的反馈和表现
{"title":"Towards Mobile OCR: How to Take a Good Picture of a Document Without Sight","authors":"M. Cutter, R. Manduchi","doi":"10.1145/2682571.2797066","DOIUrl":"https://doi.org/10.1145/2682571.2797066","url":null,"abstract":"The advent of mobile OCR (optical character recognition) applications on regular smartphones holds great promise for enabling blind people to access printed information. Unfortunately, these systems suffer from a problem: in order for OCR output to be meaningful, a well-framed image of the document needs to be taken, something that is difficult to do without sight. This contribution presents an experimental investigation of how blind people position and orient a camera phone while acquiring document images. We developed experimental software to investigate if verbal guidance aids in the acquisition of OCR-readable images without sight. We report on our participant's feedback and performance before and after assistance from our software","PeriodicalId":106339,"journal":{"name":"Proceedings of the 2015 ACM Symposium on Document Engineering","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124227214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Spatio-temporal Validation of Multimedia Documents 多媒体文档的时空验证
Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797060
J. Santos, Christiano Braga, D. Muchaluat-Saade, C. Roisin, Nabil Layaïda
A multimedia document authoring system should provide analysis and validation tools that help authors find and correct mistakes before document deployment. Although very useful, multimedia validation tools are not often provided. Spatial validation of multimedia documents may be performed over the initial position of media items before presentation starts. However, such an approach does not lead to ideal results when media item placement changes over time. Some document authoring languages allow the definition of spatio-temporal relationships among media items and they can be moved or resized during runtime. Current validation approaches do not verify dynamic spatio-temporal relationships. This paper presents a novel approach for spatio-temporal validation of multimedia documents. We model the document state, extending the Simple Hypermedia Model (SHM), comprising media item positioning during the whole document presentation. Mapping between document states represent time lapse or user interaction. We also define a set of atomic formulas upon which the author's expectations related to the spatio-temporal layout can be described and analyzed.
多媒体文档创作系统应该提供分析和验证工具,帮助作者在文档部署之前发现并纠正错误。虽然多媒体验证工具非常有用,但通常不提供。空间验证多媒体文档可能被执行在演讲开始前媒体项目的初始位置。但是,当媒体项目的放置位置随时间变化时,这种方法不能产生理想的结果。一些文档编写语言允许定义媒体项之间的时空关系,并且可以在运行时移动或调整它们的大小。当前的验证方法不能验证动态时空关系。本文提出一种新颖的方法对时空多媒体文档的验证。我们对文档状态进行建模,扩展了简单超媒体模型(Simple Hypermedia model, SHM),包括整个文档表示过程中的媒体项定位。文档状态之间的映射表示时间推移或用户交互。我们还定义了一组原子公式在作者的预期与时空布局可以描述和分析。
{"title":"Spatio-temporal Validation of Multimedia Documents","authors":"J. Santos, Christiano Braga, D. Muchaluat-Saade, C. Roisin, Nabil Layaïda","doi":"10.1145/2682571.2797060","DOIUrl":"https://doi.org/10.1145/2682571.2797060","url":null,"abstract":"A multimedia document authoring system should provide analysis and validation tools that help authors find and correct mistakes before document deployment. Although very useful, multimedia validation tools are not often provided. Spatial validation of multimedia documents may be performed over the initial position of media items before presentation starts. However, such an approach does not lead to ideal results when media item placement changes over time. Some document authoring languages allow the definition of spatio-temporal relationships among media items and they can be moved or resized during runtime. Current validation approaches do not verify dynamic spatio-temporal relationships. This paper presents a novel approach for spatio-temporal validation of multimedia documents. We model the document state, extending the Simple Hypermedia Model (SHM), comprising media item positioning during the whole document presentation. Mapping between document states represent time lapse or user interaction. We also define a set of atomic formulas upon which the author's expectations related to the spatio-temporal layout can be described and analyzed.","PeriodicalId":106339,"journal":{"name":"Proceedings of the 2015 ACM Symposium on Document Engineering","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115784903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
An Approach for Designing Proofreading Views in Publishing Chains 出版链中校对视图的设计方法
Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797096
Léonard Dumas, Stéphane Crozat, B. Bachimont, Sylvain Spinelli
Documentary production often involves a revising process in which documents need to be proofread. This important task faces new challenges when dealing with digital documents. Indeed, three features of digital writing are problematic: (1) documents evolve very frequently and cannot be proofread each time as a whole, (2) interactions provided by hypertexts make the task less efficient and (3) document repurposing increases the views of content to proofread. As an advanced digital writing technology, XML publishing chains are a relevant framework for studying proofreading of digital documents. This paper argues the need for proofreading views, which enable the comparison of two versions of the document based on a diff algorithm. It also proposes a design approach based on case studies within Scenari publishing chains, involving the annotation of content and the validation of modifications.
纪录片的制作通常涉及到文件需要校对的修改过程。在处理数字文档时,这一重要任务面临着新的挑战。事实上,数字写作的三个特点是有问题的:(1)文档的发展非常频繁,不能作为一个整体每次都进行校对;(2)超文本提供的交互使任务效率降低;(3)文档重新利用增加了要校对的内容的视图。XML出版链作为一种先进的数字书写技术,是研究数字文档校对的一个相关框架。本文论证了校对视图的必要性,它可以基于diff算法对两个版本的文件进行比较。它还提出了一种基于scenario发布链中的案例研究的设计方法,涉及内容的注释和修改的验证。
{"title":"An Approach for Designing Proofreading Views in Publishing Chains","authors":"Léonard Dumas, Stéphane Crozat, B. Bachimont, Sylvain Spinelli","doi":"10.1145/2682571.2797096","DOIUrl":"https://doi.org/10.1145/2682571.2797096","url":null,"abstract":"Documentary production often involves a revising process in which documents need to be proofread. This important task faces new challenges when dealing with digital documents. Indeed, three features of digital writing are problematic: (1) documents evolve very frequently and cannot be proofread each time as a whole, (2) interactions provided by hypertexts make the task less efficient and (3) document repurposing increases the views of content to proofread. As an advanced digital writing technology, XML publishing chains are a relevant framework for studying proofreading of digital documents. This paper argues the need for proofreading views, which enable the comparison of two versions of the document based on a diff algorithm. It also proposes a design approach based on case studies within Scenari publishing chains, involving the annotation of content and the validation of modifications.","PeriodicalId":106339,"journal":{"name":"Proceedings of the 2015 ACM Symposium on Document Engineering","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127242491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 2015 ACM Symposium on Document Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1