In this paper, we present a cloud-based collaborative authoring tool called Creaza VideoCloud. This authoring tool offers an extensive set of features for document-based video authoring in the cloud.
{"title":"A cloud-based and social authoring tool for video","authors":"Naimdjon Takhirov, F. Duchateau","doi":"10.1145/2034691.2034705","DOIUrl":"https://doi.org/10.1145/2034691.2034705","url":null,"abstract":"In this paper, we present a cloud-based collaborative authoring tool called Creaza VideoCloud. This authoring tool offers an extensive set of features for document-based video authoring in the cloud.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"404 ","pages":"65-68"},"PeriodicalIF":0.0,"publicationDate":"2011-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91510908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Citizen-focused documents in Public Administration devote considerable effort to the expression of conditions. These conditions are commonly expressed as statements of eligibility requirements for the programs being described, but they manifest themselves in other places as well, such as in feedback to readers in tailored informational brochures and as input fields on program application forms. This paper discusses how administrative conditions can be represented in a manner that supports both the eligibility reasoning required for the generation of citizen-tailored documents and also the automated generation of condition expressions in a variety of forms. The paper pays particular attention to the question of how a generation mechanism can allow authors to override the default forms of automated expression when necessary. The discussion is based on a prototype tailored delivery application whose knowledge base is implemented in OWL DL and whose output is constructed using Myriad, a platform for tailored document planning and formatting.
{"title":"Expressing conditions in tailored brochures for public administration","authors":"N. Colineau, Cécile Paris, Keith Vander Linden","doi":"10.1145/2034691.2034736","DOIUrl":"https://doi.org/10.1145/2034691.2034736","url":null,"abstract":"Citizen-focused documents in Public Administration devote considerable effort to the expression of conditions. These conditions are commonly expressed as statements of eligibility requirements for the programs being described, but they manifest themselves in other places as well, such as in feedback to readers in tailored informational brochures and as input fields on program application forms. This paper discusses how administrative conditions can be represented in a manner that supports both the eligibility reasoning required for the generation of citizen-tailored documents and also the automated generation of condition expressions in a variety of forms. The paper pays particular attention to the question of how a generation mechanism can allow authors to override the default forms of automated expression when necessary. The discussion is based on a prototype tailored delivery application whose knowledge base is implemented in OWL DL and whose output is constructed using Myriad, a platform for tailored document planning and formatting.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"9 1","pages":"209-218"},"PeriodicalIF":0.0,"publicationDate":"2011-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87883566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nowadays, the amount of information accessible on the web is huge. Although web users today expect a more integrated way to access information on the web, it is still rather difficult to "integrate" information from different web sites since most web pages are authored in HTML format, which is actually a presentation-oriented language and is usually considered unstructured. Today, there are many research works aiming at extracting information from web pages. Existing works typically transform the extracting results into structured or semi-structured data formats, thus other applications can further process the results to discover more useful information. Nevertheless, the unstructured nature of HTML makes the transformation process complex and can hardly be widely adopted. In this paper, an annotation-based HTML-to-XML ransformation technology is proposed. The mechanism is developed with both usability and simplicity in mind. With the proposed mechanism, ordinary web site developers simply add annotations to their web pages. Annotated web pages can then be processed by our software libraries and transformed into XML documents, which are machine-understandable. Software agents thus can be developed based on our technology.
{"title":"Developer-friendly annotation-based HTML-to-XML transformation technology","authors":"Lendle Tseng","doi":"10.1145/2034691.2034707","DOIUrl":"https://doi.org/10.1145/2034691.2034707","url":null,"abstract":"Nowadays, the amount of information accessible on the web is huge. Although web users today expect a more integrated way to access information on the web, it is still rather difficult to \"integrate\" information from different web sites since most web pages are authored in HTML format, which is actually a presentation-oriented language and is usually considered unstructured. Today, there are many research works aiming at extracting information from web pages. Existing works typically transform the extracting results into structured or semi-structured data formats, thus other applications can further process the results to discover more useful information. Nevertheless, the unstructured nature of HTML makes the transformation process complex and can hardly be widely adopted. In this paper, an annotation-based HTML-to-XML ransformation technology is proposed. The mechanism is developed with both usability and simplicity in mind. With the proposed mechanism, ordinary web site developers simply add annotations to their web pages. Annotated web pages can then be processed by our software libraries and transformed into XML documents, which are machine-understandable. Software agents thus can be developed based on our technology.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"27 5 1","pages":"73-76"},"PeriodicalIF":0.0,"publicationDate":"2011-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89686448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ildus Ahmadullin, J. Allebach, Niranjan Damera-Venkata, Jian Fan, S. Lee, Qian Lin, Jerry Liu, Eamonn O'Brien-Strain
Managing large document databases has become an important task. Being able to automatically compare document layouts and classify and search documents with respect to their visual appearance proves to be desirable in many applications. We propose a new algorithm that approximates a metric function between documents based on their visual similarity. The comparison is based only on the visual appearance of the document without taking into consideration its text content. We measure the similarity of single page documents with respect to distance functions between three document components: background, text, and saliency. Each document component is represented as a Gaussian mixture distribution; and distances between the components of different documents are calculated as an approximation of the Hellinger distance between corresponding distributions. Since the Hellinger distance obeys the triangle inequality, it proves to be favorable in the task of nearest neighbor search in a document database. Thus, the computation required to find similar documents in a document database can be significantly reduced.
{"title":"Document visual similarity measure for document search","authors":"Ildus Ahmadullin, J. Allebach, Niranjan Damera-Venkata, Jian Fan, S. Lee, Qian Lin, Jerry Liu, Eamonn O'Brien-Strain","doi":"10.1145/2034691.2034722","DOIUrl":"https://doi.org/10.1145/2034691.2034722","url":null,"abstract":"Managing large document databases has become an important task. Being able to automatically compare document layouts and classify and search documents with respect to their visual appearance proves to be desirable in many applications. We propose a new algorithm that approximates a metric function between documents based on their visual similarity. The comparison is based only on the visual appearance of the document without taking into consideration its text content. We measure the similarity of single page documents with respect to distance functions between three document components: background, text, and saliency. Each document component is represented as a Gaussian mixture distribution; and distances between the components of different documents are calculated as an approximation of the Hellinger distance between corresponding distributions. Since the Hellinger distance obeys the triangle inequality, it proves to be favorable in the task of nearest neighbor search in a document database. Thus, the computation required to find similar documents in a document database can be significantly reduced.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"76 1","pages":"139-142"},"PeriodicalIF":0.0,"publicationDate":"2011-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76084136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Many Web-based geographic applications have been developed in various domains, such as tourism, education, surveillance and military. However, developing such applications is a cumbersome task because it requires several types of components (e.g., maps, contents, indexing services, databases) that have to be assembled together. Hence, developers have to deal with different technologies and application behavior models. In order to create Web-based geographic applications and overcome these design problems, we propose a framework composed of three complementary tasks: identifying some desired data, building the graphical layout organization and defining potential user interactions. According to this framework, we have specified a unified model and we have encoded it using Semantic Web technologies, such as RDF. Through a prototype named WINDMash, we have implemented some tools that instantiate our model and automatically generate concrete Internet geographic applications that can be executed on Web browsers.
{"title":"A framework with tools for designing web-based geographic applications","authors":"The Nhan Luong, Sébastien Laborie, T. Nodenot","doi":"10.1145/2034691.2034699","DOIUrl":"https://doi.org/10.1145/2034691.2034699","url":null,"abstract":"Many Web-based geographic applications have been developed in various domains, such as tourism, education, surveillance and military. However, developing such applications is a cumbersome task because it requires several types of components (e.g., maps, contents, indexing services, databases) that have to be assembled together. Hence, developers have to deal with different technologies and application behavior models. In order to create Web-based geographic applications and overcome these design problems, we propose a framework composed of three complementary tasks: identifying some desired data, building the graphical layout organization and defining potential user interactions. According to this framework, we have specified a unified model and we have encoded it using Semantic Web technologies, such as RDF. Through a prototype named WINDMash, we have implemented some tools that instantiate our model and automatically generate concrete Internet geographic applications that can be executed on Web browsers.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"10 27","pages":"33-42"},"PeriodicalIF":0.0,"publicationDate":"2011-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91506601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This workshop will explore a new Google technology involving documents and programming (not yet unannounced by the paper submission deadline).
本次研讨会将探讨一项涉及文档和编程的谷歌新技术(论文提交截止日期前尚未公布)。
{"title":"Google mystery workshop","authors":"John Day-Richter","doi":"10.1145/2034691.2034750","DOIUrl":"https://doi.org/10.1145/2034691.2034750","url":null,"abstract":"This workshop will explore a new Google technology involving documents and programming (not yet unannounced by the paper submission deadline).","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"140 1","pages":"281-282"},"PeriodicalIF":0.0,"publicationDate":"2011-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73282860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Niranjan Damera-Venkata, José Bento, Eamonn O'Brien-Strain
We present a new paradigm for automated document composition based on a generative, unified probabilistic document model (PDM) that models document composition. The model formally incorporates key design variables such as content pagination, relative arrangement possibilities for page elements and possible page edits. These design choices are modeled jointly as coupled random variables (a Bayesian Network) with uncertainty modeled by their probability distributions. The overall joint probability distribution for the network assigns higher probability to good design choices. Given this model, we show that the general document layout problem can be reduced to probabilistic inference over the Bayesian network. We show that the inference task may be accomplished efficiently, scaling linearly with the content in the best case. We provide a useful specialization of the general model and use it to illustrate the advantages of soft probabilistic encodings over hard one-way constraints in specifying design aesthetics.
{"title":"Probabilistic document model for automated document composition","authors":"Niranjan Damera-Venkata, José Bento, Eamonn O'Brien-Strain","doi":"10.1145/2034691.2034695","DOIUrl":"https://doi.org/10.1145/2034691.2034695","url":null,"abstract":"We present a new paradigm for automated document composition based on a generative, unified probabilistic document model (PDM) that models document composition. The model formally incorporates key design variables such as content pagination, relative arrangement possibilities for page elements and possible page edits. These design choices are modeled jointly as coupled random variables (a Bayesian Network) with uncertainty modeled by their probability distributions. The overall joint probability distribution for the network assigns higher probability to good design choices. Given this model, we show that the general document layout problem can be reduced to probabilistic inference over the Bayesian network. We show that the inference task may be accomplished efficiently, scaling linearly with the content in the best case. We provide a useful specialization of the general model and use it to illustrate the advantages of soft probabilistic encodings over hard one-way constraints in specifying design aesthetics.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"62 1","pages":"3-12"},"PeriodicalIF":0.0,"publicationDate":"2011-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89839408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
At the microscopic level, printing on a substrate exhibits imperfections that can be used as a unique identifier for labels, documents and other printed items. In previous work, we have demonstrated using these minute imperfections around a simple forensic mark such as a single printed character for robust authentication of the character with a low cost (and mobile) system. This approach allows for product authentication even when there is only minimal printing (e.g. on a small label or medallion), supporting a variety of secure document workflows. In this paper, we present an investigation on the influence that the substrate type has on the imperfections of the printing process that are used to derive the character 'signature'. We also make a comparison between two printing processes, dry electro photographic process (laser) and (thermal) inkjet. Understanding the sensitivity of our methods to these factors is important so that we know the limitations of the approach for document forensics.
{"title":"A study of the interaction of paper substrates on printed forensic imaging","authors":"Guy B. Adams, S. Pollard, S. Simske","doi":"10.1145/2034691.2034743","DOIUrl":"https://doi.org/10.1145/2034691.2034743","url":null,"abstract":"At the microscopic level, printing on a substrate exhibits imperfections that can be used as a unique identifier for labels, documents and other printed items. In previous work, we have demonstrated using these minute imperfections around a simple forensic mark such as a single printed character for robust authentication of the character with a low cost (and mobile) system. This approach allows for product authentication even when there is only minimal printing (e.g. on a small label or medallion), supporting a variety of secure document workflows. In this paper, we present an investigation on the influence that the substrate type has on the imperfections of the printing process that are used to derive the character 'signature'. We also make a comparison between two printing processes, dry electro photographic process (laser) and (thermal) inkjet. Understanding the sensitivity of our methods to these factors is important so that we know the limitations of the approach for document forensics.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"331 1","pages":"263-266"},"PeriodicalIF":0.0,"publicationDate":"2011-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77143418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
R. Piccoli, Rodrigo Chamun, Nicole Carrion Cogo, J. B. Oliveira, I. Manssour
Marketing flyers, greeting cards, brochures and similar materials are expensive to produce, since these documents need to be personalized and typically require a graphic design professional to create. Either authoring tools are too complex to use or a predefined set of fixed templates is available, which can be restrictive and difficult to produce the desired results. Thus, simpler design tools are a compelling need for small businesses and consumers. This paper describes an interactive authoring method for creating free-form documents based on a force-directed approach, traditionally applied for graph layout problems. This is used for automatically distributing and manipulating images, text and decorative elements on a page, according to forces modeled after physical laws. Such approach can be used for enabling easy authoring of personalized brochures, photo albums, calendars, greeting cards and other free-form documents. A prototype has been developed for evaluation purposes, and is briefly described in this paper. Evaluation results are presented as well, showing that users enjoy the experience of designing a page by interacting with it, and that end results can be satisfactory.
{"title":"A novel physics-based interaction model for free document layout","authors":"R. Piccoli, Rodrigo Chamun, Nicole Carrion Cogo, J. B. Oliveira, I. Manssour","doi":"10.1145/2034691.2034725","DOIUrl":"https://doi.org/10.1145/2034691.2034725","url":null,"abstract":"Marketing flyers, greeting cards, brochures and similar materials are expensive to produce, since these documents need to be personalized and typically require a graphic design professional to create. Either authoring tools are too complex to use or a predefined set of fixed templates is available, which can be restrictive and difficult to produce the desired results. Thus, simpler design tools are a compelling need for small businesses and consumers. This paper describes an interactive authoring method for creating free-form documents based on a force-directed approach, traditionally applied for graph layout problems. This is used for automatically distributing and manipulating images, text and decorative elements on a page, according to forces modeled after physical laws. Such approach can be used for enabling easy authoring of personalized brochures, photo albums, calendars, greeting cards and other free-form documents. A prototype has been developed for evaluation purposes, and is briefly described in this paper. Evaluation results are presented as well, showing that users enjoy the experience of designing a page by interacting with it, and that end results can be satisfactory.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"7 1","pages":"153-162"},"PeriodicalIF":0.0,"publicationDate":"2011-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82211129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Michael Nebeling, Fabrice Matulic, Lucas Streit, M. Norrie
Despite the fact that average screen size and resolution have dramatically increased, many of today's web sites still do not scale well in larger viewing contexts. The upcoming HTML5 and CSS3 standards propose features that can be used to build more flexible web page layouts, but their potential to accommodate a wider range of display environments is currently relatively unexplored. We examine the proposed standards to identify the most promising features and report on experiments with a number of adaptive layout mechanisms that support the required forms of adaptation to take advantage of greater screen real estates, such as automated scaling of text and media. Special attention is given to the effective use of multi-column layout, a brand new feature for web design that contributes to optimising the space occupied by text, but at the same time still poses problems in predominantly continuous vertical-scrolling browsing behaviours. The proposed solutions were integrated in a flexible layout template that was then applied to an existing news web site and tested on users to identify the adaptive features that best support reading comfort and efficiency.
{"title":"Adaptive layout template for effective web content presentation in large-screen contexts","authors":"Michael Nebeling, Fabrice Matulic, Lucas Streit, M. Norrie","doi":"10.1145/2034691.2034737","DOIUrl":"https://doi.org/10.1145/2034691.2034737","url":null,"abstract":"Despite the fact that average screen size and resolution have dramatically increased, many of today's web sites still do not scale well in larger viewing contexts. The upcoming HTML5 and CSS3 standards propose features that can be used to build more flexible web page layouts, but their potential to accommodate a wider range of display environments is currently relatively unexplored. We examine the proposed standards to identify the most promising features and report on experiments with a number of adaptive layout mechanisms that support the required forms of adaptation to take advantage of greater screen real estates, such as automated scaling of text and media. Special attention is given to the effective use of multi-column layout, a brand new feature for web design that contributes to optimising the space occupied by text, but at the same time still poses problems in predominantly continuous vertical-scrolling browsing behaviours. The proposed solutions were integrated in a flexible layout template that was then applied to an existing news web site and tested on users to identify the adaptive features that best support reading comfort and efficiency.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"35 1","pages":"219-228"},"PeriodicalIF":0.0,"publicationDate":"2011-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81467742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}