This paper describes research on the automatic generation of hypermedia or Web-based presentations for semi-structured data resulting from ad-hoc queries. We identify how different aspects of adaptation, such as personalization and customization, influence the generation process. We address important aspects of the software that facilitates the generation process.
{"title":"Automatic hypermedia generation for ad hoc queries on semi-structured data","authors":"G. Houben, P. D. Bra","doi":"10.1145/336597.336675","DOIUrl":"https://doi.org/10.1145/336597.336675","url":null,"abstract":"This paper describes research on the automatic generation of hypermedia or Web-based presentations for semi-structured data resulting from ad-hoc queries. We identify how different aspects of adaptation, such as personalization and customization, influence the generation process. We address important aspects of the software that facilitates the generation process.","PeriodicalId":42447,"journal":{"name":"Digital Library Perspectives","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2000-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73334641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A preliminary study was conducted to help understand the purpose of digital libraries (DLs) and to investigate whether meaningful results could be obtained from small user studies of digital libraries. Results stress the importance of mental models, and of "traditional" library support.
{"title":"Purpose and usability of digital libraries","authors":"Y. Theng, N. Mohd-Nasir, H. Thimbleby","doi":"10.1145/336597.336674","DOIUrl":"https://doi.org/10.1145/336597.336674","url":null,"abstract":"A preliminary study was conducted to help understand the purpose of digital libraries (DLs) and to investigate whether meaningful results could be obtained from small user studies of digital libraries. Results stress the importance of mental models, and of \"traditional\" library support.","PeriodicalId":42447,"journal":{"name":"Digital Library Perspectives","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2000-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84173951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The automatic document metadata extraction process is animportant task in a world where thousands of documents are just one``click'' away. Thus, powerful indices are necessary to support effective retrieval. The upcoming XML standard represents an important step in this direction as itssemistructuredrepresentation conveys document metadata together with the text of the document. For example, retrieval of scientific papers by authors or affiliations would be a straightforward tasks if papers were stored in XML.Unfortunately, today, the largest majority of documents on the web are available in forms that do not carryadditional semantics. Converting existing documents to a semistructured representation is time consuming and no automatic process can be easily applied. In this paper we discuss a system, based on a novel spatial/visualknowledge principle, for extracting metadata from scientific papers storedas PostScript files. Our system embeds the general knowledge about the graphical layout of a scientific paper to guide the metadata extraction process. Our system can effectively assist the automatic index creation for digital libraries.
{"title":"Knowledge-based metadata extraction from PostScript files","authors":"G. Giuffrida, E. C. Shek, Jihoon Yang","doi":"10.1145/336597.336639","DOIUrl":"https://doi.org/10.1145/336597.336639","url":null,"abstract":"The automatic document metadata extraction process is animportant task in a world where thousands of documents are just one``click'' away. Thus, powerful indices are necessary to support effective retrieval. The upcoming XML standard represents an important step in this direction as itssemistructuredrepresentation conveys document metadata together with the text of the document. For example, retrieval of scientific papers by authors or affiliations would be a straightforward tasks if papers were stored in XML.Unfortunately, today, the largest majority of documents on the web are available in forms that do not carryadditional semantics. Converting existing documents to a semistructured representation is time consuming and no automatic process can be easily applied. In this paper we discuss a system, based on a novel spatial/visualknowledge principle, for extracting metadata from scientific papers storedas PostScript files. Our system embeds the general knowledge about the graphical layout of a scientific paper to guide the metadata extraction process. Our system can effectively assist the automatic index creation for digital libraries.","PeriodicalId":42447,"journal":{"name":"Digital Library Perspectives","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2000-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84354303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Digital libraries challenge humanists and other academics to rethink the relationship between technology and their work. At the Perseus Project, we have seen the rise of a new combination of skills. The “Corpus Editor” manages a collection of materials that are thematically coherent and focused but are too large to be managed soley with the labor-intensive techniques of traditional editing. The corpus editor must possess a degree of domain specific knowledge and technical expertise that virtually no established graduate training provides. This new position poses a challenge to humanists as they train and support members of the field pursuing new, but necessary tasks.
{"title":"New technology and new roles: the need for “corpus editors”","authors":"G. Crane, Jeffrey A. Rydberg-Cox","doi":"10.1145/336597.336686","DOIUrl":"https://doi.org/10.1145/336597.336686","url":null,"abstract":"Digital libraries challenge humanists and other academics to rethink the relationship between technology and their work. At the Perseus Project, we have seen the rise of a new combination of skills. The “Corpus Editor” manages a collection of materials that are thematically coherent and focused but are too large to be managed soley with the labor-intensive techniques of traditional editing. The corpus editor must possess a degree of domain specific knowledge and technical expertise that virtually no established graduate training provides. This new position poses a challenge to humanists as they train and support members of the field pursuing new, but necessary tasks.","PeriodicalId":42447,"journal":{"name":"Digital Library Perspectives","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2000-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/336597.336686","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72458069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Smaller scholarly projects represent a sector of the digital library which makes an important contribution to research on specialized text encoding, retrieval, and interface design for scholarly audiences. Such projects, however, face funding and publication challenges which threaten their long-term survival. The Women Writers Project is using an unusual tier-based licensing model to make the transition from grant-funded research project to independent financial viability.
{"title":"A licensing model for scholarly textbases","authors":"Julia Flanders, E. Mylonas","doi":"10.1145/336597.336690","DOIUrl":"https://doi.org/10.1145/336597.336690","url":null,"abstract":"Smaller scholarly projects represent a sector of the digital library which makes an important contribution to research on specialized text encoding, retrieval, and interface design for scholarly audiences. Such projects, however, face funding and publication challenges which threaten their long-term survival. The Women Writers Project is using an unusual tier-based licensing model to make the transition from grant-funded research project to independent financial viability.","PeriodicalId":42447,"journal":{"name":"Digital Library Perspectives","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2000-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88606070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A new model has been developed for the standardization of names in bibliographic databases. This paper describes the model and its implementation and also compares it with an existing model. The results show that the new model will revolutionize name authority control and will also improve on the existing NACO model. A prototype that was developed also indicates the technical feasibility of the model's implementation.
{"title":"Revolutionizing name authority control","authors":"M. Snyman, M. J. V. Rensburg","doi":"10.1145/336597.336660","DOIUrl":"https://doi.org/10.1145/336597.336660","url":null,"abstract":"A new model has been developed for the standardization of names in bibliographic databases. This paper describes the model and its implementation and also compares it with an existing model. The results show that the new model will revolutionize name authority control and will also improve on the existing NACO model. A prototype that was developed also indicates the technical feasibility of the model's implementation.","PeriodicalId":42447,"journal":{"name":"Digital Library Perspectives","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2000-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80365231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
R. Wesley, D. Greenstein, David Millman, M. Tibbetts, G. Zick
This panel will address some of the practical issues of implementing digital libraries. Everyone seems to agree that digital libraries are in their infancy. Many of us watch in amazement as new developments occur. In the year 2000 building a production digital library means analyzing tradeoffs between stability, innovation, and costs/benefits. In this panel we will address some of the more interesting issues of digital library implementation. One of the critical challenges is the economics of digital libraries. Institutions are forming consortia out of necessity and this raises issues of trust, cooperation and commitment. In order for institutions to deliver web-based resources effectively sophisticated cross-organizational access management tools are needed to authenticate and authorize. Institutions are developing new tools and exporting them from research departments into libraries. How well does this collaboration work? The above issues are only a sampling of the challenges this panel will explore.
{"title":"Implementing digital libraries (panel session)","authors":"R. Wesley, D. Greenstein, David Millman, M. Tibbetts, G. Zick","doi":"10.1145/336597.336699","DOIUrl":"https://doi.org/10.1145/336597.336699","url":null,"abstract":"This panel will address some of the practical issues of implementing digital libraries. Everyone seems to agree that digital libraries are in their infancy. Many of us watch in amazement as new developments occur. In the year 2000 building a production digital library means analyzing tradeoffs between stability, innovation, and costs/benefits. In this panel we will address some of the more interesting issues of digital library implementation.\u0000One of the critical challenges is the economics of digital libraries. Institutions are forming consortia out of necessity and this raises issues of trust, cooperation and commitment. \u0000In order for institutions to deliver web-based resources effectively sophisticated cross-organizational access management tools are needed to authenticate and authorize.\u0000Institutions are developing new tools and exporting them from research departments into libraries. How well does this collaboration work?\u0000The above issues are only a sampling of the challenges this panel will explore.","PeriodicalId":42447,"journal":{"name":"Digital Library Perspectives","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2000-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82416260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This paper describes the Greenstone digital library software, a comprehensive, open-source system for the construction and presentation of information collections. Collections built with Greenstone offer effective full-text searching and metadata-based browsing facilities that are attractive and easy to use. Moreover, they are easily maintainable and can be augmented and rebuilt entirely automatically. The system is extensible: software "plugins" accommodate different document and metadata types.
{"title":"Greenstone: a comprehensive open-source digital library software system","authors":"I. Witten, S. Boddie, D. Bainbridge, R. McNab","doi":"10.1145/336597.336650","DOIUrl":"https://doi.org/10.1145/336597.336650","url":null,"abstract":"This paper describes the Greenstone digital library software, a comprehensive, open-source system for the construction and presentation of information collections. Collections built with Greenstone offer effective full-text searching and metadata-based browsing facilities that are attractive and easy to use. Moreover, they are easily maintainable and can be augmented and rebuilt entirely automatically. The system is extensible: software \"plugins\" accommodate different document and metadata types.","PeriodicalId":42447,"journal":{"name":"Digital Library Perspectives","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2000-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76333827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this paper we present a 2x3 factorial design study evaluating the limits and differences on the behavior of 10 users when searching in a virtual reality representation that mimics the arrangement of a traditional library. The focus of this study was the effect of clustering techniques and query highlighting on search strategy users develop in the virtual environment, and whether position or spatial arrangement influenced user behavior. We found several particularities that can be attributed to the differences in the VR environment.This study's results identify: 1) the need of co-designing both spatial arrangement and interaction method; 2) adifficulty novice users faced when using clusters to identify commontopics; 3) the influence of position and distance on users' selection of collection items to inspect; and 4) that users did not search until found the best match, but only until they found a satisfactory match.
{"title":"A study of user behavior in an immersive virtual environment for digital libraries","authors":"F. D. Neves, E. Fox","doi":"10.1145/336597.336648","DOIUrl":"https://doi.org/10.1145/336597.336648","url":null,"abstract":"In this paper we present a 2x3 factorial design study evaluating the limits and differences on the behavior of 10 users when searching in a virtual reality representation that mimics the arrangement of a traditional library. The focus of this study was the effect of clustering techniques and query highlighting on search strategy users develop in the virtual environment, and whether position or spatial arrangement influenced user behavior. We found several particularities that can be attributed to the differences in the VR environment.This study's results identify: 1) the need of co-designing both spatial arrangement and interaction method; 2) adifficulty novice users faced when using clusters to identify commontopics; 3) the influence of position and distance on users' selection of collection items to inspect; and 4) that users did not search until found the best match, but only until they found a satisfactory match.","PeriodicalId":42447,"journal":{"name":"Digital Library Perspectives","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2000-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90556034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this paper we introduce the MatchDetectReveal(MDR) system, which is capable of identifying overlapping and plagiarised documents. Each component of the system is briefly described. The matching-engine component uses a modified suffix tree representation, which is able to identify the exact overlapping chunks and its performance is also presented.
{"title":"Document overlap detection system for distributed digital libraries","authors":"K. Monostori, A. Zaslavsky, H. Schmidt","doi":"10.1145/336597.336667","DOIUrl":"https://doi.org/10.1145/336597.336667","url":null,"abstract":"In this paper we introduce the MatchDetectReveal(MDR) system, which is capable of identifying overlapping and plagiarised documents. Each component of the system is briefly described. The matching-engine component uses a modified suffix tree representation, which is able to identify the exact overlapping chunks and its performance is also presented.","PeriodicalId":42447,"journal":{"name":"Digital Library Perspectives","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2000-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90823865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}