Pub Date : 1995-09-11DOI: 10.1109/MASS.1995.528219
R. Grossman, X. Qin, W. Xu, H. Hulen, T. Tyler
Requirements for a high-performance, scalable digital library of multimedia data are presented together with a layered architecture for a system that addresses the requirements. The approach is to view digital data as persistent collections of complex objects and to use lightweight object management to manage this data. To scale as the amount of data increases, the object management component is layered over a storage management component. The storage management component supports hierarchical storage, third-party data transfer and parallel input-output. Several issues that arise from the interface between the storage management and object management components are discussed. The authors have developed a prototype of a digital library using this design. Two key components of the prototype are AIM Net and HPSS. AIM Net is a persistent object manager and is a product of Oak Park Research. HPSS is the High Performance Storage System, developed by a collaboration including IBM Government Systems and several national labs.
提出了对高性能、可扩展的多媒体数据数字库的需求,以及满足这些需求的系统的分层体系结构。该方法将数字数据视为复杂对象的持久集合,并使用轻量级对象管理来管理这些数据。为了随着数据量的增加而扩展,对象管理组件在存储管理组件之上分层。存储管理组件支持分层存储、第三方数据传输和并行输入输出。讨论了存储管理和对象管理组件之间的接口产生的几个问题。作者利用这种设计开发了一个数字图书馆的原型。原型的两个关键组件是AIM网和HPSS。AIM Net是一个持久对象管理器,是Oak Park Research的一个产品。HPSS是高性能存储系统,由IBM政府系统和几个国家实验室合作开发。
{"title":"An architecture for a scalable, high-performance digital library","authors":"R. Grossman, X. Qin, W. Xu, H. Hulen, T. Tyler","doi":"10.1109/MASS.1995.528219","DOIUrl":"https://doi.org/10.1109/MASS.1995.528219","url":null,"abstract":"Requirements for a high-performance, scalable digital library of multimedia data are presented together with a layered architecture for a system that addresses the requirements. The approach is to view digital data as persistent collections of complex objects and to use lightweight object management to manage this data. To scale as the amount of data increases, the object management component is layered over a storage management component. The storage management component supports hierarchical storage, third-party data transfer and parallel input-output. Several issues that arise from the interface between the storage management and object management components are discussed. The authors have developed a prototype of a digital library using this design. Two key components of the prototype are AIM Net and HPSS. AIM Net is a persistent object manager and is a product of Oak Park Research. HPSS is the High Performance Storage System, developed by a collaboration including IBM Government Systems and several national labs.","PeriodicalId":345074,"journal":{"name":"Proceedings of IEEE 14th Symposium on Mass Storage Systems","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121824259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-09-11DOI: 10.1109/MASS.1995.528239
J. Berry
The digital revolution is now underway. The use of binary zeros and ones to store data is increasing at a steady rate. They may represent text, images, pictures, sounds, maps, books, music, instructions, programs, or just about anything else which can be represented digitally. As the sizes of the digital data holdings have continued to grow, so too has the need to provide meaningful access to this data. There are a number of efforts now underway to provide such access. In most cases the efforts have been domain specific and progress in one area has been hard to replicate in a different domain. Part of this difficulty has been the lack of a general set of concepts and vocabulary that are sufficiently broad enough to bridge the gaps. The paper presents a general taxonomy of knowledge that is independent of subject matter domain. It begins with knowledge as the most general class and then proceeds to subdivide knowledge into its constituent parts: factual knowledge, procedural knowledge, and judgmental knowledge. Definitions of each type of knowledge are given along with examples sufficient to understand each subclass. A vocabulary is introduced that provides a means to discuss the topic in a manner independent of a specific problem domain. Understanding of the differences between different types or classes of knowledge is necessary if a person or an organization is to begin to build systems that acquire, organize, store, and retrieve various types of knowledge. The paper concludes with a discussion of some tools that are currently available to assist in the building and maintaining of a knowledge resource.
{"title":"A framework for understanding large scale digital storage systems","authors":"J. Berry","doi":"10.1109/MASS.1995.528239","DOIUrl":"https://doi.org/10.1109/MASS.1995.528239","url":null,"abstract":"The digital revolution is now underway. The use of binary zeros and ones to store data is increasing at a steady rate. They may represent text, images, pictures, sounds, maps, books, music, instructions, programs, or just about anything else which can be represented digitally. As the sizes of the digital data holdings have continued to grow, so too has the need to provide meaningful access to this data. There are a number of efforts now underway to provide such access. In most cases the efforts have been domain specific and progress in one area has been hard to replicate in a different domain. Part of this difficulty has been the lack of a general set of concepts and vocabulary that are sufficiently broad enough to bridge the gaps. The paper presents a general taxonomy of knowledge that is independent of subject matter domain. It begins with knowledge as the most general class and then proceeds to subdivide knowledge into its constituent parts: factual knowledge, procedural knowledge, and judgmental knowledge. Definitions of each type of knowledge are given along with examples sufficient to understand each subclass. A vocabulary is introduced that provides a means to discuss the topic in a manner independent of a specific problem domain. Understanding of the differences between different types or classes of knowledge is necessary if a person or an organization is to begin to build systems that acquire, organize, store, and retrieve various types of knowledge. The paper concludes with a discussion of some tools that are currently available to assist in the building and maintaining of a knowledge resource.","PeriodicalId":345074,"journal":{"name":"Proceedings of IEEE 14th Symposium on Mass Storage Systems","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123746866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-04-01DOI: 10.1109/MASS.1995.528214
R. Watson, R. Coyne
Datasets up to terabyte size and petabyte total capacities have created a serious imbalance between I/O and storage-system performance and system functionality. One promising approach is the use of parallel data-transfer techniques for client access to storage, peripheral-to-peripheral transfers, and remote file transfers. This paper describes the parallel I/O architecture and mechanisms, parallel transport protocol (PTP), parallel FTP, and parallel client application programming interface (API) used by the high-performance storage system (HPSS). Parallel storage integration issues with a local parallel file system are also discussed.
{"title":"The parallel I/O architecture of the high-performance storage system (HPSS)","authors":"R. Watson, R. Coyne","doi":"10.1109/MASS.1995.528214","DOIUrl":"https://doi.org/10.1109/MASS.1995.528214","url":null,"abstract":"Datasets up to terabyte size and petabyte total capacities have created a serious imbalance between I/O and storage-system performance and system functionality. One promising approach is the use of parallel data-transfer techniques for client access to storage, peripheral-to-peripheral transfers, and remote file transfers. This paper describes the parallel I/O architecture and mechanisms, parallel transport protocol (PTP), parallel FTP, and parallel client application programming interface (API) used by the high-performance storage system (HPSS). Parallel storage integration issues with a local parallel file system are also discussed.","PeriodicalId":345074,"journal":{"name":"Proceedings of IEEE 14th Symposium on Mass Storage Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130403697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}