2003 Joint Conference on Digital Libraries, 2003. Proceedings.最新文献

英文中文

SCENS: a system for the mediated sharing of sensitive data SCENS:敏感数据的中介共享系统

2003 Joint Conference on Digital Libraries, 2003. Proceedings.

Pub Date : 2003-05-27 DOI: 10.1109/JCDL.2003.1204875

S. Ye, F. Makedon, T. Steinberg, Li Shen, J. Ford, Yuhang Wang, Yan Zhao, S. Kapidakis

We introduce SCENS, a secure content exchange negotiation system suitable for the exchange of private digital data that reside in distributed digital repositories. SCENS is an open negotiation system with flexibility, security and scalability. SCENS is currently being designed to support data sharing in scientific research, by providing incentives and goals specific to a research community. However, it can easily be extended to apply to other communities, such as government, commercial and other types of exchanges. It is a trusted third party software infrastructure enabling independent entities to interact and conduct multiple forms of negotiation.

本文介绍了一种安全的内容交换协商系统SCENS，它适用于分布式数字存储库中私有数字数据的交换。SCENS是一个具有灵活性、安全性和可扩展性的开放式协商系统。目前正在设计SCENS，通过提供特定于研究社区的激励措施和目标，支持科学研究中的数据共享。然而，它可以很容易地扩展到适用于其他社区，如政府、商业和其他类型的交易所。它是一个可信的第三方软件基础设施，使独立实体能够进行交互并进行多种形式的协商。

引用次数: 19

Utility of an OAI service provider search portal OAI服务提供者搜索门户的实用程序

2003 Joint Conference on Digital Libraries, 2003. Proceedings.

Pub Date : 2003-05-27 DOI: 10.1109/JCDL.2003.1204880

Sarah L. Shreeves, Christine M. Kirkham, J. Kaczmarek, Timothy W. Cole

The Open Archives Initiative (OAI) Protocol for Metadata Harvesting (PMH) facilitates efficient interoperability between digital collections, in particular by enabling service providers to construct, with relatively modest effort, search portals that present aggregated metadata to specific communities. We describe the experiences of the University of Illinois at Urbana-Champaign Library as an OAI service provider. We discuss the creation of a search portal to an aggregation of metadata describing cultural heritage resources. We examine several key challenges posed by the aggregated metadata and present preliminary findings of a pilot study of the utility of the portal for a specific community (student teachers). We also comment briefly on the potential for using text analysis tools to uncover themes and relationships within the aggregated metadata.

开放档案倡议(OAI)元数据收集协议(PMH)促进了数字馆藏之间的有效互操作性，特别是通过使服务提供商能够以相对适度的努力构建搜索门户，将聚合的元数据呈现给特定社区。我们描述了伊利诺伊大学厄巴纳-香槟分校图书馆作为OAI服务提供者的经验。我们将讨论创建一个搜索门户，以聚合描述文化遗产资源的元数据。我们研究了聚合元数据带来的几个关键挑战，并介绍了针对特定社区(学生教师)的门户实用性试点研究的初步结果。我们还简要介绍了使用文本分析工具在聚合元数据中发现主题和关系的可能性。

引用次数: 4

Assembling and enriching digital library collections 整合和丰富数字图书馆馆藏

2003 Joint Conference on Digital Libraries, 2003. Proceedings.

Pub Date : 2003-05-27 DOI: 10.1109/JCDL.2003.1204885

D. Bainbridge, John Thompson, I. Witten

People who create digital libraries need to gather together the raw material, add metadata as necessary, and design and build new collections. We set out the requirements for these tasks and describe a new tool that supports them interactively, making it easy for users to create their own collections from electronic files of all types. The process involves selecting documents for inclusion, coming up with a suitable metadata set, assigning metadata to each document or group of documents, designing the form of the collection in terms of document formats, searchable indexes, and browsing facilities, building the necessary indexes and data structures, and putting the collection in place for others to use. Moreover, different situations require different workflows, and the system must be flexible enough to cope with these demands. Although the tool is specific to the Greenstone digital library software, the underlying ideas should prove useful in more general contexts.

创建数字图书馆的人需要收集原始资料，必要时添加元数据，并设计和构建新的馆藏。我们列出了这些任务的要求，并描述了一个支持这些任务的交互式新工具，使用户可以轻松地从所有类型的电子文件中创建自己的集合。这个过程包括选择要包含的文档，提出合适的元数据集，为每个文档或文档组分配元数据，根据文档格式、可搜索索引和浏览工具设计集合的形式，构建必要的索引和数据结构，并将集合放置在适当的位置以供其他人使用。此外，不同的情况需要不同的工作流，系统必须足够灵活以应对这些需求。虽然这个工具是专门针对Greenstone数字图书馆软件的，但其潜在的思想应该在更普遍的情况下证明是有用的。

引用次数: 55

CephSchool: a pedagogic portal for teaching biological principles with cephalopod molluscs 头足类软体动物生物学原理教学门户网站

2003 Joint Conference on Digital Libraries, 2003. Proceedings.

Pub Date : 2003-05-27 DOI: 10.1109/JCDL.2003.1204920

J. Wood, Caitlin M. H. Shaw

CephSchool is based on CephBase and takes the information present in CephBase's digital libraries and redirects it towards students and teachers. CephSchool is organized into eight arms and contains information about cephalopods, discussion topics, teacher support, and student assessment techniques. These provide an accurate and inquiry base-learning environment for students to learn basic biological concepts using cephalopods as the subject organism by giving them a dynamic Web page that is updated, as new information is made available.

CephSchool是基于CephBase的，它采用CephBase数字图书馆中的信息，并将其重新定向给学生和教师。cepphschool分为八个部分，包含有关头足类动物、讨论主题、教师支持和学生评估技术的信息。这些网站为学生提供了一个准确的、探究性的学习环境，让他们以头足类动物为主题，学习基本的生物学概念，并为他们提供了一个动态的网页，随着新信息的出现而更新。

引用次数: 0

Learning digital library technology across borders 跨国界学习数字图书馆技术

2003 Joint Conference on Digital Libraries, 2003. Proceedings.

Pub Date : 2003-05-27 DOI: 10.1109/JCDL.2003.1204859

S. Southwick, R. Southwick

We describe the background context and initial findings from an ongoing case study of an electronic theses and dissertations (ETD) digital library (DL) project in Brazil. The specific focus of the case study centers on the activities of a Brazilian government agency acting as a mediator between software developers - primarily academic institutions in the United States-and university clients in Brazil. We highlight the loosely integrated nature of the DL technology, and the uncertain relationship between developers and users in terms of support. These circumstances reinforce a view of technology transfer as a process of organizational learning. As a consequence, the mediating institution in the study is viewed as assuming multiple roles in advancing the project.

我们描述了巴西电子论文和学位论文(ETD)数字图书馆(DL)项目正在进行的案例研究的背景和初步发现。案例研究的具体焦点集中在巴西政府机构作为软件开发人员(主要是美国的学术机构)和巴西的大学客户之间的调解人的活动上。我们强调了深度学习技术的松散集成性质，以及开发人员和用户之间在支持方面的不确定关系。这些情况加强了技术转让是一个组织学习过程的观点。因此，研究中的中介机构被视为在推进项目中承担多重角色。

引用次数: 5

Correcting broken characters in the recognition of historical printed documents 历史印刷文献识别中的断字纠错

2003 Joint Conference on Digital Libraries, 2003. Proceedings.

Pub Date : 2003-05-27 DOI: 10.1109/JCDL.2003.1204889

M. Droettboom

We present a new technique for dealing with broken characters, one of the major challenges in the optical character recognition (OCR) of degraded historical printed documents. A technique based on graph combinatorics is used to rejoin the appropriate connected components. It has been applied to real data with successful results.

本文提出了一种处理破碎字符的新技术，这是退化历史印刷文献光学字符识别(OCR)的主要挑战之一。使用基于图组合的技术来重新连接适当的连接组件。该方法已应用于实际数据，取得了良好的效果。

引用次数: 47

Protein association discovery in biomedical literature 生物医学文献中蛋白质关联的发现

2003 Joint Conference on Digital Libraries, 2003. Proceedings.

Pub Date : 2003-05-27 DOI: 10.1109/JCDL.2003.1204848

Yueyu Fu, Javed Mostafa, Kazuhiro Seki

Protein association discovery can directly contribute toward developing protein pathways; hence it is a significant problem in bioinformatics. LUCAS (Library of User-Oriented Concepts for Access Services) was designed to automatically extract and determine associations among proteins from biomedical literature. Such a tool has notable potential to automate database construction in biomedicine, instead of relying on experts' analysis. We report on the mechanisms for automatically generating clusters of proteins. A formal evaluation of the system, based on a subset of 2000 MEDLINE titles and abstracts, has been conducted against Swiss-Prot database in which the associations among concepts are entered by experts manually.

蛋白质关联的发现可以直接促进蛋白质通路的发展;因此，它是生物信息学中的一个重要问题。LUCAS(面向用户的访问服务概念库)设计用于从生物医学文献中自动提取和确定蛋白质之间的关联。该工具具有显著的潜力，可以实现生物医学数据库的自动化建设，而不是依赖于专家的分析。我们报告了自动生成蛋白质簇的机制。基于2000个MEDLINE标题和摘要的子集，对该系统进行了正式评估，并对Swiss-Prot数据库进行了评估，其中概念之间的关联由专家手动输入。

引用次数: 10

An XQuery engine for digital library systems 用于数字图书馆系统的XQuery引擎

2003 Joint Conference on Digital Libraries, 2003. Proceedings.

Pub Date : 2003-05-27 DOI: 10.1109/JCDL.2003.1204919

Ji-Hoon Kang, Chul-Soo Kim, Eun-Jeong Ko

A standard query language is very helpful for interoperability among digital library systems over the Internet. We propose an XQuery engine that can be used as an XQuery processing module in a digital library system that supports XML documents. We assume generic digital library system architecture. It consists of four modules: a user interface, an XQuery engine, an information retrieval engine, and an XML repository. The XQuery engine parses an input XQuery and constructs a syntax tree for the query.

标准的查询语言对Internet上数字图书馆系统之间的互操作性非常有帮助。我们提出了一个可以在支持XML文档的数字图书馆系统中用作XQuery处理模块的XQuery引擎。我们假设通用的数字图书馆系统架构。它由四个模块组成:用户界面、XQuery引擎、信息检索引擎和XML存储库。XQuery引擎解析输入的XQuery并为该查询构造语法树。

引用次数: 17

Automatic document metadata extraction using support vector machines 使用支持向量机自动提取文档元数据

2003 Joint Conference on Digital Libraries, 2003. Proceedings.

Pub Date : 2003-05-27 DOI: 10.1109/JCDL.2003.1204842

Hui Han, C. Lee Giles, Eren Manavoglu, H. Zha, Zhenyue Zhang, E. Fox

Automatic metadata generation provides scalability and usability for digital libraries and their collections. Machine learning methods offer robust and adaptable automatic metadata extraction. We describe a support vector machine classification-based method for metadata extraction from header part of research papers and show that it outperforms other machine learning methods on the same task. The method first classifies each line of the header into one or more of 15 classes. An iterative convergence procedure is then used to improve the line classification by using the predicted class labels of its neighbor lines in the previous round. Further metadata extraction is done by seeking the best chunk boundaries of each line. We found that discovery and use of the structural patterns of the data and domain based word clustering can improve the metadata extraction performance. An appropriate feature normalization also greatly improves the classification performance. Our metadata extraction method was originally designed to improve the metadata extraction quality of the digital libraries Citeseer [S. Lawrence et al., (1999)] and EbizSearch [Y. Petinot et al., (2003)]. We believe it can be generalized to other digital libraries.

自动生成元数据为数字图书馆及其馆藏提供了可伸缩性和可用性。机器学习方法提供鲁棒性和适应性强的自动元数据提取。我们描述了一种基于支持向量机分类的方法，用于从研究论文的标题部分提取元数据，并表明它在相同的任务上优于其他机器学习方法。该方法首先将标题的每行分类为15个类中的一个或多个。然后使用迭代收敛过程，利用前一轮预测的相邻线的类别标签来改进线的分类。进一步的元数据提取是通过寻找每行的最佳块边界来完成的。我们发现发现和使用数据的结构模式和基于领域的词聚类可以提高元数据提取的性能。适当的特征归一化也可以大大提高分类性能。我们的元数据提取方法最初是为了提高数字图书馆Citeseer [S]的元数据提取质量而设计的。劳伦斯等人，(1999)]和EbizSearch [j]。Petinot et al.，(2003)]。我们相信它可以推广到其他数字图书馆。

{"title":"Automatic document metadata extraction using support vector machines","authors":"Hui Han, C. Lee Giles, Eren Manavoglu, H. Zha, Zhenyue Zhang, E. Fox","doi":"10.1109/JCDL.2003.1204842","DOIUrl":"https://doi.org/10.1109/JCDL.2003.1204842","url":null,"abstract":"Automatic metadata generation provides scalability and usability for digital libraries and their collections. Machine learning methods offer robust and adaptable automatic metadata extraction. We describe a support vector machine classification-based method for metadata extraction from header part of research papers and show that it outperforms other machine learning methods on the same task. The method first classifies each line of the header into one or more of 15 classes. An iterative convergence procedure is then used to improve the line classification by using the predicted class labels of its neighbor lines in the previous round. Further metadata extraction is done by seeking the best chunk boundaries of each line. We found that discovery and use of the structural patterns of the data and domain based word clustering can improve the metadata extraction performance. An appropriate feature normalization also greatly improves the classification performance. Our metadata extraction method was originally designed to improve the metadata extraction quality of the digital libraries Citeseer [S. Lawrence et al., (1999)] and EbizSearch [Y. Petinot et al., (2003)]. We believe it can be generalized to other digital libraries.","PeriodicalId":248854,"journal":{"name":"2003 Joint Conference on Digital Libraries, 2003. Proceedings.","volume":"125 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120853091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 339

How to turn the page [digital libraries] 如何翻开新的一页[数码图书馆]

2003 Joint Conference on Digital Libraries, 2003. Proceedings.

Pub Date : 2003-05-27 DOI: 10.1109/JCDL.2003.1204862

Yi-Chun Chu, I. Witten, R. Lobb, D. Bainbridge

Can digital libraries provide a reading experience that more closely resembles a real book than a scrolled or paginated electronic display? We describe a prototype page turning system that realistically animates full three-dimensional page-turns. The dynamic behavior is generated by a mass-spring model defined on a rectangular grid of particles. The prototype takes a PDF or e-book file, renders it into a sequence of PNG images representing individual pages, and animates the page-turns under user control. The simulation behaves fairly naturally, although more computer graphics work is required to perfect it.

数字图书馆能否提供一种更接近真实书籍的阅读体验，而不是滚动或分页的电子显示屏?我们描述了一个原型翻页系统，实际动画全三维翻页。动力学行为由定义在矩形粒子网格上的质量-弹簧模型生成。原型采用PDF或电子书文件，将其呈现为代表各个页面的PNG图像序列，并在用户控制下使翻页动画化。虽然需要更多的计算机图形工作来完善它，但模拟表现得相当自然。

引用次数: 22

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2003 Joint Conference on Digital Libraries, 2003. Proceedings.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀