2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)最新文献

英文中文

Joint workshop on bibliometric-enhanced information retrieval and natural language processing for digital libraries (BIRNDL 2016) 数字图书馆文献计量学增强信息检索和自然语言处理联合研讨会(BIRNDL 2016)

2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)

Pub Date : 2017-06-08 DOI: 10.1145/3077136.3084370

Muthu Kumar Chandrasekaran, Kokil Jaidka, Philipp Mayr

The large scale of scholarly publications poses a challenge for scholars in information-seeking and sensemaking. Bibliometric, information retrieval (IR), text mining and NLP techniques could help in these activities, but are not yet widely used in digital libraries. This workshop is intended to stimulate IR researchers and digital library professionals to elaborate on new approaches in natural language processing, information retrieval, scientometric and recommendation techniques which can advance the state-of-the-art in scholarly document understanding, analysis and retrieval at scale.

学术出版物的规模之大，对学者的信息获取和意义建构提出了挑战。文献计量学、信息检索(IR)、文本挖掘和自然语言处理技术可以帮助这些活动，但尚未广泛应用于数字图书馆。本次研讨会旨在激发IR研究人员和数字图书馆专业人员详细阐述自然语言处理、信息检索、科学计量学和推荐技术方面的新方法，这些技术可以推动学术文献理解、分析和大规模检索的最新技术。

引用次数: 10

Desiderata for exploratory search interfaces to Web archives in support of scholarly activities 需要为支持学术活动的Web档案提供探索性搜索接口

2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)

Pub Date : 2016-06-19 DOI: 10.1145/2910896.2910912

Andrew N. Jackson, Jimmy J. Lin, Ian Milligan, Nick Ruest

Web archiving initiatives around the world capture ephemeral web content to preserve our collective digital memory. In this paper, we describe initial experiences in providing an exploratory search interface to web archives for humanities scholars and social scientists. We describe our initial implementation and discuss our findings in terms of desiderata for such a system. It is clear that the standard organization of a search engine results page (SERP), consisting of an ordered list of hits, is inadequate to support the needs of scholars. Shneiderman's mantra for visual information seeking (“overview first, zoom and filter, then details-on-demand”) provides a nice organizing principle for interface design, to which we propose an addendum: “Make everything transparent”. We elaborate on this by highlighting the importance of the temporal dimension of web pages as well as issues surrounding metadata and veracity.

世界各地的网络存档计划捕获短暂的网络内容，以保存我们的集体数字记忆。在本文中，我们描述了为人文学者和社会科学家提供网络档案探索性搜索界面的初步经验。我们描述了我们的初步实施，并讨论了我们的发现，对这样一个系统的期望。很明显，搜索引擎结果页面(SERP)的标准组织，由一个有序的点击列表组成，不足以支持学者的需求。Shneiderman关于视觉信息搜索的口头禅(“先概述，放大和过滤，然后按需细节”)为界面设计提供了一个很好的组织原则，对此我们提出一个补充:“让一切都透明”。我们通过强调网页时间维度的重要性以及围绕元数据和准确性的问题来详细阐述这一点。

引用次数: 29

Open datasets for evaluating the interpretation of bibliographic records 用于评估书目记录解释的开放数据集

2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)

Pub Date : 2016-06-19 DOI: 10.1145/2910896.2925457

Joffrey Decourselle, F. Duchateau, Trond Aalberg, Naimdjon Takhirov, Nicolas Lumineau

The transformation of legacy MARC catalogs to FRBR catalogs (FRBRization) is a complex and important challenge for libraries. Although many FRBRization tools have provided experimental validation, it is difficult to evaluate and compare these systems on a fair basis due to a lack of common datasets. This poster presents two public datasets (T42 and BIB-RCAT) intended to support the validation of the FRBRization process.

将传统MARC编目转换为FRBR编目(FRBRization)是图书馆面临的一项复杂而重要的挑战。尽管许多FRBRization工具已经提供了实验验证，但由于缺乏通用数据集，很难在公平的基础上评估和比较这些系统。这张海报展示了两个公共数据集(T42和BIB-RCAT)，旨在支持FRBRization过程的验证。

引用次数: 4

BIBSURF — Discover bibliographic entities by searching for units of interest, ranking and filtering BIBSURF -发现书目实体通过搜索感兴趣的单位，排名和过滤

2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)

Pub Date : 2016-06-19 DOI: 10.1145/2910896.2925434

Trond Aalberg, Tanja Mercun, M. Zumer

BIBSURF is a system demonstrating search, ranking and filtering of bibliographic RDF data that is organized in form of entities representing intellectual endeavor at different levels of abstraction: item, manifestation, expression, work.

BIBSURF是一个展示书目RDF数据的搜索、排序和过滤的系统，这些数据以实体的形式组织起来，表示在不同抽象层次上的智力努力:条目、表现、表达、工作。

引用次数: 3

Visualizing published metadata in large aggregations 可视化大型聚合中发布的元数据

2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)

Pub Date : 2016-06-19 DOI: 10.1145/2910896.2925451

Unmil Karadkar, Geoffrey A. Potter, Shengwei Wang

Large metadata aggregations provide access to documents held by multiple cultural heritage (CH) institutions. As CH institutions encode their metadata using different schemas and follow different data standards, aggregators must process the received data before making it available through a unified portal. Staff members at the contributing CH institutions don't receive feedback regarding the quality of the provided or the processed data. We are developing mechanisms that enable staff at the CH institutions to understand the effectiveness of their metadata with a goal of improving the visibility of their items in these large portals such as the Digital Public Library of America. This poster will present a classification of the DPLA metadata application profile highlighting compliance levels as well as a visualization framework for presenting the compliance of an institution's data with the DPLA data model.

大型元数据聚合提供了对多个文化遗产(CH)机构持有的文件的访问。由于CH机构使用不同的模式对其元数据进行编码，并遵循不同的数据标准，因此聚合器必须先处理接收到的数据，然后才能通过统一门户提供。提供数据的卫生保健机构的工作人员没有收到关于所提供或处理数据质量的反馈。我们正在开发机制，使卫生保健机构的工作人员能够了解他们的元数据的有效性，目标是提高他们的项目在这些大型门户网站(如美国数字公共图书馆)上的可见性。这张海报将介绍DPLA元数据应用程序概要的分类，突出符合级别，以及一个可视化框架，用于显示机构数据与DPLA数据模型的遵从性。

引用次数: 0

Using co-authorship networks for author name disambiguation 使用合作作者网络消除作者姓名歧义

2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)

Pub Date : 2016-06-19 DOI: 10.1145/2910896.2925461

Fakhri Momeni, Philipp Mayr

With the increasing size of digital libraries (DLs) it has become a challenge to identify author names correctly and assign publications to them. The situation becomes more critical when different persons share the same name (homonym problem) or when the names of authors are presented in several different ways (synonym problem). This paper focuses on homonym names in the computer science bibliography DBLP. The goal of this study is to implement and evaluate a method which uses co-authorship networks in order to disambiguate homonym names, especially common names. The results show that the implemented method has a good performance and can be used for author name disambiguation of sparse bibliographic records.

随着数字图书馆规模的不断扩大，正确识别作者姓名并为其分配出版物已成为一个挑战。当不同的人使用相同的名字(同音问题)或当作者的名字以不同的方式呈现(同义问题)时，情况变得更加严重。本文主要研究计算机科学书目DBLP中的同音名问题。本研究的目的是实现和评估一种使用合作作者网络来消除同音名歧义的方法，特别是常见的名字。结果表明，所实现的方法具有良好的性能，可用于稀疏书目记录的作者姓名消歧。

引用次数: 14

Enhancing scholarly use of digital libraries: A comparative survey and review of bibliographic metadata ontologies 加强数字图书馆的学术应用:书目元数据本体的比较调查与回顾

2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)

Pub Date : 2016-06-19 DOI: 10.1145/2910896.2910903

Jacob Jett, Terhi Nurmikko-Fuller, Timothy W. Cole, Kevin R. Page, J. S. Downie

The HathiTrust Research Center (HTRC) is engaged in the development of tools that will give scholars the ability to analyze the HathiTrust digital library's 14 million volume corpus. A cornerstone of the HTRC's digital infrastructure is the workset - a kind of scholar-built research collection intended for use with the HTRC's analytics platform. Because more than 66% of the digital corpus is subject to copyright restrictions, scholarly users remain dependent upon the descriptive accounts provided by traditional metadata records in order to identify and gather together bibliographic resources for analysis. This paper compares the MADSRDF/MODSRDF, Bibframe, schema.org, BIBO, and FaBiO ontologies by assessing their suitability for employment by the HTRC to meet scholars' needs. These include distinguishing among multiple versions of the same work; representing the complex historical and physical relationships among those versions; and identifying and providing access to finer grained bibliographic entities, e.g., poems, chapters, sections, and even smaller segments of content.

HathiTrust研究中心(HTRC)致力于开发工具，使学者能够分析HathiTrust数字图书馆的1400万卷语料库。HTRC数字基础设施的基石是工作集，这是一种学者构建的研究集合，旨在与HTRC的分析平台一起使用。由于超过66%的数字语料库受到版权限制，学术用户仍然依赖传统元数据记录提供的描述性描述，以便识别和收集书目资源进行分析。本文比较了MADSRDF/MODSRDF、Bibframe、schema.org、BIBO和FaBiO本体，评估了它们是否适合HTRC使用，以满足学者的需求。这包括区分同一作品的多个版本;代表了这些版本之间复杂的历史和物理关系;识别并提供对更细粒度的书目实体的访问，例如诗歌、章节、章节，甚至更小的内容片段。

{"title":"Enhancing scholarly use of digital libraries: A comparative survey and review of bibliographic metadata ontologies","authors":"Jacob Jett, Terhi Nurmikko-Fuller, Timothy W. Cole, Kevin R. Page, J. S. Downie","doi":"10.1145/2910896.2910903","DOIUrl":"https://doi.org/10.1145/2910896.2910903","url":null,"abstract":"The HathiTrust Research Center (HTRC) is engaged in the development of tools that will give scholars the ability to analyze the HathiTrust digital library's 14 million volume corpus. A cornerstone of the HTRC's digital infrastructure is the workset - a kind of scholar-built research collection intended for use with the HTRC's analytics platform. Because more than 66% of the digital corpus is subject to copyright restrictions, scholarly users remain dependent upon the descriptive accounts provided by traditional metadata records in order to identify and gather together bibliographic resources for analysis. This paper compares the MADSRDF/MODSRDF, Bibframe, schema.org, BIBO, and FaBiO ontologies by assessing their suitability for employment by the HTRC to meet scholars' needs. These include distinguishing among multiple versions of the same work; representing the complex historical and physical relationships among those versions; and identifying and providing access to finer grained bibliographic entities, e.g., poems, chapters, sections, and even smaller segments of content.","PeriodicalId":109613,"journal":{"name":"2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116941128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

Future digital libraries: Research and responsibilities 未来的数字图书馆:研究与责任

2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)

Pub Date : 2016-06-19 DOI: 10.1145/2910896.2926740

M. Zemankova

Summary form only given. In October 1991 the National Science Foundation (NSF) sponsored a workshop to examine the role of the Information Retrieval research community in the emerging environment of Internet, high performance text processing capabilities and ever-increasing volumes of digitized documents. Ed Fox, Michael Lesk and Michael McGill drafted a White Paper, calling for a National Electronic Science, Engineering, and Technology Library. The term “Digital Library” was adopted and for follow-up workshops with the goal to identify research directions, leading to National Science Foundation (NSF)/Defense Advanced Research Projects Agency (DARPA)/National Aeronautics and Space Administration (NASA) Research in Digital Libraries Initiative announced in late 1993. Now, in 2016, 25 years after the first workshop, 15 years after the Joint Conference on Digital Libraries has been established, and many initiatives and developments around the world, what is the state of Digital Libraries? What items should be in digital libraries, who should their custodians, how can the items be organized to support knowledge discovery, how can the contents be safeguarded and preserved? Ebla, Syria (2500 B.C.-2250 B.C.) constitutes the oldest organized library of tables yet discovered. What will the archeologists discover in year 4400 about the world, politics, economies, technologies, science, climate, species, health, food, culture, art, entertainment and everyday life through the ages? The talk will examine what we can do to support innovative research and design and implementation of lasting, informative Digital Libraries that will promote global goals of knowledge discovery and international understanding and personal needs to organize and selectively share important facts, creations, and memories.

只提供摘要形式。1991年10月，美国国家科学基金会(NSF)赞助了一个研讨会，探讨信息检索研究界在互联网、高性能文本处理能力和不断增长的数字化文档的新兴环境中的作用。Ed Fox, Michael Lesk和Michael McGill起草了一份白皮书，呼吁建立一个国家电子科学、工程和技术图书馆。术语“数字图书馆”被采用，并为后续研讨会的目标是确定研究方向，导致美国国家科学基金会(NSF)/国防高级研究计划局(DARPA)/美国国家航空航天局(NASA)在1993年底宣布的数字图书馆倡议研究。现在，2016年，在第一次研讨会召开25年后，在数字图书馆联合会议成立15年后，在世界各地有许多倡议和发展，数字图书馆的状况如何?数字图书馆中应该有哪些项目?谁应该保管这些项目?如何组织这些项目以支持知识发现?如何保护和保存内容?叙利亚的埃布拉(公元前2500年-公元前2250年)构成了迄今为止发现的最古老的有组织的表格图书馆。在公元4400年，考古学家们将会在世界、政治、经济、技术、科学、气候、物种、健康、食物、文化、艺术、娱乐和日常生活中发现什么?这次演讲将探讨我们可以做些什么来支持创新研究、设计和实施持久的、信息丰富的数字图书馆，这些图书馆将促进知识发现的全球目标、国际理解和个人需要，以组织和有选择地分享重要的事实、创作和记忆。

{"title":"Future digital libraries: Research and responsibilities","authors":"M. Zemankova","doi":"10.1145/2910896.2926740","DOIUrl":"https://doi.org/10.1145/2910896.2926740","url":null,"abstract":"Summary form only given. In October 1991 the National Science Foundation (NSF) sponsored a workshop to examine the role of the Information Retrieval research community in the emerging environment of Internet, high performance text processing capabilities and ever-increasing volumes of digitized documents. Ed Fox, Michael Lesk and Michael McGill drafted a White Paper, calling for a National Electronic Science, Engineering, and Technology Library. The term “Digital Library” was adopted and for follow-up workshops with the goal to identify research directions, leading to National Science Foundation (NSF)/Defense Advanced Research Projects Agency (DARPA)/National Aeronautics and Space Administration (NASA) Research in Digital Libraries Initiative announced in late 1993. Now, in 2016, 25 years after the first workshop, 15 years after the Joint Conference on Digital Libraries has been established, and many initiatives and developments around the world, what is the state of Digital Libraries? What items should be in digital libraries, who should their custodians, how can the items be organized to support knowledge discovery, how can the contents be safeguarded and preserved? Ebla, Syria (2500 B.C.-2250 B.C.) constitutes the oldest organized library of tables yet discovered. What will the archeologists discover in year 4400 about the world, politics, economies, technologies, science, climate, species, health, food, culture, art, entertainment and everyday life through the ages? The talk will examine what we can do to support innovative research and design and implementation of lasting, informative Digital Libraries that will promote global goals of knowledge discovery and international understanding and personal needs to organize and selectively share important facts, creations, and memories.","PeriodicalId":109613,"journal":{"name":"2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117234665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Towards better understanding of academic search 更好地理解学术搜索

2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)

Pub Date : 2016-06-19 DOI: 10.1145/2910896.2910922

Madian Khabsa, Zhaohui Wu, C. Lee Giles

Academics have relied heavily on search engines to identify and locate research manuscripts that are related to their research areas. Many of the early information retrieval systems and technologies were developed while catering for librarians to help them sift through books and proceedings, followed by recent online academic search engines such as Google Scholar and Microsoft Academic Search. In spite of their popularity among academics and importance to academia, the usage, query behaviors, and retrieval models for academic search engines have not been well studied. To this end, we study the distribution of queries that are received by an academic search engine. Furthermore, we delve deeper into academic search queries and classify them into navigational and informational queries. This work introduces a definition for navigational queries in academic search engines under which a query is considered navigational if the user is searching for a specific paper or document. We describe multiple facets of navigational academic queries, and introduce a machine learning approach with a set of features to identify such queries.

学者们严重依赖搜索引擎来识别和定位与他们的研究领域相关的研究手稿。许多早期的信息检索系统和技术都是为了满足图书馆员的需求而开发的，以帮助他们筛选书籍和会议记录，其次是最近的在线学术搜索引擎，如谷歌学术和微软学术搜索。尽管学术搜索引擎受到学术界的广泛欢迎和重视，但学术界对其使用、查询行为和检索模型的研究还不够深入。为此，我们研究了学术搜索引擎收到的查询的分布。此外，我们更深入地研究了学术搜索查询，并将其分为导航查询和信息查询。这项工作介绍了学术搜索引擎中导航查询的定义，在该定义下，如果用户正在搜索特定的论文或文档，则查询被认为是导航的。我们描述了导航学术查询的多个方面，并引入了一种带有一组特征的机器学习方法来识别此类查询。

引用次数: 25

Mining advisor-advisee relationships in scholarly big data: A deep learning approach 在学术大数据中挖掘顾问与被顾问的关系:一种深度学习方法

2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)

Pub Date : 2016-06-19 DOI: 10.1145/2910896.2925435

Wei Wang, Jiaying Liu, Shuo Yu, Chenxin Zhang, Zhenzhen Xu, Feng Xia

Mining advisor-advisee relationships can benefit many interesting applications such as advisor recommendation and protege performance analysis. Based on the hypothesis that, advisor-advisee relationships among researchers are hidden in scholarly big data, we propose in this work a deep learning based advisor-advisee relationship identification method which considers the personal properties and network characteristics with a stacked autoencoder model. To the best of our knowledge, this is the first time that a deep learning model is utilized to represent coauthor network features for relationships identification. Moreover, experiments demonstrate that the proposed method has better performance compared with other state-of-the-art methods.

挖掘顾问-被顾问关系可以使许多有趣的应用程序受益，例如顾问推荐和protege性能分析。基于学术大数据中隐含科研人员导师关系的假设，本文提出了一种基于深度学习的导师关系识别方法，该方法考虑了个人属性和网络特征，采用堆叠自编码器模型。据我们所知，这是第一次使用深度学习模型来表示共同作者网络特征以进行关系识别。实验结果表明，该方法与现有方法相比具有更好的性能。

引用次数: 14

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀