首页 > 最新文献

International Journal on Digital Libraries最新文献

英文 中文
A discovery system for narrative query graphs: entity-interaction-aware document retrieval. 叙述性查询图的发现系统:实体交互感知文档检索。
IF 1.5 Q1 Social Sciences Pub Date : 2023-04-24 DOI: 10.1007/s00799-023-00356-3
Hermann Kroll, Jan Pirklbauer, Jan-Christoph Kalo, Morris Kunz, Johannes Ruthmann, Wolf-Tilo Balke

Finding relevant publications in the scientific domain can be quite tedious: Accessing large-scale document collections often means to formulate an initial keyword-based query followed by many refinements to retrieve a sufficiently complete, yet manageable set of documents to satisfy one's information need. Since keyword-based search limits researchers to formulating their information needs as a set of unconnected keywords, retrieval systems try to guess each user's intent. In contrast, distilling short narratives of the searchers' information needs into simple, yet precise entity-interaction graph patterns provides all information needed for a precise search. As an additional benefit, such graph patterns may also feature variable nodes to flexibly allow for different substitutions of entities taking a specified role. An evaluation over the PubMed document collection quantifies the gains in precision for our novel entity-interaction-aware search. Moreover, we perform expert interviews and a questionnaire to verify the usefulness of our system in practice. This paper extends our previous work by giving a comprehensive overview about the discovery system to realize narrative query graph retrieval.

在科学领域寻找相关出版物可能相当乏味:访问大型文档集通常意味着制定一个基于关键字的初始查询,然后进行许多改进,以检索一组足够完整但可管理的文档,以满足信息需求。由于基于关键字的搜索限制了研究人员将他们的信息需求表述为一组不相连的关键字,检索系统试图猜测每个用户的意图。相反,将搜索者信息需求的简短叙述提炼成简单而精确的实体交互图模式,可以提供精确搜索所需的所有信息。作为额外的好处,这样的图模式还可以以可变节点为特征,以灵活地允许对承担特定角色的实体进行不同的替换。对PubMed文档集的评估量化了我们新的实体交互感知搜索的精度增益。此外,我们还进行了专家访谈和问卷调查,以验证我们的系统在实践中的有用性。本文对实现叙述性查询图检索的发现系统进行了全面的概述,从而扩展了我们以前的工作。
{"title":"A discovery system for narrative query graphs: entity-interaction-aware document retrieval.","authors":"Hermann Kroll,&nbsp;Jan Pirklbauer,&nbsp;Jan-Christoph Kalo,&nbsp;Morris Kunz,&nbsp;Johannes Ruthmann,&nbsp;Wolf-Tilo Balke","doi":"10.1007/s00799-023-00356-3","DOIUrl":"10.1007/s00799-023-00356-3","url":null,"abstract":"<p><p>Finding relevant publications in the scientific domain can be quite tedious: Accessing large-scale document collections often means to formulate an initial keyword-based query followed by many refinements to retrieve a <i>sufficiently complete, yet manageable</i> set of documents to satisfy one's information need. Since keyword-based search limits researchers to formulating their information needs as a set of unconnected keywords, retrieval systems try to guess each user's intent. In contrast, distilling short narratives of the searchers' information needs into simple, yet precise entity-interaction graph patterns provides all information needed for a precise search. As an additional benefit, such graph patterns may also feature variable nodes to flexibly allow for different substitutions of entities taking a specified role. An evaluation over the PubMed document collection quantifies the gains in precision for our novel entity-interaction-aware search. Moreover, we perform expert interviews and a questionnaire to verify the usefulness of our system in practice. This paper extends our previous work by giving a comprehensive overview about the discovery system to realize narrative query graph retrieval.</p>","PeriodicalId":44974,"journal":{"name":"International Journal on Digital Libraries","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10123011/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10092914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Approximate nearest neighbor for long document relationship labeling in digital libraries 数字图书馆长文档关系标注的近似最近邻算法
IF 1.5 Q1 Social Sciences Pub Date : 2023-04-15 DOI: 10.1007/s00799-023-00354-5
Peter Organisciak, Benjamin M. Schmidt, M. Durward
{"title":"Approximate nearest neighbor for long document relationship labeling in digital libraries","authors":"Peter Organisciak, Benjamin M. Schmidt, M. Durward","doi":"10.1007/s00799-023-00354-5","DOIUrl":"https://doi.org/10.1007/s00799-023-00354-5","url":null,"abstract":"","PeriodicalId":44974,"journal":{"name":"International Journal on Digital Libraries","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77031104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Referencing behaviours across disciplines: publication types and common metadata for defining bibliographic references 跨学科引用行为:用于定义书目引用的出版物类型和通用元数据
IF 1.5 Q1 Social Sciences Pub Date : 2023-03-27 DOI: 10.1007/s00799-023-00351-8
Erika Alves dos Santos, S. Peroni, M. L. Mucheroni
{"title":"Referencing behaviours across disciplines: publication types and common metadata for defining bibliographic references","authors":"Erika Alves dos Santos, S. Peroni, M. L. Mucheroni","doi":"10.1007/s00799-023-00351-8","DOIUrl":"https://doi.org/10.1007/s00799-023-00351-8","url":null,"abstract":"","PeriodicalId":44974,"journal":{"name":"International Journal on Digital Libraries","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80516342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Scientific document processing: challenges for modern learning methods. 科学文档处理:现代学习方法面临的挑战。
IF 1.5 Q1 Social Sciences Pub Date : 2023-03-24 DOI: 10.1007/s00799-023-00352-7
Abhinav Ramesh Kashyap, Yajing Yang, Min-Yen Kan

Neural network models enjoy success on language tasks related to Web documents, including news and Wikipedia articles. However, the characteristics of scientific publications pose specific challenges that have yet to be satisfactorily addressed: the discourse structure of scientific documents crucial in scholarly document processing (SDP) tasks, the interconnected nature of scientific documents, and their multimodal nature. We survey modern neural network learning methods that tackle these challenges: those that can model discourse structure and their interconnectivity and use their multimodal nature. We also highlight efforts to collect large-scale datasets and tools developed to enable effective deep learning deployment for SDP. We conclude with a discussion on upcoming trends and recommend future directions for pursuing neural natural language processing approaches for SDP.

神经网络模型在与网络文档相关的语言任务上取得了成功,包括新闻和维基百科文章。然而,科学出版物的特点带来了尚未令人满意地解决的具体挑战:在学术文献处理(SDP)任务中至关重要的科学文献的话语结构、科学文献的相互联系性质及其多模式性质。我们调查了应对这些挑战的现代神经网络学习方法:那些能够建模话语结构及其相互关联性并利用其多模态性质的方法。我们还强调了收集大规模数据集的努力,以及为实现SDP的有效深度学习部署而开发的工具。最后,我们讨论了即将到来的趋势,并为SDP的神经自然语言处理方法提出了未来的发展方向。
{"title":"Scientific document processing: challenges for modern learning methods.","authors":"Abhinav Ramesh Kashyap,&nbsp;Yajing Yang,&nbsp;Min-Yen Kan","doi":"10.1007/s00799-023-00352-7","DOIUrl":"10.1007/s00799-023-00352-7","url":null,"abstract":"<p><p>Neural network models enjoy success on language tasks related to Web documents, including news and Wikipedia articles. However, the characteristics of scientific publications pose specific challenges that have yet to be satisfactorily addressed: the discourse structure of scientific documents crucial in scholarly document processing (SDP) tasks, the interconnected nature of scientific documents, and their multimodal nature. We survey modern neural network learning methods that tackle these challenges: those that can model discourse structure and their interconnectivity and use their multimodal nature. We also highlight efforts to collect large-scale datasets and tools developed to enable effective deep learning deployment for SDP. We conclude with a discussion on upcoming trends and recommend future directions for pursuing neural natural language processing approaches for SDP.</p>","PeriodicalId":44974,"journal":{"name":"International Journal on Digital Libraries","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10036973/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9770420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
The digitization of historical astrophysical literature with highly localized figures and figure captions 具有高度本地化图形和图形说明的历史天体物理文献的数字化
Q1 Social Sciences Pub Date : 2023-03-22 DOI: 10.1007/s00799-023-00350-9
Jill P. Naiman, Peter K. G. Williams, Alyssa Goodman
Scientific articles published prior to the “age of digitization” in the late 1990s contain figures which are “trapped” within their scanned pages. While progress to extract figures and their captions has been made, there is currently no robust method for this process. We present a YOLO-based method for use on scanned pages, after they have been processed with Optical character recognition (OCR), which uses both grayscale and OCR features. We focus our efforts on translating the intersection-over-union (IOU) metric from the field of object detection to document layout analysis and quantify “high localization” levels as an IOU of 0.9. When applied to the astrophysics literature holdings of the NASA astrophysics data system, we find F1 scores of 90.9% (92.2%) for figures (figure captions) with the IOU cut-off of 0.9 which is a significant improvement over other state-of-the-art methods.
在20世纪90年代末“数字化时代”之前发表的科学文章中,有一些数字被“困”在扫描页面中。虽然在提取数字及其说明文字方面取得了进展,但目前尚无可靠的方法来处理这一过程。我们提出了一种基于yolo的方法,用于扫描页面,在使用光学字符识别(OCR)处理后,该方法同时使用灰度和OCR特征。我们专注于将目标检测领域的交叉-超联合(IOU)度量转换为文件布局分析,并将“高本地化”水平量化为IOU为0.9。当应用于NASA天体物理数据系统的天体物理文献时,我们发现数字(图注)的F1得分为90.9% (92.2%),IOU截止值为0.9,这比其他最先进的方法有了显着提高。
{"title":"The digitization of historical astrophysical literature with highly localized figures and figure captions","authors":"Jill P. Naiman, Peter K. G. Williams, Alyssa Goodman","doi":"10.1007/s00799-023-00350-9","DOIUrl":"https://doi.org/10.1007/s00799-023-00350-9","url":null,"abstract":"Scientific articles published prior to the “age of digitization” in the late 1990s contain figures which are “trapped” within their scanned pages. While progress to extract figures and their captions has been made, there is currently no robust method for this process. We present a YOLO-based method for use on scanned pages, after they have been processed with Optical character recognition (OCR), which uses both grayscale and OCR features. We focus our efforts on translating the intersection-over-union (IOU) metric from the field of object detection to document layout analysis and quantify “high localization” levels as an IOU of 0.9. When applied to the astrophysics literature holdings of the NASA astrophysics data system, we find F1 scores of 90.9% (92.2%) for figures (figure captions) with the IOU cut-off of 0.9 which is a significant improvement over other state-of-the-art methods.","PeriodicalId":44974,"journal":{"name":"International Journal on Digital Libraries","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136173999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Beyond translation: engaging with foreign languages in a digital library 超越翻译:在数字图书馆中学习外语
IF 1.5 Q1 Social Sciences Pub Date : 2023-03-19 DOI: 10.1007/s00799-023-00349-2
G. Crane, Alison Babeu, Lisa M. Cerrato, Amelia Parrish, Carolina Penagos, Farnoosh Shamsian, James Tauber, Jake Wegner
{"title":"Beyond translation: engaging with foreign languages in a digital library","authors":"G. Crane, Alison Babeu, Lisa M. Cerrato, Amelia Parrish, Carolina Penagos, Farnoosh Shamsian, James Tauber, Jake Wegner","doi":"10.1007/s00799-023-00349-2","DOIUrl":"https://doi.org/10.1007/s00799-023-00349-2","url":null,"abstract":"","PeriodicalId":44974,"journal":{"name":"International Journal on Digital Libraries","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77985407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
CH-Bench: a user-oriented benchmark for systems for efficient distant reading (design, performance, and insights) CH-Bench:面向用户的系统基准,用于高效远程读取(设计,性能和见解)
IF 1.5 Q1 Social Sciences Pub Date : 2023-03-15 DOI: 10.1007/s00799-023-00347-4
Jens Willkomm, Markus Raster, Martin Schäler, Klemens Böhm
{"title":"CH-Bench: a user-oriented benchmark for systems for efficient distant reading (design, performance, and insights)","authors":"Jens Willkomm, Markus Raster, Martin Schäler, Klemens Böhm","doi":"10.1007/s00799-023-00347-4","DOIUrl":"https://doi.org/10.1007/s00799-023-00347-4","url":null,"abstract":"","PeriodicalId":44974,"journal":{"name":"International Journal on Digital Libraries","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91108945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DeepMetaGen: an unsupervised deep neural approach to generate template-based meta-reviews leveraging on aspect category and sentiment analysis from peer reviews DeepMetaGen:一种无监督深度神经方法,用于生成基于模板的元评论,利用来自同行评论的方面类别和情感分析
IF 1.5 Q1 Social Sciences Pub Date : 2023-03-10 DOI: 10.1007/s00799-023-00348-3
Sandeep Kumar, Tirthankar Ghosal, Asif Ekbal
{"title":"DeepMetaGen: an unsupervised deep neural approach to generate template-based meta-reviews leveraging on aspect category and sentiment analysis from peer reviews","authors":"Sandeep Kumar, Tirthankar Ghosal, Asif Ekbal","doi":"10.1007/s00799-023-00348-3","DOIUrl":"https://doi.org/10.1007/s00799-023-00348-3","url":null,"abstract":"","PeriodicalId":44974,"journal":{"name":"International Journal on Digital Libraries","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84105181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Implications of an ecospatial indigenous perspective on digital information organization and access 生态空间土著视角对数字信息组织和获取的影响
IF 1.5 Q1 Social Sciences Pub Date : 2023-03-07 DOI: 10.1007/s00799-023-00353-6
Sebastian Mukumbira, H. Winschiers-Theophilus
{"title":"Implications of an ecospatial indigenous perspective on digital information organization and access","authors":"Sebastian Mukumbira, H. Winschiers-Theophilus","doi":"10.1007/s00799-023-00353-6","DOIUrl":"https://doi.org/10.1007/s00799-023-00353-6","url":null,"abstract":"","PeriodicalId":44974,"journal":{"name":"International Journal on Digital Libraries","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86469923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Transliterating Latin to Amharic scripts using user-defined rules and character mappings 使用用户定义的规则和字符映射将拉丁语音译为阿姆哈拉语
IF 1.5 Q1 Social Sciences Pub Date : 2023-03-01 DOI: 10.1007/s00799-023-00346-5
Z. Abebaw, A. Rauber, Solomon Atnafu
{"title":"Transliterating Latin to Amharic scripts using user-defined rules and character mappings","authors":"Z. Abebaw, A. Rauber, Solomon Atnafu","doi":"10.1007/s00799-023-00346-5","DOIUrl":"https://doi.org/10.1007/s00799-023-00346-5","url":null,"abstract":"","PeriodicalId":44974,"journal":{"name":"International Journal on Digital Libraries","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77494404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal on Digital Libraries
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1