首页 > 最新文献

Electron. Libr.最新文献

英文 中文
Towards better library services: an investigation of factors affecting tourists' satisfaction with "library + cultural tourism" 走向更好的图书馆服务:影响游客“图书馆+文化旅游”满意度的因素调查
Pub Date : 2021-10-06 DOI: 10.1108/el-03-2021-0070
Y. Pan, Lia H. Sun, H. Yang, Jianming Zheng
Purpose“Library + cultural tourism” (LCT) is a new direction for the sustainable development of libraries, but few scholars have researched it from a tourist perspective. The purpose of this paper is to identify what factors affect tourist satisfaction (TS) with LCT, reveal the interaction among these factors and provide strategies for better library services.Design/methodology/approachThis study collected 5,308 comments on Tianjin Binhai Library from three popular online travel communities. Grounded theory was adopted to identify what factors affect TS with LCT and reveal the interaction among these factors.FindingsThe results indicated that TS with LCT was affected by complex factors related to tour plans, tour expectations, cultural characteristics, environment, support services and emotions. Cultural characteristics, environment and support services were impacted by tour plans and tour expectations, which directly or indirectly affected TS mediated by emotions. TS further influenced their expectation of their next tour.Originality/valueThis paper uncovered critical problems with LCT using a case study of Tianjin Binhai Library. The results provide a reference for library practitioners to develop better library services for tourists and regular users/readers.
“图书馆+文化旅游”(LCT)是图书馆可持续发展的新方向,但很少有学者从旅游的角度进行研究。本文的目的是找出影响游客满意度的因素,揭示这些因素之间的相互作用,并提供更好的图书馆服务策略。设计/方法/方法本研究收集了来自三个热门在线旅游社区对天津滨海图书馆的5308条评论。本研究采用扎根理论分析影响LCT与TS的因素,并揭示这些因素之间的相互作用。结果表明:旅游计划、旅游期望、文化特征、环境、支持服务和情绪等因素对旅游伴游的影响是复杂的。旅游计划和旅游期望对旅游文化特征、旅游环境和旅游支持服务有直接或间接的影响。TS进一步影响了他们对下一次巡演的期待。本文以天津滨海图书馆为例,揭示了LCT存在的关键问题。研究结果可为图书馆从业者提供参考,以更好地为游客和普通用户/读者提供图书馆服务。
{"title":"Towards better library services: an investigation of factors affecting tourists' satisfaction with \"library + cultural tourism\"","authors":"Y. Pan, Lia H. Sun, H. Yang, Jianming Zheng","doi":"10.1108/el-03-2021-0070","DOIUrl":"https://doi.org/10.1108/el-03-2021-0070","url":null,"abstract":"\u0000Purpose\u0000“Library + cultural tourism” (LCT) is a new direction for the sustainable development of libraries, but few scholars have researched it from a tourist perspective. The purpose of this paper is to identify what factors affect tourist satisfaction (TS) with LCT, reveal the interaction among these factors and provide strategies for better library services.\u0000\u0000\u0000Design/methodology/approach\u0000This study collected 5,308 comments on Tianjin Binhai Library from three popular online travel communities. Grounded theory was adopted to identify what factors affect TS with LCT and reveal the interaction among these factors.\u0000\u0000\u0000Findings\u0000The results indicated that TS with LCT was affected by complex factors related to tour plans, tour expectations, cultural characteristics, environment, support services and emotions. Cultural characteristics, environment and support services were impacted by tour plans and tour expectations, which directly or indirectly affected TS mediated by emotions. TS further influenced their expectation of their next tour.\u0000\u0000\u0000Originality/value\u0000This paper uncovered critical problems with LCT using a case study of Tianjin Binhai Library. The results provide a reference for library practitioners to develop better library services for tourists and regular users/readers.\u0000","PeriodicalId":330882,"journal":{"name":"Electron. Libr.","volume":"223 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115490516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Research on influencing factors of information diffusion in online social networks under different themes 不同主题下网络社交网络信息扩散影响因素研究
Pub Date : 2021-10-06 DOI: 10.1108/el-12-2020-0329
Ling Zhang, D. Li, Robert J. Boncella
PurposeThis paper aims to study the factors influencing online social network (OSN) information diffusion under different themes helps to understand information diffusion in general.Design/methodology/approachThis study collects data from the Web of Science, use the strategic consulting intelligent support system for word frequency analysis and use keyword clustering to classify themes, then research information themes as influencing factors of OSN information diffusion.FindingsFive themes of “natural disaster”, “political event”, “product marketing”, “sport and entertainment” and “health-disease” have been identified. It is found that the research objects, research methods and research theories used by scholars under different themes have different focuses, and the factors affecting information diffusion are different.Research limitations/implicationsThe limitation of this paper is that it only focuses on five typical themes, and there may be more themes.Practical implicationsThe research helps other scholars to conduct in-depth research on the diffusion of OSN information under different topics and focus on the content of the research on OSN information diffusion under different topics.Social implicationsThe research helps other scholars to conduct in-depth research on the diffusion of social network information under different topics, so as to better understand and predict the law of information diffusion.Originality/valueThe research summarizes the research on information diffusion in OSNs from the theme level and analyses the key points and theories and further enriches the research system on information diffusion in OSNs.
目的研究不同主题下网络社交网络(online social network, OSN)信息扩散的影响因素,有助于从总体上理解信息扩散。设计/方法/方法本研究从Web of Science收集数据,利用战略咨询智能支持系统进行词频分析,利用关键词聚类对主题进行分类,研究信息主题作为OSN信息扩散的影响因素。调查结果确定了五个主题:“自然灾害”、“政治事件”、“产品营销”、“体育和娱乐”以及“健康-疾病”。研究发现,不同主题下学者的研究对象、研究方法和研究理论的侧重点不同,影响信息扩散的因素也不同。研究局限性/启示本文的局限性在于只关注了五个典型的主题,可能还会有更多的主题。实践意义本研究有助于其他学者对不同主题下的OSN信息扩散进行深入研究,关注不同主题下OSN信息扩散研究的内容。社会意义本研究有助于其他学者对不同主题下的社会网络信息扩散进行深入研究,从而更好地理解和预测信息扩散规律。本研究从主题层面对网络环境信息扩散的研究进行了总结,分析了研究的重点和理论,进一步丰富了网络环境信息扩散的研究体系。
{"title":"Research on influencing factors of information diffusion in online social networks under different themes","authors":"Ling Zhang, D. Li, Robert J. Boncella","doi":"10.1108/el-12-2020-0329","DOIUrl":"https://doi.org/10.1108/el-12-2020-0329","url":null,"abstract":"\u0000Purpose\u0000This paper aims to study the factors influencing online social network (OSN) information diffusion under different themes helps to understand information diffusion in general.\u0000\u0000\u0000Design/methodology/approach\u0000This study collects data from the Web of Science, use the strategic consulting intelligent support system for word frequency analysis and use keyword clustering to classify themes, then research information themes as influencing factors of OSN information diffusion.\u0000\u0000\u0000Findings\u0000Five themes of “natural disaster”, “political event”, “product marketing”, “sport and entertainment” and “health-disease” have been identified. It is found that the research objects, research methods and research theories used by scholars under different themes have different focuses, and the factors affecting information diffusion are different.\u0000\u0000\u0000Research limitations/implications\u0000The limitation of this paper is that it only focuses on five typical themes, and there may be more themes.\u0000\u0000\u0000Practical implications\u0000The research helps other scholars to conduct in-depth research on the diffusion of OSN information under different topics and focus on the content of the research on OSN information diffusion under different topics.\u0000\u0000\u0000Social implications\u0000The research helps other scholars to conduct in-depth research on the diffusion of social network information under different topics, so as to better understand and predict the law of information diffusion.\u0000\u0000\u0000Originality/value\u0000The research summarizes the research on information diffusion in OSNs from the theme level and analyses the key points and theories and further enriches the research system on information diffusion in OSNs.\u0000","PeriodicalId":330882,"journal":{"name":"Electron. Libr.","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125738951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A new subject-based retrieval and search result visualization approach for scientific digital libraries 科学数字图书馆基于主题的检索与搜索结果可视化新方法
Pub Date : 2021-09-22 DOI: 10.1108/el-08-2020-0243
Sayed Mahmood Bakhshayesh, A. Ahmadi, Azadeh Mohebi
PurposeMany search engines in digital libraries are restricted to the terms presented in users’ queries. When users cannot represent their information needs in terms of keywords in a query, the search engine fails to provide appropriate results. In addition, most search engines do not have the ability to visualize search results for users to help them in their information journey. The purpose of this paper is to develop a new approach for search result visualization in digital libraries. The visualization approach enables subject-based visualization of search results and search queries.Design/methodology/approachTo enable subject-based visualization of search results in digital libraries, new subject-based document retrieval is proposed in which each document is represented as a vector of subjects as well. Then, using a vector space model for information retrieval, along with the subject-based vector, related documents to the user’s query are retrieved, whilst each document is visualized through a ring chart, showing the inherent subjects within each document and the query.FindingsThe proposed subject-based retrieval and visualization approach is evaluated from various perspectives to amplify the impact of the visualization approach from users’ opinions. Users have evaluated the performance of the proposed subject-based retrieval and search result visualization, whilst 67% of users prefer subject-based document retrieval and 80% of them believe that the proposed visualization approach is practical.Research limitations/implicationsThis research has provided a subject-based representation scheme for search result visualization in a digital library. The implication of this research can be viewed from two perspectives. First, the subject-based retrieval approach provides an opportunity for the users to understand their information needs, beyond the explicit terms in the query, leading to results, which are semantically relevant to the query. Second, the simple subject-based visualization scheme, helps users to explore the results easily, whilst allowing them to build their knowledge experience.Originality/valueA new vectorized subject-based representation of documents and queries is proposed. This representation determines the semantic and subject-based relationship between a given query and documents within a digital scientific library. In addition, it also provides a subject-based representation of the retrieved documents through which users can track the semantic relationship between the query and retrieve documents, visually.
目的数字图书馆中的许多搜索引擎仅限于用户查询中呈现的术语。当用户不能在查询中用关键字表示他们的信息需求时,搜索引擎就不能提供合适的结果。此外,大多数搜索引擎不具备可视化搜索结果的能力,以帮助用户在他们的信息旅程。本文的目的是开发一种新的数字图书馆搜索结果可视化方法。可视化方法支持搜索结果和搜索查询的基于主题的可视化。设计/方法/方法为了实现数字图书馆中基于主题的搜索结果可视化,提出了一种新的基于主题的文档检索方法,其中每个文档也表示为主题向量。然后,使用用于信息检索的向量空间模型以及基于主题的向量,检索与用户查询相关的文档,同时通过环形图将每个文档可视化,显示每个文档和查询中的固有主题。从多个角度对基于主题的检索和可视化方法进行了评估,以放大用户意见可视化方法的影响。用户已经评估了所提出的基于主题的检索和搜索结果可视化的性能,而67%的用户更喜欢基于主题的文档检索,其中80%的用户认为所提出的可视化方法是实用的。研究局限/启示本研究为数字图书馆搜索结果可视化提供了一个基于主题的表示方案。本研究的意义可以从两个角度来看待。首先,基于主题的检索方法为用户提供了了解他们的信息需求的机会,而不是查询中的显式术语,从而产生与查询在语义上相关的结果。其次,简单的基于主题的可视化方案,帮助用户轻松地探索结果,同时让他们建立自己的知识体验。原创性/价值提出了一种新的基于主题的矢量化文档和查询表示方法。这种表示确定了数字科学图书馆中给定查询和文档之间的语义和基于主题的关系。此外,它还提供了检索文档的基于主题的表示,用户可以通过它直观地跟踪查询和检索文档之间的语义关系。
{"title":"A new subject-based retrieval and search result visualization approach for scientific digital libraries","authors":"Sayed Mahmood Bakhshayesh, A. Ahmadi, Azadeh Mohebi","doi":"10.1108/el-08-2020-0243","DOIUrl":"https://doi.org/10.1108/el-08-2020-0243","url":null,"abstract":"\u0000Purpose\u0000Many search engines in digital libraries are restricted to the terms presented in users’ queries. When users cannot represent their information needs in terms of keywords in a query, the search engine fails to provide appropriate results. In addition, most search engines do not have the ability to visualize search results for users to help them in their information journey. The purpose of this paper is to develop a new approach for search result visualization in digital libraries. The visualization approach enables subject-based visualization of search results and search queries.\u0000\u0000\u0000Design/methodology/approach\u0000To enable subject-based visualization of search results in digital libraries, new subject-based document retrieval is proposed in which each document is represented as a vector of subjects as well. Then, using a vector space model for information retrieval, along with the subject-based vector, related documents to the user’s query are retrieved, whilst each document is visualized through a ring chart, showing the inherent subjects within each document and the query.\u0000\u0000\u0000Findings\u0000The proposed subject-based retrieval and visualization approach is evaluated from various perspectives to amplify the impact of the visualization approach from users’ opinions. Users have evaluated the performance of the proposed subject-based retrieval and search result visualization, whilst 67% of users prefer subject-based document retrieval and 80% of them believe that the proposed visualization approach is practical.\u0000\u0000\u0000Research limitations/implications\u0000This research has provided a subject-based representation scheme for search result visualization in a digital library. The implication of this research can be viewed from two perspectives. First, the subject-based retrieval approach provides an opportunity for the users to understand their information needs, beyond the explicit terms in the query, leading to results, which are semantically relevant to the query. Second, the simple subject-based visualization scheme, helps users to explore the results easily, whilst allowing them to build their knowledge experience.\u0000\u0000\u0000Originality/value\u0000A new vectorized subject-based representation of documents and queries is proposed. This representation determines the semantic and subject-based relationship between a given query and documents within a digital scientific library. In addition, it also provides a subject-based representation of the retrieved documents through which users can track the semantic relationship between the query and retrieve documents, visually.\u0000","PeriodicalId":330882,"journal":{"name":"Electron. Libr.","volume":"227 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124508109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigating the use of metadata record graphs to analyze subject headings in the digital public library of America 调查使用元数据记录图来分析美国数字公共图书馆的主题标题
Pub Date : 2021-08-16 DOI: 10.1108/el-11-2020-0317
M. Phillips, H. Tarver
PurposeThis study furthers metadata quality research by providing complementary network-based metrics and insights to analyze metadata records and identify areas for improvement.Design/methodology/approachMetadata record graphs apply network analysis to metadata field values; this study evaluates the interconnectedness of subjects within each Hub aggregated into the Digital Public Library of America. It also reviews the effects of NACO normalization – simulating revision of values for consistency – and breaking up pre-coordinated subject headings – to simulate applying the Faceted Application of Subject Terminology to Library of Congress Subject Headings.FindingsNetwork statistics complement count- or value-based metrics by providing context related to the number of records a user might actually find starting from one item and moving to others via shared subject values. Additionally, connectivity increases through the normalization of values to correct or adjust for formatting differences or by breaking pre-coordinated subject strings into separate topics.Research limitations/implicationsThis analysis focuses on exact-string matches, which is the lowest-common denominator for searching, although many search engines and digital library indexes may use less stringent matching methods. In terms of practical implications for evaluating or improving subjects in metadata, the normalization components demonstrate where resources may be most effectively allocated for these activities (depending on a collection).Originality/valueAlthough the individual components of this research are not particularly novel, network analysis has not generally been applied to metadata analysis. This research furthers previous studies related to metadata quality analysis of aggregations and digital collections in general.
本研究通过提供互补的基于网络的度量和见解来分析元数据记录并确定需要改进的领域,从而进一步推进元数据质量研究。设计/方法/方法元数据记录图将网络分析应用于元数据字段值;本研究评估了汇聚到美国数字公共图书馆的每个中心内学科的互联性。本文还回顾了NACO规范化的效果——模拟一致性值的修订——和分解预先协调的主题标题——模拟在国会图书馆主题标题中应用主题术语的分面应用。findsnetwork统计数据通过提供与用户可能从一个项目开始并通过共享主题值移动到其他项目的记录数量相关的上下文,来补充基于计数或值的度量。此外,通过对值进行规范化以纠正或调整格式差异,或将预先协调的主题字符串分解为单独的主题,可以增加连接性。研究限制/意义本分析侧重于精确字符串匹配,这是搜索的最低标准,尽管许多搜索引擎和数字图书馆索引可能使用不太严格的匹配方法。就评估或改进元数据中的主题的实际含义而言,规范化组件演示了在哪里可以最有效地为这些活动分配资源(取决于集合)。原创性/价值虽然本研究的个别组成部分并不是特别新颖,但网络分析并没有普遍应用于元数据分析。本研究进一步推进了以往关于聚合和数字集合的元数据质量分析的研究。
{"title":"Investigating the use of metadata record graphs to analyze subject headings in the digital public library of America","authors":"M. Phillips, H. Tarver","doi":"10.1108/el-11-2020-0317","DOIUrl":"https://doi.org/10.1108/el-11-2020-0317","url":null,"abstract":"\u0000Purpose\u0000This study furthers metadata quality research by providing complementary network-based metrics and insights to analyze metadata records and identify areas for improvement.\u0000\u0000\u0000Design/methodology/approach\u0000Metadata record graphs apply network analysis to metadata field values; this study evaluates the interconnectedness of subjects within each Hub aggregated into the Digital Public Library of America. It also reviews the effects of NACO normalization – simulating revision of values for consistency – and breaking up pre-coordinated subject headings – to simulate applying the Faceted Application of Subject Terminology to Library of Congress Subject Headings.\u0000\u0000\u0000Findings\u0000Network statistics complement count- or value-based metrics by providing context related to the number of records a user might actually find starting from one item and moving to others via shared subject values. Additionally, connectivity increases through the normalization of values to correct or adjust for formatting differences or by breaking pre-coordinated subject strings into separate topics.\u0000\u0000\u0000Research limitations/implications\u0000This analysis focuses on exact-string matches, which is the lowest-common denominator for searching, although many search engines and digital library indexes may use less stringent matching methods. In terms of practical implications for evaluating or improving subjects in metadata, the normalization components demonstrate where resources may be most effectively allocated for these activities (depending on a collection).\u0000\u0000\u0000Originality/value\u0000Although the individual components of this research are not particularly novel, network analysis has not generally been applied to metadata analysis. This research furthers previous studies related to metadata quality analysis of aggregations and digital collections in general.\u0000","PeriodicalId":330882,"journal":{"name":"Electron. Libr.","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132314048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Collecting and evaluating large volumes of bibliographic metadata aggregated in the WorldCat database: a proposed methodology to overcome challenges 收集和评估汇集在WorldCat数据库中的大量书目元数据:克服挑战的建议方法
Pub Date : 2021-08-09 DOI: 10.1108/el-11-2020-0316
Vyacheslav Zavalin, Shawne D. Miksa
PurposeThis paper aims to discuss the challenges encountered in collecting, cleaning and analyzing the large data set of bibliographic metadata records in machine-readable cataloging [MARC 21] format. Possible solutions are presented.Design/methodology/approachThis mixed method study relied on content analysis and social network analysis. The study examined subject representation in MARC 21 metadata records created in 2020 in WorldCat – the largest international database of “big smart data.” The methodological challenges that were encountered and solutions are examined.FindingsIn this general review paper with a focus on methodological issues, the discussion of challenges is followed by a discussion of solutions developed and tested as part of this study. Data collection, processing, analysis and visualization are addressed separately. Lessons learned and conclusions related to challenges and solutions for the design of a large-scale study evaluating MARC 21 bibliographic metadata from WorldCat are given. Overall recommendations for the design and implementation of future research are suggested.Originality/valueThere are no previous publications that address the challenges and solutions of data collection and analysis of WorldCat’s “big smart data” in the form of MARC 21 data. This is the first study to use a large data set to systematically examine MARC 21 library metadata records created after the most recent addition of new fields and subfields to MARC 21 Bibliographic Format standard in 2019 based on resource description and access rules. It is also the first to focus its analyzes on the networks formed by subject terms shared by MARC 21 bibliographic records in a data set extracted from a heterogeneous centralized database WorldCat.
目的探讨机器可读编目[MARC 21]格式的大型书目元数据记录的收集、清理和分析所面临的挑战。提出了可能的解决方案。设计/方法/方法这种混合方法研究依赖于内容分析和社会网络分析。该研究检查了2020年在世界最大的“大智能数据”国际数据库WorldCat中创建的MARC 21元数据记录中的主题表示。所遇到的方法上的挑战和解决方案进行了审查。在这篇以方法论问题为重点的综述文章中,首先讨论了挑战,然后讨论了作为本研究一部分开发和测试的解决方案。数据的收集、处理、分析和可视化分别进行了讨论。本文给出了设计一项评估WorldCat MARC 21书目元数据的大规模研究的经验教训和相关的挑战和解决方案。对未来研究的设计和实施提出了总体建议。原创性/价值以前没有出版物以MARC 21数据的形式解决WorldCat的“大智能数据”的数据收集和分析的挑战和解决方案。这是第一个使用大型数据集系统检查MARC 21图书馆元数据记录的研究,该记录是在2019年根据资源描述和访问规则向MARC 21书目格式标准添加新字段和子字段后创建的。它也是第一个集中分析由从异构中央数据库WorldCat提取的数据集中的MARC 21书目记录共享的主题术语所形成的网络。
{"title":"Collecting and evaluating large volumes of bibliographic metadata aggregated in the WorldCat database: a proposed methodology to overcome challenges","authors":"Vyacheslav Zavalin, Shawne D. Miksa","doi":"10.1108/el-11-2020-0316","DOIUrl":"https://doi.org/10.1108/el-11-2020-0316","url":null,"abstract":"\u0000Purpose\u0000This paper aims to discuss the challenges encountered in collecting, cleaning and analyzing the large data set of bibliographic metadata records in machine-readable cataloging [MARC 21] format. Possible solutions are presented.\u0000\u0000\u0000Design/methodology/approach\u0000This mixed method study relied on content analysis and social network analysis. The study examined subject representation in MARC 21 metadata records created in 2020 in WorldCat – the largest international database of “big smart data.” The methodological challenges that were encountered and solutions are examined.\u0000\u0000\u0000Findings\u0000In this general review paper with a focus on methodological issues, the discussion of challenges is followed by a discussion of solutions developed and tested as part of this study. Data collection, processing, analysis and visualization are addressed separately. Lessons learned and conclusions related to challenges and solutions for the design of a large-scale study evaluating MARC 21 bibliographic metadata from WorldCat are given. Overall recommendations for the design and implementation of future research are suggested.\u0000\u0000\u0000Originality/value\u0000There are no previous publications that address the challenges and solutions of data collection and analysis of WorldCat’s “big smart data” in the form of MARC 21 data. This is the first study to use a large data set to systematically examine MARC 21 library metadata records created after the most recent addition of new fields and subfields to MARC 21 Bibliographic Format standard in 2019 based on resource description and access rules. It is also the first to focus its analyzes on the networks formed by subject terms shared by MARC 21 bibliographic records in a data set extracted from a heterogeneous centralized database WorldCat.\u0000","PeriodicalId":330882,"journal":{"name":"Electron. Libr.","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125659500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An exploratory analysis: extracting materials science knowledge from unstructured scholarly data 探索性分析:从非结构化学术数据中提取材料科学知识
Pub Date : 2021-08-09 DOI: 10.1108/el-11-2020-0320
Xintong Zhao, Jane Greenberg, V. Meschke, E. Toberer, Xiaohua Hu
PurposeThe output of academic literature has increased significantly due to digital technology, presenting researchers with a challenge across every discipline, including materials science, as it is impossible to manually read and extract knowledge from millions of published literature. The purpose of this study is to address this challenge by exploring knowledge extraction in materials science, as applied to digital scholarship. An overriding goal is to help inform readers about the status knowledge extraction in materials science.Design/methodology/approachThe authors conducted a two-part analysis, comparing knowledge extraction methods applied materials science scholarship, across a sample of 22 articles; followed by a comparison of HIVE-4-MAT, an ontology-based knowledge extraction and MatScholar, a named entity recognition (NER) application. This paper covers contextual background, and a review of three tiers of knowledge extraction (ontology-based, NER and relation extraction), followed by the research goals and approach.FindingsThe results indicate three key needs for researchers to consider for advancing knowledge extraction: the need for materials science focused corpora; the need for researchers to define the scope of the research being pursued, and the need to understand the tradeoffs among different knowledge extraction methods. This paper also points to future material science research potential with relation extraction and increased availability of ontologies.Originality/valueTo the best of the authors’ knowledge, there are very few studies examining knowledge extraction in materials science. This work makes an important contribution to this underexplored research area.
由于数字技术的发展,学术文献的产出显著增加,这给包括材料科学在内的各个学科的研究人员带来了挑战,因为不可能手动阅读和提取数百万已发表的文献中的知识。本研究的目的是通过探索材料科学中的知识提取来解决这一挑战,并将其应用于数字学术。最重要的目标是帮助读者了解材料科学中的状态知识提取。设计/方法/方法作者对22篇文章的样本进行了两部分分析,比较了材料科学奖学金的知识提取方法;然后比较了基于本体的知识抽取方法HIVE-4-MAT和命名实体识别(NER)应用程序MatScholar。本文首先介绍了知识抽取的背景,然后介绍了知识抽取的三个层次(基于本体的、NER的和关系抽取的),然后介绍了研究目标和方法。研究结果表明,研究人员需要考虑推进知识提取的三个关键需求:需要以材料科学为重点的语料库;研究人员需要确定研究的范围,需要了解不同知识提取方法之间的权衡。本文还指出了随着关系提取和本体可用性的增加,未来材料科学研究的潜力。原创性/价值据作者所知,材料科学中知识提取的研究很少。这项工作为这一尚未充分开发的研究领域做出了重要贡献。
{"title":"An exploratory analysis: extracting materials science knowledge from unstructured scholarly data","authors":"Xintong Zhao, Jane Greenberg, V. Meschke, E. Toberer, Xiaohua Hu","doi":"10.1108/el-11-2020-0320","DOIUrl":"https://doi.org/10.1108/el-11-2020-0320","url":null,"abstract":"\u0000Purpose\u0000The output of academic literature has increased significantly due to digital technology, presenting researchers with a challenge across every discipline, including materials science, as it is impossible to manually read and extract knowledge from millions of published literature. The purpose of this study is to address this challenge by exploring knowledge extraction in materials science, as applied to digital scholarship. An overriding goal is to help inform readers about the status knowledge extraction in materials science.\u0000\u0000\u0000Design/methodology/approach\u0000The authors conducted a two-part analysis, comparing knowledge extraction methods applied materials science scholarship, across a sample of 22 articles; followed by a comparison of HIVE-4-MAT, an ontology-based knowledge extraction and MatScholar, a named entity recognition (NER) application. This paper covers contextual background, and a review of three tiers of knowledge extraction (ontology-based, NER and relation extraction), followed by the research goals and approach.\u0000\u0000\u0000Findings\u0000The results indicate three key needs for researchers to consider for advancing knowledge extraction: the need for materials science focused corpora; the need for researchers to define the scope of the research being pursued, and the need to understand the tradeoffs among different knowledge extraction methods. This paper also points to future material science research potential with relation extraction and increased availability of ontologies.\u0000\u0000\u0000Originality/value\u0000To the best of the authors’ knowledge, there are very few studies examining knowledge extraction in materials science. This work makes an important contribution to this underexplored research area.\u0000","PeriodicalId":330882,"journal":{"name":"Electron. Libr.","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121908409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Public library Twitter use during the early period of the COVID-19 lockdown in the United States 在美国COVID-19封锁初期,公共图书馆Twitter的使用情况
Pub Date : 2021-08-09 DOI: 10.1108/EL-03-2021-0067
Youngok Choi, Sung Un Kim
PurposeThe purpose of this study is to explore the Twitter use of public libraries during the early period of lockdown due to the COVID-19 pandemic to evaluate the focus of Twitter communication.Design/methodology/approachA total of 57 active, public library Twitter accounts were used for data collection and analysis. The tweets examined were a combination of original tweets (n = 1,465) and retweets (n = 516) posted from other Twitter accounts on the public libraries’ Twitter feeds. A content analysis scheme was developed to analyse topical aspects of the tweets.FindingsThe most frequent tweets were about library events, programmes and activities. However, there was a relatively low focus on sharing community information and addressing information related to the pandemic. The study suggests that public libraries could use Twitter to provide library resources and services to their patrons, whilst also acting as a virtual community centre safely keeping patrons engaged in the face of a global pandemic. By doing so, Twitter could be used as an integral part of promoting the mission of public libraries.Research limitations/implicationsThe study examined a limited number of public libraries’ Twitter posts. Whilst the study carried out a random sampling of 10% of public libraries from the five states that had the highest COVID-19 cases in the month of April 2020, the study only examined tweets of 57 public libraries being active in posting. Thus, the findings of the study are not for generalizing.Practical implicationsThe content scheme includes content types regarding library services and community information. The content category scheme is general to reflect themes of content during a normal time and any emergency. Thus, this framework could be helpful for the content development of public libraries in planning social media use.Originality/valueThe study used a new content analysis framework to examine both original tweets and retweets for information sharing of library services and community information. The approach of content analysis is distinctive to examine libraries’ communication trends on social media not only in normal times but also in times of crisis as well. The study also incorporated additional measures to assess Twitter practices including hashtags.
本研究的目的是探讨新冠肺炎大流行导致公共图书馆在封锁初期使用Twitter的情况,以评估Twitter传播的重点。设计/方法/方法总共使用了57个活跃的公共图书馆Twitter帐户进行数据收集和分析。被检查的推文是原始推文(n = 1465)和转发推文(n = 516)的组合,这些推文来自公共图书馆的推特feed上的其他推特账户。我们开发了一个内容分析方案来分析推文的主题方面。研究发现,最频繁的推文是关于图书馆的活动、项目和活动。然而,对分享社区信息和处理与大流行病有关的信息的重视程度相对较低。这项研究表明,公共图书馆可以利用Twitter向读者提供图书馆资源和服务,同时也可以作为一个虚拟社区中心,让读者在面对全球流行病时安全地参与其中。通过这样做,Twitter可以作为促进公共图书馆使命的一个组成部分。研究局限/启示这项研究调查了有限数量的公共图书馆的Twitter帖子。虽然该研究在2020年4月对COVID-19病例最多的五个州的10%的公共图书馆进行了随机抽样,但该研究只调查了57个活跃的公共图书馆的推文。因此,这项研究的结果不能一概而论。实际意义内容方案包括图书馆服务和社区信息的内容类型。内容类别方案是一般性的,以反映正常时间和任何紧急情况下的内容主题。因此,该框架可为公共图书馆规划社会媒体使用的内容开发提供帮助。独创性/价值该研究使用了一个新的内容分析框架来检查原始推文和转发推文对图书馆服务和社区信息的信息共享。内容分析的方法是独特的,不仅在正常情况下,也在危机时期,研究图书馆在社交媒体上的传播趋势。该研究还纳入了其他措施来评估推特的做法,包括标签。
{"title":"Public library Twitter use during the early period of the COVID-19 lockdown in the United States","authors":"Youngok Choi, Sung Un Kim","doi":"10.1108/EL-03-2021-0067","DOIUrl":"https://doi.org/10.1108/EL-03-2021-0067","url":null,"abstract":"\u0000Purpose\u0000The purpose of this study is to explore the Twitter use of public libraries during the early period of lockdown due to the COVID-19 pandemic to evaluate the focus of Twitter communication.\u0000\u0000\u0000Design/methodology/approach\u0000A total of 57 active, public library Twitter accounts were used for data collection and analysis. The tweets examined were a combination of original tweets (n = 1,465) and retweets (n = 516) posted from other Twitter accounts on the public libraries’ Twitter feeds. A content analysis scheme was developed to analyse topical aspects of the tweets.\u0000\u0000\u0000Findings\u0000The most frequent tweets were about library events, programmes and activities. However, there was a relatively low focus on sharing community information and addressing information related to the pandemic. The study suggests that public libraries could use Twitter to provide library resources and services to their patrons, whilst also acting as a virtual community centre safely keeping patrons engaged in the face of a global pandemic. By doing so, Twitter could be used as an integral part of promoting the mission of public libraries.\u0000\u0000\u0000Research limitations/implications\u0000The study examined a limited number of public libraries’ Twitter posts. Whilst the study carried out a random sampling of 10% of public libraries from the five states that had the highest COVID-19 cases in the month of April 2020, the study only examined tweets of 57 public libraries being active in posting. Thus, the findings of the study are not for generalizing.\u0000\u0000\u0000Practical implications\u0000The content scheme includes content types regarding library services and community information. The content category scheme is general to reflect themes of content during a normal time and any emergency. Thus, this framework could be helpful for the content development of public libraries in planning social media use.\u0000\u0000\u0000Originality/value\u0000The study used a new content analysis framework to examine both original tweets and retweets for information sharing of library services and community information. The approach of content analysis is distinctive to examine libraries’ communication trends on social media not only in normal times but also in times of crisis as well. The study also incorporated additional measures to assess Twitter practices including hashtags.\u0000","PeriodicalId":330882,"journal":{"name":"Electron. Libr.","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132044295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Towards an entity relation extraction framework in the cross-lingual context 跨语言环境下的实体关系抽取框架
Pub Date : 2021-08-03 DOI: 10.1108/el-10-2020-0304
Chuanming Yu, Haodong Xue, Manyi Wang, Lu An
PurposeOwing to the uneven distribution of annotated corpus among different languages, it is necessary to bridge the gap between low resource languages and high resource languages. From the perspective of entity relation extraction, this paper aims to extend the knowledge acquisition task from a single language context to a cross-lingual context, and to improve the relation extraction performance for low resource languages.Design/methodology/approachThis paper proposes a cross-lingual adversarial relation extraction (CLARE) framework, which decomposes cross-lingual relation extraction into parallel corpus acquisition and adversarial adaptation relation extraction. Based on the proposed framework, this paper conducts extensive experiments in two tasks, i.e. the English-to-Chinese and the English-to-Arabic cross-lingual entity relation extraction.FindingsThe Macro-F1 values of the optimal models in the two tasks are 0.880 1 and 0.789 9, respectively, indicating that the proposed CLARE framework for CLARE can significantly improve the effect of low resource language entity relation extraction. The experimental results suggest that the proposed framework can effectively transfer the corpus as well as the annotated tags from English to Chinese and Arabic. This study reveals that the proposed approach is less human labour intensive and more effective in the cross-lingual entity relation extraction than the manual method. It shows that this approach has high generalizability among different languages.Originality/valueThe research results are of great significance for improving the performance of the cross-lingual knowledge acquisition. The cross-lingual transfer may greatly reduce the time and cost of the manual construction of the multi-lingual corpus. It sheds light on the knowledge acquisition and organization from the unstructured text in the era of big data.
目的由于标注语料库在不同语言之间分布不均,有必要弥合低资源语言和高资源语言之间的差距。从实体关系抽取的角度出发,将知识获取任务从单一语言语境扩展到跨语言语境,提高低资源语言的关系抽取性能。本文提出了一种跨语言对抗关系提取(CLARE)框架,该框架将跨语言关系提取分解为平行语料库获取和对抗适应关系提取。基于所提出的框架,本文在英汉跨语言实体关系抽取和英汉阿拉伯跨语言实体关系抽取两个任务上进行了大量实验。结果两个任务中最优模型的Macro-F1值分别为0.880 1和0.789 9,表明本文提出的CLARE框架能够显著提高低资源语言实体关系提取的效果。实验结果表明,该框架能够有效地将语料库和标注标签从英语转换为汉语和阿拉伯语。研究表明,该方法在跨语言实体关系提取方面比人工方法更有效,减少了人工劳动强度。结果表明,该方法在不同语言间具有较高的通用性。本研究结果对于提高跨语言知识习得的绩效具有重要意义。跨语迁移可以大大减少人工构建多语语料库的时间和成本。揭示了大数据时代从非结构化文本中获取知识和组织知识的思路。
{"title":"Towards an entity relation extraction framework in the cross-lingual context","authors":"Chuanming Yu, Haodong Xue, Manyi Wang, Lu An","doi":"10.1108/el-10-2020-0304","DOIUrl":"https://doi.org/10.1108/el-10-2020-0304","url":null,"abstract":"\u0000Purpose\u0000Owing to the uneven distribution of annotated corpus among different languages, it is necessary to bridge the gap between low resource languages and high resource languages. From the perspective of entity relation extraction, this paper aims to extend the knowledge acquisition task from a single language context to a cross-lingual context, and to improve the relation extraction performance for low resource languages.\u0000\u0000\u0000Design/methodology/approach\u0000This paper proposes a cross-lingual adversarial relation extraction (CLARE) framework, which decomposes cross-lingual relation extraction into parallel corpus acquisition and adversarial adaptation relation extraction. Based on the proposed framework, this paper conducts extensive experiments in two tasks, i.e. the English-to-Chinese and the English-to-Arabic cross-lingual entity relation extraction.\u0000\u0000\u0000Findings\u0000The Macro-F1 values of the optimal models in the two tasks are 0.880 1 and 0.789 9, respectively, indicating that the proposed CLARE framework for CLARE can significantly improve the effect of low resource language entity relation extraction. The experimental results suggest that the proposed framework can effectively transfer the corpus as well as the annotated tags from English to Chinese and Arabic. This study reveals that the proposed approach is less human labour intensive and more effective in the cross-lingual entity relation extraction than the manual method. It shows that this approach has high generalizability among different languages.\u0000\u0000\u0000Originality/value\u0000The research results are of great significance for improving the performance of the cross-lingual knowledge acquisition. The cross-lingual transfer may greatly reduce the time and cost of the manual construction of the multi-lingual corpus. It sheds light on the knowledge acquisition and organization from the unstructured text in the era of big data.\u0000","PeriodicalId":330882,"journal":{"name":"Electron. Libr.","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120967494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Data set entity recognition based on distant supervision 基于远程监督的数据集实体识别
Pub Date : 2021-07-26 DOI: 10.1108/el-10-2020-0301
Pengcheng Li, Qikai Liu, Qikai Cheng, Wei Lu
PurposeThis paper aims to identify data set entities in scientific literature. To address poor recognition caused by a lack of training corpora in existing studies, a distant supervised learning-based approach is proposed to identify data set entities automatically from large-scale scientific literature in an open domain.Design/methodology/approachFirstly, the authors use a dictionary combined with a bootstrapping strategy to create a labelled corpus to apply supervised learning. Secondly, a bidirectional encoder representation from transformers (BERT)-based neural model was applied to identify data set entities in the scientific literature automatically. Finally, two data augmentation techniques, entity replacement and entity masking, were introduced to enhance the model generalisability and improve the recognition of data set entities.FindingsIn the absence of training data, the proposed method can effectively identify data set entities in large-scale scientific papers. The BERT-based vectorised representation and data augmentation techniques enable significant improvements in the generality and robustness of named entity recognition models, especially in long-tailed data set entity recognition.Originality/valueThis paper provides a practical research method for automatically recognising data set entities in scientific literature. To the best of the authors’ knowledge, this is the first attempt to apply distant learning to the study of data set entity recognition. The authors introduce a robust vectorised representation and two data augmentation strategies (entity replacement and entity masking) to address the problem inherent in distant supervised learning methods, which the existing research has mostly ignored. The experimental results demonstrate that our approach effectively improves the recognition of data set entities, especially long-tailed data set entities.
本文旨在识别科学文献中的数据集实体。为了解决现有研究中由于缺乏训练语料库而导致的识别能力差的问题,提出了一种基于远程监督学习的方法,从开放领域的大规模科学文献中自动识别数据集实体。设计/方法/方法首先,作者使用字典结合自举策略来创建一个标记语料库来应用监督学习。其次,采用基于BERT的双向编码器表示神经模型自动识别科学文献中的数据集实体;最后,引入实体替换和实体屏蔽两种数据增强技术,增强模型的通用性,提高数据集实体的识别能力。在缺乏训练数据的情况下,本文提出的方法可以有效地识别大规模科学论文中的数据集实体。基于bert的矢量表示和数据增强技术使命名实体识别模型的通用性和鲁棒性得到了显著提高,特别是在长尾数据集实体识别方面。原创性/价值为科学文献中数据集实体的自动识别提供了一种实用的研究方法。据作者所知,这是第一次尝试将远程学习应用于数据集实体识别的研究。作者引入了一种鲁棒的矢量化表示和两种数据增强策略(实体替换和实体屏蔽)来解决远程监督学习方法固有的问题,而现有的研究大多忽略了这一问题。实验结果表明,该方法有效地提高了对数据集实体,特别是长尾数据集实体的识别能力。
{"title":"Data set entity recognition based on distant supervision","authors":"Pengcheng Li, Qikai Liu, Qikai Cheng, Wei Lu","doi":"10.1108/el-10-2020-0301","DOIUrl":"https://doi.org/10.1108/el-10-2020-0301","url":null,"abstract":"\u0000Purpose\u0000This paper aims to identify data set entities in scientific literature. To address poor recognition caused by a lack of training corpora in existing studies, a distant supervised learning-based approach is proposed to identify data set entities automatically from large-scale scientific literature in an open domain.\u0000\u0000\u0000Design/methodology/approach\u0000Firstly, the authors use a dictionary combined with a bootstrapping strategy to create a labelled corpus to apply supervised learning. Secondly, a bidirectional encoder representation from transformers (BERT)-based neural model was applied to identify data set entities in the scientific literature automatically. Finally, two data augmentation techniques, entity replacement and entity masking, were introduced to enhance the model generalisability and improve the recognition of data set entities.\u0000\u0000\u0000Findings\u0000In the absence of training data, the proposed method can effectively identify data set entities in large-scale scientific papers. The BERT-based vectorised representation and data augmentation techniques enable significant improvements in the generality and robustness of named entity recognition models, especially in long-tailed data set entity recognition.\u0000\u0000\u0000Originality/value\u0000This paper provides a practical research method for automatically recognising data set entities in scientific literature. To the best of the authors’ knowledge, this is the first attempt to apply distant learning to the study of data set entity recognition. The authors introduce a robust vectorised representation and two data augmentation strategies (entity replacement and entity masking) to address the problem inherent in distant supervised learning methods, which the existing research has mostly ignored. The experimental results demonstrate that our approach effectively improves the recognition of data set entities, especially long-tailed data set entities.\u0000","PeriodicalId":330882,"journal":{"name":"Electron. Libr.","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126296901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Personalized news recommendation based on an improved conditional restricted Boltzmann machine 基于改进条件受限玻尔兹曼机的个性化新闻推荐
Pub Date : 2021-07-22 DOI: 10.1108/EL-06-2020-0165
Linxia Zhong, Wei Wei, Shixuan Li
PurposeBecause of the extensive user coverage of news sites and apps, greater social and commercial value can be realized if users can access their favourite news as easily as possible. However, news has a timeliness factor; there are serious cold start and data sparsity in news recommendation, and news users are more susceptible to recent topical news. Therefore, this study aims to propose a personalized news recommendation approach based on topic model and restricted Boltzmann machine (RBM).Design/methodology/approachFirstly, the model extracts the news topic information based on the LDA2vec topic model. Then, the implicit behaviour data are analysed and converted into explicit rating data according to the rules. The highest weight is assigned to recent hot news stories. Finally, the topic information and the rating data are regarded as the conditional layer and visual layer of the conditional RBM (CRBM) model, respectively, to implement news recommendations.FindingsThe experimental results show that using LDA2vec-based news topic as a conditional layer in the CRBM model provides a higher prediction rating and improves the effectiveness of news recommendations.Originality/valueThis study proposes a personalized news recommendation approach based on an improved CRBM. Topic model is applied to news topic extraction and used as the conditional layer of the CRBM. It not only alleviates the sparseness of rating data to improve the efficient in CRBM but also considers that readers are more susceptible to popular or trending news.
由于新闻网站和应用程序的广泛用户覆盖,如果用户能够尽可能容易地访问他们喜欢的新闻,就可以实现更大的社会和商业价值。然而,新闻有一个时效性因素;新闻推荐存在严重的冷启动和数据稀疏现象,新闻用户更容易受到近期热点新闻的影响。因此,本研究旨在提出一种基于主题模型和受限玻尔兹曼机(restricted Boltzmann machine, RBM)的个性化新闻推荐方法。设计/方法/方法首先,基于LDA2vec主题模型提取新闻主题信息。然后,对隐式行为数据进行分析,并根据规则将其转化为显式评分数据。权重最高的是最近的热点新闻。最后,将话题信息和评分数据分别作为条件RBM (CRBM)模型的条件层和视觉层,实现新闻推荐。实验结果表明,在CRBM模型中使用基于lda2vec的新闻主题作为条件层,可以提供更高的预测评级,提高新闻推荐的有效性。独创性/价值本研究提出了一种基于改进的CRBM的个性化新闻推荐方法。将主题模型应用于新闻主题提取,并作为CRBM的条件层。它不仅缓解了评级数据的稀疏性,提高了CRBM的效率,而且考虑到读者更容易受到流行或趋势新闻的影响。
{"title":"Personalized news recommendation based on an improved conditional restricted Boltzmann machine","authors":"Linxia Zhong, Wei Wei, Shixuan Li","doi":"10.1108/EL-06-2020-0165","DOIUrl":"https://doi.org/10.1108/EL-06-2020-0165","url":null,"abstract":"\u0000Purpose\u0000Because of the extensive user coverage of news sites and apps, greater social and commercial value can be realized if users can access their favourite news as easily as possible. However, news has a timeliness factor; there are serious cold start and data sparsity in news recommendation, and news users are more susceptible to recent topical news. Therefore, this study aims to propose a personalized news recommendation approach based on topic model and restricted Boltzmann machine (RBM).\u0000\u0000\u0000Design/methodology/approach\u0000Firstly, the model extracts the news topic information based on the LDA2vec topic model. Then, the implicit behaviour data are analysed and converted into explicit rating data according to the rules. The highest weight is assigned to recent hot news stories. Finally, the topic information and the rating data are regarded as the conditional layer and visual layer of the conditional RBM (CRBM) model, respectively, to implement news recommendations.\u0000\u0000\u0000Findings\u0000The experimental results show that using LDA2vec-based news topic as a conditional layer in the CRBM model provides a higher prediction rating and improves the effectiveness of news recommendations.\u0000\u0000\u0000Originality/value\u0000This study proposes a personalized news recommendation approach based on an improved CRBM. Topic model is applied to news topic extraction and used as the conditional layer of the CRBM. It not only alleviates the sparseness of rating data to improve the efficient in CRBM but also considers that readers are more susceptible to popular or trending news.\u0000","PeriodicalId":330882,"journal":{"name":"Electron. Libr.","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124978787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Electron. Libr.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1