{"title":"Application of Text Mining Techniques on Scholarly Research Articles: Methods and Tools","authors":"Khusbu Thakur, Vinit Kumar","doi":"10.1080/13614533.2021.1918190","DOIUrl":null,"url":null,"abstract":"Abstract A vast amount of published scholarly literature is generated every day. Today, it is one of the biggest challenges for organisations to extract knowledge embedded in published scholarly literature for business and research applications. Application of text mining is gaining popularity among researchers and applications are growing exponentially in different research areas. This study investigates the variety of text mining tools, techniques, sample sizes, domains and sections of the documents preferred by the text mining researchers through a systematic and structured literature review of conceptual and empirical studies. The significant findings depict that LDA and R package is the most extensively used tool and technique among the authors, most of the researchers prefer the sample size of 1000 articles for analysis, literature belonging to the domain of ICT, and related disciplines are frequently analysed in the text mining studies and abstracts constitute the corpus of the majority of text mining studies.","PeriodicalId":38971,"journal":{"name":"New Review of Academic Librarianship","volume":"28 1","pages":"279 - 302"},"PeriodicalIF":1.9000,"publicationDate":"2021-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/13614533.2021.1918190","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Review of Academic Librarianship","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/13614533.2021.1918190","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 12
Abstract
Abstract A vast amount of published scholarly literature is generated every day. Today, it is one of the biggest challenges for organisations to extract knowledge embedded in published scholarly literature for business and research applications. Application of text mining is gaining popularity among researchers and applications are growing exponentially in different research areas. This study investigates the variety of text mining tools, techniques, sample sizes, domains and sections of the documents preferred by the text mining researchers through a systematic and structured literature review of conceptual and empirical studies. The significant findings depict that LDA and R package is the most extensively used tool and technique among the authors, most of the researchers prefer the sample size of 1000 articles for analysis, literature belonging to the domain of ICT, and related disciplines are frequently analysed in the text mining studies and abstracts constitute the corpus of the majority of text mining studies.