Coconut Libtool: Bridging Textual Analysis Gaps for Non-Programmers

Faizhal Arif Santosa, Manika Lamba, Crissandra George, J. Stephen Downie
{"title":"Coconut Libtool: Bridging Textual Analysis Gaps for Non-Programmers","authors":"Faizhal Arif Santosa, Manika Lamba, Crissandra George, J. Stephen Downie","doi":"arxiv-2406.05949","DOIUrl":null,"url":null,"abstract":"In the era of big and ubiquitous data, professionals and students alike are\nfinding themselves needing to perform a number of textual analysis tasks.\nHistorically, the general lack of statistical expertise and programming skills\nhas stopped many with humanities or social sciences backgrounds from performing\nand fully benefiting from such analyses. Thus, we introduce Coconut Libtool\n(www.coconut-libtool.com/), an open-source, web-based application that utilizes\nstate-of-the-art natural language processing (NLP) technologies. Coconut\nLibtool analyzes text data from customized files and bibliographic databases\nsuch as Web of Science, Scopus, and Lens. Users can verify which functions can\nbe performed with the data they have. Coconut Libtool deploys multiple\nalgorithmic NLP techniques at the backend, including topic modeling (LDA,\nBiterm, and BERTopic algorithms), network graph visualization, keyword\nlemmatization, and sunburst visualization. Coconut Libtool is the people-first\nweb application designed to be used by professionals, researchers, and students\nin the information sciences, digital humanities, and computational social\nsciences domains to promote transparency, reproducibility, accessibility,\nreciprocity, and responsibility in research practices.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"23 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Digital Libraries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.05949","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In the era of big and ubiquitous data, professionals and students alike are finding themselves needing to perform a number of textual analysis tasks. Historically, the general lack of statistical expertise and programming skills has stopped many with humanities or social sciences backgrounds from performing and fully benefiting from such analyses. Thus, we introduce Coconut Libtool (www.coconut-libtool.com/), an open-source, web-based application that utilizes state-of-the-art natural language processing (NLP) technologies. Coconut Libtool analyzes text data from customized files and bibliographic databases such as Web of Science, Scopus, and Lens. Users can verify which functions can be performed with the data they have. Coconut Libtool deploys multiple algorithmic NLP techniques at the backend, including topic modeling (LDA, Biterm, and BERTopic algorithms), network graph visualization, keyword lemmatization, and sunburst visualization. Coconut Libtool is the people-first web application designed to be used by professionals, researchers, and students in the information sciences, digital humanities, and computational social sciences domains to promote transparency, reproducibility, accessibility, reciprocity, and responsibility in research practices.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
椰子 Libtool:为非程序员缩小文本分析差距
在无处不在的大数据时代,专业人士和学生都发现自己需要执行大量文本分析任务。从历史上看,由于普遍缺乏统计专业知识和编程技能,许多具有人文或社会科学背景的人无法执行此类分析并从中充分受益。因此,我们引入了 Coconut Libtool(www.coconut-libtool.com/),它是一个开源的、基于网络的应用程序,采用了最先进的自然语言处理(NLP)技术。CoconutLibtool 可以分析来自定制文件和书目数据库(如 Web of Science、Scopus 和 Lens)的文本数据。用户可以验证他们所拥有的数据可以执行哪些功能。Coconut Libtool 在后端部署了多种算法的 NLP 技术,包括主题建模(LDA、Biterm 和 BERTopic 算法)、网络图可视化、关键词格式化和旭日可视化。Coconut Libtool 是一款以人为本的网络应用程序,旨在供信息科学、数字人文和计算社会科学领域的专业人士、研究人员和学生使用,以提高研究实践的透明度、可复制性、可访问性、互惠性和责任感。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Publishing Instincts: An Exploration-Exploitation Framework for Studying Academic Publishing Behavior and "Home Venues" Research Citations Building Trust in Wikipedia Evaluating the Linguistic Coverage of OpenAlex: An Assessment of Metadata Accuracy and Completeness Towards understanding evolution of science through language model series Ensuring Adherence to Standards in Experiment-Related Metadata Entered Via Spreadsheets
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1