价值社会学:利用自然语言分析技术建立分类学的经验

M. Kashina, S. Tkach
{"title":"价值社会学:利用自然语言分析技术建立分类学的经验","authors":"M. Kashina, S. Tkach","doi":"10.26425/2658-347x-2023-6-1-48-58","DOIUrl":null,"url":null,"abstract":"Modern research in the field of sociology of science is becoming more complicated due to the constantly growing publication activity of authors. To track trends in sectoral sociology, scientists turn to scientometric methods, but they are not enough. Trends in the development of the sociology of values as a branch of sociology are the subject of the study. The purpose of the work is an assessment of the possibilities of using natural language analysis methods (NLP/NLA) for thematic and theoretical clustering of research in the sociology of values. The design of the study was quantitative and qualitative, it was carried out in two stages. At the first stage, 121 abstracts of a scientific articles were analyzed using text mining, after which their total array was divided into clusters. At the second stage, the results of machine clustering were examined by the method of qualitative text analysis, on the basis of which the limitations and capabilities of the NLP/NLA method were identified for solving the problem of clustering scientific texts. It was found that articles with a more conservative core of theoretical categories (gender studies, migration studies, the theory of globalism) are more amenable to clustering, while theories with a loosely structured and fluid theoretical core (theories using environmental terminology, theories of inequality) are much less amenable to explicit clustering. The results obtained allow us to form a new direction of work with large arrays of scientific texts, associated with their clustering using NLP/NLA. Building clusters enables researchers to work with all texts in a given subject area, and not just with the most cited ones. This, in turn, provides the visibility of all scientific ideas, including those that have not gained popularity/notability.","PeriodicalId":52710,"journal":{"name":"Tsifrovaia sotsiologiia","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sociology of values: experience of building a taxonomy by using natural language analysis technology\",\"authors\":\"M. Kashina, S. Tkach\",\"doi\":\"10.26425/2658-347x-2023-6-1-48-58\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Modern research in the field of sociology of science is becoming more complicated due to the constantly growing publication activity of authors. To track trends in sectoral sociology, scientists turn to scientometric methods, but they are not enough. Trends in the development of the sociology of values as a branch of sociology are the subject of the study. The purpose of the work is an assessment of the possibilities of using natural language analysis methods (NLP/NLA) for thematic and theoretical clustering of research in the sociology of values. The design of the study was quantitative and qualitative, it was carried out in two stages. At the first stage, 121 abstracts of a scientific articles were analyzed using text mining, after which their total array was divided into clusters. At the second stage, the results of machine clustering were examined by the method of qualitative text analysis, on the basis of which the limitations and capabilities of the NLP/NLA method were identified for solving the problem of clustering scientific texts. It was found that articles with a more conservative core of theoretical categories (gender studies, migration studies, the theory of globalism) are more amenable to clustering, while theories with a loosely structured and fluid theoretical core (theories using environmental terminology, theories of inequality) are much less amenable to explicit clustering. The results obtained allow us to form a new direction of work with large arrays of scientific texts, associated with their clustering using NLP/NLA. Building clusters enables researchers to work with all texts in a given subject area, and not just with the most cited ones. This, in turn, provides the visibility of all scientific ideas, including those that have not gained popularity/notability.\",\"PeriodicalId\":52710,\"journal\":{\"name\":\"Tsifrovaia sotsiologiia\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tsifrovaia sotsiologiia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.26425/2658-347x-2023-6-1-48-58\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tsifrovaia sotsiologiia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26425/2658-347x-2023-6-1-48-58","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

由于作者的出版活动不断增加,现代科学社会学领域的研究变得越来越复杂。为了跟踪部门社会学的趋势,科学家们求助于科学计量学方法,但这还不够。价值社会学作为社会学的一个分支,其发展趋势是本文研究的主题。这项工作的目的是评估使用自然语言分析方法(NLP/NLA)对价值社会学研究的主题和理论聚类的可能性。本研究设计采用定量和定性相结合的方法,分两个阶段进行。第一阶段,采用文本挖掘方法对121篇科学论文的摘要进行分析,然后对其总数组进行聚类。第二阶段,采用定性文本分析的方法对机器聚类结果进行检验,在此基础上确定了NLP/NLA方法解决科学文本聚类问题的局限性和能力。研究发现,具有较为保守的理论范畴核心的文章(性别研究、移民研究、全球主义理论)更适合聚类,而具有松散结构和流动理论核心的理论(使用环境术语的理论、不平等理论)则不太适合明确的聚类。获得的结果使我们能够形成一个新的工作方向,与大量科学文本阵列相关联,并使用NLP/NLA进行聚类。构建集群使研究人员能够处理给定主题领域的所有文本,而不仅仅是被引用最多的文本。这反过来又为所有科学思想提供了可见性,包括那些尚未获得普及/知名度的科学思想。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Sociology of values: experience of building a taxonomy by using natural language analysis technology
Modern research in the field of sociology of science is becoming more complicated due to the constantly growing publication activity of authors. To track trends in sectoral sociology, scientists turn to scientometric methods, but they are not enough. Trends in the development of the sociology of values as a branch of sociology are the subject of the study. The purpose of the work is an assessment of the possibilities of using natural language analysis methods (NLP/NLA) for thematic and theoretical clustering of research in the sociology of values. The design of the study was quantitative and qualitative, it was carried out in two stages. At the first stage, 121 abstracts of a scientific articles were analyzed using text mining, after which their total array was divided into clusters. At the second stage, the results of machine clustering were examined by the method of qualitative text analysis, on the basis of which the limitations and capabilities of the NLP/NLA method were identified for solving the problem of clustering scientific texts. It was found that articles with a more conservative core of theoretical categories (gender studies, migration studies, the theory of globalism) are more amenable to clustering, while theories with a loosely structured and fluid theoretical core (theories using environmental terminology, theories of inequality) are much less amenable to explicit clustering. The results obtained allow us to form a new direction of work with large arrays of scientific texts, associated with their clustering using NLP/NLA. Building clusters enables researchers to work with all texts in a given subject area, and not just with the most cited ones. This, in turn, provides the visibility of all scientific ideas, including those that have not gained popularity/notability.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
48
审稿时长
8 weeks
期刊最新文献
From quantitative measurement to understanding public demand: Exploring non-survey methods in applied regional research Social risks of internet activity of virtual communities Ethnonational discourse of migrants in social networks on the example of Tajik and Kyrgyz virtual communities on VK Problem of spreading fake information in the Internet space analysis Changing content plan structure of companies operating in Russia and represented on VK
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1