价值社会学：利用自然语言分析技术建立分类学的经验

Tsifrovaia sotsiologiia Pub Date : 2023-04-27 DOI:10.26425/2658-347x-2023-6-1-48-58

M. Kashina, S. Tkach

{"title":"价值社会学：利用自然语言分析技术建立分类学的经验","authors":"M. Kashina, S. Tkach","doi":"10.26425/2658-347x-2023-6-1-48-58","DOIUrl":null,"url":null,"abstract":"Modern research in the field of sociology of science is becoming more complicated due to the constantly growing publication activity of authors. To track trends in sectoral sociology, scientists turn to scientometric methods, but they are not enough. Trends in the development of the sociology of values as a branch of sociology are the subject of the study. The purpose of the work is an assessment of the possibilities of using natural language analysis methods (NLP/NLA) for thematic and theoretical clustering of research in the sociology of values. The design of the study was quantitative and qualitative, it was carried out in two stages. At the first stage, 121 abstracts of a scientific articles were analyzed using text mining, after which their total array was divided into clusters. At the second stage, the results of machine clustering were examined by the method of qualitative text analysis, on the basis of which the limitations and capabilities of the NLP/NLA method were identified for solving the problem of clustering scientific texts. It was found that articles with a more conservative core of theoretical categories (gender studies, migration studies, the theory of globalism) are more amenable to clustering, while theories with a loosely structured and fluid theoretical core (theories using environmental terminology, theories of inequality) are much less amenable to explicit clustering. The results obtained allow us to form a new direction of work with large arrays of scientific texts, associated with their clustering using NLP/NLA. Building clusters enables researchers to work with all texts in a given subject area, and not just with the most cited ones. This, in turn, provides the visibility of all scientific ideas, including those that have not gained popularity/notability.","PeriodicalId":52710,"journal":{"name":"Tsifrovaia sotsiologiia","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sociology of values: experience of building a taxonomy by using natural language analysis technology\",\"authors\":\"M. Kashina, S. Tkach\",\"doi\":\"10.26425/2658-347x-2023-6-1-48-58\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Modern research in the field of sociology of science is becoming more complicated due to the constantly growing publication activity of authors. To track trends in sectoral sociology, scientists turn to scientometric methods, but they are not enough. Trends in the development of the sociology of values as a branch of sociology are the subject of the study. The purpose of the work is an assessment of the possibilities of using natural language analysis methods (NLP/NLA) for thematic and theoretical clustering of research in the sociology of values. The design of the study was quantitative and qualitative, it was carried out in two stages. At the first stage, 121 abstracts of a scientific articles were analyzed using text mining, after which their total array was divided into clusters. At the second stage, the results of machine clustering were examined by the method of qualitative text analysis, on the basis of which the limitations and capabilities of the NLP/NLA method were identified for solving the problem of clustering scientific texts. It was found that articles with a more conservative core of theoretical categories (gender studies, migration studies, the theory of globalism) are more amenable to clustering, while theories with a loosely structured and fluid theoretical core (theories using environmental terminology, theories of inequality) are much less amenable to explicit clustering. The results obtained allow us to form a new direction of work with large arrays of scientific texts, associated with their clustering using NLP/NLA. Building clusters enables researchers to work with all texts in a given subject area, and not just with the most cited ones. This, in turn, provides the visibility of all scientific ideas, including those that have not gained popularity/notability.\",\"PeriodicalId\":52710,\"journal\":{\"name\":\"Tsifrovaia sotsiologiia\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tsifrovaia sotsiologiia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.26425/2658-347x-2023-6-1-48-58\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tsifrovaia sotsiologiia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26425/2658-347x-2023-6-1-48-58","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

由于作者的出版活动不断增加，现代科学社会学领域的研究变得越来越复杂。为了跟踪部门社会学的趋势，科学家们求助于科学计量学方法，但这还不够。价值社会学作为社会学的一个分支，其发展趋势是本文研究的主题。这项工作的目的是评估使用自然语言分析方法(NLP/NLA)对价值社会学研究的主题和理论聚类的可能性。本研究设计采用定量和定性相结合的方法，分两个阶段进行。第一阶段，采用文本挖掘方法对121篇科学论文的摘要进行分析，然后对其总数组进行聚类。第二阶段，采用定性文本分析的方法对机器聚类结果进行检验，在此基础上确定了NLP/NLA方法解决科学文本聚类问题的局限性和能力。研究发现，具有较为保守的理论范畴核心的文章(性别研究、移民研究、全球主义理论)更适合聚类，而具有松散结构和流动理论核心的理论(使用环境术语的理论、不平等理论)则不太适合明确的聚类。获得的结果使我们能够形成一个新的工作方向，与大量科学文本阵列相关联，并使用NLP/NLA进行聚类。构建集群使研究人员能够处理给定主题领域的所有文本，而不仅仅是被引用最多的文本。这反过来又为所有科学思想提供了可见性，包括那些尚未获得普及/知名度的科学思想。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Sociology of values: experience of building a taxonomy by using natural language analysis technology

Modern research in the field of sociology of science is becoming more complicated due to the constantly growing publication activity of authors. To track trends in sectoral sociology, scientists turn to scientometric methods, but they are not enough. Trends in the development of the sociology of values as a branch of sociology are the subject of the study. The purpose of the work is an assessment of the possibilities of using natural language analysis methods (NLP/NLA) for thematic and theoretical clustering of research in the sociology of values. The design of the study was quantitative and qualitative, it was carried out in two stages. At the first stage, 121 abstracts of a scientific articles were analyzed using text mining, after which their total array was divided into clusters. At the second stage, the results of machine clustering were examined by the method of qualitative text analysis, on the basis of which the limitations and capabilities of the NLP/NLA method were identified for solving the problem of clustering scientific texts. It was found that articles with a more conservative core of theoretical categories (gender studies, migration studies, the theory of globalism) are more amenable to clustering, while theories with a loosely structured and fluid theoretical core (theories using environmental terminology, theories of inequality) are much less amenable to explicit clustering. The results obtained allow us to form a new direction of work with large arrays of scientific texts, associated with their clustering using NLP/NLA. Building clusters enables researchers to work with all texts in a given subject area, and not just with the most cited ones. This, in turn, provides the visibility of all scientific ideas, including those that have not gained popularity/notability.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Tsifrovaia sotsiologiia

自引率

0.00%

发文量

审稿时长

8 weeks