{"title":"图书难度的特征词提取方法","authors":"Masaaki Suzuki, F. Saitoh","doi":"10.1109/SCISISIS55246.2022.10002116","DOIUrl":null,"url":null,"abstract":"With the rapid development of the E-Commerce market in recent years, product selection has become increasingly difficult. Therefore, various studies have been conducted to assist in the selection, but few have focused on the level of difficulty. However, if the level of difficulty related to content is not considered when selecting books, it is impossible to provide products that meet user needs. Therefore, this study used product reviews to identify the characteristic words for the difficulty level. As an extraction method, a distributed representation of words was obtained based on Word2Vec, and clustering was performed using DBSCAN with two-dimensional compression by t-SNE to provide stable clustering while considering the meanings of words. In addition, the experimental results show that it is possible to extract not only feature words directly related to the difficulty level but also words that indirectly affect the evaluation of the difficulty level.","PeriodicalId":21408,"journal":{"name":"Rice","volume":"152 5 1","pages":"1-4"},"PeriodicalIF":4.8000,"publicationDate":"2022-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feature Word Extraction Method for Book Difficulty\",\"authors\":\"Masaaki Suzuki, F. Saitoh\",\"doi\":\"10.1109/SCISISIS55246.2022.10002116\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid development of the E-Commerce market in recent years, product selection has become increasingly difficult. Therefore, various studies have been conducted to assist in the selection, but few have focused on the level of difficulty. However, if the level of difficulty related to content is not considered when selecting books, it is impossible to provide products that meet user needs. Therefore, this study used product reviews to identify the characteristic words for the difficulty level. As an extraction method, a distributed representation of words was obtained based on Word2Vec, and clustering was performed using DBSCAN with two-dimensional compression by t-SNE to provide stable clustering while considering the meanings of words. In addition, the experimental results show that it is possible to extract not only feature words directly related to the difficulty level but also words that indirectly affect the evaluation of the difficulty level.\",\"PeriodicalId\":21408,\"journal\":{\"name\":\"Rice\",\"volume\":\"152 5 1\",\"pages\":\"1-4\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2022-11-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Rice\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1109/SCISISIS55246.2022.10002116\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRONOMY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Rice","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1109/SCISISIS55246.2022.10002116","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
Feature Word Extraction Method for Book Difficulty
With the rapid development of the E-Commerce market in recent years, product selection has become increasingly difficult. Therefore, various studies have been conducted to assist in the selection, but few have focused on the level of difficulty. However, if the level of difficulty related to content is not considered when selecting books, it is impossible to provide products that meet user needs. Therefore, this study used product reviews to identify the characteristic words for the difficulty level. As an extraction method, a distributed representation of words was obtained based on Word2Vec, and clustering was performed using DBSCAN with two-dimensional compression by t-SNE to provide stable clustering while considering the meanings of words. In addition, the experimental results show that it is possible to extract not only feature words directly related to the difficulty level but also words that indirectly affect the evaluation of the difficulty level.
期刊介绍:
Rice aims to fill a glaring void in basic and applied plant science journal publishing. This journal is the world''s only high-quality serial publication for reporting current advances in rice genetics, structural and functional genomics, comparative genomics, molecular biology and physiology, molecular breeding and comparative biology. Rice welcomes review articles and original papers in all of the aforementioned areas and serves as the primary source of newly published information for researchers and students in rice and related research.