Davi Alves Oliveira , Valter de Senna , Hernane Borges de Barros Pereira
{"title":"CohesionNet: Software for network-based textual cohesion analysis","authors":"Davi Alves Oliveira , Valter de Senna , Hernane Borges de Barros Pereira","doi":"10.1016/j.simpa.2024.100712","DOIUrl":null,"url":null,"abstract":"<div><div>Cohesion is one of the main defining characteristics of a text. CohesionNet, an R app with a Shiny interface, processes raw text to calculate network-based cohesion indices. The indices are based on stem repetition and on the analysis of synonymy and hypernymy. The app also constructs a network representation of the text that can be saved in the Pajek NET format. CohesionNet facilitates the assessment of potential applications of the indices, like text classification, automatic summarization, and readability improvement. Currently supporting English texts only, upcoming versions will include additional language support.</div></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"22 ","pages":"Article 100712"},"PeriodicalIF":1.3000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Software Impacts","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2665963824001003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Cohesion is one of the main defining characteristics of a text. CohesionNet, an R app with a Shiny interface, processes raw text to calculate network-based cohesion indices. The indices are based on stem repetition and on the analysis of synonymy and hypernymy. The app also constructs a network representation of the text that can be saved in the Pajek NET format. CohesionNet facilitates the assessment of potential applications of the indices, like text classification, automatic summarization, and readability improvement. Currently supporting English texts only, upcoming versions will include additional language support.
内聚力是文本的主要定义特征之一。CohesionNet 是一款带有 Shiny 界面的 R 应用程序,可处理原始文本,计算基于网络的内聚力指数。这些指数基于词干重复以及同义词和超同义词分析。该应用程序还能构建文本的网络表示,并以 Pajek NET 格式保存。CohesionNet 可帮助评估指数的潜在应用,如文本分类、自动摘要和可读性改进。目前仅支持英文文本,即将推出的版本将包括更多语言支持。