Dima Alnahas, Abdullah Ateş, A. Aydin, Baris Baykant Alagöz
{"title":"重温概率关系分析:使用概率关系图对短文中的词语进行关系相似性分析","authors":"Dima Alnahas, Abdullah Ateş, A. Aydin, Baris Baykant Alagöz","doi":"10.47000/tjmcs.1240729","DOIUrl":null,"url":null,"abstract":"Relation graphs provide useful tools for structural and relational analyses of highly complex multi-component systems. Probabilistic relation graph models can represent relations between system components by their probabilistic links. These graph types have been widely used for graphical representation of Markov models and bigram probabilities. This study presents an implication of relational similarities within probabilistic graph models of textual entries. The article discusses several utilization examples of two fundamental similarity measures in probabilistic analysis of short texts. To this end, construction of probabilistic graph models by using bigram probability matrices of textual entries is illustrated and vector spaces of input word-vectors and output word-vectors are formed. In this vector space, utilization of cosine similarity and mean squared error measures are demonstrated to evaluate probabilistic relational similarity between lexeme pairs in short texts. By using probabilistic relation graphs of the short texts, relational interchangeability analyses of lexeme pairs are conducted, and confidence index parameters are defined to express reliability of these analyses. Potential applications of these graphs in language processing and linguistics are discussed on the basis of the analysis results of example texts.","PeriodicalId":506513,"journal":{"name":"Turkish Journal of Mathematics and Computer Science","volume":"38 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts\",\"authors\":\"Dima Alnahas, Abdullah Ateş, A. Aydin, Baris Baykant Alagöz\",\"doi\":\"10.47000/tjmcs.1240729\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Relation graphs provide useful tools for structural and relational analyses of highly complex multi-component systems. Probabilistic relation graph models can represent relations between system components by their probabilistic links. These graph types have been widely used for graphical representation of Markov models and bigram probabilities. This study presents an implication of relational similarities within probabilistic graph models of textual entries. The article discusses several utilization examples of two fundamental similarity measures in probabilistic analysis of short texts. To this end, construction of probabilistic graph models by using bigram probability matrices of textual entries is illustrated and vector spaces of input word-vectors and output word-vectors are formed. In this vector space, utilization of cosine similarity and mean squared error measures are demonstrated to evaluate probabilistic relational similarity between lexeme pairs in short texts. By using probabilistic relation graphs of the short texts, relational interchangeability analyses of lexeme pairs are conducted, and confidence index parameters are defined to express reliability of these analyses. Potential applications of these graphs in language processing and linguistics are discussed on the basis of the analysis results of example texts.\",\"PeriodicalId\":506513,\"journal\":{\"name\":\"Turkish Journal of Mathematics and Computer Science\",\"volume\":\"38 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Turkish Journal of Mathematics and Computer Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47000/tjmcs.1240729\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Turkish Journal of Mathematics and Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47000/tjmcs.1240729","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts
Relation graphs provide useful tools for structural and relational analyses of highly complex multi-component systems. Probabilistic relation graph models can represent relations between system components by their probabilistic links. These graph types have been widely used for graphical representation of Markov models and bigram probabilities. This study presents an implication of relational similarities within probabilistic graph models of textual entries. The article discusses several utilization examples of two fundamental similarity measures in probabilistic analysis of short texts. To this end, construction of probabilistic graph models by using bigram probability matrices of textual entries is illustrated and vector spaces of input word-vectors and output word-vectors are formed. In this vector space, utilization of cosine similarity and mean squared error measures are demonstrated to evaluate probabilistic relational similarity between lexeme pairs in short texts. By using probabilistic relation graphs of the short texts, relational interchangeability analyses of lexeme pairs are conducted, and confidence index parameters are defined to express reliability of these analyses. Potential applications of these graphs in language processing and linguistics are discussed on the basis of the analysis results of example texts.