Pub Date : 2024-08-20DOI: 10.1007/s00500-024-09666-3
Zhiguo Li, Rui Dong, Qianqian Cao, Hongwu Zhang
Complementors who provide content on platforms are increasingly threatened by the entry of platform owners. Platform owners may enter the content market through offering vertically differentiated content either by self producing or hiring the complementor to produce. We build a game-theoretic model to analyze the platform owner’s entry decisions and the complementor’s response strategy considering the effects of demand complementarity, vertical content differentiation and consumer heterogeneity to both players’ strategies. We find that vertical content differentiation relaxes boundary conditions of entry, and it is more obvious when the platform owner has advantage in content value. However, we show that though the complementor may hold advantages on content value, price, or sales volume, it faces dependent dilemma once entry happens. Further, we demonstrate that second-party cooperation may mitigate the dependent dilemma and create a “win–win” situation through leveraging the platform owner’s efficiency in marketing and the complementor’s efficiency in content producing.
{"title":"Strategy for complementor under platform owner’s entry with vertically differentiated content","authors":"Zhiguo Li, Rui Dong, Qianqian Cao, Hongwu Zhang","doi":"10.1007/s00500-024-09666-3","DOIUrl":"https://doi.org/10.1007/s00500-024-09666-3","url":null,"abstract":"<p>Complementors who provide content on platforms are increasingly threatened by the entry of platform owners. Platform owners may enter the content market through offering vertically differentiated content either by self producing or hiring the complementor to produce. We build a game-theoretic model to analyze the platform owner’s entry decisions and the complementor’s response strategy considering the effects of demand complementarity, vertical content differentiation and consumer heterogeneity to both players’ strategies. We find that vertical content differentiation relaxes boundary conditions of entry, and it is more obvious when the platform owner has advantage in content value. However, we show that though the complementor may hold advantages on content value, price, or sales volume, it faces dependent dilemma once entry happens. Further, we demonstrate that second-party cooperation may mitigate the dependent dilemma and create a “win–win” situation through leveraging the platform owner’s efficiency in marketing and the complementor’s efficiency in content producing.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"193 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142196328","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In the social media environment, ad (advertise) information diffusion can effectively increase the ad promotion effect and promote product marketing. Therefore, it is essential to study the mechanism of ad information diffusion in social media. This study aims to help cope with the complexity and uncertainty in social media ad information diffusion systems by identifying the causal relationships behind ad information diffusion behavior and clarifying the feedback mechanisms of system factors, and then testing the validity of the system model through simulation. Specifically, this study innovatively combines system dynamics and uncertainty theory to construct a dynamic model of ad information diffusion system. Particularly, the uncertainty effect of environmental noise on the ad information diffusion system is considered and portrayed as a Liu process. This study can explore the diffusion mechanism of ad information more precisely, so as to better serve the ad promotion industry.
{"title":"A dynamic model of social media ad information diffusion in uncertain environment","authors":"Meiling Jin, Yufu Ning, Fengming Liu, Zhen Li, Haoran Zheng, Yichang Gao, Jian Zhou","doi":"10.1007/s00500-024-09665-4","DOIUrl":"https://doi.org/10.1007/s00500-024-09665-4","url":null,"abstract":"<p>In the social media environment, ad (advertise) information diffusion can effectively increase the ad promotion effect and promote product marketing. Therefore, it is essential to study the mechanism of ad information diffusion in social media. This study aims to help cope with the complexity and uncertainty in social media ad information diffusion systems by identifying the causal relationships behind ad information diffusion behavior and clarifying the feedback mechanisms of system factors, and then testing the validity of the system model through simulation. Specifically, this study innovatively combines system dynamics and uncertainty theory to construct a dynamic model of ad information diffusion system. Particularly, the uncertainty effect of environmental noise on the ad information diffusion system is considered and portrayed as a Liu process. This study can explore the diffusion mechanism of ad information more precisely, so as to better serve the ad promotion industry.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"665 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142196331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-20DOI: 10.1007/s00500-024-09656-5
Shuang Zhang, Xin Gao
In traditional regression analysis, the observed data are all accurate, but the observed data that we can obtain in real life is often not accurate. For that reason, there is the uncertain regression analysis based on the uncertain variable, in the framework of the uncertainty theory. Under the premise of imprecise observations, the data obtained often contains outliers due to human input errors or incorrect measurements. Outliers can affect parameter estimation, resulting in misleading results and making model fitting inaccurate. In parameter estimation, the most commonly used method is least squares estimation, but this method is extremely sensitive to outliers and makes parameter estimation inaccurate. To solve this problem, this paper proposes an uncertain regression model based on ridge estimation, which adds a square penalty term when performing least squares estimation of unknown parameters. The advantage of ridge estimation is that the tolerance of pathological data is much better than other parameter estimation methods, which can reduce the influence of outliers. In this paper, the optimal shrinkage parameter is determined by K-fold cross-validation to estimate the parameters of the regression model, and then we conduct the residual analysis and hypothesis test on the fitted model to obtain the predicted value and the predicted confidence interval. Finally, the validity of the model is demonstrated by two numerical examples.
在传统的回归分析中,观测到的数据都是准确的,但我们在现实生活中能得到的观测数据往往并不准确。为此,在不确定性理论的框架下,出现了基于不确定变量的不确定回归分析。在观测数据不精确的前提下,由于人为输入错误或测量结果不正确,所获得的数据往往包含异常值。异常值会影响参数估计,导致误导性结果,使模型拟合不准确。在参数估计中,最常用的方法是最小二乘估计法,但这种方法对异常值极为敏感,会导致参数估计不准确。为解决这一问题,本文提出了一种基于脊估计的不确定回归模型,该模型在对未知参数进行最小二乘估计时增加了一个平方惩罚项。脊估计法的优点是对病态数据的容忍度远远优于其他参数估计方法,可以减少异常值的影响。本文通过 K 倍交叉验证确定最优收缩参数,估计回归模型参数,然后对拟合模型进行残差分析和假设检验,得到预测值和预测置信区间。最后,通过两个数值实例证明了模型的有效性。
{"title":"Ridge estimation for uncertain regression model with imprecise observations","authors":"Shuang Zhang, Xin Gao","doi":"10.1007/s00500-024-09656-5","DOIUrl":"https://doi.org/10.1007/s00500-024-09656-5","url":null,"abstract":"<p>In traditional regression analysis, the observed data are all accurate, but the observed data that we can obtain in real life is often not accurate. For that reason, there is the uncertain regression analysis based on the uncertain variable, in the framework of the uncertainty theory. Under the premise of imprecise observations, the data obtained often contains outliers due to human input errors or incorrect measurements. Outliers can affect parameter estimation, resulting in misleading results and making model fitting inaccurate. In parameter estimation, the most commonly used method is least squares estimation, but this method is extremely sensitive to outliers and makes parameter estimation inaccurate. To solve this problem, this paper proposes an uncertain regression model based on ridge estimation, which adds a square penalty term when performing least squares estimation of unknown parameters. The advantage of ridge estimation is that the tolerance of pathological data is much better than other parameter estimation methods, which can reduce the influence of outliers. In this paper, the optimal shrinkage parameter is determined by K-fold cross-validation to estimate the parameters of the regression model, and then we conduct the residual analysis and hypothesis test on the fitted model to obtain the predicted value and the predicted confidence interval. Finally, the validity of the model is demonstrated by two numerical examples.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"174 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142196346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-20DOI: 10.1007/s00500-024-09662-7
Lifeng Wang, Tan Wang
Humanities and Social Sciences (HSS) play a key role for deepening human understanding and transforming the world, providing essential value orientation for the advancement of natural sciences. Evaluating the efficiency of HSS in higher education institutions is crucial for optimizing the distribution of research resources and improving scientific research efficiency. By utilizing statistical data from HSS in colleges and universities across 31 provinces (municipalities, regions) of China (excluding Hong Kong, Macao, and Taiwan) from 2017 to 2020, this study employs the Data Envelopment Analysis (DEA) and Malmquist models to assess both the static and dynamic scientific research efficiencies. Additionally, the Theil index is used to examine regional differences. The findings reveal that: (1) HSS in colleges and universities exhibit relatively high static efficiency, with the Malmquist index indicating a general trend of improvement primarily driven by efficiency change (EC); (2) colleges and universities in China’s central region display higher static efficiency than those in the western region, yet they lack the internal driving forces for fostering research efficiency; (3) an analysis of static and dynamic efficiencies via the Theil index shows widening regional differences, with within-group differences being the main source of overall differences. The above results imply that each region should adopt different countermeasures to improve scientific research efficiency.
{"title":"Research efficiency evaluation and regional differences analysis of humanities and social sciences of colleges and universities in China","authors":"Lifeng Wang, Tan Wang","doi":"10.1007/s00500-024-09662-7","DOIUrl":"https://doi.org/10.1007/s00500-024-09662-7","url":null,"abstract":"<p>Humanities and Social Sciences (HSS) play a key role for deepening human understanding and transforming the world, providing essential value orientation for the advancement of natural sciences. Evaluating the efficiency of HSS in higher education institutions is crucial for optimizing the distribution of research resources and improving scientific research efficiency. By utilizing statistical data from HSS in colleges and universities across 31 provinces (municipalities, regions) of China (excluding Hong Kong, Macao, and Taiwan) from 2017 to 2020, this study employs the Data Envelopment Analysis (DEA) and Malmquist models to assess both the static and dynamic scientific research efficiencies. Additionally, the Theil index is used to examine regional differences. The findings reveal that: (1) HSS in colleges and universities exhibit relatively high static efficiency, with the Malmquist index indicating a general trend of improvement primarily driven by efficiency change (EC); (2) colleges and universities in China’s central region display higher static efficiency than those in the western region, yet they lack the internal driving forces for fostering research efficiency; (3) an analysis of static and dynamic efficiencies via the Theil index shows widening regional differences, with within-group differences being the main source of overall differences. The above results imply that each region should adopt different countermeasures to improve scientific research efficiency.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"78 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142196348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-20DOI: 10.1007/s00500-024-09657-4
Zhuqing Liu, Yaodong Ni
In this paper, we study optimal pricing strategy decisions of the monopolist in a social network under a fuzzy environment, in which consumers experience a nonnegative network effect that is influenced by their neighbors’ consumption level, the extent to which they are affected is considered as a fuzzy variable. To derive the equilibrium solution, we establish a two-stage game model for decision processes in consumer social networks. Utilizing the backward induction, we first get the expected consumption equilibrium, then figure out the matrix expression of unique pricing equilibrium by maximizing the monopolist’s profit. In addition, we introduce Fuzzy Bonacich Centrality, and find out components of the price each consumer charged by the monopolist in a fuzzy network, this points out the importance of the monopolist knowing consumer network structure. By conducting numerical studies, we find that the network effect plays an essential role in deciding pricing strategies in fuzzy social networks, but fuzziness would weaken this impact. For social networks with fuzziness existing, the monopolist should choose discriminatory pricing strategy to benefit most. The results of our model can provide valuable managerial insights when helping the monopolist make pricing decisions.
{"title":"Optimal pricing in social networks under fuzzy environment","authors":"Zhuqing Liu, Yaodong Ni","doi":"10.1007/s00500-024-09657-4","DOIUrl":"https://doi.org/10.1007/s00500-024-09657-4","url":null,"abstract":"<p>In this paper, we study optimal pricing strategy decisions of the monopolist in a social network under a fuzzy environment, in which consumers experience a nonnegative network effect that is influenced by their neighbors’ consumption level, the extent to which they are affected is considered as a fuzzy variable. To derive the equilibrium solution, we establish a two-stage game model for decision processes in consumer social networks. Utilizing the backward induction, we first get the expected consumption equilibrium, then figure out the matrix expression of unique pricing equilibrium by maximizing the monopolist’s profit. In addition, we introduce Fuzzy Bonacich Centrality, and find out components of the price each consumer charged by the monopolist in a fuzzy network, this points out the importance of the monopolist knowing consumer network structure. By conducting numerical studies, we find that the network effect plays an essential role in deciding pricing strategies in fuzzy social networks, but fuzziness would weaken this impact. For social networks with fuzziness existing, the monopolist should choose discriminatory pricing strategy to benefit most. The results of our model can provide valuable managerial insights when helping the monopolist make pricing decisions.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"67 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142196350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In 2012, the European Court of Justice introduced the ban on differentiating car insurance premiums for gender to avoid gender inequality. This paper deals with a gender analysis of driving ability by investigating the relationship between gender and the relative total claim amount in Motor Third Party Liability insurance, also considering the effect of age. Leveraging a two-part model based on parametric quantile regression, we want to investigate the average behaviour of drivers and their tail behaviour in order to highlight the importance of dispersion and the impact of largest claims. As a consequence, the purpose of our contribution is to study how gender and age can influence the entire probability distribution of the insurance claim with a particular focus on the quantiles with high probability levels, which are very important indicators to determine the effective riskiness of a driver. We apply our model to an Australian insurance dataset; our results suggest that men are in general riskier in terms of both average and tail behaviour.
{"title":"The influence of gender and age in driving ability: an analysis of average and extreme behaviours","authors":"Fabio Baione, Davide Biancalana, Massimiliano Menzietti","doi":"10.1007/s00500-024-09782-0","DOIUrl":"https://doi.org/10.1007/s00500-024-09782-0","url":null,"abstract":"<p>In 2012, the European Court of Justice introduced the ban on differentiating car insurance premiums for gender to avoid gender inequality. This paper deals with a gender analysis of driving ability by investigating the relationship between gender and the relative total claim amount in Motor Third Party Liability insurance, also considering the effect of age. Leveraging a two-part model based on parametric quantile regression, we want to investigate the average behaviour of drivers and their tail behaviour in order to highlight the importance of dispersion and the impact of largest claims. As a consequence, the purpose of our contribution is to study how gender and age can influence the entire probability distribution of the insurance claim with a particular focus on the quantiles with high probability levels, which are very important indicators to determine the effective riskiness of a driver. We apply our model to an Australian insurance dataset; our results suggest that men are in general riskier in terms of both average and tail behaviour.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"9 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142196349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-19DOI: 10.1007/s00500-024-09906-6
Necla Kırcalı Gürsoy, Tahsin Öner, Arif Gürsoy, Alper Ülker
In this study, we introduce the Sheffer stroke L-algebra and prove some fundamental theorems, propositions and lemmas of Sheffer Stroke L-algebras. The notions of filter and ultrafilter for Sheffer stroke L-algebra are studied. We give subalgebra and normal subset definitions of a Sheffer stroke L-algebras. Moreover, a homomorphism between Sheffer stroke L-algebras is introduced and isomorphism theorems are presented. Finally, we give three new algorithms for Sheffer stroke L-algebras. Thus, it is contributed to researchers on different application areas by presenting an algorithmic approach on this subject, for the first time in the literature.
{"title":"Sheffer stroke operation on L-algebras via an algorithmic approach","authors":"Necla Kırcalı Gürsoy, Tahsin Öner, Arif Gürsoy, Alper Ülker","doi":"10.1007/s00500-024-09906-6","DOIUrl":"https://doi.org/10.1007/s00500-024-09906-6","url":null,"abstract":"<p>In this study, we introduce the Sheffer stroke L-algebra and prove some fundamental theorems, propositions and lemmas of Sheffer Stroke L-algebras. The notions of filter and ultrafilter for Sheffer stroke L-algebra are studied. We give subalgebra and normal subset definitions of a Sheffer stroke L-algebras. Moreover, a homomorphism between Sheffer stroke L-algebras is introduced and isomorphism theorems are presented. Finally, we give three new algorithms for Sheffer stroke L-algebras. Thus, it is contributed to researchers on different application areas by presenting an algorithmic approach on this subject, for the first time in the literature.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"25 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142196354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-19DOI: 10.1007/s00500-024-09947-x
Hua Ke, Haoyang Li
The rapid growth of private car ownership has led to significant issues such as traffic congestion and environmental pollution. Ridesharing has emerged as a promising solution to alleviate the negative impacts associated with private car usage. This paper focuses on the stability of ridesharing systems and establishes a single-driver multiple-rider ridesharing matching model. To solve this model, a filtering algorithm for the pre-matching set and a fast-solving algorithm for stable matching scheme are proposed. Furthermore, we introduce the concept of subsidy distance upper limit into the ridesharing system. Remarkably, our findings indicate that with a limit of 0.1km, the distance saved generated by the subsidy amounts to 560.5% of the total subsidy. To validate our approach, we simulate ridesharing demand data using real taxi data, and design computational experiments to prove the computational efficiency of the filtering algorithm and fast-solving algorithm. The impact of various parameters on ridesharing systems is also explored.
{"title":"Multi-rider ridesharing stable matching optimization","authors":"Hua Ke, Haoyang Li","doi":"10.1007/s00500-024-09947-x","DOIUrl":"https://doi.org/10.1007/s00500-024-09947-x","url":null,"abstract":"<p>The rapid growth of private car ownership has led to significant issues such as traffic congestion and environmental pollution. Ridesharing has emerged as a promising solution to alleviate the negative impacts associated with private car usage. This paper focuses on the stability of ridesharing systems and establishes a single-driver multiple-rider ridesharing matching model. To solve this model, a filtering algorithm for the pre-matching set and a fast-solving algorithm for stable matching scheme are proposed. Furthermore, we introduce the concept of subsidy distance upper limit into the ridesharing system. Remarkably, our findings indicate that with a limit of 0.1km, the distance saved generated by the subsidy amounts to 560.5% of the total subsidy. To validate our approach, we simulate ridesharing demand data using real taxi data, and design computational experiments to prove the computational efficiency of the filtering algorithm and fast-solving algorithm. The impact of various parameters on ridesharing systems is also explored.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"68 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142196353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-19DOI: 10.1007/s00500-024-09820-x
Xiu-Yun Wu, Chun-Yan Liao, Hui-Min Zhang
The aim of this paper is to discuss algebraic rough set and its relationships with convex space, rough set and generalized neighborhood space. Specifically, the notion of algebraic relations is introduced and a pair of lower approximation operator and upper approximation operator are presented. Then, several conditions of algebraic relations such as seriality, reflexivity, (resp., weak, primitive) symmetry and (resp., strong) transitivity are characterized by algebraic approximation operators. Based on this, relationships among algebraic rough sets, convex structures and generalized neighborhood systems are investigated. It is proved that the category of reflexive and transitive algebraic rough spaces is isomorphic to the category of convex spaces. In particular, the category of reflexive, weakly symmetric and transitive algebraic rough spaces is isomorphic to the category of convex matroids and the category of reflexive, weakly symmetric and transitive algebraic generalized neighborhood spaces.
{"title":"Algebraic rough sets via algebraic relations","authors":"Xiu-Yun Wu, Chun-Yan Liao, Hui-Min Zhang","doi":"10.1007/s00500-024-09820-x","DOIUrl":"https://doi.org/10.1007/s00500-024-09820-x","url":null,"abstract":"<p>The aim of this paper is to discuss algebraic rough set and its relationships with convex space, rough set and generalized neighborhood space. Specifically, the notion of algebraic relations is introduced and a pair of lower approximation operator and upper approximation operator are presented. Then, several conditions of algebraic relations such as seriality, reflexivity, (resp., weak, primitive) symmetry and (resp., strong) transitivity are characterized by algebraic approximation operators. Based on this, relationships among algebraic rough sets, convex structures and generalized neighborhood systems are investigated. It is proved that the category of reflexive and transitive algebraic rough spaces is isomorphic to the category of convex spaces. In particular, the category of reflexive, weakly symmetric and transitive algebraic rough spaces is isomorphic to the category of convex matroids and the category of reflexive, weakly symmetric and transitive algebraic generalized neighborhood spaces.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"12 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142196399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-19DOI: 10.1007/s00500-024-09934-2
Dezhi Cao, Yue Zhao, Licheng Wu
Grapheme-to-phoneme (G2P) conversion technology is currently dominated by two methodologies: knowledge-based and data-based approaches. Knowledge-driven methods struggle to adapt to extensive datasets, while data-driven methods rely heavily on high-quality data and require precise feature selection for model construction. To address these challenges, this research aims to propose an integrated approach that combines prior knowledge with data-driven techniques for automatic G2P conversion in the Korean language. In this work, we extract attributes based on pronunciation rules and phonetic transformations between Korean words to construct a decision tree. Subsequently, the model is trained using a data-driven approach for automated phonetic transcription. The proposed integrated model achieves more accurate alignment between input and output variables, effectively capturing phonological variations in continuous Korean speech, and determining corresponding phonemes for graphemes. Rigorous cross-validation confirms its superiority, with an average accuracy of 94.63% in grapheme-to-phoneme conversion, outperforming existing methodologies. In conclusion, this research demonstrates the effectiveness of an integrated approach combining prior knowledge and data-driven techniques for G2P conversion in Korean. The high accuracy and performance of this method are significant for Korean G2P. Our approach can also be applied to low-resource or endangered languages that already have some linguistic research foundation to improve the accuracy of the pronunciation lexicon of the language.
{"title":"Integrating prior knowledge and data-driven approaches for improving grapheme-to-phoneme conversion in Korean language","authors":"Dezhi Cao, Yue Zhao, Licheng Wu","doi":"10.1007/s00500-024-09934-2","DOIUrl":"https://doi.org/10.1007/s00500-024-09934-2","url":null,"abstract":"<p>Grapheme-to-phoneme (G2P) conversion technology is currently dominated by two methodologies: knowledge-based and data-based approaches. Knowledge-driven methods struggle to adapt to extensive datasets, while data-driven methods rely heavily on high-quality data and require precise feature selection for model construction. To address these challenges, this research aims to propose an integrated approach that combines prior knowledge with data-driven techniques for automatic G2P conversion in the Korean language. In this work, we extract attributes based on pronunciation rules and phonetic transformations between Korean words to construct a decision tree. Subsequently, the model is trained using a data-driven approach for automated phonetic transcription. The proposed integrated model achieves more accurate alignment between input and output variables, effectively capturing phonological variations in continuous Korean speech, and determining corresponding phonemes for graphemes. Rigorous cross-validation confirms its superiority, with an average accuracy of 94.63% in grapheme-to-phoneme conversion, outperforming existing methodologies. In conclusion, this research demonstrates the effectiveness of an integrated approach combining prior knowledge and data-driven techniques for G2P conversion in Korean. The high accuracy and performance of this method are significant for Korean G2P. Our approach can also be applied to low-resource or endangered languages that already have some linguistic research foundation to improve the accuracy of the pronunciation lexicon of the language.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"26 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142196352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}