Pub Date : 2023-07-03DOI: 10.1007/s41060-023-00415-7
Shiyun Wa, Xinai Lu, Minjuan Wang
{"title":"Regression model and method settings for air pollution status analysis based on air quality data in Beijing (2017–2021)","authors":"Shiyun Wa, Xinai Lu, Minjuan Wang","doi":"10.1007/s41060-023-00415-7","DOIUrl":"https://doi.org/10.1007/s41060-023-00415-7","url":null,"abstract":"","PeriodicalId":45667,"journal":{"name":"International Journal of Data Science and Analytics","volume":"4 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78818765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-03DOI: 10.1007/s41060-023-00411-x
Nat Pavasant, Takashi Morita, M. Numao, Ken-ichi Fukui
{"title":"Granger causality-based cluster sequence mining for spatio-temporal causal relation mining","authors":"Nat Pavasant, Takashi Morita, M. Numao, Ken-ichi Fukui","doi":"10.1007/s41060-023-00411-x","DOIUrl":"https://doi.org/10.1007/s41060-023-00411-x","url":null,"abstract":"","PeriodicalId":45667,"journal":{"name":"International Journal of Data Science and Analytics","volume":"40 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88899936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-01DOI: 10.1007/s41060-023-00422-8
Kehinde Olobatuyi, Matthew R. P. Parker, Oludare Ariyo
Cluster-weighted models (CWMs) are an important class of machine learning models that are commonly used for modelling complex datasets. However, they are known to suffer from reduced computing efficiency and estimator accuracy when dealing with high-dimensional data. Previous work has proposed a parsimonious technique that can improve CWMs’ performance in the high-dimensional data paradigm. However, this method has a setback for very high-dimensional data, where the dimensionality is greater than 100. In this paper, we propose a new hybridised method that incorporates a dimensionality reduction technique called T-distributed stochastic neighbour embedding (TSNE) to enhance the parsimonious CWMs in high-dimensional space. Additionally, we introduce a novel heuristic for detecting the hidden components of the underlying mixture model, which can be used with the popular R package FlexCWM. We evaluated the performance of the proposed method using two real datasets and found that it improves clustering power when compared to both the parsimony methods and the TSNE methods combined with CWMs in the high-dimensional data setting. Our results suggest that the proposed method can improve the efficiency and accuracy of CWMs in dealing with high-dimensional data, making it a valuable tool for data scientists and statisticians.
{"title":"Cluster weighted model based on TSNE algorithm for high-dimensional data","authors":"Kehinde Olobatuyi, Matthew R. P. Parker, Oludare Ariyo","doi":"10.1007/s41060-023-00422-8","DOIUrl":"https://doi.org/10.1007/s41060-023-00422-8","url":null,"abstract":"Cluster-weighted models (CWMs) are an important class of machine learning models that are commonly used for modelling complex datasets. However, they are known to suffer from reduced computing efficiency and estimator accuracy when dealing with high-dimensional data. Previous work has proposed a parsimonious technique that can improve CWMs’ performance in the high-dimensional data paradigm. However, this method has a setback for very high-dimensional data, where the dimensionality is greater than 100. In this paper, we propose a new hybridised method that incorporates a dimensionality reduction technique called T-distributed stochastic neighbour embedding (TSNE) to enhance the parsimonious CWMs in high-dimensional space. Additionally, we introduce a novel heuristic for detecting the hidden components of the underlying mixture model, which can be used with the popular R package FlexCWM. We evaluated the performance of the proposed method using two real datasets and found that it improves clustering power when compared to both the parsimony methods and the TSNE methods combined with CWMs in the high-dimensional data setting. Our results suggest that the proposed method can improve the efficiency and accuracy of CWMs in dealing with high-dimensional data, making it a valuable tool for data scientists and statisticians.","PeriodicalId":45667,"journal":{"name":"International Journal of Data Science and Analytics","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136185133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-01DOI: 10.1007/s41060-023-00405-9
Arianna Agosto, P. Cerchiello, Paolo Giudici
{"title":"Bayesian learning models to measure the relative impact of ESG factors on credit ratings","authors":"Arianna Agosto, P. Cerchiello, Paolo Giudici","doi":"10.1007/s41060-023-00405-9","DOIUrl":"https://doi.org/10.1007/s41060-023-00405-9","url":null,"abstract":"","PeriodicalId":45667,"journal":{"name":"International Journal of Data Science and Analytics","volume":"89 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90583311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-01DOI: 10.1007/s41060-023-00403-x
E. Merkurjev
{"title":"Efficient graph-based spectral techniques for data with few labeled samples","authors":"E. Merkurjev","doi":"10.1007/s41060-023-00403-x","DOIUrl":"https://doi.org/10.1007/s41060-023-00403-x","url":null,"abstract":"","PeriodicalId":45667,"journal":{"name":"International Journal of Data Science and Analytics","volume":"65 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81380059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-06-30DOI: 10.1007/s41060-023-00406-8
A. Keyhanipour
{"title":"Graph-based comparative analysis of learning to rank datasets","authors":"A. Keyhanipour","doi":"10.1007/s41060-023-00406-8","DOIUrl":"https://doi.org/10.1007/s41060-023-00406-8","url":null,"abstract":"","PeriodicalId":45667,"journal":{"name":"International Journal of Data Science and Analytics","volume":"14 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80443400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-06-30DOI: 10.1007/s41060-023-00421-9
Pritthijit Nath, Asif Iqbal Middya, Sarbani Roy
{"title":"Empirical assessment of transformer-based neural network architecture in forecasting pollution trends","authors":"Pritthijit Nath, Asif Iqbal Middya, Sarbani Roy","doi":"10.1007/s41060-023-00421-9","DOIUrl":"https://doi.org/10.1007/s41060-023-00421-9","url":null,"abstract":"","PeriodicalId":45667,"journal":{"name":"International Journal of Data Science and Analytics","volume":"33 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87622077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-06-29DOI: 10.1007/s41060-023-00404-w
Shoujin Wang, Yan Wang, F. Sivrikaya, S. Albayrak, V. W. Anelli
{"title":"Data science for next-generation recommender systems","authors":"Shoujin Wang, Yan Wang, F. Sivrikaya, S. Albayrak, V. W. Anelli","doi":"10.1007/s41060-023-00404-w","DOIUrl":"https://doi.org/10.1007/s41060-023-00404-w","url":null,"abstract":"","PeriodicalId":45667,"journal":{"name":"International Journal of Data Science and Analytics","volume":"98 1","pages":"135 - 145"},"PeriodicalIF":2.4,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85834333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-06-27DOI: 10.1007/s41060-023-00399-4
Erich Kummerfeld, Leland Williams, Sisi Ma
{"title":"Power analysis for causal discovery","authors":"Erich Kummerfeld, Leland Williams, Sisi Ma","doi":"10.1007/s41060-023-00399-4","DOIUrl":"https://doi.org/10.1007/s41060-023-00399-4","url":null,"abstract":"","PeriodicalId":45667,"journal":{"name":"International Journal of Data Science and Analytics","volume":"196 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72898649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-06-21DOI: 10.1007/s41060-023-00401-z
C. Metta, Andrea Beretta, Riccardo Guidotti, Yuan Yin, P. Gallinari, S. Rinzivillo, F. Giannotti
{"title":"Improving trust and confidence in medical skin lesion diagnosis through explainable deep learning","authors":"C. Metta, Andrea Beretta, Riccardo Guidotti, Yuan Yin, P. Gallinari, S. Rinzivillo, F. Giannotti","doi":"10.1007/s41060-023-00401-z","DOIUrl":"https://doi.org/10.1007/s41060-023-00401-z","url":null,"abstract":"","PeriodicalId":45667,"journal":{"name":"International Journal of Data Science and Analytics","volume":"25 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74379265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}