Pub Date : 2025-01-03DOI: 10.1007/s10796-024-10559-x
Ines Gasmi, Sana Neji, Salima Smiti, Makram Soui
Credit risk assessment has drawn great interests from both researcher studies and financial institutions. In fact, classifying an applicant as defaulter or non-defaulter customer helps banks to make a reasonable decision. The classification of applicants is based on a set of historical information of past loans. Data sets for analysis may include different features, many of which may be irrelevant to the decision making process. Keeping irrelevant features or leaving out relevant ones may be harmful, causing generation of poor quality patterns that may lead to confusion decision. Determining an appropriate set of predictors is an important challenge in credit risk prediction research which guarantees better decision-making. It is the task of searching the smallest subset of features that provide the highest accuracy and comprehensibility. Thus, this study proposes feature selection-based classification model on credit risk assessment. To this end, five algorithms are applied, Speed-constrained Multi-objective PSO (SMPSO), Non-dominated Sorting Algorithm (NSGA-II), Sequential Forward Selection (SFS), Sequential Forward Floating Selection (SFFS), and Random Subset Feature Selection (RSFS). The selected subset is evaluated based on three classifiers K-Nearest Neighbors (KNN), Support Vector Machine (SVM) and Artificial Neural Network (ANN). Our proposed model is validated using three real-world credit datasets. The obtained results confirm the efficiency of SMPSO-KNN model to select the most significant features and provide the highest classification accuracy compared to existing models.
{"title":"Features Selection for Credit Risk Prediction Problem","authors":"Ines Gasmi, Sana Neji, Salima Smiti, Makram Soui","doi":"10.1007/s10796-024-10559-x","DOIUrl":"https://doi.org/10.1007/s10796-024-10559-x","url":null,"abstract":"<p>Credit risk assessment has drawn great interests from both researcher studies and financial institutions. In fact, classifying an applicant as defaulter or non-defaulter customer helps banks to make a reasonable decision. The classification of applicants is based on a set of historical information of past loans. Data sets for analysis may include different features, many of which may be irrelevant to the decision making process. Keeping irrelevant features or leaving out relevant ones may be harmful, causing generation of poor quality patterns that may lead to confusion decision. Determining an appropriate set of predictors is an important challenge in credit risk prediction research which guarantees better decision-making. It is the task of searching the smallest subset of features that provide the highest accuracy and comprehensibility. Thus, this study proposes feature selection-based classification model on credit risk assessment. To this end, five algorithms are applied, Speed-constrained Multi-objective PSO (SMPSO), Non-dominated Sorting Algorithm (NSGA-II), Sequential Forward Selection (SFS), Sequential Forward Floating Selection (SFFS), and Random Subset Feature Selection (RSFS). The selected subset is evaluated based on three classifiers K-Nearest Neighbors (KNN), Support Vector Machine (SVM) and Artificial Neural Network (ANN). Our proposed model is validated using three real-world credit datasets. The obtained results confirm the efficiency of SMPSO-KNN model to select the most significant features and provide the highest classification accuracy compared to existing models.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"28 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142917148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-03DOI: 10.1007/s10796-024-10566-y
Mohsen Jozani, Jason A Williams, Ahmed Aleroud, Sarbottam Bhagat
Online health communities (OHCs) offer emotional and informational support to their users. However, past research has primarily treated these supports as separate, but they coexist in messages, making it essential to consider the emotional valence of text to understand the support being provided. This study examines how aligning questions and responses in OHCs reduces information gaps, and enhances support quality and perceived helpfulness. We use a labeled data set of question-response pairs to develop multimodal machine learning models to predict support interactions. Using explainable AI, we reveal the emotions within support exchanges, underscoring how emotional valence in the text determines informational support in OHCs and provide insight into the interaction between emotional and informational support. This study refines social support theory and establishes a foundation for decision aids and emotion-sensitive AI systems to deliver personalized social support tailored to users’ informational and emotional needs.
{"title":"Emotional and Informational Dynamics in Question-Response Pairs in Online Health Communities: A Multimodal Deep Learning Approach","authors":"Mohsen Jozani, Jason A Williams, Ahmed Aleroud, Sarbottam Bhagat","doi":"10.1007/s10796-024-10566-y","DOIUrl":"https://doi.org/10.1007/s10796-024-10566-y","url":null,"abstract":"<p>Online health communities (OHCs) offer emotional and informational support to their users. However, past research has primarily treated these supports as separate, but they coexist in messages, making it essential to consider the emotional valence of text to understand the support being provided. This study examines how aligning questions and responses in OHCs reduces information gaps, and enhances support quality and perceived helpfulness. We use a labeled data set of question-response pairs to develop multimodal machine learning models to predict support interactions. Using explainable AI, we reveal the emotions within support exchanges, underscoring how emotional valence in the text determines informational support in OHCs and provide insight into the interaction between emotional and informational support. This study refines social support theory and establishes a foundation for decision aids and emotion-sensitive AI systems to deliver personalized social support tailored to users’ informational and emotional needs.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"34 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142917316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-23DOI: 10.1007/s10796-024-10568-w
Gautam Kishore Shahi, Ali Sercan Basyurt, Stefan Stieglitz, Christoph Neuberger
As per agenda-setting theory, political agenda is concerned with the government’s agenda, including politicians and political parties. Political actors utilize various channels to set their political agenda, including social media platforms such as Twitter (now X). Political agenda-setting can be influenced by anonymous user-generated content following the Bright Internet. This is why speech acts, experts, users with affiliations and parties through annotated Tweets were analyzed in this study. In doing so, the agenda formation during the 2019 European Parliament Election in Germany based on the agenda-setting theory as our theoretical framework, was analyzed. A prediction model was trained to predict users’ voting tendencies based on three feature categories: social, network, and text. By combining features from all categories logistical regression leads to the best predictions matching the election results. The contribution to theory is an approach to identify agenda formation based on our novel variables. For practice, a novel approach is presented to forecast the winner of events.
{"title":"Agenda Formation and Prediction of Voting Tendencies for European Parliament Election using Textual, Social and Network Features","authors":"Gautam Kishore Shahi, Ali Sercan Basyurt, Stefan Stieglitz, Christoph Neuberger","doi":"10.1007/s10796-024-10568-w","DOIUrl":"https://doi.org/10.1007/s10796-024-10568-w","url":null,"abstract":"<p>As per agenda-setting theory, political agenda is concerned with the government’s agenda, including politicians and political parties. Political actors utilize various channels to set their political agenda, including social media platforms such as Twitter (now <i>X</i>). Political agenda-setting can be influenced by anonymous user-generated content following the Bright Internet. This is why speech acts, experts, users with affiliations and parties through annotated Tweets were analyzed in this study. In doing so, the agenda formation during the 2019 European Parliament Election in Germany based on the agenda-setting theory as our theoretical framework, was analyzed. A prediction model was trained to predict users’ voting tendencies based on three feature categories: social, network, and text. By combining features from all categories logistical regression leads to the best predictions matching the election results. The contribution to theory is an approach to identify agenda formation based on our novel variables. For practice, a novel approach is presented to forecast the winner of events.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"92 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142874196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-21DOI: 10.1007/s10796-024-10564-0
Igugu Tshisekedi Etienne, Muhammad Firdaus, Cho Nwe Zin Latt, Siwan Noh, Kyung-Hyune Rhee
Network slicing is a 5G concept that virtualizes the physical network infrastructure to accommodate multiple service requirements on the same network, where each slice manages diverse needs and ensures their coexistence. In this work, we leverage blockchain technology to strengthen the security of handover authentication (HA) processes in network slicing systems.The proposed system addresses the challenge of reducing latency during handovers by incorporating a hybrid on-chain/off-chain model, optimizing the balance between security and speed. It employs the Raft consensus mechanism, which offers lower latency compared to more traditional consensus protocols such as PBFT. It establishes a decentralized registry for recording transfer events, streamlining user equipment (UE) identification verification, and improving HA efficiency. Moreover, we also introduce a three-component model: network slicing, user environments, and a Hyperledger Fabric (HLF) blockchain for authentication and authorization, which enhances the user experience by minimizing delays, ensuring data privacy, and providing scalability. By leveraging edge computing in conjunction with network slicing, the system further reduces latency, making it more efficient for real-time applications in dynamic mobile environments. Performance experiments indicate satisfactory scalability and maintained service quality under increasing throughput, affirming the suitability of the HLF-based system for managing network scenarios. Furthermore, the system’s modular design ensures compatibility with existing authentication protocols, such as AKA and EAP, enabling seamless integration with legacy systems. Consequently, this work enhances network security and service quality, especially in network slicing, HA, and employing HLF for privacy and security solutions. As 5G networks continue to evolve toward 6G, this system’s scalability and flexibility offer a promising approach to addressing future challenges in secure and efficient handover authentication.
{"title":"Hyperledger Fabric-Powered Network Slicing Handover Authentication","authors":"Igugu Tshisekedi Etienne, Muhammad Firdaus, Cho Nwe Zin Latt, Siwan Noh, Kyung-Hyune Rhee","doi":"10.1007/s10796-024-10564-0","DOIUrl":"https://doi.org/10.1007/s10796-024-10564-0","url":null,"abstract":"<p>Network slicing is a 5G concept that virtualizes the physical network infrastructure to accommodate multiple service requirements on the same network, where each slice manages diverse needs and ensures their coexistence. In this work, we leverage blockchain technology to strengthen the security of handover authentication (HA) processes in network slicing systems.The proposed system addresses the challenge of reducing latency during handovers by incorporating a hybrid on-chain/off-chain model, optimizing the balance between security and speed. It employs the Raft consensus mechanism, which offers lower latency compared to more traditional consensus protocols such as PBFT. It establishes a decentralized registry for recording transfer events, streamlining user equipment (UE) identification verification, and improving HA efficiency. Moreover, we also introduce a three-component model: network slicing, user environments, and a Hyperledger Fabric (HLF) blockchain for authentication and authorization, which enhances the user experience by minimizing delays, ensuring data privacy, and providing scalability. By leveraging edge computing in conjunction with network slicing, the system further reduces latency, making it more efficient for real-time applications in dynamic mobile environments. Performance experiments indicate satisfactory scalability and maintained service quality under increasing throughput, affirming the suitability of the HLF-based system for managing network scenarios. Furthermore, the system’s modular design ensures compatibility with existing authentication protocols, such as AKA and EAP, enabling seamless integration with legacy systems. Consequently, this work enhances network security and service quality, especially in network slicing, HA, and employing HLF for privacy and security solutions. As 5G networks continue to evolve toward 6G, this system’s scalability and flexibility offer a promising approach to addressing future challenges in secure and efficient handover authentication.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"53 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142867096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-12DOI: 10.1007/s10796-024-10560-4
Baile Lu, La Ta, Hongyan Dai, Xun Xu, Wanfeng Yan, Zhiyu Zhang
With the rapid development of information technology, the gig labor marketplace is fast growing, with digital platform-based instant messaging (IM) playing an important role in raising freelancers’ orders, serving the intention for the crowdsourcing platforms to increase capacity to balance supply and demand. Using a large-scale field experiment on a crowdsourcing freight platform, this study investigates the impact of IM on freelancers’ response rate of orders. Our findings suggest the effects of IM depend on its content and information richness level. Task-relevant information in IM increases the freelancers’ response rate, especially for the priority commitment information, compared with order price information. In addition, although adding task-irrelevant information in IM decreases the freelancers’ response rate, it does not mean the less task-irrelevant information results in a weaker negative IM effect. Rather than that, including task-irrelevant information with a medium information richness level in IM harms the freelancers’ response to the most significant extent. Moreover, our findings reveal crowdsourcing platforms’ actions of IM to increase freelancers’ response rate are consistent with the actions to improve the order acceptance rate, thus demonstrating the critical role of increasing freelancers’ response rate in raising their interest in the final acceptance of the order serving. Our findings guide crowdsourcing platforms to design effective digital platform-based IMs to communicate with freelancers to arouse their response and interest in serving the orders. The capacity of crowdsourcing platforms thus can be dynamically adjusted and expanded to benefit their profitability.
{"title":"Unfreezing the Freelancers: Investigating the Strategy of Digital Platform-Based Instant Messaging Communication in Increasing Freelancers’ Response in Gig Economy","authors":"Baile Lu, La Ta, Hongyan Dai, Xun Xu, Wanfeng Yan, Zhiyu Zhang","doi":"10.1007/s10796-024-10560-4","DOIUrl":"https://doi.org/10.1007/s10796-024-10560-4","url":null,"abstract":"<p>With the rapid development of information technology, the gig labor marketplace is fast growing, with digital platform-based instant messaging (IM) playing an important role in raising freelancers’ orders, serving the intention for the crowdsourcing platforms to increase capacity to balance supply and demand. Using a large-scale field experiment on a crowdsourcing freight platform, this study investigates the impact of IM on freelancers’ response rate of orders. Our findings suggest the effects of IM depend on its content and information richness level. Task-relevant information in IM increases the freelancers’ response rate, especially for the priority commitment information, compared with order price information. In addition, although adding task-irrelevant information in IM decreases the freelancers’ response rate, it does not mean the less task-irrelevant information results in a weaker negative IM effect. Rather than that, including task-irrelevant information with a medium information richness level in IM harms the freelancers’ response to the most significant extent. Moreover, our findings reveal crowdsourcing platforms’ actions of IM to increase freelancers’ response rate are consistent with the actions to improve the order acceptance rate, thus demonstrating the critical role of increasing freelancers’ response rate in raising their interest in the final acceptance of the order serving. Our findings guide crowdsourcing platforms to design effective digital platform-based IMs to communicate with freelancers to arouse their response and interest in serving the orders. The capacity of crowdsourcing platforms thus can be dynamically adjusted and expanded to benefit their profitability.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"28 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142809880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-03DOI: 10.1007/s10796-024-10562-2
Aaron M. French, J. P. Shim
This paper provides a comprehensive examination of the ongoing debate surrounding Artificial Intelligence (AI) and its societal implications, with a particular focus on job displacement. The release of generative AI tools for public use, particularly ChatGPT, has created numerous concerns on how these tools will be used and adverse impacts on society. Augmented Intelligence has been introduced as a concept utilizing AI to enhance human capabilities but its distinction as an assistive role is ill-defined. This research provides insights into the reconceptualization of AI as Augmented Intelligence examining their differences in terms of knowledge development, decision-making, and outcomes. Through three case studies, we demonstrate the assistive role of Augmented Intelligence and how it can serve as a catalyst for job creation and cognitive enhancement. We also explore the impact of AI and IA tools as a sociotechnical system and their effect on human cognitive abilities through the theoretical lens of the Dunning Kruger Effect. We conclude with a research agenda to stimulate future directions of research.
{"title":"From Artificial Intelligence to Augmented Intelligence: A Shift in Perspective, Application, and Conceptualization of AI","authors":"Aaron M. French, J. P. Shim","doi":"10.1007/s10796-024-10562-2","DOIUrl":"https://doi.org/10.1007/s10796-024-10562-2","url":null,"abstract":"<p>This paper provides a comprehensive examination of the ongoing debate surrounding Artificial Intelligence (AI) and its societal implications, with a particular focus on job displacement. The release of generative AI tools for public use, particularly ChatGPT, has created numerous concerns on how these tools will be used and adverse impacts on society. Augmented Intelligence has been introduced as a concept utilizing AI to enhance human capabilities but its distinction as an assistive role is ill-defined. This research provides insights into the reconceptualization of AI as Augmented Intelligence examining their differences in terms of knowledge development, decision-making, and outcomes. Through three case studies, we demonstrate the assistive role of Augmented Intelligence and how it can serve as a catalyst for job creation and cognitive enhancement. We also explore the impact of AI and IA tools as a sociotechnical system and their effect on human cognitive abilities through the theoretical lens of the Dunning Kruger Effect. We conclude with a research agenda to stimulate future directions of research.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"38 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142760614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-02DOI: 10.1007/s10796-024-10557-z
Manisha Rathi, Adrija Majumdar, Sawan Rathi
Online reviews are effective information-sharing tools due to their word-of-mouth characteristics. The extant literature has considered reviews as independent variables that influence business performance, while the environmental factors shaping these reviews remain under-explored. We examine the impact of COVID-19-related environmental uncertainties on changes in review prosumption (production and consumption) behavior. Based on the stimulus-response theory, with COVID-19 as the stimulus and prosumption as the response, we examined the changes in the characteristics of online reviews. Using the difference-in-differences methodology, we analyze online reviews of restaurants in two US cities that experienced different levels of COVID-19 impact. On the production side, we find an increased use of contextual terms and negative sentiments. On the consumption side, we find an increase in review usefulness and a decline in funniness. The results are robust, supported by coarsened exact matching and falsification tests. We conclude with a discussion of the study’s implications and contributions.
{"title":"Unraveling Prosumption Behavior for Online Reviews during Environmental Uncertainty: A Stimulus-Response Perspective","authors":"Manisha Rathi, Adrija Majumdar, Sawan Rathi","doi":"10.1007/s10796-024-10557-z","DOIUrl":"https://doi.org/10.1007/s10796-024-10557-z","url":null,"abstract":"<p>Online reviews are effective information-sharing tools due to their word-of-mouth characteristics. The extant literature has considered reviews as independent variables that influence business performance, while the environmental factors shaping these reviews remain under-explored. We examine the impact of COVID-19-related environmental uncertainties on changes in review prosumption (production and consumption) behavior. Based on the stimulus-response theory, with COVID-19 as the stimulus and prosumption as the response, we examined the changes in the characteristics of online reviews. Using the difference-in-differences methodology, we analyze online reviews of restaurants in two US cities that experienced different levels of COVID-19 impact. On the production side, we find an increased use of contextual terms and negative sentiments. On the consumption side, we find an increase in review usefulness and a decline in funniness. The results are robust, supported by coarsened exact matching and falsification tests. We conclude with a discussion of the study’s implications and contributions.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"79 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142758163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Over the past few years, several efforts have been made to enable specification and enforcement of flexible and dynamic access control policies using traditional access control (such as role based access control (RBAC), etc.) and attribute based access control (ABAC). Recently, a unified framework, named MPBAC (meta-policy based access control), has been developed to enable specification and enforcement of heterogeneous access control policies such as ABAC, RBAC and a combination of policies (such as ABAC and RBAC). However, one significant limitation is that no complete administrative model has been developed for heterogeneous access control policies. In this article, we present a complete role-based administrative model (named as RAMHAC) for managing heterogeneous access control policies. We also introduce a novel methodology for analyzing heterogeneous access control policies in the presence of RAMHAC by modeling the policies through Datalog facts and using the μz tool. The administrative model includes a wide range of administrative relations, commands, pre-constraints and post-constraints. A comprehensive experimental evaluation demonstrates the scalability of the proposed approach.
{"title":"A Role-Based Administrative Model for Administration of Heterogeneous Access Control Policies and its Security Analysis.","authors":"Mahendra Pratap Singh, Shamik Sural, Jaideep Vaidya, Vijayalakshmi Atluri","doi":"10.1007/s10796-021-10167-z","DOIUrl":"10.1007/s10796-021-10167-z","url":null,"abstract":"<p><p>Over the past few years, several efforts have been made to enable specification and enforcement of flexible and dynamic access control policies using traditional access control (such as role based access control (RBAC), etc.) and attribute based access control (ABAC). Recently, a unified framework, named MPBAC (meta-policy based access control), has been developed to enable specification and enforcement of heterogeneous access control policies such as ABAC, RBAC and a combination of policies (such as ABAC and RBAC). However, one significant limitation is that no complete administrative model has been developed for heterogeneous access control policies. In this article, we present a complete role-based administrative model (named as RAMHAC) for managing heterogeneous access control policies. We also introduce a novel methodology for analyzing heterogeneous access control policies in the presence of RAMHAC by modeling the policies through Datalog facts and using the <i>μ</i>z tool. The administrative model includes a wide range of administrative relations, commands, pre-constraints and post-constraints. A comprehensive experimental evaluation demonstrates the scalability of the proposed approach.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":" ","pages":"2255-2272"},"PeriodicalIF":8.3,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11981199/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48860257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A recent trend in data management research investigates whether machine learning techniques could improve or replace core components of traditional database architectures, such as the query optimizer or selectivity and cardinality cost estimators. The preliminary approaches leverage cost-based optimizers and cost models to avoid a cold-start as they train and build learning models. In this work, we investigate whether learning could also be beneficial in rule-based optimizers, which instead of driving query execution decisions via a cost model they rely on a set of fixed rules and pre-defined heuristics. Our experimental testbed employs MonetDB, an open-source, column-store analytics data engine, and explore whether a learning model using Graph Neural Networks (GNNs) that is trained on a cost-based engine, such as PostgreSQL, could improve MonetDB optimizer’s decisions. Our initial findings reveal that our approach could improve significantly MonetDB’s query execution plans, especially as the query complexity increases whet it involves many join operators.
{"title":"Investigating Learning Join Order Optimization Strategies for Rule-based Data Engines","authors":"Antonios Karvelas, Yannis Foufoulas, Alkis Simitsis, Yannis Ioannidis","doi":"10.1007/s10796-024-10555-1","DOIUrl":"https://doi.org/10.1007/s10796-024-10555-1","url":null,"abstract":"<p>A recent trend in data management research investigates whether machine learning techniques could improve or replace core components of traditional database architectures, such as the query optimizer or selectivity and cardinality cost estimators. The preliminary approaches leverage cost-based optimizers and cost models to avoid a cold-start as they train and build learning models. In this work, we investigate whether learning could also be beneficial in rule-based optimizers, which instead of driving query execution decisions via a cost model they rely on a set of fixed rules and pre-defined heuristics. Our experimental testbed employs MonetDB, an open-source, column-store analytics data engine, and explore whether a learning model using Graph Neural Networks (GNNs) that is trained on a cost-based engine, such as PostgreSQL, could improve MonetDB optimizer’s decisions. Our initial findings reveal that our approach could improve significantly MonetDB’s query execution plans, especially as the query complexity increases whet it involves many join operators.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"8 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142684212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-22DOI: 10.1007/s10796-024-10554-2
Jingyun Sun, Xinlong Chen, Kaiyuan Zheng, Yan Zan
The well-organized structure of Policy Texts (PTs) is fundamental to intelligent governance, yet most PTs lack fine-grained category labels. PTs from different domains follow different classification systems, and traditional encoder-only models cannot directly handle scenarios where the label spaces of the source and target domains differ significantly, as their output layer typically is a fixed-dimensional classification head. Therefore, we propose a Cross-Domain Policy Text Classification (CDPTC) task. We introduce a method for the task called InstructCDPTC. This method, within an instruction tuning framework, transforms the classification task into a generation task, using the decoder-only model BigBird to predict masked tokens. We wrap the original PT within an instruction template containing a task description, a label description, and a mask sequence, which serve as input to BigBird. During training, we use the names of gold categories as the prediction targets for masked positions. During inference, we determine the final predicted category by computing the semantic distance between the averaged representations of the mask predictions and each candidate label. We constructed a dataset of 20,189 labeled policy texts from five different policy domains to evaluate InstructCDPTC. Experimental results demonstrate that InstructCDPTC achieves an F1 score of 0.824 under conditions where the sample distribution and label space of the target domain are entirely unseen, surpassing other baselines.
{"title":"A Fine-grained Classification Method for Cross-domain Policy Texts Based on Instruction Tuning","authors":"Jingyun Sun, Xinlong Chen, Kaiyuan Zheng, Yan Zan","doi":"10.1007/s10796-024-10554-2","DOIUrl":"https://doi.org/10.1007/s10796-024-10554-2","url":null,"abstract":"<p>The well-organized structure of Policy Texts (PTs) is fundamental to intelligent governance, yet most PTs lack fine-grained category labels. PTs from different domains follow different classification systems, and traditional encoder-only models cannot directly handle scenarios where the label spaces of the source and target domains differ significantly, as their output layer typically is a fixed-dimensional classification head. Therefore, we propose a Cross-Domain Policy Text Classification (CDPTC) task. We introduce a method for the task called InstructCDPTC. This method, within an instruction tuning framework, transforms the classification task into a generation task, using the decoder-only model BigBird to predict masked tokens. We wrap the original PT within an instruction template containing a task description, a label description, and a mask sequence, which serve as input to BigBird. During training, we use the names of gold categories as the prediction targets for masked positions. During inference, we determine the final predicted category by computing the semantic distance between the averaged representations of the mask predictions and each candidate label. We constructed a dataset of 20,189 labeled policy texts from five different policy domains to evaluate InstructCDPTC. Experimental results demonstrate that InstructCDPTC achieves an F1 score of 0.824 under conditions where the sample distribution and label space of the target domain are entirely unseen, surpassing other baselines.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"66 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142684127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}