首页 > 最新文献

Information Systems Frontiers最新文献

英文 中文
Skyline-based Exploration of Temporal Property Graphs 基于天际线的时态属性图探索
IF 5.9 3区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-26 DOI: 10.1007/s10796-024-10505-x
Evangelia Tsoukanara, Georgia Koloniari, Evaggelia Pitoura

In this paper, we focus on temporal property graphs, that is, property graphs whose labeled nodes and edges as well as the values of the properties associated with them may change with time. A key challenge in studying temporal graphs lies in detecting interesting events in their evolution, defined as time intervals of significant stability, growth, or shrinkage. To address this challenge, we build aggregated graphs, where nodes are grouped based on the values of their properties, and seek events at the aggregated level. To locate such events, we propose a novel approach based on unified evolution skylines. A unified evolution skyline assesses the significance of an event in conjunction with the duration of the interval in which the event occurs. Significance is measured by a set of counts, where each count refers to the number of graph elements that remain stable, are created, or deleted, for a specific property value. Lastly, we share experimental findings that highlight the efficiency and effectiveness of our approach.

在本文中,我们重点研究时态属性图,即其标记的节点和边以及与之相关的属性值可能随时间变化的属性图。研究时间图的一个关键挑战在于检测其演变过程中的有趣事件,即显著稳定、增长或缩小的时间间隔。为了应对这一挑战,我们构建了聚合图,根据节点的属性值对节点进行分组,并在聚合层面上寻找事件。为了定位此类事件,我们提出了一种基于统一演化天际线的新方法。统一演化天际线结合事件发生的时间间隔评估事件的重要性。重要性通过一组计数来衡量,其中每个计数指的是在特定属性值下保持稳定、创建或删除的图元素数量。最后,我们将分享实验结果,以突出我们方法的效率和有效性。
{"title":"Skyline-based Exploration of Temporal Property Graphs","authors":"Evangelia Tsoukanara, Georgia Koloniari, Evaggelia Pitoura","doi":"10.1007/s10796-024-10505-x","DOIUrl":"https://doi.org/10.1007/s10796-024-10505-x","url":null,"abstract":"<p>In this paper, we focus on temporal property graphs, that is, property graphs whose labeled nodes and edges as well as the values of the properties associated with them may change with time. A key challenge in studying temporal graphs lies in detecting interesting events in their evolution, defined as time intervals of significant stability, growth, or shrinkage. To address this challenge, we build aggregated graphs, where nodes are grouped based on the values of their properties, and seek events at the aggregated level. To locate such events, we propose a novel approach based on <i>unified evolution skylines</i>. A unified evolution skyline assesses the significance of an event in conjunction with the duration of the interval in which the event occurs. Significance is measured by a set of counts, where each count refers to the number of graph elements that remain stable, are created, or deleted, for a specific property value. Lastly, we share experimental findings that highlight the efficiency and effectiveness of our approach.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"19 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141453104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploiting Shared Sub-Expression and Materialized View Reuse for Multi-Query Optimization 利用共享子表达式和物化视图重用实现多查询优化
IF 5.9 3区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-25 DOI: 10.1007/s10796-024-10506-w
Bala Gurumurthy, Vasudev Raghavendra Bidarkar, David Broneske, Thilo Pionteck, Gunter Saake

Querying in isolation lacks the potential of reusing intermediate results, which ends up wasting computational resources. Multi-Query Optimization (MQO) addresses this challenge by devising a shared execution strategy across queries, with two generally used strategies: batched or cached. These strategies are shown to improve performance, but hardly any study explores the combination of both. In this work we explore such a hybrid MQO, combining batching (Shared Sub-Expression) and caching (Materialized View Reuse) techniques. Our hybrid-MQO system merges batched query results as well as caches the intermediate results, thereby any new query is given a path within the previous plan as well as reusing the results. Since caching is a key component for improving performance, we measure the impact of common caching techniques such as FIFO, LRU, MRU and LFU. Our results show LRU to be the optimal for our usecase, which we use in our subsequent evaluations. To study the influence of batching, we vary the factor - derivability - which represents the similarity of the results within a query batch. Similarly, we vary the cache sizes to study the influence of caching. Moreover, we also study the role of different database operators in the performance of our hybrid system. The results suggest that, depending on the individual operators, our hybrid method gains a speed-up between 4x to a slowdown of 2x from using MQO techniques in isolation. Furthermore, our results show that workloads with a generously sized cache that contain similar queries benefit from using our hybrid method, with an observed speed-up of 2x over sequential execution in the best case.

孤立查询缺乏重复使用中间结果的潜力,最终会浪费计算资源。多查询优化(Multi-Query Optimization,MQO)通过设计一种跨查询的共享执行策略来应对这一挑战,通常使用两种策略:批处理或缓存。这些策略都被证明可以提高性能,但几乎没有研究探讨过如何将这两种策略结合起来。在这项工作中,我们探索了这样一种混合 MQO,它结合了批处理(共享子表达式)和缓存(物化视图重用)技术。我们的混合 MQO 系统合并了批处理查询结果,并缓存了中间结果,从而为任何新查询提供了先前计划中的路径,并重复使用了查询结果。由于缓存是提高性能的关键因素,我们衡量了 FIFO、LRU、MRU 和 LFU 等常用缓存技术的影响。我们的结果表明,LRU 对于我们的用例来说是最佳的,我们在随后的评估中使用了它。为了研究批处理的影响,我们改变了表示查询批处理中结果相似性的因素--可衍生性。同样,我们还改变了缓存大小,以研究缓存的影响。此外,我们还研究了不同数据库操作员对混合系统性能的影响。结果表明,与单独使用 MQO 技术相比,根据不同的操作员,我们的混合方法可以提高 4 倍的速度,降低 2 倍的速度。此外,我们的结果表明,使用我们的混合方法,缓存大小适中且包含类似查询的工作负载将受益匪浅,在最佳情况下,观察到的速度比顺序执行提高了 2 倍。
{"title":"Exploiting Shared Sub-Expression and Materialized View Reuse for Multi-Query Optimization","authors":"Bala Gurumurthy, Vasudev Raghavendra Bidarkar, David Broneske, Thilo Pionteck, Gunter Saake","doi":"10.1007/s10796-024-10506-w","DOIUrl":"https://doi.org/10.1007/s10796-024-10506-w","url":null,"abstract":"<p>Querying in isolation lacks the potential of reusing intermediate results, which ends up wasting computational resources. Multi-Query Optimization (MQO) addresses this challenge by devising a shared execution strategy across queries, with two generally used strategies: <i>batched</i> or <i>cached</i>. These strategies are shown to improve performance, but hardly any study explores the combination of both. In this work we explore such a hybrid MQO, combining batching (Shared Sub-Expression) and caching (Materialized View Reuse) techniques. Our hybrid-MQO system merges batched query results as well as caches the intermediate results, thereby any new query is given a path within the previous plan as well as reusing the results. Since caching is a key component for improving performance, we measure the impact of common caching techniques such as FIFO, LRU, MRU and LFU. Our results show LRU to be the optimal for our usecase, which we use in our subsequent evaluations. To study the influence of batching, we vary the factor - <span>derivability</span> - which represents the similarity of the results within a query batch. Similarly, we vary the cache sizes to study the influence of caching. Moreover, we also study the role of different database operators in the performance of our hybrid system. The results suggest that, depending on the individual operators, our hybrid method gains a speed-up between 4x to a slowdown of 2x from using MQO techniques in isolation. Furthermore, our results show that workloads with a generously sized cache that contain similar queries benefit from using our hybrid method, with an observed speed-up of 2x over sequential execution in the best case.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"1 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141448351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Economic Framework for Creating AI-Augmented Solutions Across Countries Over Time 各国随时间推移创建人工智能增强型解决方案的经济框架
IF 5.9 3区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-24 DOI: 10.1007/s10796-024-10487-w
Jin Sik Kim, Jinsoo Yeo, Hemant Jain

This paper examines the potential for collaboration between countries with differential resource endowments to advance AI innovation and achieve mutual economic benefits. Our framework juxtaposes economies with a comparative advantage in AI-capital and those with a comparative advantage in tech-labor, analyzing how these endowments can lead to enhanced comparative advantages over time. Through the application of various production functions and the use of Edgeworth boxes, our analysis reveals that strategic collaboration based on comparative advantage can yield Pareto improvements for both developed and developing countries. Nonetheless, this study also discusses the challenges of uneven benefit distribution, particularly the risk of “brain drain” from developing nations. Contributing to the discourse on the economics of AI and international collaboration, this study highlights the importance of thoughtful strategic planning to promote equitable and sustainable AI development worldwide.

本文探讨了具有不同资源禀赋的国家之间开展合作的潜力,以推动人工智能创新并实现共同经济利益。我们的框架将具有人工智能资本比较优势的经济体与具有技术劳动力比较优势的经济体并列,分析这些禀赋如何随着时间的推移增强比较优势。通过应用各种生产函数和埃奇沃斯箱,我们的分析揭示了基于比较优势的战略合作可以为发达国家和发展中国家带来帕累托改进。不过,本研究也讨论了利益分配不均所带来的挑战,特别是发展中国家 "人才外流 "的风险。本研究为有关人工智能经济学和国际合作的讨论做出了贡献,强调了深思熟虑的战略规划对于促进全球公平和可持续的人工智能发展的重要性。
{"title":"An Economic Framework for Creating AI-Augmented Solutions Across Countries Over Time","authors":"Jin Sik Kim, Jinsoo Yeo, Hemant Jain","doi":"10.1007/s10796-024-10487-w","DOIUrl":"https://doi.org/10.1007/s10796-024-10487-w","url":null,"abstract":"<p>This paper examines the potential for collaboration between countries with differential resource endowments to advance AI innovation and achieve mutual economic benefits. Our framework juxtaposes economies with a comparative advantage in <i>AI-capital</i> and those with a comparative advantage in <i>tech-labor</i>, analyzing how these endowments can lead to enhanced comparative advantages over time. Through the application of various production functions and the use of Edgeworth boxes, our analysis reveals that strategic collaboration based on comparative advantage can yield Pareto improvements for both developed and developing countries. Nonetheless, this study also discusses the challenges of uneven benefit distribution, particularly the risk of “brain drain” from developing nations. Contributing to the discourse on the economics of AI and international collaboration, this study highlights the importance of thoughtful strategic planning to promote equitable and sustainable AI development worldwide.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"82 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141444792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modelling forest fire dynamics using conditional variational autoencoders 利用条件变异自动编码器建立林火动态模型
IF 5.9 3区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-24 DOI: 10.1007/s10796-024-10507-9
Tiago Filipe Rodrigues Ribeiro, Fernando José Mateus da Silva, Rogério Luís de Carvalho Costa

Forest fires have far-reaching consequences, threatening human life, economic stability, and the environment. Understanding the dynamics of forest fires is crucial, especially in high-incidence regions. In this work, we apply deep networks to simulate the spatiotemporal progression of the area burnt in a forest fire. We tackle the region interpolation problem challenge by using a Conditional Variational Autoencoder (CVAE) model and generate in-between representations on the evolution of the burnt area. We also apply a CVAE model to forecast the progression of fire propagation, estimating the burnt area at distinct horizons and propagation stages. We evaluate our approach against other established techniques using real-world data. The results demonstrate that our method is competitive in geometric similarity metrics and exhibits superior temporal consistency for in-between representation generation. In the context of burnt area forecasting, our approach achieves scores of 90% for similarity and 99% for temporal consistency. These findings suggest that CVAE models may be a viable alternative for modeling the spatiotemporal evolution of 2D moving regions of forest fire evolution.

森林火灾影响深远,威胁人类生命、经济稳定和环境。了解森林火灾的动态至关重要,尤其是在火灾高发地区。在这项工作中,我们应用深度网络来模拟森林火灾中烧毁面积的时空进展。我们通过使用条件变异自动编码器(CVAE)模型来解决区域插值问题的挑战,并生成关于燃烧区域演变的中间表征。我们还应用 CVAE 模型预测火灾的蔓延过程,估计不同地平线和蔓延阶段的烧毁面积。我们使用真实世界的数据对我们的方法与其他成熟技术进行了评估。结果表明,我们的方法在几何相似度指标方面具有竞争力,在生成中间表示时表现出卓越的时间一致性。在烧伤面积预测方面,我们的方法在相似性方面达到了 90%,在时间一致性方面达到了 99%。这些研究结果表明,CVAE 模型可能是林火演化过程中二维移动区域时空演变建模的可行替代方法。
{"title":"Modelling forest fire dynamics using conditional variational autoencoders","authors":"Tiago Filipe Rodrigues Ribeiro, Fernando José Mateus da Silva, Rogério Luís de Carvalho Costa","doi":"10.1007/s10796-024-10507-9","DOIUrl":"https://doi.org/10.1007/s10796-024-10507-9","url":null,"abstract":"<p>Forest fires have far-reaching consequences, threatening human life, economic stability, and the environment. Understanding the dynamics of forest fires is crucial, especially in high-incidence regions. In this work, we apply deep networks to simulate the spatiotemporal progression of the area burnt in a forest fire. We tackle the region interpolation problem challenge by using a Conditional Variational Autoencoder (CVAE) model and generate in-between representations on the evolution of the burnt area. We also apply a CVAE model to forecast the progression of fire propagation, estimating the burnt area at distinct horizons and propagation stages. We evaluate our approach against other established techniques using real-world data. The results demonstrate that our method is competitive in geometric similarity metrics and exhibits superior temporal consistency for in-between representation generation. In the context of burnt area forecasting, our approach achieves scores of 90% for similarity and 99% for temporal consistency. These findings suggest that CVAE models may be a viable alternative for modeling the spatiotemporal evolution of 2D moving regions of forest fire evolution.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"54 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141444896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Value of Original and Generated Ultrasound Data Towards Training Robust Classifiers for Breast Cancer Identification 原始和生成的超声波数据对训练用于乳腺癌鉴定的鲁棒分类器的价值
IF 5.9 3区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-12 DOI: 10.1007/s10796-024-10499-6
Bianca-Ştefania Munteanu, Alexandra Murariu, Mǎrioara Nichitean, Luminiţa-Gabriela Pitac, Laura Dioşan

Breast cancer represents one of the leading causes of death among women, with 1 in 39 (around 2.5%) of them losing their lives annually, at the global level. According to the American Cancer Society, it is the second most lethal type of cancer in females, preceded only by lung cancer. Early diagnosis is crucial in increasing the chances of survival. In recent years, the incidence rate has increased by 0.5% per year, with 1 in 8 women at increased risk of developing a tumor during their life. Despite technological advances, there are still difficulties in identifying, characterizing, and accurately monitoring malignant tumors. The main focus of this article is on the computerized diagnosis of breast cancer. The main objective is to solve this problem using intelligent algorithms, that are built with artificial neural networks and involve 3 important steps: augmentation, segmentation, and classification. The experiment was made using a publicly available dataset that contains medical ultrasound images, collected from approximately 600 female patients (it is considered a benchmark). The results of the experiment are close to the goal set by our team. The final accuracy obtained is 86%.

乳腺癌是导致女性死亡的主要原因之一,在全球范围内,每年每 39 名女性中就有 1 人(约 2.5%)死于乳腺癌。根据美国癌症协会的数据,乳腺癌是女性第二大致命癌症,仅次于肺癌。早期诊断是增加生存机会的关键。近年来,发病率每年增加 0.5%,每 8 名女性中就有 1 人在一生中罹患肿瘤的风险增加。尽管技术在不断进步,但在识别、描述和准确监测恶性肿瘤方面仍存在困难。本文的重点是乳腺癌的计算机诊断。其主要目的是利用人工神经网络构建的智能算法来解决这一问题,其中涉及三个重要步骤:增强、分割和分类。实验使用了一个公开的数据集,该数据集包含从大约 600 名女性患者那里收集的医学超声波图像(被视为一个基准)。实验结果接近我们团队设定的目标。最终获得的准确率为 86%。
{"title":"Value of Original and Generated Ultrasound Data Towards Training Robust Classifiers for Breast Cancer Identification","authors":"Bianca-Ştefania Munteanu, Alexandra Murariu, Mǎrioara Nichitean, Luminiţa-Gabriela Pitac, Laura Dioşan","doi":"10.1007/s10796-024-10499-6","DOIUrl":"https://doi.org/10.1007/s10796-024-10499-6","url":null,"abstract":"<p>Breast cancer represents one of the leading causes of death among women, with 1 in 39 (around 2.5%) of them losing their lives annually, at the global level. According to the American Cancer Society, it is the second most lethal type of cancer in females, preceded only by lung cancer. Early diagnosis is crucial in increasing the chances of survival. In recent years, the incidence rate has increased by 0.5% per year, with 1 in 8 women at increased risk of developing a tumor during their life. Despite technological advances, there are still difficulties in identifying, characterizing, and accurately monitoring malignant tumors. The main focus of this article is on the computerized diagnosis of breast cancer. The main objective is to solve this problem using intelligent algorithms, that are built with artificial neural networks and involve 3 important steps: augmentation, segmentation, and classification. The experiment was made using a publicly available dataset that contains medical ultrasound images, collected from approximately 600 female patients (it is considered a benchmark). The results of the experiment are close to the goal set by our team. The final accuracy obtained is 86%.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"1 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141309086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Consumers’ Financial Distress: Prediction and Prescription Using Interpretable Machine Learning 消费者的财务困境:利用可解释的机器学习进行预测和开药方
IF 5.9 3区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-11 DOI: 10.1007/s10796-024-10501-1
Hendrik de Waal, Serge Nyawa, Samuel Fosso Wamba

This paper shows how transactional bank account data can be used to predict and to prevent financial distress in consumers. Machine learning methods were used to identify the most significant transactional behaviours that cause financial distress. We show that Random Forest outperforms the other machine learning models when predicting the financial distress of a consumer. We obtain that Fees and Interest paid stand out as primary contributors of financial distress, emphasizing the significance of financial charges and interest payments in gauging individuals’ financial vulnerability. Using Local Interpretable Model-agnostic Explanations, we study the marginal effect of transactional behaviours on the probability of being in financial distress and assess how different variables selected across all the data point selection sets influence each case. We also propose prescriptions that can be communicated to the client to help the individual improve their financial wellbeing. This research used data from a major South African bank.

本文展示了如何利用银行账户交易数据来预测和预防消费者的财务困境。我们使用机器学习方法来识别导致财务困境的最重要的交易行为。我们发现,在预测消费者的财务困境时,随机森林模型优于其他机器学习模型。我们发现,费用和利息支付是造成财务困境的主要因素,这强调了财务费用和利息支付在衡量个人财务脆弱性方面的重要性。我们利用 "本地可解释模型"(Local Interpretable Model-agnostic Explanations)研究了交易行为对陷入财务困境概率的边际效应,并评估了在所有数据点选择集中选择的不同变量对每种情况的影响。我们还提出了可向客户传达的处方,以帮助个人改善财务状况。本研究使用了南非一家大型银行的数据。
{"title":"Consumers’ Financial Distress: Prediction and Prescription Using Interpretable Machine Learning","authors":"Hendrik de Waal, Serge Nyawa, Samuel Fosso Wamba","doi":"10.1007/s10796-024-10501-1","DOIUrl":"https://doi.org/10.1007/s10796-024-10501-1","url":null,"abstract":"<p>This paper shows how transactional bank account data can be used to predict and to prevent financial distress in consumers. Machine learning methods were used to identify the most significant transactional behaviours that cause financial distress. We show that Random Forest outperforms the other machine learning models when predicting the financial distress of a consumer. We obtain that Fees and Interest paid stand out as primary contributors of financial distress, emphasizing the significance of financial charges and interest payments in gauging individuals’ financial vulnerability. Using Local Interpretable Model-agnostic Explanations, we study the marginal effect of transactional behaviours on the probability of being in financial distress and assess how different variables selected across all the data point selection sets influence each case. We also propose prescriptions that can be communicated to the client to help the individual improve their financial wellbeing. This research used data from a major South African bank.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"53 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141304342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Mixed-Integer Formulation for the Simultaneous Input Selection and Outlier Filtering in Soft Sensor Training 软传感器训练中同时进行输入选择和离群值过滤的混合整数公式
IF 5.9 3区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-07 DOI: 10.1007/s10796-024-10492-z
Hasan Sildir, Onur Can Boy, Sahin Sarrafi

Soft sensors are used to calculate the real-time values of process variables which can be measured in the laboratory only or require expensive online measurement tools. A set of mathematical expressions are developed and trained from historical data to exploit the statistical knowledge between online and offline measurements to ensure a reliable prediction performance, for optimization and control purposes. This study focuses on the development of a mixed-integer optimization problem to perform input selection and outlier filtering simultaneously using rigorous algorithms during the training procedure, unlike traditional heuristic and sequential methods. Nonlinearities and nonconvexities in the optimization problem is further tailored for global optimality and computational advancements by reformulations and piecewise linearizations to address the complexity of the task with additional binary variables, representing the selection of a particular input or data. The proposed approach is implemented on actual data from two different industrial plants and compared to traditional approach.

软传感器用于计算只能在实验室测量或需要昂贵的在线测量工具的过程变量的实时值。根据历史数据开发和训练了一套数学表达式,以利用在线和离线测量之间的统计知识,确保可靠的预测性能,从而达到优化和控制的目的。与传统的启发式和顺序式方法不同,本研究侧重于开发一个混合整数优化问题,在训练过程中使用严格的算法同时执行输入选择和离群值过滤。优化问题中的非线性和非凸性通过重构和分片线性化得到进一步调整,以实现全局最优和计算进步,从而利用额外的二进制变量来解决任务的复杂性,这些变量代表特定输入或数据的选择。所提出的方法在两个不同工业工厂的实际数据中得以实施,并与传统方法进行了比较。
{"title":"A Mixed-Integer Formulation for the Simultaneous Input Selection and Outlier Filtering in Soft Sensor Training","authors":"Hasan Sildir, Onur Can Boy, Sahin Sarrafi","doi":"10.1007/s10796-024-10492-z","DOIUrl":"https://doi.org/10.1007/s10796-024-10492-z","url":null,"abstract":"<p>Soft sensors are used to calculate the real-time values of process variables which can be measured in the laboratory only or require expensive online measurement tools. A set of mathematical expressions are developed and trained from historical data to exploit the statistical knowledge between online and offline measurements to ensure a reliable prediction performance, for optimization and control purposes. This study focuses on the development of a mixed-integer optimization problem to perform input selection and outlier filtering simultaneously using rigorous algorithms during the training procedure, unlike traditional heuristic and sequential methods. Nonlinearities and nonconvexities in the optimization problem is further tailored for global optimality and computational advancements by reformulations and piecewise linearizations to address the complexity of the task with additional binary variables, representing the selection of a particular input or data. The proposed approach is implemented on actual data from two different industrial plants and compared to traditional approach.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"16 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141292746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trust, Risk, Privacy and Security in e-Government Use: Insights from a MASEM Analysis 电子政务使用中的信任、风险、隐私和安全:来自 MASEM 分析的启示
IF 5.9 3区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-05 DOI: 10.1007/s10796-024-10497-8
Parul Gupta, Apeksha Hooda, Anand Jeyaraj, Jonathan J.M. Seddon, Yogesh K. Dwivedi

Despite considerable research on the factors influencing the use of e-government, citizens are apprehensive of e-government services due to the concerns primarily related to trust, risk, security and privacy. This study presents a meta-analytic structural equation modeling (MASEM) analysis of the findings reported by 68 prior empirical studies on e-government adoption. Specifically, the model examined the direct effects of trust in government, trust in internet, perceived risk, and perceived privacy and security on e-government trust, and its impact on users’ behavioral intention to use e-government. The findings bear significant theoretical and practical implications.

尽管对影响电子政务使用的因素进行了大量研究,但由于主要涉及信任、风险、安全和隐私,公民对电子政务服务心存顾虑。本研究对之前 68 项关于电子政务应用的实证研究报告结果进行了元分析结构方程模型(MASEM)分析。具体而言,该模型考察了对政府的信任、对互联网的信任、感知风险、感知隐私和安全对电子政务信任的直接影响,及其对用户使用电子政务行为意向的影响。研究结果具有重要的理论和实践意义。
{"title":"Trust, Risk, Privacy and Security in e-Government Use: Insights from a MASEM Analysis","authors":"Parul Gupta, Apeksha Hooda, Anand Jeyaraj, Jonathan J.M. Seddon, Yogesh K. Dwivedi","doi":"10.1007/s10796-024-10497-8","DOIUrl":"https://doi.org/10.1007/s10796-024-10497-8","url":null,"abstract":"<p>Despite considerable research on the factors influencing the use of e-government, citizens are apprehensive of e-government services due to the concerns primarily related to trust, risk, security and privacy. This study presents a meta-analytic structural equation modeling (MASEM) analysis of the findings reported by 68 prior empirical studies on e-government adoption. Specifically, the model examined the direct effects of trust in government, trust in internet, perceived risk, and perceived privacy and security on e-government trust, and its impact on users’ behavioral intention to use e-government. The findings bear significant theoretical and practical implications.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"124 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141251597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficiently Labeling and Retrieving Temporal Anomalies in Relational Databases 高效标记和检索关系数据库中的时间异常点
IF 5.9 3区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-05-31 DOI: 10.1007/s10796-024-10495-w
Christina Khnaisser, Hind Hamrouni, David B. Blumenthal, Anton Dignös, Johann Gamper

Time and temporal constraints are implicit in most databases. To facilitate data analysis and quality assessment, a database should provide explicit operations to identify the violation of temporal constraints. Against this background, the purpose of this paper is threefold: (1) we identify and provide a formal definition of five common anomalies in temporal databases, (2) we propose two new relational operations that allow, respectively, to label anomalous tuples in and to retrieve the anomalous tuples from a dataset, and (3) we provide three different SQL implementations of these operations for current relational database management systems. The healthcare domain is used to illustrate the usage and utility of the temporal anomalies. Finally, an experimental evaluation on real-world and synthetic data analyses the performance of the different implementations of the anomaly operators.

时间和时间限制在大多数数据库中都是隐含的。为便于数据分析和质量评估,数据库应提供明确的操作来识别违反时间约束的情况。在此背景下,本文的目的有三:(1) 我们识别并提供了时态数据库中五种常见异常的正式定义;(2) 我们提出了两种新的关系操作,分别允许在数据集中标记异常图元和从数据集中检索异常图元;(3) 我们为当前的关系数据库管理系统提供了这些操作的三种不同 SQL 实现。我们将利用医疗保健领域来说明时态异常的用法和效用。最后,在真实世界和合成数据上进行的实验评估分析了异常运算符不同实现方式的性能。
{"title":"Efficiently Labeling and Retrieving Temporal Anomalies in Relational Databases","authors":"Christina Khnaisser, Hind Hamrouni, David B. Blumenthal, Anton Dignös, Johann Gamper","doi":"10.1007/s10796-024-10495-w","DOIUrl":"https://doi.org/10.1007/s10796-024-10495-w","url":null,"abstract":"<p>Time and temporal constraints are implicit in most databases. To facilitate data analysis and quality assessment, a database should provide explicit operations to identify the violation of temporal constraints. Against this background, the purpose of this paper is threefold: (1) we identify and provide a formal definition of five common anomalies in temporal databases, (2) we propose two new relational operations that allow, respectively, to label anomalous tuples in and to retrieve the anomalous tuples from a dataset, and (3) we provide three different SQL implementations of these operations for current relational database management systems. The healthcare domain is used to illustrate the usage and utility of the temporal anomalies. Finally, an experimental evaluation on real-world and synthetic data analyses the performance of the different implementations of the anomaly operators.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"318 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141182387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Big Data Analytics Adoption in Manufacturing Companies: The Contingent Role of Data-Driven Culture 制造企业对大数据分析的采用:数据驱动文化的权变作用
IF 5.9 3区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-05-27 DOI: 10.1007/s10796-024-10491-0
Priveena Thanabalan, Ali Vafaei-Zadeh, Haniruzila Hanifah, T. Ramayah

The objective of this paper is to investigate the factors that influence the adoption of Big Data Analytics (BDA) in manufacturing companies and examine the impact of BDA adoption on performance, while also considering the moderating effect of data-driven culture. An online questionnaire survey was conducted with medium and large manufacturing companies in Malaysia, resulting in a total of 267 responses collected through non-probability purposive sampling. The results show that technology complexity, perceived relative advantage, top management support, IT infrastructure and capabilities, normative pressure, and mimetic pressure are significant determinants of BDA adoption. Moreover, the adoption of BDA has a positive impact on financial and market performance, with data-driven culture moderating the relationship between BDA adoption and financial performance. This study highlights the critical factors that contribute to BDA adoption and its outcomes, providing manufacturing companies with awareness on this topic.

本文旨在调查影响制造企业采用大数据分析(BDA)的因素,并研究采用 BDA 对绩效的影响,同时考虑数据驱动文化的调节作用。我们对马来西亚的大中型制造企业进行了在线问卷调查,通过非概率目的性抽样,共收集到 267 份回复。结果显示,技术复杂性、感知相对优势、高层管理支持、IT 基础设施和能力、规范压力和模仿压力是采用 BDA 的重要决定因素。此外,采用 BDA 对财务和市场绩效有积极影响,而数据驱动文化则调节了采用 BDA 与财务绩效之间的关系。本研究强调了有助于采用 BDA 及其结果的关键因素,为制造企业提供了对这一主题的认识。
{"title":"Big Data Analytics Adoption in Manufacturing Companies: The Contingent Role of Data-Driven Culture","authors":"Priveena Thanabalan, Ali Vafaei-Zadeh, Haniruzila Hanifah, T. Ramayah","doi":"10.1007/s10796-024-10491-0","DOIUrl":"https://doi.org/10.1007/s10796-024-10491-0","url":null,"abstract":"<p>The objective of this paper is to investigate the factors that influence the adoption of Big Data Analytics (BDA) in manufacturing companies and examine the impact of BDA adoption on performance, while also considering the moderating effect of data-driven culture. An online questionnaire survey was conducted with medium and large manufacturing companies in Malaysia, resulting in a total of 267 responses collected through non-probability purposive sampling. The results show that technology complexity, perceived relative advantage, top management support, IT infrastructure and capabilities, normative pressure, and mimetic pressure are significant determinants of BDA adoption. Moreover, the adoption of BDA has a positive impact on financial and market performance, with data-driven culture moderating the relationship between BDA adoption and financial performance. This study highlights the critical factors that contribute to BDA adoption and its outcomes, providing manufacturing companies with awareness on this topic.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"34 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141156682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Information Systems Frontiers
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1