Pub Date : 2025-12-01Epub Date: 2025-02-11DOI: 10.1080/24725854.2024.2445776
Yu Ding, Virend K Somers, Bing Si
The increasing availability of health data from resources such as large biobanks, electronic healthcare records, medical tests, and wearable sensors, has set the stage for the development of novel machine learning (ML) models for multi-modal mixed-type data to capture the complexity of human health and disease. Clustering is a type of ML model that aims to identify homogenous subgroups from heterogeneous data, providing a data-driven solution to targeted, subgroup-specific study and intervention. While such data contain diverse and complementary information to facilitate decision making and improve population health, clustering of high-dimensional multi-modal mixed-type data poses major challenges to existing ML and statistical models. We propose a novel Multi-modal Mixed-type Structural Equation Model (M2-SEM) with structured sparsity to cluster heterogeneous health data for precise subgroup discovery. To accommodate a mix of continuous and categorical data modalities, we developed a novel Gauss-Hermite-enabled Expectation-Majorization-Minimization (GH-EMM) algorithm that integrates the GH quadrature and the Majorization Maximization (MM) algorithm within the Expectation Maximization (EM) framework for efficient model estimation. The proposed M2-SEM and GH-EMM are first tested in extensive simulation studies in comparison with benchmarks, and then applied to identify subgroups of individuals with low- and high-risk of developing adverse cardiometabolic (CM) outcomes based on a full spectrum of CM risk factors such as poor nutrition and mental health, physical inactivity, and sleep deprivation. These findings shed light on the promise of using multi-modal mixed-type health data for early identification and targeted intervention of at-risk individuals for health promotion at the population level.
{"title":"Multi-modal mixed-type structural equation modeling with structured sparsity for subgroup discovery from heterogeneous health data.","authors":"Yu Ding, Virend K Somers, Bing Si","doi":"10.1080/24725854.2024.2445776","DOIUrl":"10.1080/24725854.2024.2445776","url":null,"abstract":"<p><p>The increasing availability of health data from resources such as large biobanks, electronic healthcare records, medical tests, and wearable sensors, has set the stage for the development of novel machine learning (ML) models for multi-modal mixed-type data to capture the complexity of human health and disease. Clustering is a type of ML model that aims to identify homogenous subgroups from heterogeneous data, providing a data-driven solution to targeted, subgroup-specific study and intervention. While such data contain diverse and complementary information to facilitate decision making and improve population health, clustering of high-dimensional multi-modal mixed-type data poses major challenges to existing ML and statistical models. We propose a novel Multi-modal Mixed-type Structural Equation Model (M2-SEM) with structured sparsity to cluster heterogeneous health data for precise subgroup discovery. To accommodate a mix of continuous and categorical data modalities, we developed a novel Gauss-Hermite-enabled Expectation-Majorization-Minimization (GH-EMM) algorithm that integrates the GH quadrature and the Majorization Maximization (MM) algorithm within the Expectation Maximization (EM) framework for efficient model estimation. The proposed M2-SEM and GH-EMM are first tested in extensive simulation studies in comparison with benchmarks, and then applied to identify subgroups of individuals with low- and high-risk of developing adverse cardiometabolic (CM) outcomes based on a full spectrum of CM risk factors such as poor nutrition and mental health, physical inactivity, and sleep deprivation. These findings shed light on the promise of using multi-modal mixed-type health data for early identification and targeted intervention of at-risk individuals for health promotion at the population level.</p>","PeriodicalId":56039,"journal":{"name":"IISE Transactions","volume":"57 12","pages":"1497-1511"},"PeriodicalIF":2.3,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12519570/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145304660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-04DOI: 10.1080/24725854.2024.2417258
Shiva Afshar, Yinghan Chen, Shizhong Han, Ying Lin
Combining multiple predictors obtained from distributed data sources to an accurate meta-learner is promising to achieve enhanced performance in lots of prediction problems. As the accuracy of each predictor is usually unknown, integrating the predictors to achieve better performance is challenging. Conventional ensemble learning methods assess the accuracy of predictors based on extensive labeled data. In practical applications, however, the acquisition of such labeled data can prove to be an arduous task. Furthermore, the predictors under consideration may exhibit high degrees of correlation, particularly when similar data sources or machine learning algorithms were employed during their model training. In response to these challenges, this paper introduces a novel structured unsupervised ensemble learning model (SUEL) to exploit the dependency between a set of predictors with continuous predictive scores, rank the predictors without labeled data and combine them to an ensembled score with weights. Two novel correlation-based decomposition algorithms are further proposed to estimate the SUEL model, constrained quadratic optimization (SUEL.CQO) and matrix-factorization-based (SUEL.MF) approaches. The efficacy of the proposed methods is rigorously assessed through both simulation studies and real-world application of risk genes discovery. The results compellingly demonstrate that the proposed methods can efficiently integrate the dependent predictors to an ensemble model without the need of ground truth data.
{"title":"Ranking and Combining Latent Structured Predictive Scores without Labeled Data.","authors":"Shiva Afshar, Yinghan Chen, Shizhong Han, Ying Lin","doi":"10.1080/24725854.2024.2417258","DOIUrl":"10.1080/24725854.2024.2417258","url":null,"abstract":"<p><p>Combining multiple predictors obtained from distributed data sources to an accurate meta-learner is promising to achieve enhanced performance in lots of prediction problems. As the accuracy of each predictor is usually unknown, integrating the predictors to achieve better performance is challenging. Conventional ensemble learning methods assess the accuracy of predictors based on extensive labeled data. In practical applications, however, the acquisition of such labeled data can prove to be an arduous task. Furthermore, the predictors under consideration may exhibit high degrees of correlation, particularly when similar data sources or machine learning algorithms were employed during their model training. In response to these challenges, this paper introduces a novel structured unsupervised ensemble learning model (SUEL) to exploit the dependency between a set of predictors with continuous predictive scores, rank the predictors without labeled data and combine them to an ensembled score with weights. Two novel correlation-based decomposition algorithms are further proposed to estimate the SUEL model, constrained quadratic optimization (SUEL.CQO) and matrix-factorization-based (SUEL.MF) approaches. The efficacy of the proposed methods is rigorously assessed through both simulation studies and real-world application of risk genes discovery. The results compellingly demonstrate that the proposed methods can efficiently integrate the dependent predictors to an ensemble model without the need of ground truth data.</p>","PeriodicalId":56039,"journal":{"name":"IISE Transactions","volume":" ","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12345621/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144979971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-22DOI: 10.1080/24725854.2024.2381727
Daoheng Zhang, Hasan Hüseyin Turan, Ruhul Sarker, Daryl Essam
This comparative study, constituting Part B of our extensive investigation into robust optimization (RO) in inventory management, builds on the foundational insights from Part A’s survey. It conduc...
本比较研究是我们对库存管理中稳健优化(RO)广泛调查的 B 部分,它建立在 A 部分调查的基础上。它将对库存管理中的鲁棒性优化(RO)进行...
{"title":"Robust Optimization Approaches in Inventory Management: Part B - The Comparative Study","authors":"Daoheng Zhang, Hasan Hüseyin Turan, Ruhul Sarker, Daryl Essam","doi":"10.1080/24725854.2024.2381727","DOIUrl":"https://doi.org/10.1080/24725854.2024.2381727","url":null,"abstract":"This comparative study, constituting Part B of our extensive investigation into robust optimization (RO) in inventory management, builds on the foundational insights from Part A’s survey. It conduc...","PeriodicalId":56039,"journal":{"name":"IISE Transactions","volume":"73 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141770340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-22DOI: 10.1080/24725854.2024.2381713
Daoheng Zhang, Hasan Hüseyin Turan, Ruhul Sarker, Daryl Essam
This work, the first part (Part A) of a comprehensive study, presents a survey on Robust Optimization (RO) in inventory management, highlighting its role in addressing uncertainties. This survey re...
{"title":"Robust Optimization Approaches in Inventory Management: Part A - The Survey","authors":"Daoheng Zhang, Hasan Hüseyin Turan, Ruhul Sarker, Daryl Essam","doi":"10.1080/24725854.2024.2381713","DOIUrl":"https://doi.org/10.1080/24725854.2024.2381713","url":null,"abstract":"This work, the first part (Part A) of a comprehensive study, presents a survey on Robust Optimization (RO) in inventory management, highlighting its role in addressing uncertainties. This survey re...","PeriodicalId":56039,"journal":{"name":"IISE Transactions","volume":"41 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141738022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-09DOI: 10.1080/24725854.2024.2376650
Michael Biehler, Jing Li, Jianjun Shi
Traditional high-fidelity imaging techniques, such as X-ray computer tomography (CT), excel in capturing intricate shape details through high-resolution two-dimensional (2D) images. However, the ex...
传统的高保真成像技术,如 X 射线计算机断层扫描 (CT),擅长通过高分辨率二维 (2D) 图像捕捉复杂的形状细节。然而,二维图像的...
{"title":"FUSION3D: Multimodal Data Fusion for 3D Shape Reconstruction - A Soft Sensing Approach","authors":"Michael Biehler, Jing Li, Jianjun Shi","doi":"10.1080/24725854.2024.2376650","DOIUrl":"https://doi.org/10.1080/24725854.2024.2376650","url":null,"abstract":"Traditional high-fidelity imaging techniques, such as X-ray computer tomography (CT), excel in capturing intricate shape details through high-resolution two-dimensional (2D) images. However, the ex...","PeriodicalId":56039,"journal":{"name":"IISE Transactions","volume":"36 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141610817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-18DOI: 10.1080/24725854.2024.2367224
Xianjian Xie, Xiaochen Xian, Dan Li, Andi Wang
The Internet of Federated Things (IoFT) represents a network of interconnected systems with federated learning as the backbone, facilitating collaborative knowledge acquisition while ensuring data ...
联合物联网(IoFT)是一个以联合学习为骨干的互联系统网络,在促进协作知识获取的同时确保数据...
{"title":"Adversarial Client Detection via Non-parametric Subspace Monitoring in the Internet of Federated Things","authors":"Xianjian Xie, Xiaochen Xian, Dan Li, Andi Wang","doi":"10.1080/24725854.2024.2367224","DOIUrl":"https://doi.org/10.1080/24725854.2024.2367224","url":null,"abstract":"The Internet of Federated Things (IoFT) represents a network of interconnected systems with federated learning as the backbone, facilitating collaborative knowledge acquisition while ensuring data ...","PeriodicalId":56039,"journal":{"name":"IISE Transactions","volume":"30 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141510662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-05DOI: 10.1080/24725854.2024.2363316
Noam Goldberg, Mark Langer, Shimrit Shtern
Radiotherapy treatment planning is a challenging large-scale optimization problem plagued by uncertainty. Following the robust optimization methodology, we propose a novel, spatially based uncertai...
{"title":"Robust Radiotherapy Planning with Spatially Based Uncertainty Sets","authors":"Noam Goldberg, Mark Langer, Shimrit Shtern","doi":"10.1080/24725854.2024.2363316","DOIUrl":"https://doi.org/10.1080/24725854.2024.2363316","url":null,"abstract":"Radiotherapy treatment planning is a challenging large-scale optimization problem plagued by uncertainty. Following the robust optimization methodology, we propose a novel, spatially based uncertai...","PeriodicalId":56039,"journal":{"name":"IISE Transactions","volume":"99 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141510663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-30DOI: 10.1080/24725854.2024.2361460
Zhen-Yu Chen, Minghe Sun
Supply chain members can intelligently learn their decisions based on historical data by using machine-learning (ML) algorithms. To coordinate the supply chain, the data-driven contract design prob...
{"title":"Data-driven Contract Design for Supply Chain Coordination with Algorithm Sharing and Algorithm Competition","authors":"Zhen-Yu Chen, Minghe Sun","doi":"10.1080/24725854.2024.2361460","DOIUrl":"https://doi.org/10.1080/24725854.2024.2361460","url":null,"abstract":"Supply chain members can intelligently learn their decisions based on historical data by using machine-learning (ML) algorithms. To coordinate the supply chain, the data-driven contract design prob...","PeriodicalId":56039,"journal":{"name":"IISE Transactions","volume":"26 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141550981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-29DOI: 10.1080/24725854.2024.2360619
Madi Arabi, Xiaolei Fang
Most prognostic methods require a decent amount of data for model training. In reality, however, the amount of historical data owned by a single organization might be small or not large enough to t...
{"title":"A Federated Data Fusion-Based Prognostic Model for Applications with Multi-Stream Incomplete Signals","authors":"Madi Arabi, Xiaolei Fang","doi":"10.1080/24725854.2024.2360619","DOIUrl":"https://doi.org/10.1080/24725854.2024.2360619","url":null,"abstract":"Most prognostic methods require a decent amount of data for model training. In reality, however, the amount of historical data owned by a single organization might be small or not large enough to t...","PeriodicalId":56039,"journal":{"name":"IISE Transactions","volume":"94 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141510664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-29DOI: 10.1080/24725854.2024.2359991
Sumin Shen, Zhiyang Zhang, Ran Jin, Xinwei Deng
In various data science applications, the relationship between predictor variables and the response is dynamic in the sense that the corresponding model parameters are varying coefficients. Estimat...
在各种数据科学应用中,预测变量与响应之间的关系是动态的,即相应的模型参数是变化的系数。估计...
{"title":"Efficient Estimation and Selection for Regularized Dynamic Logistic Regression","authors":"Sumin Shen, Zhiyang Zhang, Ran Jin, Xinwei Deng","doi":"10.1080/24725854.2024.2359991","DOIUrl":"https://doi.org/10.1080/24725854.2024.2359991","url":null,"abstract":"In various data science applications, the relationship between predictor variables and the response is dynamic in the sense that the corresponding model parameters are varying coefficients. Estimat...","PeriodicalId":56039,"journal":{"name":"IISE Transactions","volume":"322 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141510665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}