Marcin Czajkowski, Krzysztof Jurczuk, Marek Kretowski
{"title":"利用进化多测试树和相对表达增强组学数据分析的透明性","authors":"Marcin Czajkowski, Krzysztof Jurczuk, Marek Kretowski","doi":"10.1016/j.eswa.2025.127131","DOIUrl":null,"url":null,"abstract":"<div><div>This paper presents a novel classification algorithm for multi-omics data, called Evolutionary Multi-Test Tree with Relative Expression (EMTTree+RX). The innovation lies in the model’s design, which integrates multi-test decision nodes with Relative Expression Analysis (RXA). Each decision node combines traditional univariate tests and top-scoring pair (TSP) comparisons, allowing the algorithm to capture complex relationships between features without relying solely on absolute values. This approach enables the proposed method to detect subtle patterns across various omics layers while maintaining a high level of interpretability, a feature crucial for clinical and bioinformatics applications. The tree structure is induced through Evolutionary Algorithms (EA), optimizing both the global architecture and local multi-test nodes to balance classification accuracy, test diversity, and feature cost. Applied to large-scale multi-omics datasets, where conventional decision tree methods often struggle with underfitting or overfitting, the proposed method consistently outperforms traditional models in terms of accuracy and transparency. This makes it a valuable tool for precision medicine and multi-modal data integration.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"276 ","pages":"Article 127131"},"PeriodicalIF":7.5000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing transparency of omics data analysis with the Evolutionary Multi-Test Tree and Relative Expression\",\"authors\":\"Marcin Czajkowski, Krzysztof Jurczuk, Marek Kretowski\",\"doi\":\"10.1016/j.eswa.2025.127131\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper presents a novel classification algorithm for multi-omics data, called Evolutionary Multi-Test Tree with Relative Expression (EMTTree+RX). The innovation lies in the model’s design, which integrates multi-test decision nodes with Relative Expression Analysis (RXA). Each decision node combines traditional univariate tests and top-scoring pair (TSP) comparisons, allowing the algorithm to capture complex relationships between features without relying solely on absolute values. This approach enables the proposed method to detect subtle patterns across various omics layers while maintaining a high level of interpretability, a feature crucial for clinical and bioinformatics applications. The tree structure is induced through Evolutionary Algorithms (EA), optimizing both the global architecture and local multi-test nodes to balance classification accuracy, test diversity, and feature cost. Applied to large-scale multi-omics datasets, where conventional decision tree methods often struggle with underfitting or overfitting, the proposed method consistently outperforms traditional models in terms of accuracy and transparency. This makes it a valuable tool for precision medicine and multi-modal data integration.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"276 \",\"pages\":\"Article 127131\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425007535\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/10 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425007535","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/10 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
本文提出了一种新的多组学数据分类算法——emtree +RX (Evolutionary Multi-Test Tree with Relative Expression)。创新之处在于模型的设计,将多测试决策节点与相对表达分析(Relative Expression Analysis, RXA)相结合。每个决策节点结合了传统的单变量测试和最高得分对(TSP)比较,允许算法捕捉特征之间的复杂关系,而不依赖于绝对值。这种方法使所提出的方法能够检测跨不同组学层的细微模式,同时保持高水平的可解释性,这是临床和生物信息学应用的关键特征。通过进化算法(EA)生成树结构,优化全局架构和局部多测试节点,以平衡分类精度、测试多样性和特征成本。应用于大规模多组学数据集,传统的决策树方法经常与欠拟合或过拟合作斗争,该方法在准确性和透明度方面始终优于传统模型。这使其成为精准医疗和多模态数据集成的重要工具。
Enhancing transparency of omics data analysis with the Evolutionary Multi-Test Tree and Relative Expression
This paper presents a novel classification algorithm for multi-omics data, called Evolutionary Multi-Test Tree with Relative Expression (EMTTree+RX). The innovation lies in the model’s design, which integrates multi-test decision nodes with Relative Expression Analysis (RXA). Each decision node combines traditional univariate tests and top-scoring pair (TSP) comparisons, allowing the algorithm to capture complex relationships between features without relying solely on absolute values. This approach enables the proposed method to detect subtle patterns across various omics layers while maintaining a high level of interpretability, a feature crucial for clinical and bioinformatics applications. The tree structure is induced through Evolutionary Algorithms (EA), optimizing both the global architecture and local multi-test nodes to balance classification accuracy, test diversity, and feature cost. Applied to large-scale multi-omics datasets, where conventional decision tree methods often struggle with underfitting or overfitting, the proposed method consistently outperforms traditional models in terms of accuracy and transparency. This makes it a valuable tool for precision medicine and multi-modal data integration.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.