{"title":"基于集成学习的Hi-C数据分辨率改进方法","authors":"Zhaoheng Ai, Hao Wu","doi":"10.1109/ICAIS56108.2023.10073678","DOIUrl":null,"url":null,"abstract":"The multi-level spatial structure of chromosomes allows remote regulatory elements in the linear coordinate space to closely regulate the expression level of the target genes in the three-dimensional structural space, so, the efficient analysis will be essential. Especially, this paper focuses on the Hi-C data resolution improvement method based on ensemble learning. Hi-C data standardization is used to remove the systematic bias between samples introduced by the various unavoidable nonrandom factors, hence, the accuracy is essential. Therefore, this study utilizes the stacking integration model to achieve the ensemble task, the designed model can avoid the problems of low prediction accuracy and the poor model robustness. Similarly, the multi-objective regression evolved based on the idea of multi-label classification. After testing the designed model on the public data sets, the accuracy can reach more than 99%. Compared with the traditional tools, our designed algorithm reaches better results.","PeriodicalId":164345,"journal":{"name":"2023 Third International Conference on Artificial Intelligence and Smart Energy (ICAIS)","volume":"129 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hi-C Data Resolution Improvement Method based on Ensemble Learning\",\"authors\":\"Zhaoheng Ai, Hao Wu\",\"doi\":\"10.1109/ICAIS56108.2023.10073678\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The multi-level spatial structure of chromosomes allows remote regulatory elements in the linear coordinate space to closely regulate the expression level of the target genes in the three-dimensional structural space, so, the efficient analysis will be essential. Especially, this paper focuses on the Hi-C data resolution improvement method based on ensemble learning. Hi-C data standardization is used to remove the systematic bias between samples introduced by the various unavoidable nonrandom factors, hence, the accuracy is essential. Therefore, this study utilizes the stacking integration model to achieve the ensemble task, the designed model can avoid the problems of low prediction accuracy and the poor model robustness. Similarly, the multi-objective regression evolved based on the idea of multi-label classification. After testing the designed model on the public data sets, the accuracy can reach more than 99%. Compared with the traditional tools, our designed algorithm reaches better results.\",\"PeriodicalId\":164345,\"journal\":{\"name\":\"2023 Third International Conference on Artificial Intelligence and Smart Energy (ICAIS)\",\"volume\":\"129 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 Third International Conference on Artificial Intelligence and Smart Energy (ICAIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAIS56108.2023.10073678\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Third International Conference on Artificial Intelligence and Smart Energy (ICAIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIS56108.2023.10073678","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hi-C Data Resolution Improvement Method based on Ensemble Learning
The multi-level spatial structure of chromosomes allows remote regulatory elements in the linear coordinate space to closely regulate the expression level of the target genes in the three-dimensional structural space, so, the efficient analysis will be essential. Especially, this paper focuses on the Hi-C data resolution improvement method based on ensemble learning. Hi-C data standardization is used to remove the systematic bias between samples introduced by the various unavoidable nonrandom factors, hence, the accuracy is essential. Therefore, this study utilizes the stacking integration model to achieve the ensemble task, the designed model can avoid the problems of low prediction accuracy and the poor model robustness. Similarly, the multi-objective regression evolved based on the idea of multi-label classification. After testing the designed model on the public data sets, the accuracy can reach more than 99%. Compared with the traditional tools, our designed algorithm reaches better results.